VCS RACE Solver Wiki

Overview

VCS RACE Solver is identified in the source material as the default RACE solver used in a VCS constraint-solver workflow for generating AMD microcode stimuli and x86 opcodes. The article compares RACE with a BDD solver while evaluating different constrained-random opcode-generation architectures. [C1]

Use in constrained-random opcode generation

The cited study evaluates opcode generation approaches for the complex x86 instruction set. It reports that a simple constrained-random approach improved distribution compared with serial randomization, but the complexity of the x86 instruction set caused speed and memory limits that reduced simulation performance. [C2]

The study then describes an architecture that first chooses an opcode category before randomizing the instruction. This reduced the set of active variables and constraints because only constraints specific to the selected opcode category were present. [C3]

Performance characteristics

In runtime comparisons between a single-class and a multiple-class implementation, the multiple-class architecture was faster with both evaluated solvers and both opcodes. For the default RACE solver, the multiple-class approach showed a 4x speedup. [C4]

The article attributes the runtime and memory improvements primarily to the smaller set of variables and constraints in the new implementation. Profile data showed that the new implementation had 7x fewer constraints than the original, allowing the solver to calculate solutions more efficiently. [C5]

Memory behavior

The memory comparison in the article focuses on the BDD solver, because the memory consumed by RACE is described as typically smaller and not a limiting factor. [C6]

By contrast, the BDD solver elaborates the full solution space of a randomize call before selecting a solution; this can require large amounts of memory and time, although the solution space is cached to accelerate subsequent randomization calls. [C7]

Profiling and testcase extraction context

The VCS constraint profile views shown in the source include cumulative randomize CPU runtime, individual randomize CPU runtime, individual partition CPU runtime, and memory data. The article also states that VCS 2009.12 provided a testcase extraction feature to automatically extract the slowest partition from each randomize call. [C8]

Practical implication from the evidence

For the evaluated x86 opcode-generation workload, RACE benefited from reducing the constraint problem size through opcode-category partitioning. The evidence supports the conclusion that architectural partitioning of constrained-random generation can materially improve RACE runtime without sacrificing distribution or test-level control. [C9]