Counterexample-Driven Development Wiki

Overview

Counterexample-driven development is a processor-development workflow in which model-based testing produces reduced counterexamples that guide implementation and debugging. In the TestRIG paper, the authors state that TestRIG’s model-based testing leads to this workflow and describe it as an advancement over conventional test-driven development in processor design. [C1]

Relationship to TestRIG

The concept is explicitly tied to TestRIG: counterexample-driven development using TestRIG automatically provides reduced stimulus for basic features and can carry development through advanced interactions. The same paper concludes that TestRIG accelerates development at all stages and provides a tighter debugging loop than the authors had experienced in other processor-development paradigms. [C1][C2]

Contrast with test-driven development

The paper contrasts counterexample-driven development with typical test-driven development for processor design. In conventional test-driven development, a basic working design is needed before architectural unit tests can be used. With TestRIG, reduced counterexamples can be generated early enough to help with basic features and can remain useful as designs reach more advanced interactions. [C1]

Debugging example: overlapping memory operations

The TestRIG paper gives a debugging example in which a generator constructed addresses within the TestRIG memory range and produced random loads and stores. It discovered a bug after 42 tests and 20 rounds of shrinking. The reduced counterexample ultimately contained only three memory operations: two loads with a single store between them, all to overlapping addresses. The counterexample was found less than 10 seconds into the TestRIG run, and the bug was fixed within the hour. [C3]

The authors report that this bug had escaped the Flute processor’s development process, had not been found by the RISC-V unit-test suite, and was overwhelmingly difficult to debug from a full software trace. In contrast, it was trivial to resolve with the TestRIG counterexample. [C4]

Example: CHERI extension to Ibex

The paper identifies the CHERI extension to Ibex as a striking example of counterexample-driven development. After Ibex was extended with RVFI-DII support, a summer intern independently added full CHERI functionality to Ibex in a month; the paper attributes this to the tight cycle of reduced counterexamples provided by QCVEngine. [C5]

Significance

Within the available evidence, the significance of counterexample-driven development is its use of reduced counterexamples as short, actionable stimuli for processor work. This helps create a tight debugging loop, supports development from basic functionality to advanced interactions, and can reduce the burden of diagnosing failures from full traces. [C1][C2][C4]