UCB1 Algorithm
TechniqueUCB1 is an Upper Confidence Bound multi-armed bandit algorithm used to select among candidate virtual sequences by balancing exploitation of sequences with high observed reward and exploration of less-used sequences. In the cited UVM-based RISC-V verification flow, UCB1 selects one virtual sequence per trial, updates rewards from functional coverage data, and can reach the same coverage as random sequence selection in substantially fewer trials.
WIKI
Overview
The UCB1 Algorithm is described in the provided evidence as an Upper Confidence Bound multi-armed bandit algorithm used to orchestrate simulation tests. In the verification flow, the available virtual sequences are treated like bandit "arms" or slot machines, and the algorithm selects which sequence to run based on rewards observed in previous trials. Its goal is to maximize and speed up functional coverage while still trying sequences that have not been used often enough to rule out high payoff potential.
Role in a MAB-based verification flow
NEIGHBORHOOD
No graph connections found for this entity yet. It may appear in future ingestion runs.
explore full graph →