Bytecode

At each cycle of the RISC-V virtual machine, the current instruction (as indicated by the program counter) is "fetched" from the bytecode and decoded. In Jolt, this is proven by treating the bytecode as a lookup table, and fetches as lookups. To prove the correctness of these lookups, we use the Shout lookup argument.

One distinguishing feature of the bytecode Shout instance is that we have multiple instances of the read-checking and $raf$ -evaluation sumchecks. Intuitively, the bytecode serves as the "ground truth" of what's being executed, so one would expect many virtual polynomial claims to eventually lead back to the bytecode. And this holds in practice –– in the Jolt sumcheck DAG diagram, we see that there are five stages of read-checking claims pointing to the bytecode read-checking node, and two $raf$ evaluation claims folded into stages 1 and 3.

Each stage has its own unique opening point, so having in-edges of different colors implies that multiple instances of that sumcheck must be run in parallel to prove the different claims.

Read-checking

Another distinguishing feature of the bytecode Shout instance is that we treat each entry of lookup table as containing a tuple of values, rather than a single value. Intuitively, this is because each instruction in the bytecode encodes multiple pieces of information: opcode, operands, etc.

The figure below loosely depicts the relationship between bytecode and $Val$ polynomial.

bytecode

We start from some ELF file, compiled from the guest program. For each instruction in the ELF (raw bytes), we decode/preprocess the instruction into a structured format containing the individual witness values used in Jolt:

The instruction operands rs1, rs2, rd, imm
Circuit and lookup table flags
The instruction address

Then, we compute a Reed-Solomon fingerprint of some subset of the values in the tuple, depending on what $rv$ claims are being proven. These fingerprints serve as the coefficients of the $Val$ polynomial for that read-checking instance.

$raf$ -evaluation

$raf$ -evaluation claims for the program counter are folded into the multi-stage read-checking sumcheck rather than being handled separately. There are two $raf$ claims:

raf claim 1: From the Spartan "outer" sumcheck, folded into Stage 1 with weight $γ^{5}$
raf claim 3: From the Spartan "shift" sumcheck, folded into Stage 3 with weight $γ^{6}$

The $raf$ polynomial in the context of bytecode is the expanded program counter ( $PC$ ), which maps each cycle to the bytecode index being executed. This is distinct from $UnexpandedPC$ , which is the ELF/memory address.

Batching read-checking and $raf$ -evaluation together

The bytecode read-checking sumcheck combines claims from five different sumcheck stages into a single batched sumcheck, using two levels of random linear combinations (RLCs):

Stage-level RLC (using $γ$ ): Combines the five stages together
Per-stage RLC (using $β_{s}$ ): Combines multiple claims within each stage

The five stages are:

Stage 1: Spartan outer sumcheck claims (program counter, immediate values, circuit flags)
Stage 2: Product virtualization claims (jump/branch flags, register write flags)
Stage 3: Shift sumcheck claims (instruction operand flags, virtual instruction metadata)
Stage 4: Register read-write checking claims (register addresses)
Stage 5: Register value evaluation and instruction lookup claims (register addresses, lookup table flags)

Additionally, bytecode $raf$ claims for the program counter are folded into stages 1 and 3.

Combined Sumcheck Expression

The overall sumcheck proves the following identity:

$s = 1 \sum 5 γ^{s - 1} \cdot rv_{s} (r_{s}) + γ^{5} \cdot raf_{1} (r_{1}) + γ^{6} \cdot raf_{3} (r_{3}) = j, k \sum ra (k, j) \cdot [s = 1 \sum 5 γ^{s - 1} \cdot eq_{s} (r_{s}, j) \cdot Val_{s} (k) + γ^{5} \cdot eq_{1} (r_{1}, j) \cdot Int (k) + γ^{6} \cdot eq_{3} (r_{3}, j) \cdot Int (k)]$

where:

$k$ ranges over bytecode indices (address space)
$j$ ranges over cycle indices (time dimension)
$ra (k, j)$ is the read access polynomial indicating cycle $j$ accesses bytecode row $k$
$Val_{s} (k)$ is the stage-specific value polynomial encoding instruction data
$Int (k)$ is the identity polynomial that converts a bytecode index from binary form $k \in {0, 1}^{K}$ to the corresponding field element $Int (k) \in F$ (used for $raf$ claims)
$γ$ is the stage-folding challenge
$β_{s}$ challenges are used within each $Val_{s} (k)$ to combine multiple sub-claims

Note that the $raf$ claims treat the expanded program counter as $ra$ , not UnexpandedPC.

Stage $Val$ Polynomials

Each of the five claims has a corresponding $Val_{s} (k)$ polynomial that encodes different instruction properties. Using per-stage challenges $β_{s}$ , these are defined as:

Stage 1 (Spartan outer sumcheck): $Val_{1} (k) = unexpanded_pc (k) + β_{1} \cdot imm (k) + t \sum β_{1}^{2 + t} \cdot circuit_flag_{t} (k)$

Stage 2 (Product virtualization): $Val_{2} (k) = jump_flag (k) + β_{2} \cdot branch_flag (k) + β_{2}^{2} \cdot is_rd_not_zero (k) + β_{2}^{3} \cdot write_lookup_to_rd (k)$

Stage 3 (Shift sumcheck): $Val_{3} (k) = imm (k) + β_{3} \cdot unexpanded_pc (k) + β_{3}^{2} \cdot left_is_rs1 (k) + β_{3}^{3} \cdot left_is_pc (k) + β_{3}^{4} \cdot right_is_rs2 (k) + β_{3}^{5} \cdot right_is_imm (k) + β_{3}^{6} \cdot is_noop (k) + β_{3}^{7} \cdot virtual_instruction (k) + β_{3}^{8} \cdot is_first_in_sequence (k)$

Stage 4 (Register read-write checking): $Val_{4} (k) = eq (rd [k], r_{register}) + β_{4} \cdot eq (rs1 [k], r_{register}) + β_{4}^{2} \cdot eq (rs2 [k], r_{register})$

Stage 5 (Register value evaluation and instruction lookups): $Val_{5} (k) = eq (rd [k], r_{register}) + β_{5} \cdot not_interleaved (k) + i \sum β_{5}^{2 + i} \cdot lookup_table_flag_{i} (k)$

where:

$unexpanded_pc (k)$ is the instruction's ELF/memory address (not the bytecode index $k$ )
$eq (rd [k], r_{register})$ equals 1 if instruction $k$ has destination register $rd [k] = r_{register}$ , and 0 otherwise (similarly for $rs1$ and $rs2$ )
Various boolean flags indicate instruction properties

Instruction address

Each instruction in the bytecode has two associated "addresses":

Bytecode index $k$ : its index in the expanded bytecode. "Expanded" bytecode refers to the preprocessed bytecode, after instructions are expanded to their virtual sequences. This is what the $ra (k, j)$ polynomial uses to indicate which bytecode row is accessed.
ELF/memory address $unexpanded_pc (k)$ : its memory address as given by the ELF. All the instructions in a virtual sequence are assigned the address of the "real" instruction they were expanded from. This is stored as part of the instruction data in bytecode row $k$ .

The bytecode index $k$ is used for addressing within the sumcheck (the $k$ variable in the double sum). The ELF address $unexpanded_pc (k)$ is used to enforce program counter updates in the R1CS constraints, and is treated as a part of the tuple of values in the preprocessed bytecode.

The "outer" and shift sumchecks in Spartan output claims about the virtual UnexpandedPC polynomial, which corresponds to the ELF address. These claims are proven using bytecode read-checking (specifically, they appear in the Stage 1 and Stage 3 value polynomials).

Flags

There are two types of Boolean flags used in Jolt:

Circuit flags, used in R1CS constraints
Lookup table flags, used in the instruction execution Shout

The associated flags for a given instruction in the bytecode can be computed a priori (i.e. in preprocessing), so any claims about these flags arising from Spartan or instruction execution Shout are also proven using bytecode read-checking. Circuit flags appear in Stages 1, 2, and 3, while lookup table flags appear in Stage 5.

Sumcheck Structure and Binding Order

The bytecode read-checking sumcheck proceeds in two phases:

Phase 1: Address Variables (first $lo g K$ rounds)

In the first $lo g K$ rounds, address variables are bound in low-to-high order. During this phase:

The $Val_{s} (k)$ polynomials for each stage are bound, eventually reducing to scalar values
Intermediate "F" polynomials are computed: $F_{s} [k] = \sum_{j : PC (j) = k} eq_{s} (r_{s}, j)$ , representing the weighted frequency that each bytecode row is accessed
These F polynomials are also bound during the address phase

The sumcheck univariate in this phase has degree 2 (quadratic).

Phase 2: Cycle Variables (final $lo g T$ rounds)

In the final $lo g T$ rounds, cycle variables are bound, also in low-to-high order. During this phase:

The $ra (k, j)$ polynomial is computed as a product of $d$ chunked one-hot polynomials: $ra (k, j) = \prod_{i = 0}^{d - 1} ra_{i} (k_{i}, j)$
Each stage uses a GruenSplitEqPolynomial to efficiently handle the per-stage $eq_{s}$ evaluations
The bound $Val_{s}$ and $Int$ values from Phase 1 are used as coefficients

The sumcheck univariate in this phase has degree $d + 1$ .

The chunking parameter $d$ is chosen based on the bytecode size to balance prover time and proof size. Larger $d$ reduces the number of committed RA polynomials but increases the degree (and thus the cost per round) of the sumcheck.

One-hot checks

Jolt enforces that the $ra_{i}$ polynomials used for bytecode Shout are one-hot, using a Booleanity and Hamming weight sumcheck as described in the paper. These implementations follow the Twist and Shout paper closely, with no notable deviations.

JoltBook