Dory

Dory is the polynomial commitment scheme used in Jolt. It is based on the scheme described in Lee21 and implemented in the a16z/dory repository.

Background: AFGHO commitments

Dory uses a pairing-inner-product commitment to vectors of group elements, based on the structure-preserving commitment scheme of Abe-Fuchsbauer-Groth-Haralambiev-Ohkubo, 2010 (AFGHO).

Work over bilinear groups $(G_{1}, G_{2}, G_{T}, e)$ , and write the group operations additively. Given a vector of source-group elements $x = (x_{1}, \dots, x_{n}) \in G_{1}^{n}$ and public generators $Γ_{2} = (g_{2}^{(1)}, \dots, g_{2}^{(n)}) \in G_{2}^{n}$ , the core commitment is the pairing inner product:

$C = ⟨ x, Γ_{2} ⟩ = i = 1 \sum n e (x_{i}, g_{2}^{(i)}) \in G_{T}$

The construction is symmetric: vectors in $G_{2}^{n}$ can be committed with generators in $G_{1}^{n}$ . A hiding version additionally adds a random multiple of a fixed target-group element, for example $r \cdot e (H_{1}, H_{2})$ .

The important property for Dory is additive homomorphism. Since pairing is bilinear, commitments to source-group vectors can be linearly combined in $G_{T}$ :

$⟨ a x + b y, Γ_{2} ⟩ = a \cdot ⟨ x, Γ_{2} ⟩ + b \cdot ⟨ y, Γ_{2} ⟩$

For scalar polynomial coefficients, Dory first commits each row of coefficients into a $G_{1}$ element. Those row commitments then form the source-group vector that is committed with the AFGHO pairing-inner-product commitment. This is the two-tier structure that gives Dory both small commitments and efficient batched openings.

How Dory works

Matrix layout

Dory views a multilinear polynomial with $N = 2^{n}$ coefficients as a $2^{ν} \times 2^{σ}$ matrix, where $ν + σ = n$ . Concretely, the coefficient vector is arranged into rows of length $2^{σ}$ , and the commitment proceeds in two tiers:

Tier 1 (row commitments): For each row $i$ , compute a $G_{1}$ element:

$C_{1}^{(i)} = j = 1 \sum 2^{σ} v_{i, j} \cdot g_{1}^{(j)}$

Tier 2 (final commitment): Combine the row commitments with $G_{2}$ generators via pairing:

$C = i = 1 \sum 2^{ν} e (C_{1}^{(i)}, g_{2}^{(i)}) \in G_{T}$

The final commitment is a single $G_{T}$ element. The tier-1 row commitments are retained as a hint for the opening proof.

Opening proofs

To prove that a committed polynomial evaluates to a claimed value $y$ at a point $r \in F^{n}$ , Dory runs a reduction protocol. The point $r$ is split into "row" and "column" components according to the matrix layout, and an inner-product argument (derived from the AFGHO structure) is used to prove the claimed evaluation. The proof has $O (lo g n)$ group elements and can be verified with a constant number of pairings.

Setup

Dory requires a universal reference string (URS) consisting of generators in $G_{1}$ and $G_{2}$ . This URS is transparent: it is generated deterministically from a seed (using a hash-based PRG) with no trusted setup ceremony. Crucially, the URS has sublinear size in the polynomial length. Specifically, for a polynomial of length $N = 2^{n}$ , the URS contains $O (2^{n /2}) = O (N)$ generators rather than $O (N)$ , because the two-tier structure only needs generators for rows and columns independently.

Why Dory?

Jolt's Twist and Shout protocol requires the prover to commit to one-hot polynomials: vectors over ${0, 1}^{K^{1/ c} \cdot T}$ with at most one nonzero entry per block of $K^{1/ c}$ consecutive entries. These arise from representing memory-access addresses in one-hot form (see Twist and Shout: one-hot polynomials).

These polynomials have two special properties that a PCS should exploit:

Boolean coefficients. Every coefficient is either 0 or 1. A PCS that charges "per field element" wastes work: each 254-bit field multiplication is doing the job of a single bit.
Extreme sparsity. Out of $K \cdot T$ coefficients, at most $T$ are nonzero.

Dory is well-suited to Jolt because of three properties:

Sublinear key size

For a polynomial of length $N$ , Dory's URS contains $O (N)$ group elements, compared to $O (N)$ for schemes like HyperKZG. This matters in Jolt because the one-hot polynomials can be very long ( $K^{1/ c} \cdot T$ coefficients, where $K$ is the address-space size and $T$ is the number of execution cycles).

Pay-per-bit commitment costs

In the tier-1 step, each row commitment is computed via a multi-scalar multiplication (MSM). Dory (as implemented in Jolt) uses a SmallScalar trait that dispatches to specialized MSM routines for small coefficient types (booleans, u8, u16, etc.). When the coefficients are Boolean, the MSM reduces to a subset sum of generators — no scalar multiplications are needed at all. For small integer coefficients (e.g. u8), the MSM uses windowed methods with windows as small as 1--8 bits, rather than the 254-bit windows needed for full field elements.

The result is that committing to a polynomial whose coefficients are $b$ -bit integers costs roughly $b /254$ times as much as committing to the same-length polynomial with arbitrary field-element coefficients. We call this pay-per-bit commitment cost.

Efficient one-hot commitment

For one-hot polynomials specifically, Jolt further exploits the sparsity structure. Rather than running a full MSM over each row (most of whose entries are zero), the prover groups the nonzero indices by address and uses batch $G_{1}$ additions. Since each execution cycle contributes exactly one nonzero entry across all $K$ addresses, the cost of committing to a one-hot polynomial of length $K^{1/ c} \cdot T$ is proportional to $T$ group additions rather than $K^{1/ c} \cdot T$ MSM operations.

Additive homomorphism

Because Dory commitments live in $G_{T}$ and are additively homomorphic, Jolt can batch-open many committed polynomials at a common point by taking a random linear combination (RLC) of the commitments. The verifier combines commitments (which are cheap $G_{T}$ operations), and the prover combines the underlying polynomials and produces a single opening proof. This is used throughout Jolt's batched opening proof to amortize the cost of opening dozens of committed polynomials.

Streaming commitment

In Jolt, witness polynomials can be committed in a streaming fashion: rather than materializing the entire polynomial in memory and then committing, the prover generates coefficients one row at a time during witness generation and immediately computes the tier-1 row commitment for that row. After all rows have been processed, a single tier-2 aggregation step produces the final $G_{T}$ commitment. This keeps memory usage proportional to $O (2^{σ}) = O (K^{1/ c} \cdot T)$ (a single row) rather than $O (N)$ (the entire polynomial/matrix).

Implementation

The Jolt implementation of Dory lives in crates/jolt-prover-legacy/src/poly/commitment/dory/ and wraps the a16z/dory library. Key files:

commitment_scheme.rs — Implements the CommitmentScheme and StreamingCommitmentScheme traits.
dory_globals.rs — Manages per-context Dory matrix dimensions ( $ν$ , $σ$ ) and coefficient layout.
wrappers.rs — Bridges Jolt's MultilinearPolynomial types to Dory's polynomial interface, including specialized commit_tier_1 for compact scalars and one-hot polynomials.
jolt_dory_routines.rs — Custom implementations of low-level group operations (MSM, vector-scalar multiplication, folding) used by the Dory prover and verifier.

JoltBook