Read-write memory (registers and RAM)

Jolt proves the validity of registers and RAM using offline memory checking. In contrast to our usage of offline memory checking in other modules, registers and RAM are writable memory.

Memory layout

For the purpose of offline memory checking, Jolt treats registers, program inputs/outputs, and RAM as occupying one unified address space. This remapped address space is laid out as follows:

Memory layout

The zero-padding depicted above is sized so that RAM starts at a power-of-two offset (this is explained below). As noted in the diagram, the size of the witness scales with the highest memory addressed over the course of the program's execution. In addition to the zero-padding between the "Program I/O" and "RAM" sections, the end of the witness is zero-padded to a power of two.

Handling program I/O

Inputs

Program inputs and outputs (plus the panic and termination bits, which indicate whether the program has panicked or otherwise terminated, respectively) live in the same memory address space as RAM. Program inputs populate the designated input space upon initialization: init memory

The verifier can efficiently compute the MLE of this initial memory state on its own (i.e. in time proportional to the IO size, not the total memory size).

Outputs and panic

On the other hand, the verifier cannot compute the MLE of the final memory state on its own –– though the program I/O is known to the verifier, the final memory state contains values written to registers/RAM over the course of program execution, which are not known to the verifier.

The verifier is, however, able to compute the MLE of the program I/O values (padded on both sides with zeros) –– this is denoted v_io below. If the prover is honest, then the final memory state (v_final below) should agree with v_io at the indices corresponding to program I/O.

final memory

To enforce this, we invoke the sumcheck protocol to perform a "zero-check" on the difference between v_final and v_io:

final memory

This also motivates the zero-padding between the "Program I/O" and "RAM" sections of the witness. The zero-padding ensures that both input_start and ram_witness_offset are powers of two, which makes it easier for the verifier to compute the MLEs of v_init and v_io.

Timestamp range check

Registers and RAM are writable memory, which introduces additional requirements compared to offline memory checking in a read-only context.

The multiset equality check for read-only memory, typically expressed as $I \cdot W = R \cdot F$ , is not adequate for ensuring the accuracy of read values. It is essential to also verify that each read operation retrieves a value that was written in a previous step (not a future step). (Here, $I$ denotes the tuples capturing initialization of memory and $W$ the tuples capturing all of the writes to memory following initialization. $R$ denotes the tuples capturing all read operations, and $F$ denotes tuples capturing a final read pass over all memory cells).

To formalize this, we assert that the timestamp of each read operation, denoted as $read_timestamp$ , must not exceed the global timestamp at that particular step. The global timestamp starts at 0 and is incremented once per step.

The verification of $read_timestamp \leq global_timestamp$ is equivalent to confirming that $read_timestamp$ falls within the range $[0, TRACE_LENGTH)$ and that the difference $(global_timestamp - read_timestamp)$ is also within the same range.

The process of ensuring that both $read_timestamp$ and $(global_timestamp - read_timestamp)$ lie within the specified range is known as range-checking. This is the procedure implemented in timestamp_range_check.rs, using a modified version of Lasso.

Intuitively, checking that each read timestamp does not exceed the global timestamp prevents an attacker from answering all read operations to a given cell with "the right set of values, but out of order". Such an attack requires the attacker to "jump forward and backward in time". That is, for this attack to succeed, at some timestamp $t$ when the cell is read, the attacker would have to return a value that will be written to that cell in the future (and at some later timestamp t' when the same cell is read the attacker would have to return a value that was written to that cell much earlier). This attack is prevented by confirming that all values returned have a timestamp that does not exceed the current global timestamp.

Word-addressable memory

According to the RISC-V specification, the RISC-V memory is byte-addressable, i.e. load and store instructions can access any memory address, with no restrictions on whether the address is aligned to a 4-byte word. However, the specification caveats that unaligned accesses may incur performance penalties on hardware, and some RISC-V implementations may choose to disallow unaligned memory accesses entirely. This means that LW and SW must access memory addresses that are word-aligned (i.e. multiples of 4), and LH, LHU, and SH must access memory addresses that are halfword-aligned (i.e. multiples of 2). There are no such restrictions on LB, LBU and SB, as they only access a single byte.

Jolt disallows unaligned memory accesses for prover performance reasons; one cannot generate a valid Jolt proof for an execution trace that includes unaligned memory accesses. Enforcing aligned accesses allows Jolt to treat memory as word-addressable, which reduces the number of committed polynomials and instances of offline memory-checking.

In more detail, Jolt

constrains all LW and SW instructions to only allow word-aligned accesses, and constrains all LH, LHU, and SH instructions to only allow halfword-aligned accesses. This is accomplished using virtual ASSERT instructions (see Section 6.1.1 of the Jolt paper).
replaces all LH, LHU, SH, LB, LBU, and SB instructions with virtual sequences that only perform word-aligned accesses.

`LB` virtual sequence

ADDI rs1, --, imm, $v_{0}$ // Compute the memory address being accessed
ANDI $v_{0}$ , --, (1 << 32) - 4, $v_{1}$ // Mask out the lower bits to obtain the word-aligned address
LW $v_{1}$ , --, 0, $v_{2}$ // Load the full word
XORI $v_{0}$ , --, 0b11, $v_{3}$ // Compute the number of bytes to shift the word (in the lower 2 bits)
SLLI $v_{3}$ , --, 3, $v_{3}$ // Compute the number of bits to shift the word (in the lower 5 bits)
SLL $v_{2}$ , $v_{3}$ , --, rd // Shift the word so that the desired byte is left-aligned
SRAI rd, --, 24, rd // Right arithmetic shift to sign-extend and right-align the byte

`LBU` virtual sequence

ADDI rs1, --, imm, $v_{0}$ // Compute the memory address being accessed
ANDI $v_{0}$ , --, (1 << 32) - 4, $v_{1}$ // Mask out the lower bits to obtain the word-aligned address
LW $v_{1}$ , --, 0, $v_{2}$ // Load the full word
XORI $v_{0}$ , --, 0b11, $v_{3}$ // Compute the number of bytes to shift the word (in the lower 2 bits)
SLLI $v_{3}$ , --, 3, $v_{3}$ // Compute the number of bits to shift the word (in the lower 5 bits)
SLL $v_{2}$ , $v_{3}$ , --, rd // Shift the word so that the desired byte is left-aligned
SRLI rd, --, 24, rd // Right logical shift to zero-extend and right-align the byte

`LH` virtual sequence

ASSERT_HALFWORD_ALIGNMENT rs1, --, imm, -- // Virtual instruction to enforce aligned memory access
ADDI rs1, --, imm, $v_{0}$ // Compute the memory address being accessed
ANDI $v_{0}$ , --, (1 << 32) - 4, $v_{1}$ // Mask out the lower bits to obtain the word-aligned address
LW $v_{1}$ , --, 0, $v_{2}$ // Load the full word
XORI $v_{0}$ , --, 0b10, $v_{3}$ // Compute the number of bytes to shift the word (in the lower 2 bits)
SLLI $v_{3}$ , --, 3, $v_{3}$ // Compute the number of bits to shift the word (in the lower 5 bits)
SLL $v_{2}$ , $v_{3}$ , --, rd // Shift the word so that the desired halfword is left-aligned
SRAI rd, --, 16, rd // Right arithmetic shift to sign-extend and right-align the halfword

`LHU` virtual sequence

ASSERT_HALFWORD_ALIGNMENT rs1, --, imm, -- // Virtual instruction to enforce aligned memory access
ADDI rs1, --, imm, $v_{0}$ // Compute the memory address being accessed
ANDI $v_{0}$ , --, (1 << 32) - 4, $v_{1}$ // Mask out the lower bits to obtain the word-aligned address
LW $v_{1}$ , --, 0, $v_{2}$ // Load the full word
XORI $v_{0}$ , --, 0b10, $v_{3}$ // Compute the number of bytes to shift the word (in the lower 2 bits)
SLLI $v_{3}$ , --, 3, $v_{3}$ // Compute the number of bits to shift the word (in the lower 5 bits)
SLL $v_{2}$ , $v_{3}$ , --, rd // Shift the word so that the desired halfword is left-aligned
SRLI rd, --, 16, rd // Right logical shift to zero-extend and right-align the halfword

`SB` virtual sequence

ADDI rs1, --, imm, $v_{0}$ // Compute the memory address being accessed
ANDI $v_{0}$ , --, (1 << 32) - 4, $v_{1}$ // Mask out the lower bits to obtain the word-aligned address
LW $v_{1}$ , --, 0, $v_{2}$ // Load the full word
SLLI $v_{0}$ , --, 3, $v_{3}$ // Compute the number of bits to shift the byte (in the lower 5 bits)
LUI --, --, 0xff, $v_{4}$ // Load the bitmask into a virtual register
SLL $v_{4}$ , $v_{3}$ , --, $v_{4}$ // Shift the bitmask into the position of the desired byte
SLL rs2, $v_{3}$ , --, $v_{5}$ // Shift the value being stored to align with the bitmask
XOR $v_{2}$ , $v_{5}$ , --, $v_{5}$
AND $v_{5}$ , $v_{4}$ , --, $v_{5}$
XOR $v_{2}$ , $v_{4}$ , --, $v_{2}$
SW $v_{1}$ , $v_{2}$ , 0, -- // Store the updated word

Instructions 8-10 use a bit-twiddling hack to mask the stored byte into the word.

`SH` virtual sequence

ASSERT_HALFWORD_ALIGNMENT rs1, --, imm, -- // Virtual instruction to enforce aligned memory access
ADDI rs1, --, imm, $v_{0}$ // Compute the memory address being accessed
ANDI $v_{0}$ , --, (1 << 32) - 4, $v_{1}$ // Mask out the lower bits to obtain the word-aligned address
LW $v_{1}$ , --, 0, $v_{2}$ // Load the full word
SLLI $v_{0}$ , --, 3, $v_{3}$ // Compute the number of bits to shift the halfword (in the lower 5 bits)
LUI --, --, 0xffff, $v_{4}$ // Load the bitmask into a virtual register
SLL $v_{4}$ , $v_{3}$ , --, $v_{4}$ // Shift the bitmask into the position of the desired halfword
SLL rs2, $v_{3}$ , --, $v_{5}$ // Shift the value being stored to align with the bitmask
XOR $v_{2}$ , $v_{5}$ , --, $v_{5}$
AND $v_{5}$ , $v_{4}$ , --, $v_{5}$
XOR $v_{2}$ , $v_{4}$ , --, $v_{2}$
SW $v_{1}$ , $v_{2}$ , 0, -- // Store the updated word

Instructions 9-11 use a bit-twiddling hack to mask the stored byte into the word.

JoltBook