perf(sqlite-native): avoid cloning cached read chunks by NathanFlurry · Pull Request #4634 · rivet-dev/rivet

NathanFlurry · 2026-04-13T03:56:32Z

Description

Please include a summary of the changes and the related issue. Please also include relevant motivation and context.

Type of change

Bug fix (non-breaking change which fixes an issue)
New feature (non-breaking change which adds functionality)
Breaking change (fix or feature that would cause existing functionality to not work as expected)
This change requires a documentation update

How Has This Been Tested?

Please describe the tests that you ran to verify your changes.

Checklist:

My code follows the style guidelines of this project
I have performed a self-review of my code
I have commented my code, particularly in hard-to-understand areas
I have made corresponding changes to the documentation
My changes generate no new warnings
I have added tests that prove my fix is effective or that my feature works
New and existing unit tests pass locally with my changes

NathanFlurry · 2026-04-13T03:56:54Z

Warning

This pull request is not mergeable via GitHub because a downstack PR is open. Once all requirements are satisfied, merge this PR as a stack on Graphite.
Learn more

This stack of pull requests is managed by Graphite. Learn more about stacking.

claude · 2026-04-13T04:02:46Z

PR Review: `perf(sqlite-native): avoid cloning cached read chunks`

Overview

This PR optimizes kv_io_read in the native SQLite VFS by eliminating heap allocations when serving chunks from the read cache or dirty buffer. The change is well-targeted and correctness-preserving.

What Changed

Avoids cloning read-cached chunks: buffered_chunks changes from HashMap<usize, Vec<u8>> to HashMap<usize, &[u8]>, so chunks already in read_cache are referenced in place instead of cloned. For 4 KiB chunks this is a meaningful allocation reduction on cache-heavy read workloads.
Avoids cloning dirty-buffer chunks: Previously, a batch-mode hit also cloned the chunk into buffered_chunks. Now the dirty buffer is accessed directly in the read loop via a second get, trading an O(log n) BTreeMap lookup for a 4 KiB clone. This is a sound tradeoff.
Moves read-cache population after the copy loop: Functionally equivalent since the newly fetched data is only needed for subsequent reads. The added comment explaining the empty-resp no-op case is clear and helpful.

Issues / Observations

Potential soundness concern in the unsafe context (minor)

buffered_chunks holds &[u8] slices pointing into the Vec<u8> heap data owned by read_cache. When read_cache.insert later causes the HashMap to rehash, it moves the Vec<u8> fat-pointers but not their heap-allocated data, so the &[u8] slices remain valid. This is safe in practice, but relies on implementation details of how Vec<u8> and HashMap interact.

Since this is inside unsafe extern "C" code with raw-pointer-derived references (the borrow checker cannot enforce the lifetime relationship), a brief safety comment at the mutable read_cache insertion site would help future maintainers understand why this is sound. Something like:

// Safety: buffered_chunks is fully consumed above before this mutable borrow of read_cache.
if let Some(read_cache) = state.read_cache.as_mut() {

Double lookup of dirty buffer in batch mode (acceptable tradeoff)

For chunks in dirty_buffer, the new code does two lookups: contains_key in the setup loop and get in the read loop. The prior code did one get but then cloned the chunk data. Avoiding a 4 KiB clone is worth an extra O(log n) BTreeMap lookup, so the tradeoff is sound.

Comment style nit

The new comment uses a parenthetical: // Skip fetching chunks already present in the dirty buffer (batch mode) or read cache. Per project CLAUDE.md conventions, parenthetical structures should be avoided. A cleaner form: // Skip fetching chunks already present in the dirty buffer or read cache.

No tests added

The checklist item for tests is unchecked. Since this is a pure refactor with no behavioural change, existing driver test coverage for reads in batch mode and read-cache hits should be sufficient. Worth confirming those paths are exercised before merge.

Summary

The optimization is correct and well-scoped. The main thing worth addressing before merge is a Safety: comment at the mutable read_cache insertion site to make the unsafe invariant explicit. Everything else is minor.

NathanFlurry force-pushed the 04-12-perf_sqlite-native_gate_kv_operation_labels branch from 5482e7d to bde5c19 Compare April 13, 2026 05:38

NathanFlurry force-pushed the 04-12-perf_sqlite-native_avoid_cloning_cached_read_chunks branch 2 times, most recently from ff117f9 to e25c1b6 Compare April 13, 2026 05:50

NathanFlurry force-pushed the 04-12-perf_sqlite-native_gate_kv_operation_labels branch from 13c73bd to 768d9da Compare April 13, 2026 07:03

NathanFlurry force-pushed the 04-12-perf_sqlite-native_avoid_cloning_cached_read_chunks branch from e25c1b6 to 532364f Compare April 13, 2026 07:03

This was referenced Apr 13, 2026

chore(rivetkit): remove wasm sqlite runtime #4614

Open

chore(sqlite-vfs): remove wasm sqlite packages and workspace wiring #4641

Open

test(rivetkit): remove wasm sqlite-only fixtures and coverage #4642

Open

NathanFlurry force-pushed the 04-12-perf_sqlite-native_gate_kv_operation_labels branch from 768d9da to b37f244 Compare April 13, 2026 21:07

NathanFlurry force-pushed the 04-12-perf_sqlite-native_avoid_cloning_cached_read_chunks branch from 532364f to 22df032 Compare April 13, 2026 21:07

This was referenced Apr 13, 2026

chore: lockfile lefthook #4647

Open

chore(engine): publish engine bases in ci #4649

Open

BREAK THIS UP: WIP patches #4650

Closed

NathanFlurry marked this pull request as ready for review April 14, 2026 21:32

NathanFlurry mentioned this pull request Apr 14, 2026

chore(publish): pin docker base image refs #4652

Open

11 tasks

brittianwarner mentioned this pull request Apr 14, 2026

feat(engine,rivetkit): align local dev with envoy-mode design #4653

Open

11 tasks

perf(sqlite-native): avoid cloning cached read chunks

8793c33

NathanFlurry force-pushed the 04-12-perf_sqlite-native_gate_kv_operation_labels branch from b37f244 to cdfa80f Compare April 14, 2026 23:43

NathanFlurry force-pushed the 04-12-perf_sqlite-native_avoid_cloning_cached_read_chunks branch from 22df032 to 8793c33 Compare April 14, 2026 23:43

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perf(sqlite-native): avoid cloning cached read chunks#4634

perf(sqlite-native): avoid cloning cached read chunks#4634
NathanFlurry wants to merge 1 commit into04-12-perf_sqlite-native_gate_kv_operation_labelsfrom
04-12-perf_sqlite-native_avoid_cloning_cached_read_chunks

NathanFlurry commented Apr 13, 2026

Uh oh!

NathanFlurry commented Apr 13, 2026 •

edited

Loading

Uh oh!

claude bot commented Apr 13, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

NathanFlurry commented Apr 13, 2026

Description

Type of change

How Has This Been Tested?

Checklist:

Uh oh!

NathanFlurry commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

claude bot commented Apr 13, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Review: perf(sqlite-native): avoid cloning cached read chunks

Overview

What Changed

Issues / Observations

Summary

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

NathanFlurry commented Apr 13, 2026 •

edited

Loading

claude bot commented Apr 13, 2026 •

edited

Loading

PR Review: `perf(sqlite-native): avoid cloning cached read chunks`