fix(test): check average coverage against threshold by robobun · Pull Request #27979 · oven-sh/bun

robobun · 2026-03-10T13:51:23Z

Summary

Add average coverage check against configured coverageThreshold in test_command.zig
When all files' average function/line coverage falls below the threshold, the test runner now exits non-zero
Excludes stmts from the average check, consistent with the per-file check in CodeCoverage.zig:135
Add regression test for Bun test code coverage - Not returning exit code 1 when under threshold #7367

Root Cause

The test runner computed average coverage across all files but never checked it against the configured threshold. Only per-file checks could trigger a non-zero exit. The "All files" summary row could show below-threshold coverage while the runner exited 0.

Fix

After computing the average, check avg.functions < base_fraction.functions or avg.lines < base_fraction.lines and set failing = true if below threshold. The stmts field is intentionally excluded to match the per-file behavior.

Based on community PR #27933 by @ssing2, with fixes:

Removed unrelated file changes (docs/runtime/sqlite.mdx, packages/bun-types/overrides.d.ts)
Removed stmts from the average check for consistency with per-file check
Added regression test

Test plan

bun bd test test/regression/issue/7367.test.ts passes
Test verifies exit code 1 when coverage below threshold
Test verifies exit code 0 when coverage meets threshold
bun run zig:check compiles successfully

Closes #7367

When coverage thresholds are configured via bunfig.toml coverageThreshold, the test runner now checks the average coverage across all files against the threshold, not just individual files. This matches the documented behavior where the "All files" row should cause a non-zero exit when below threshold. Excludes stmts from the average check to be consistent with the per-file check in CodeCoverage.zig which intentionally omits stmts. Closes #7367

robobun · 2026-03-10T13:51:38Z

^{Updated 7:37 AM PT - Mar 10th, 2026}

❌ @claude, your commit b6702d4 has 8 failures in Build #39210 (All Failures):

test/js/bun/http/bun-serve-file.test.ts - timeout on 🍎 13 aarch64
test/v8/v8.test.ts - code 1 on 🪟 2019 x64-baseline
test/v8/v8.test.ts - code 1 on 🪟 11 aarch64
test/v8/v8.test.ts - code 1 on 🪟 2019 x64
test/napi/napi.test.ts - code 1 on 🪟 2019 x64
test/napi/napi.test.ts - code 1 on 🪟 2019 x64-baseline
test/napi/napi.test.ts - code 1 on 🪟 11 aarch64
test/napi/node-napi-tests/test/js-native-api/test_instance_data/do.test.ts - code 1 on 🪟 2019 x64-baseline
test/napi/node-napi-tests/test/js-native-api/test_instance_data/do.test.ts - code 1 on 🪟 2019 x64
test/napi/node-napi-tests/test/js-native-api/test_instance_data/do.test.ts - code 1 on 🪟 11 aarch64
test/napi/node-napi-tests/test/js-native-api/test_general/do.test.ts - code 1 on 🪟 2019 x64-baseline
test/napi/node-napi-tests/test/js-native-api/test_general/do.test.ts - code 1 on 🪟 2019 x64
test/napi/node-napi-tests/test/js-native-api/test_general/do.test.ts - code 1 on 🪟 11 aarch64
test/js/bun/http/bun-serve-html.test.ts - code 1 on 🪟 11 aarch64
test/js/bun/http/bun-serve-html.test.ts - code 1 on 🪟 2019 x64-baseline
test/js/bun/http/bun-serve-html.test.ts - code 1 on 🍎 14 aarch64
test/js/bun/http/bun-serve-html.test.ts - code 1 on 🪟 2019 x64
test/js/bun/http/bun-serve-html.test.ts - code 1 on 🍎 14 x64
test/js/bun/http/bun-serve-html.test.ts - code 1 on 🍎 13 x64
test/js/bun/http/bun-serve-html.test.ts - code 1 on 🐧 25.04 aarch64
test/js/bun/http/bun-serve-html.test.ts - code 1 on 🐧 13 aarch64
test/js/bun/http/bun-serve-html.test.ts - code 1 on 🐧 3.23 x64
test/js/bun/http/bun-serve-html.test.ts - code 1 on 🐧 3.23 aarch64
test/js/bun/http/bun-serve-html.test.ts - code 1 on 🐧 3.23 x64-baseline
test/js/bun/http/bun-serve-html.test.ts - code 1 on 🐧 25.04 x64
test/js/bun/http/bun-serve-html.test.ts - code 1 on 🐧 13 x64
test/js/bun/http/bun-serve-html.test.ts - code 1 on 🐧 25.04 x64-baseline
test/js/bun/http/bun-serve-html.test.ts - code 1 on 🐧 13 x64-baseline
test/js/bun/http/bun-serve-html.test.ts - code 1 on 🐧 13 x64-asan
test/js/bun/http/bun-serve-html.test.ts - code 1 on 🍎 13 aarch64
test/integration/next-pages/test/dev-server.test.ts - code 1 on 🍎 14 aarch64
test/internal/ban-words.test.ts - code 1 on 🪟 11 aarch64
test/internal/ban-words.test.ts - code 1 on 🪟 2019 x64-baseline
test/internal/ban-words.test.ts - code 1 on 🪟 2019 x64
test/internal/ban-words.test.ts - code 1 on 🍎 13 aarch64
test/internal/ban-words.test.ts - code 1 on 🍎 14 aarch64
test/internal/ban-words.test.ts - code 1 on 🍎 14 x64
test/internal/ban-words.test.ts - code 1 on 🍎 13 x64
test/internal/ban-words.test.ts - code 1 on 🐧 25.04 aarch64
test/internal/ban-words.test.ts - code 1 on 🐧 13 aarch64
test/internal/ban-words.test.ts - code 1 on 🐧 3.23 x64
test/internal/ban-words.test.ts - code 1 on 🐧 3.23 x64-baseline
test/internal/ban-words.test.ts - code 1 on 🐧 3.23 aarch64
test/internal/ban-words.test.ts - code 1 on 🐧 13 x64
test/internal/ban-words.test.ts - code 1 on 🐧 25.04 x64
test/internal/ban-words.test.ts - code 1 on 🐧 25.04 x64-baseline
test/internal/ban-words.test.ts - code 1 on 🐧 13 x64-baseline
test/internal/ban-words.test.ts - code 1 on 🐧 13 x64-asan

🧪 To try this PR locally:

bunx bun-pr 27979

That installs a local version of the PR into your bun-27979 executable, so you can run:

bun-27979 --bun

coderabbitai · 2026-03-10T13:54:15Z

No actionable comments were generated in the recent review. 🎉

ℹ️ Recent review info

⚙️ Run configuration

Configuration used: Path: .coderabbit.yaml

Review profile: ASSERTIVE

Plan: Pro

Run ID: a989accb-7066-4fed-85c5-5cf8fbbe710f

📥 Commits

Reviewing files that changed from the base of the PR and between 2eb2b01 and b6702d4.

📒 Files selected for processing (2)

src/cli/test_command.zig
test/regression/issue/7367.test.ts

Walkthrough

Adds a post-aggregation check to code coverage reporting to mark coverage as failing when per-metric averages fall below configured thresholds. Includes regression tests validating exit code behavior when coverage is below or meets specified thresholds.

Changes

Cohort / File(s)	Summary
Coverage Reporting Logic `src/cli/test_command.zig`	Adds post-aggregation check to mark coverage as failing when averaged per-file fractions fall below configured baseline thresholds for functions or lines.
Regression Tests `test/regression/issue/7367.test.ts`	Adds regression tests validating exit code behavior: non-zero exit when function coverage is below 90% threshold, zero exit when coverage meets or exceeds 50% threshold.

🚥 Pre-merge checks | ✅ 4

✅ Passed checks (4 passed)

Check name	Status	Explanation
Title check	✅ Passed	The title accurately summarizes the main change: adding a check for average coverage against the configured threshold in the test command.
Description check	✅ Passed	The pull request description comprehensively covers objectives, root cause, implementation details, test plan, and links to the original issue, exceeding the template requirements.
Linked Issues check	✅ Passed	The code changes directly address issue `#7367` by implementing average coverage threshold checking and adding regression tests that verify the expected exit code behavior.
Out of Scope Changes check	✅ Passed	All code changes are directly scoped to fixing the coverage threshold issue; unrelated file changes from the upstream PR were explicitly removed per the description.

_{✏️ Tip: You can configure your own custom pre-merge checks in the settings.}

_{Comment @coderabbitai help to get the list of available commands and usage tips.}

claude

LGTM — small, correct fix with good regression test coverage.

Extended reasoning...

Overview

This PR adds a 4-line check in test_command.zig (lines 1214-1216) to compare average function/line coverage against the configured threshold, and adds a new regression test (test/regression/issue/7367.test.ts) with both failing and passing coverage threshold scenarios. The scope is minimal and focused.

Security risks

None. The change is a simple numeric comparison that sets a boolean flag, with no user input handling, no network access, and no file system operations beyond what already exists.

Level of scrutiny

Low scrutiny is appropriate. This is a 4-line logic addition to test infrastructure (not production runtime code), following an existing pattern. The regression test exercises both the below-threshold (exit 1) and meets-threshold (exit 0) cases.

Other factors

The only bug reported is a nit observing that the average check is mathematically redundant with per-file checks (since if all per-file values >= threshold, the mean must also be >= threshold). This is correct but the code is harmless defense-in-depth and aligns with user expectations. The test file is well-structured using tempDir with inline config and source files. The stmts exclusion is consistent with the per-file check in CodeCoverage.zig.

claude · 2026-03-10T14:15:44Z

src/cli/test_command.zig

+                    if (avg.functions < base_fraction.functions or avg.lines < base_fraction.lines) {
+                        failing = true;
+                    }


🟡 The average coverage threshold check (line 1214) is mathematically redundant with the per-file checks (line 1187). By the arithmetic mean property, if all per-file values are >= threshold then the average must also be >= threshold; conversely if the average is below threshold, at least one per-file value must be below it, so failing was already set to true. The code is harmless as defense-in-depth but is a no-op, and the regression test passes with or without this change since it triggers the per-file check.

Extended reasoning...

The redundancy

The new code at lines 1214-1216 checks whether the average function/line coverage across all files falls below the configured threshold, and if so sets failing = true. However, this check can never independently change failing from false to true because of a basic property of the arithmetic mean.

Mathematical proof

The per-file check works as follows: in CodeCoverage.zig, writeFormat() compares each file's function and line coverage fractions against base_fraction (the threshold). If any file's coverage is below threshold, fraction.failing is set to true, and then at test_command.zig:1187-1188, failing = true is set.

The average is computed by accumulating per-file fractions and dividing by the count. By the arithmetic mean inequality: if every value x_i >= T (threshold), then (sum of x_i) / n >= T. The contrapositive is: if the mean < T, then at least one x_i < T, which means the per-file check already set failing = true.

Step-by-step example

Consider the regression test: one file lib.ts with 10% function coverage (1/10 functions used) against a 90% threshold. The per-file check at line 1187 sees 0.1 < 0.9 and sets failing = true. The average check at line 1214 also sees 0.1 < 0.9 and sets failing = true again — but it was already true. Now consider the passing case: lib.ts with 100% coverage against a 50% threshold. Per-file: 1.0 >= 0.5, so failing stays false. Average: 1.0 >= 0.5, so the average check also does not trigger. There is no scenario where the average check triggers but per-file checks do not.

Impact

The code is harmless — it cannot cause incorrect behavior. It functions as defense-in-depth and makes the intent explicit (that average coverage should be checked). However, it is technically dead code: it can never change program behavior. The regression test validates the end-to-end threshold behavior correctly, but does not (and cannot) exercise a code path where the average check independently causes failure.

Why this is a nit

As noted by reviewers, redundant safety checks are common practice and this one aligns with user expectations about threshold checking. It also provides a safety net if the per-file checking logic ever changes in the future. The code is correct and not harmful — it is just unnecessary from a strict logical standpoint. The PR still fixes the user-facing issue (#7367) via the per-file checks that were already in place.

github-actions bot added the claude label Mar 10, 2026

claude bot reviewed Mar 10, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix(test): check average coverage against threshold#27979

fix(test): check average coverage against threshold#27979
robobun wants to merge 1 commit intomainfrom
claude/fix-coverage-threshold

robobun commented Mar 10, 2026

Uh oh!

robobun commented Mar 10, 2026 •

edited

Loading

Uh oh!

coderabbitai bot commented Mar 10, 2026

Uh oh!

claude bot left a comment

Uh oh!

claude bot Mar 10, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

robobun commented Mar 10, 2026

Summary

Root Cause

Fix

Test plan

Uh oh!

robobun commented Mar 10, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

coderabbitai bot commented Mar 10, 2026

Walkthrough

Changes

Uh oh!

claude bot left a comment

Choose a reason for hiding this comment

Overview

Security risks

Level of scrutiny

Other factors

Uh oh!

claude bot Mar 10, 2026

Choose a reason for hiding this comment

The redundancy

Mathematical proof

Step-by-step example

Impact

Why this is a nit

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

robobun commented Mar 10, 2026 •

edited

Loading