[Fix] Fix Method to Obtain Prefix Token ID by anzr299 · Pull Request #18317 · pytorch/executorch

anzr299 · 2026-03-19T13:13:23Z

Summary

Fix prefix_token_id to return the BOS token ID instead of the EOT token ID.

EagerEvalWrapper.prefix_token_id was incorrectly returning the EOT token ID.
Since lm-eval prepends prefix_token_id to every evaluation sequence, this caused
Llama 3's <|end_of_text|> (token 128001) to be used instead of <|begin_of_text|>
(token 128000), resulting in higher perplexity scores.

Llama 3 8B Wikitext PPL before fix : 9.18
Llama 3 8B Wikitext PPL after fix : 7.793

The result after the fix matches the expected perplexity when evaluating
the same model directly via HuggingFace.

pytorch-bot · 2026-03-19T13:13:27Z

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18317

📄 Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

❌ 6 Awaiting Approval, 1 New Failure

As of commit 55fd4d1 with merge base 1925873 ():

AWAITING APPROVAL - The following workflows need approval before CI can run:

NEW FAILURE - The following job has failed:

Copilot code review / Cleanup artifacts (gh)
Process completed with exit code 1.

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions · 2026-03-19T13:14:09Z

This PR needs a `release notes:` label

If your change should be included in the release notes (i.e. would users of this library care about this change?), please use a label starting with release notes:. This helps us keep track and include your important work in the next release notes.

To add a label, you can comment to pytorchbot, for example
@pytorchbot label "release notes: none"

For more information, see
https://github.com/pytorch/pytorch/wiki/PyTorch-AutoLabel-Bot#why-categorize-for-release-notes-and-how-does-it-work.

Copilot

Pull request overview

Fixes Llama evaluation behavior by returning a proper prefix token id (BOS when available) instead of incorrectly defaulting to the end-of-text/end-of-sequence token, aligning perplexity results with Hugging Face’s eval flow.

Changes:

Update prefix_token_id to prefer tokenizer.bos_id when present.
Preserve prior fallback behavior by using eot_token_id when BOS is unavailable.

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

You can also share your feedback on Copilot code review. Take the survey.

Update eager_eval.py

55fd4d1

anzr299 requested a review from lucylq as a code owner March 19, 2026 13:13

Copilot AI review requested due to automatic review settings March 19, 2026 13:13

meta-cla bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Mar 19, 2026

Copilot AI reviewed Mar 19, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Fix] Fix Method to Obtain Prefix Token ID#18317

[Fix] Fix Method to Obtain Prefix Token ID#18317
anzr299 wants to merge 1 commit intopytorch:mainfrom
anzr299:patch-4

anzr299 commented Mar 19, 2026 •

edited

Loading

Uh oh!

pytorch-bot bot commented Mar 19, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Mar 19, 2026

Uh oh!

Copilot AI left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

anzr299 commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Uh oh!

pytorch-bot bot commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/executorch/18317

❌ 6 Awaiting Approval, 1 New Failure

Uh oh!

github-actions bot commented Mar 19, 2026

This PR needs a release notes: label

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

anzr299 commented Mar 19, 2026 •

edited

Loading

pytorch-bot bot commented Mar 19, 2026 •

edited

Loading

This PR needs a `release notes:` label