docs: clarify 100B refers to training tokens, not model size by JasonOA888 · Pull Request #443 · microsoft/BitNet

JasonOA888 · 2026-03-13T04:12:03Z

Summary

Fixes #391 - Clarifies that the 100B mentioned in the README refers to training tokens, not model parameters.

Problem

The original text run a 100B BitNet b1.58 model is frequently misinterpreted as referring to a 100B parameter model, when it actually means a model trained on 100B tokens.

This has become a yearly rite of passage for engagement farming accounts to misinterpret this section.

Change

Before: run a 100B BitNet b1.58 model
After: run a BitNet b1.58 model trained on 100B tokens

Testing

Verified README.md renders correctly
No code changes, documentation only

Closes #391

This fixes confusion where readers misinterpret '100B model' as referring to parameter count, when it actually refers to training tokens. The change makes this distinction clear: - Before: 'run a 100B BitNet b1.58 model' - After: 'run a BitNet b1.58 model trained on 100B tokens' Closes microsoft#391 Co-authored-by: Jason L <jason@outland.art>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: clarify 100B refers to training tokens, not model size#443

docs: clarify 100B refers to training tokens, not model size#443
JasonOA888 wants to merge 1 commit intomicrosoft:mainfrom
JasonOA888:docs/clarify-100b-training-tokens

JasonOA888 commented Mar 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

JasonOA888 commented Mar 13, 2026

Summary

Problem

Change

Testing

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant