Fix LARS/LAMB optimizer support and non-contiguous tensor handling on XPU by jiqing-feng · Pull Request #1902 · bitsandbytes-foundation/bitsandbytes

jiqing-feng · 2026-03-19T05:26:38Z

Changes

Enable LARS/LAMB optimizers on XPU: Added missing "lars" and "lamb" entries to name2optimizer_id, name2optimizer_32bit_fn, and name2optimizer_fn dicts in the Triton backend, and name2optimizer_id in the default backend. LARS maps to MOMENTUM (1-state) and LAMB maps to ADAM (2-state).
Fix Triton compilation error with fp16 gradients: In _optimizer_precondition_1state_32bit, the MOMENTUM branch's step == 1 path does s1_vals = g_vals (direct assignment). When gradients are fp16, this changes s1_vals from fp32 to fp16, conflicting with the else branch where arithmetic auto-promotes to fp32. Fixed by casting g_vals to fp32 at the assignment site.
Fix non-contiguous tensor support for blockwise quantization: Triton kernels use linear offsets to access memory, which is incorrect for non-contiguous tensors. Added .contiguous() calls in quantize_blockwise and dequantize_blockwise entry points.

Related tests:

pytest -k "xpu" -ra test_ops.py::TestNonContiguousInputs::test_quantize_blockwise_non_contiguous
pytest -k "xpu" -ra test_optim.py::test_optimizer32bit

Hi @matthewdouglas . Would you please review this PR? Thanks!

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

jiqing-feng added 3 commits March 19, 2026 12:44

enable lars on XPU

1baac13

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

fix optimizer

b6bfbeb

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

fix contiguous

9e70800

Signed-off-by: jiqing-feng <jiqing.feng@intel.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Fix LARS/LAMB optimizer support and non-contiguous tensor handling on XPU#1902

Fix LARS/LAMB optimizer support and non-contiguous tensor handling on XPU#1902
jiqing-feng wants to merge 3 commits intobitsandbytes-foundation:mainfrom
jiqing-feng:xpu

jiqing-feng commented Mar 19, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Uh oh!

Conversation

jiqing-feng commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

jiqing-feng commented Mar 19, 2026 •

edited

Loading