feat(nn): training options and in-memory batching by szvsw · Pull Request #35 · MITSustainableDesignLab/globi

szvsw · 2026-03-29T21:21:07Z

Summary

Extends the PyTorch NN surrogate backend with:

Early stopping improvement threshold — early_stopping_min_delta on NNTrainerConfig. Validation loss must improve by at least this amount versus the previous best to reset patience (default 0.0: any strictly lower val loss counts).
L1 regularization — l1_penalty on NNTrainerConfig. When positive, training loss is MSE plus l1_penalty * sum(|theta|) over trainable parameters. Default 0.0 disables it. Validation and early stopping still use plain MSE.
Max training time — max_training_minutes on NNTrainerConfig (default None). When set, training stops once monotonic elapsed time from the start of the first training batch of the first epoch reaches that many minutes. The current epoch is abandoned without validation if the limit is hit mid-epoch. Uses time.monotonic().
In-memory batching — Training no longer uses DataLoader. Data stays as CPU torch tensors built from NumPy; each epoch uses np.random.permutation(n_samples) and batch indices with drop_last-equivalent batch count (n // batch_size). Validation uses sequential tensor slices with the same drop-last rule and no shuffle.

L2: Unchanged — Adam/SGD weight_decay remains the L2-style term on the optimizer.

Example (YAML)

early_stopping_min_delta: 1.0e-4
l1_penalty: 1.0e-5
max_training_minutes: 120
optimizer:
  optimizer: adam
  weight_decay: 1.0e-4  # L2-style

Testing

uv run pytest — 5 passed

Notes

Local commits may use -c commit.gpgsign=false when the SSH signing helper hits a GLIBC mismatch in this environment.

…eshold Require validation loss to improve by at least min_delta vs the previous best before resetting early stopping patience. Default 0 preserves prior behavior (any strictly lower val loss counts). Co-authored-by: Sam Wolk <szvsw@users.noreply.github.com>

Trainer l1_penalty adds lambda * sum(|theta|) to the training MSE loss. L2-style regularization remains optimizer weight_decay. Validation uses plain MSE only. Co-authored-by: Sam Wolk <szvsw@users.noreply.github.com>

Stop when monotonic elapsed time from the first training batch exceeds the limit. Partial epochs are skipped without validation; best checkpoint and post-training flow unchanged. Co-authored-by: Sam Wolk <szvsw@users.noreply.github.com>

Shuffle each epoch with np.random.permutation over row indices; slice batches with drop-last semantics. Validation uses sequential tensor slices. Avoids DataLoader overhead for data already in memory. Co-authored-by: Sam Wolk <szvsw@users.noreply.github.com>

cursoragent and others added 2 commits March 29, 2026 21:20

feat(nn): add configurable L1 penalty on trainable weights

f352424

Trainer l1_penalty adds lambda * sum(|theta|) to the training MSE loss. L2-style regularization remains optimizer weight_decay. Validation uses plain MSE only. Co-authored-by: Sam Wolk <szvsw@users.noreply.github.com>

cursor bot changed the title ~~feat(nn): early stopping validation improvement threshold (min_delta)~~ feat(nn): early stopping min-delta and L1 weight penalty Mar 29, 2026

cursor bot changed the title ~~feat(nn): early stopping min-delta and L1 weight penalty~~ feat(nn): early stopping min-delta, L1 penalty, and max training time Mar 29, 2026

cursor bot changed the title ~~feat(nn): early stopping min-delta, L1 penalty, and max training time~~ feat(nn): training options and in-memory batching Mar 30, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(nn): training options and in-memory batching#35

feat(nn): training options and in-memory batching#35
szvsw wants to merge 4 commits intofeature/create-refactored-surrogate-trainingfrom
cursor/early-stopping-improvement-threshold-d64c

szvsw commented Mar 29, 2026 •

edited by cursor bot

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

szvsw commented Mar 29, 2026 • edited by cursor bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Example (YAML)

Testing

Notes

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

szvsw commented Mar 29, 2026 •

edited by cursor bot

Loading