Fix a security issue in LossNode by adrinjalali · Pull Request #506 · skops-dev/skops

adrinjalali · 2026-03-26T14:18:33Z

This fixes an issue in LossNode where we used to load module from a given file.

This is not too big of a deal since if the user has a malicious package, they're already compromised via .pth files anyway. But this can still be avoided.

adrinjalali · 2026-03-26T15:07:44Z

The failing CI against nightly build can be ignored, the AI says:

Yes — this is a known upstream issue. scikit-learn/scikit-learn#33616 was filed just yesterday (2026-03-25) about exactly this.

What happened: scipy PR #24800 merged ILP64 support for cython_blas/cython_lapack, changing function signatures from int * to blas_int *. sklearn's nightly wheels were compiled against the old signatures, so they break with scipy 1.18.dev0 nightlies.

Current state on the sklearn side:

Labeled as Blocker for milestone 1.9
@thomasjpfan has a starting point for a fix, but noted it creates the reverse problem: compiling sklearn against the new scipy then breaks with scipy 1.17.x
It's essentially an ABI break in scipy's Cython API for downstream consumers
Nothing to do on the skops side — it'll resolve once sklearn publishes updated nightly wheels.

cakedev0 · 2026-03-31T08:28:29Z

Codex found this issue too, I was about to open a PR when I saw this one.

It wrote a non-regression test, in case you're interested to add it in this PR (I'd say it's a nice-to-have):

Details

def test_loss_node_does_not_import_before_audit(monkeypatch):
    from sklearn._loss._loss import CyAbsoluteError

    dumped = dumps(CyAbsoluteError())
    buffer = io.BytesIO()

    with ZipFile(io.BytesIO(dumped), "r") as src, ZipFile(buffer, "w") as dst:
        schema = json.loads(src.read("schema.json"))
        schema["__module__"] = "malicious_mod"
        schema["__class__"] = "Payload"

        for info in src.infolist():
            if info.filename == "schema.json":
                dst.writestr("schema.json", json.dumps(schema))
            else:
                dst.writestr(info, src.read(info.filename))

    dumped = buffer.getvalue()

    def fail_gettype(*args, **kwargs):
        raise AssertionError("gettype() should not be called before audit")

    monkeypatch.setattr("skops.io._sklearn.gettype", fail_gettype)

    with pytest.raises(UntrustedTypesFoundException, match="malicious_mod.Payload"):
        loads(dumped)

adrinjalali added 5 commits March 26, 2026 15:08

SEC fix load module issue in LossNode

d1677cf

Merge remote-tracking branch 'upstream/main' into lossnode

742293f

minor fix to pyproject.toml

74253c5

remove id

8c64c4e

fix QuantileForest

766c8ab

adrinjalali added 3 commits March 26, 2026 16:22

bumping scipy since there's a known issue with lapack on windows CI

a49eaff

update pixi lock file

e3c0bc6

trigger CI

8050ce3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix a security issue in LossNode#506

Fix a security issue in LossNode#506
adrinjalali wants to merge 8 commits intoskops-dev:mainfrom
adrinjalali:lossnode

adrinjalali commented Mar 26, 2026

Uh oh!

adrinjalali commented Mar 26, 2026 •

edited

Loading

Uh oh!

cakedev0 commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

adrinjalali commented Mar 26, 2026

Uh oh!

adrinjalali commented Mar 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cakedev0 commented Mar 31, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

adrinjalali commented Mar 26, 2026 •

edited

Loading