Make whisper an optional extra with faster-whisper by default by Dreamsorcerer · Pull Request #1877 · dimensionalOS/dimos

Dreamsorcerer · 2026-04-17T15:30:16Z

Problem

Whisper requires downloading a 150MB model and depends on torch (with GBs of CUDA downloads).

Solution

Provide faster-whisper by default (2MB) and use as a fallback when whisper is not available.
This avoids the 150MB download, and means we are one step closer to not depending on torch for a base install.

Breaking Changes

Users need to request dimos[whisper] now for full whisper feature.

Test

python -c "
from dimos.stream.audio.pipelines import stt
node = stt()
node.emit_text().subscribe(on_next=lambda t: print(f"USER: {t}"))
from dimos.stream.audio.utils import keepalive
keepalive()
"

Dreamsorcerer · 2026-04-17T15:56:18Z

TTS seems to work pretty well with faster-whisper anyway.

greptile-apps · 2026-04-17T15:57:18Z

Greptile Summary

This PR makes openai-whisper an optional extra (dimos[whisper]) and adds faster-whisper as the default audio transcription backend in dimos[agents], significantly reducing the default install footprint. The WhisperNode class now auto-detects which backend is available at import time, preferring openai-whisper if present and falling back to faster-whisper otherwise.

The UserWarning on line 36–40 fires for every default install (faster-whisper is in agents), misleading users into thinking their setup is degraded when it is the intended configuration.
faster-whisper in pyproject.toml has no lower version bound, but device=\"auto\" requires >=1.0.0, which can cause a TypeError at runtime on older installs.

Confidence Score: 4/5

Safe to merge after fixing the misleading UserWarning that fires for all default users.

One P1 finding: the UserWarning implies a degraded fallback state for every default install, which will confuse users. The missing version constraint on faster-whisper (P2) could also cause a runtime TypeError with older versions. Both are straightforward one-line fixes.

dimos/stream/audio/stt/node_whisper.py (misleading warning), pyproject.toml (missing version constraint)

Important Files Changed

Filename	Overview
dimos/stream/audio/stt/node_whisper.py	Adds faster-whisper fallback when openai-whisper is absent; misleading UserWarning fires for all default installs, and caller's modelopts dict is mutated via pop().
pyproject.toml	Moves openai-whisper to a new optional [whisper] extra and adds faster-whisper to [agents]; no version constraint on faster-whisper despite using device="auto" (requires >=1.0.0).
uv.lock	Lockfile updated to reflect new faster-whisper dependency; no manual review needed.

Flowchart

%%{init: {'theme': 'neutral'}}%%
flowchart TD
    A[import WhisperNode] --> B{try: import whisper}
    B -- success --> C[_USE_FASTER_WHISPER = False\nopenai-whisper backend]
    B -- ImportError --> D{try: from faster_whisper\nimport WhisperModel}
    D -- success --> E[UserWarning fired\n_USE_FASTER_WHISPER = True\nfaster-whisper backend]
    D -- ImportError --> F[Raise ImportError\nNo backend found]

    C --> G[WhisperNode.__init__]
    E --> G
    G --> H{_USE_FASTER_WHISPER?}
    H -- True --> I[pop fp16 → compute_type\nWhisperModel device=auto]
    H -- False --> J[whisper.load_model]

    I --> K[transcribe → segments iterator\njoin seg.text]
    J --> L[transcribe → dict\nresult text]

_{Reviews (1): Last reviewed commit: "Add warning" | Re-trigger Greptile}

Dreamsorcerer added 4 commits April 17, 2026 14:10

Move whisper to separate extra

3bfb6b7

Use faster-whisper

3d2f3a4

Include by default

98bb819

Add warning

7a332a5

Dreamsorcerer marked this pull request as ready for review April 17, 2026 15:54

greptile-apps Bot reviewed Apr 17, 2026

View reviewed changes

Comment thread dimos/stream/audio/stt/node_whisper.py Outdated

Comment thread dimos/stream/audio/stt/node_whisper.py

Comment thread pyproject.toml Outdated

Dreamsorcerer added 2 commits April 17, 2026 17:01

Minimum version

980d825

Avoid mutating original dict

c85b1d4

paul-nechifor reviewed Apr 18, 2026

View reviewed changes

Comment thread pyproject.toml Outdated

Dreamsorcerer commented Apr 21, 2026

View reviewed changes

Comment thread pyproject.toml Outdated

Dreamsorcerer added 2 commits April 21, 2026 15:49

Update node_whisper.py

500b30e

Apply suggestion from @Dreamsorcerer

1693d65

Dreamsorcerer commented Apr 21, 2026

View reviewed changes

Comment thread dimos/stream/audio/stt/node_whisper.py Outdated

Dreamsorcerer added 3 commits April 21, 2026 15:53

Apply suggestion from @Dreamsorcerer

c78c2c1

Merge branch 'dev' into sam/move-whisper

c3dc54f

Install whisper in CI

a32dc41

Dreamsorcerer commented Apr 21, 2026

View reviewed changes

Comment thread dimos/stream/audio/stt/node_whisper.py Outdated

Apply suggestion from @Dreamsorcerer

388b057

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Make whisper an optional extra with faster-whisper by default#1877

Make whisper an optional extra with faster-whisper by default#1877
Dreamsorcerer wants to merge 12 commits intodevfrom
sam/move-whisper

Dreamsorcerer commented Apr 17, 2026 •

edited

Loading

Uh oh!

Dreamsorcerer commented Apr 17, 2026

Uh oh!

greptile-apps Bot commented Apr 17, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

Dreamsorcerer commented Apr 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Problem

Solution

Breaking Changes

Test

Uh oh!

Dreamsorcerer commented Apr 17, 2026

Uh oh!

greptile-apps Bot commented Apr 17, 2026

Greptile Summary

Confidence Score: 4/5

Important Files Changed

Flowchart

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Dreamsorcerer commented Apr 17, 2026 •

edited

Loading