Speech Recognition¶

In restricted or air-gapped environments, a DSS administrator can preload Whisper assets in the plugin code env resources so the recipe runs without downloading files at runtime.

Recommended setup:

Download the required Whisper model file(s) (.pt) on a machine with internet access. Model URLs are listed in the Whisper repository (line 17): https://github.com/openai/whisper/blob/v20250625/whisper/__init__.py.
Create a zip archive containing the .pt files.
Open the plugin code environment settings.
If you plan on using containerized execution, in Containerized execution set Resources initialization to Copy resources from local code environment.
In Resources, upload the zip archive and make sure files are available directly under the speechrecognition-package folder.

Offline speech-recognition code env setup

Rebuild the code environment.

At runtime, models are expected under the folder speechrecognition-package. If the selected model is not available locally, the recipe will try to fetch it.

Migration from Speech-to-Text ¶

The plugin includes two maintenance runnables to replace deprecated speech-to-text CPU/GPU recipes:

Replace deprecated Speech-to-Text recipes (current project)
Replace deprecated Speech-to-Text recipes (all projects)

During replacement, the new recipe type is applied, the old weights input role is removed, and existing recipe outputs are preserved.

Both runnables provide an option to delete the old weights managed folder when it is no longer needed by the migrated recipe.