Speech to Text is the process of transforming audio files to text.

Dataiku provides several speech-to-text capabilities

Native speech-to-text

The native speech to text capability of Dataiku provides speech-to-text in English. It is an offline capability, meaning that it does not leverage a 3rd party API.


This capability is provided by the “Speech to Text” plugin, which you need to install. Please see Installing plugins.

This plugin is Not supported

Please see our Speech to Text plugin page for detailed documentation.

AWS Transcribe

The AWS Transcribe integration provides speech-to-text extraction in 40 languages

Please see NLP using AWS APIs for more details