Deepspeech

DeepSpeech is an open-source Speech-To-Text (STT) engine that uses a model trained by machine learning techniques. It was initially developed based on Baidu’s Deep Speech research paper and is now maintained by Mozilla.

Deepspeech

Areas of application

  • Natural Language Processing
  • Speech Recognition
  • Machine Learning
  • Audio Transcription
  • Text-to-Speech

Example

Use DeepSpeech to transcribe audio files into text, such as podcasts or lectures.