The Wayback Machine - https://web.archive.org/web/20230326081301/https://github.com/topics/asr

#

asr

Here are 739 public repositories matching this topic...

PaddlePaddle / PaddleSpeech

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

Updated Mar 24, 2023
Python

NVIDIA / NeMo

NeMo: a toolkit for conversational AI

nlp text-to-speech deep-learning neural-network machine-translation tts speech-synthesis speech-recognition speech-to-text nmt language-model speaker-recognition nlp-machine-learning asr speaker-diarization text-normalization

Updated Mar 26, 2023
Python

speechbrain / speechbrain

A PyTorch-based Speech Toolkit

Updated Mar 25, 2023
Python

alphacep / vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

Updated Mar 26, 2023
Jupyter Notebook

wukong-robot

wzpan / wukong-robot

Sponsor

🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目，支持ChatGPT多轮对话能力，还可能是首个支持脑机交互的开源智能音箱项目。

alexa ai amazon-echo muse tts openai google-home unit bci speaker homeassistant snowboy asr anyq raspeberry-pi gpt3 chatgpt

Updated Mar 26, 2023
Python

xiangyuecn / Recorder

html5 js 录音 mp3 wav ogg webm amr 格式，支持pc和Android、iOS部分浏览器、Hybrid App（提供Android iOS App源码）、微信，提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码

audio javascript html html5 dtmf webrtc webm mp3 wav recording recorder amr ogg record h5 asr sound-record luyin

Updated Feb 23, 2023
JavaScript

snakers4 / silero-models

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

Updated Dec 26, 2022
Jupyter Notebook

wenet-e2e / wenet

Production First and Production Ready End-to-End Speech Recognition Toolkit

pytorch transformer speech-recognition automatic-speech-recognition production-ready asr conformer e2e-models

Updated Mar 25, 2023
C++

tensorflow / lingvo

Lingvo

nlp research translation tensorflow machine-translation speech distributed tts speech-synthesis mnist speech-recognition lm seq2seq speech-to-text gpu-computing language-model asr

Updated Mar 25, 2023
Python

pytorch-kaldi

mravanelli / pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

deep-neural-networks deep-learning speech dnn pytorch recurrent-neural-networks lstm gru speech-recognition rnn kaldi rnn-model asr lstm-neural-networks multilayer-perceptron-network timit dnn-hmm

Updated Mar 14, 2022
Python

m-bain / whisperX

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

speech speech-recognition speech-to-text whisper asr

Updated Mar 24, 2023
Python

coqui-ai / STT

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

deep-learning tensorflow voice-recognition speech-recognition automatic-speech-recognition speech-to-text stt asr speech-recognizer speech-recognition-api

Updated Mar 24, 2023
C++

Delta-ML / delta

DELTA is a deep learning based natural language and speech processing platform.

Updated Mar 24, 2023
Python

jdepoix / youtube-transcript-api

Sponsor

This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require a headless browser, like other selenium based solutions do!

python cli youtube youtube-video youtube-api captions subtitles transcript subtitle transcripts asr youtube-subtitles youtube-transcripts youtube-captions youtube-transcript translating-transcripts youtube-asr

Updated Mar 16, 2023
Python

mravanelli / SincNet

SincNet is a neural architecture for efficiently processing raw audio samples.

Updated Apr 28, 2021
Python

pykaldi / pykaldi

A Python wrapper for Kaldi

python wrapper numpy speech feature-extraction speech-recognition kaldi language-model asr openfst clif

Updated Sep 18, 2022
Python

espresso

freewym / espresso

Espresso: A Fast End-to-End Neural Speech Recognition Toolkit

python end-to-end pytorch speech-recognition kaldi asr fairseq

Updated Dec 8, 2022
Python

athena-team / athena

an open-source implementation of sequence-to-sequence based speech processing engine

deployment tensorflow tts speech-synthesis transformer speech-recognition sequence-to-sequence unsupervised-learning speaker-recognition asr ctc wfst

Updated Dec 2, 2022
C++

srvk / eesen

The official repository of the Eesen project

tensorflow speech-recognition speech-to-text kaldi asr ctc ctc-loss

Updated May 23, 2019
C++

snakers4 / open_stt

Open STT

dataset russian automatic-speech-recognition speech-to-text stt asr

Updated Mar 11, 2022
Python

Improve this page

Add a description, image, and links to the asr topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the asr topic, visit your repo's landing page and select "manage topics."