The Wayback Machine - https://web.archive.org/web/20230326081301/https://github.com/topics/asr
Here are
739 public repositories
matching this topic...
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Updated
Mar 24, 2023
Python
NeMo: a toolkit for conversational AI
Updated
Mar 26, 2023
Python
A PyTorch-based Speech Toolkit
Updated
Mar 25, 2023
Python
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Updated
Mar 26, 2023
Jupyter Notebook
🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,支持ChatGPT多轮对话能力,还可能是首个支持脑机交互的开源智能音箱项目。
Updated
Mar 26, 2023
Python
html5 js 录音 mp3 wav ogg webm amr 格式,支持pc和Android、iOS部分浏览器、Hybrid App(提供Android iOS App源码)、微信,提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码
Updated
Feb 23, 2023
JavaScript
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
Updated
Dec 26, 2022
Jupyter Notebook
Production First and Production Ready End-to-End Speech Recognition Toolkit
Updated
Mar 25, 2023
Python
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
Updated
Mar 14, 2022
Python
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
Updated
Mar 24, 2023
Python
🐸 STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.
DELTA is a deep learning based natural language and speech processing platform.
Updated
Mar 24, 2023
Python
This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require a headless browser, like other selenium based solutions do!
Updated
Mar 16, 2023
Python
SincNet is a neural architecture for efficiently processing raw audio samples.
Updated
Apr 28, 2021
Python
A Python wrapper for Kaldi
Updated
Sep 18, 2022
Python
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
Updated
Dec 8, 2022
Python
an open-source implementation of sequence-to-sequence based speech processing engine
The official repository of the Eesen project
Updated
Mar 11, 2022
Python
Improve this page
Add a description, image, and links to the
asr
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
asr
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.