#
speech-recognition
Here are 2,140 public repositories matching this topic...
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
machine-learning
embedded
deep-learning
offline
tensorflow
speech-recognition
neural-networks
speech-to-text
deepspeech
on-device
-
Updated
Jun 10, 2021 - C++
kaldi-asr/kaldi is the official location of the Kaldi project.
shell
c-plus-plus
cuda
speech
speech-recognition
speech-to-text
kaldi
speaker-verification
speaker-id
-
Updated
Jun 17, 2021 - Shell
Drench yourself in Deep Learning, Reinforcement Learning, Machine Learning, Computer Vision, and NLP by learning from these exciting lectures!!
machine-learning
natural-language-processing
deep-neural-networks
reinforcement-learning
computer-vision
deep-learning
optimization
machine-translation
deep-reinforcement-learning
medical-imaging
speech-recognition
artificial-neural-networks
pattern-recognition
probabilistic-graphical-models
bayesian-statistics
artificial-intelligence-algorithms
visual-recognition
graph-neural-networks
-
Updated
May 21, 2021
Open
Fedora & apt-get
2
AsterYujano
commented
Oct 5, 2019
Specs
- Leon version: latest
- OS (or browser) version: Fedora 30
- Node.js version: 10.16.3
- Complete "npm run check" output:
➡ Here is the diagnosis about your current setup
✔ Run
✔ Run modules
✔ Reply you by texting
❗ Amazon Polly text-to-speech
❗ Google Cloud text-to-speech
❗ Watson text-to-speech
❗ Offline text-to-speech
❗ Google Cloud speech-to-text
❗ Watson spee
-
Updated
Mar 26, 2021 - JavaScript
Facebook AI Research's Automatic Speech Recognition Toolkit
-
Updated
Jun 2, 2021 - C++
Speech recognition module for Python, supporting several engines and APIs, online and offline.
-
Updated
Feb 28, 2021 - Python
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
-
Updated
May 16, 2021 - Python
End-to-End Speech Processing Toolkit
deep-learning
chainer
end-to-end
machine-translation
pytorch
speech-synthesis
speech-recognition
kaldi
voice-conversion
speech-separation
speech-enhancement
speech-translation
-
Updated
Jun 16, 2021 - Python
PocketSphinx is a lightweight speech recognition engine, specifically tuned for handheld and mobile devices, though it works equally well on the desktop
-
Updated
Jan 8, 2021 - C
NeMo: a toolkit for conversational AI
nlp
text-to-speech
deep-learning
neural-network
machine-translation
speech-synthesis
speech-recognition
speech-to-text
nmt
nlp-machine-learning
-
Updated
Jun 18, 2021 - Jupyter Notebook
End-to-end Automatic Speech Recognition for Madarian and English in Tensorflow
audio
deep-learning
tensorflow
paper
end-to-end
evaluation
cnn
lstm
speech-recognition
rnn
automatic-speech-recognition
feature-vector
data-preprocessing
phonemes
timit-dataset
layer-normalization
rnn-encoder-decoder
chinese-speech-recognition
-
Updated
May 25, 2021 - Python
A PyTorch-based Speech Toolkit
audio
transformers
pytorch
voice-recognition
speech-recognition
speech-to-text
language-model
speaker-recognition
speaker-verification
speech-processing
audio-processing
asr
speaker-diarization
speechrecognition
speech-separation
speech-enhancement
spoken-language-understanding
huggingface
speech-toolkit
speechbrain
-
Updated
Jun 17, 2021 - Python
Lingvo
nlp
research
translation
tensorflow
machine-translation
speech
distributed
tts
speech-synthesis
mnist
speech-recognition
lm
seq2seq
speech-to-text
gpu-computing
language-model
asr
-
Updated
Jun 17, 2021 - Python
-
Updated
Nov 20, 2018 - Python
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
deep-neural-networks
deep-learning
speech
dnn
pytorch
recurrent-neural-networks
lstm
gru
speech-recognition
rnn
kaldi
rnn-model
asr
lstm-neural-networks
multilayer-perceptron-network
timit
dnn-hmm
-
Updated
Mar 15, 2021 - Python
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
roadmap
neural-network
cnn
dnn
tts
speech-synthesis
speech-recognition
rnn
seq2seq
automatic-speech-recognition
papers
language-model
attention-mechanism
speaker-verification
timit-dataset
acoustic-model
recognition-synthesis
-
Updated
Jun 16, 2021
3
nshmyrev
commented
Aug 4, 2020
One can use https://github.com/s-yata/marisa-trie to save a lot of space for symbols.
Attempt at tracking states of the arts and recent results (bibliography) on speech recognition.
-
Updated
Dec 21, 2020
-
Updated
Mar 3, 2020 - Python
Machine Learning Resources, Practice and Research
-
Updated
Apr 11, 2021 - Python
Alias is a teachable “parasite” that is designed to give users more control over their smart assistants, both when it comes to customisation and privacy. Through a simple app the user can train Alias to react on a custom wake-word/sound, and once trained, Alias can take control over your home assistant by activating it for you.
raspberry-pi
machine-learning
hack
smarthome
microphone
speech-recognition
classification
alias
sound-synthesis
wakeword
-
Updated
Apr 5, 2020 - Python
Kalliope is a framework that will help you to create your own personal assistant.
linux
bot
home-automation
speech-synthesis
speech-recognition
personal-assistant
bot-creation
raspberry
speech-to-text
jarvis
-
Updated
Jun 11, 2021 - Python
DELTA is a deep learning based natural language and speech processing platform.
nlp
front-end
ops
deep-learning
text-classification
tensorflow
nlu
speech
inference
text-generation
speech-recognition
seq2seq
sequence-to-sequence
speaker-verification
asr
tensorflow-serving
emotion-recognition
custom-ops
serving
tensorflow-lite
-
Updated
Apr 16, 2021 - Python
Toolkit for efficient experimentation with Speech Recognition, Text2Speech and NLP
text-to-speech
deep-learning
tensorflow
multi-node
speech-synthesis
speech-recognition
seq2seq
speech-to-text
neural-machine-translation
sequence-to-sequence
language-model
multi-gpu
float16
mixed-precision
-
Updated
May 11, 2021 - Python
List of Machine Learning, AI, NLP solutions for iOS. The most recent version of this article can be found on my blog.
swift
machine-learning
natural-language-processing
computer-vision
deep-learning
neural-network
artificial-intelligence
speech-recognition
gpgpu
awesome-list
-
Updated
Jul 30, 2018
Open-Source Large Vocabulary Continuous Speech Recognition Engine
-
Updated
Jun 6, 2021 - C
A PaddlePaddle ASR toolkit.
speech
transformer
speech-recognition
speech-to-text
conformer
deepspeech
ngram-language-model
ctc-decode
mandarin-language
-
Updated
Jun 17, 2021 - Jupyter Notebook
deepxuexi
commented
Jun 25, 2019
一个非常方便的python录音程序,专门为MASR量身定做:
按回车开始录音,说完话后再按Enter结束录音并显示识别结果,录音文件会以识别的文本命名保存,方便后期统计识别率。
代码地址:
https://github.com/deepxuexi/ARFASR
如果觉得好用,给我点个star,谢谢!
6
Improve this page
Add a description, image, and links to the speech-recognition topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with the speech-recognition topic, visit your repo's landing page and select "manage topics."


Let's use this Issue to track performance issues and enhancement requests, so it's easier to prioritize the work.
This is for pytorch
transformersAlso I will label it as a
Good Difficult Issuein case someone is ready for a challenging but rewarding experience of figuring things out. If you do want to take the challenge comment in the corresponding Issue/PR that resonates with you s