Pre-trained models
The following table lists links for all pre-trained models.
Description |
URL |
Speech recognition (speech to text, ASR) |
https://github.com/k2-fsa/sherpa-onnx/releases/tag/asr-models |
Text to speech (TTS) |
https://github.com/k2-fsa/sherpa-onnx/releases/tag/tts-models |
VAD |
https://github.com/k2-fsa/sherpa-onnx/releases/download/asr-models/silero_vad.onnx |
Keyword spotting |
https://github.com/k2-fsa/sherpa-onnx/releases/tag/kws-models |
Speech identification (Speaker ID) |
https://github.com/k2-fsa/sherpa-onnx/releases/tag/speaker-recongition-models |
Spoken language identification (Language ID) |
https://github.com/k2-fsa/sherpa-onnx/releases/tag/asr-models (multi-lingual whisper) |
Audio tagging |
https://github.com/k2-fsa/sherpa-onnx/releases/tag/audio-tagging-models |
Punctuation |
https://github.com/k2-fsa/sherpa-onnx/releases/tag/punctuation-models |
In this section, we describe how to download and use all available pre-trained models for speech recognition.
- Online transducer models
- Zipformer-transducer-based Models
- sherpa-onnx-streaming-zipformer-korean-2024-06-16 (Korean)
- sherpa-onnx-streaming-zipformer-multi-zh-hans-2023-12-12 (Chinese)
- k2-fsa/icefall-asr-zipformer-wenetspeech-streaming-small (Chinese)
- k2-fsa/icefall-asr-zipformer-wenetspeech-streaming-large (Chinese)
- pkufool/icefall-asr-zipformer-streaming-wenetspeech-20230615 (Chinese)
- csukuangfj/sherpa-onnx-streaming-zipformer-en-2023-06-26 (English)
- csukuangfj/sherpa-onnx-streaming-zipformer-en-2023-06-21 (English)
- csukuangfj/sherpa-onnx-streaming-zipformer-en-2023-02-21 (English)
- csukuangfj/sherpa-onnx-streaming-zipformer-bilingual-zh-en-2023-02-20 (Bilingual, Chinese + English)
- shaojieli/sherpa-onnx-streaming-zipformer-fr-2023-04-14 (French)
- sherpa-onnx-streaming-zipformer-small-bilingual-zh-en-2023-02-16 (Bilingual, Chinese + English)
- csukuangfj/sherpa-onnx-streaming-zipformer-zh-14M-2023-02-23 (Chinese)
- csukuangfj/sherpa-onnx-streaming-zipformer-en-20M-2023-02-17 (English)
- Conformer-transducer-based Models
- LSTM-transducer-based Models
- Zipformer-transducer-based Models
- Online paraformer models
- Online CTC models
- Offline transducer models
- Zipformer-transducer-based Models
- sherpa-onnx-zipformer-ru-2024-09-18 (Russian, 俄语)
- sherpa-onnx-small-zipformer-ru-2024-09-18 (Russian, 俄语)
- sherpa-onnx-zipformer-ja-reazonspeech-2024-08-01 (Japanese, 日语)
- sherpa-onnx-zipformer-korean-2024-06-24 (Korean, 韩语)
- sherpa-onnx-zipformer-thai-2024-06-20 (Thai, 泰语)
- sherpa-onnx-zipformer-cantonese-2024-03-13 (Cantonese, 粤语)
- sherpa-onnx-zipformer-gigaspeech-2023-12-12 (English)
- zrjin/sherpa-onnx-zipformer-multi-zh-hans-2023-9-2 (Chinese)
- yfyeung/icefall-asr-cv-corpus-13.0-2023-03-09-en-pruned-transducer-stateless7-2023-04-17 (English)
- k2-fsa/icefall-asr-zipformer-wenetspeech-small (Chinese)
- k2-fsa/icefall-asr-zipformer-wenetspeech-large (Chinese)
- pkufool/icefall-asr-zipformer-wenetspeech-20230615 (Chinese)
- csukuangfj/sherpa-onnx-zipformer-large-en-2023-06-26 (English)
- csukuangfj/sherpa-onnx-zipformer-small-en-2023-06-26 (English)
- csukuangfj/sherpa-onnx-zipformer-en-2023-06-26 (English)
- icefall-asr-multidataset-pruned_transducer_stateless7-2023-05-04 (English)
- csukuangfj/sherpa-onnx-zipformer-en-2023-04-01 (English)
- csukuangfj/sherpa-onnx-zipformer-en-2023-03-30 (English)
- Conformer-transducer-based Models
- NeMo transducer-based Models
- Zipformer-transducer-based Models
- Offline paraformer models
- Paraformer models
- csukuangfj/sherpa-onnx-paraformer-trilingual-zh-cantonese-en (Chinese + English + Cantonese 粤语)
- csukuangfj/sherpa-onnx-paraformer-en-2024-03-09 (English)
- csukuangfj/sherpa-onnx-paraformer-zh-small-2024-03-09 (Chinese + English)
- csukuangfj/sherpa-onnx-paraformer-zh-2024-03-09 (Chinese + English)
- csukuangfj/sherpa-onnx-paraformer-zh-2023-03-28 (Chinese + English)
- csukuangfj/sherpa-onnx-paraformer-zh-2023-09-14 (Chinese + English))
- Paraformer models
- Offline CTC models
- TeleSpeech
- Whisper
- WeNet
- Small models