ten-vad
Our support of ten-vad uses https://github.com/TEN-framework/ten-vad/pull/36 as a reference, which use 0 for the pitch feature. It may degrade the performance, but it greatly simplifies the implementation.
Download models files
We have added some meta data to the original ten-vad.onnx
, so please
use the model files from the following table:
Model size |
Download URL |
|
|
324 KB |
|
|
126 KB |
If you use the model from https://github.com/TEN-framework/ten-vad/blob/main/src/onnx_model/ten-vad.onnx in sherpa-onnx, you will get runtime errors.
Note that ten-vad supports only 16k Hz samples.
Android examples
Source |
Pre-built APK URL |
|
Pure VAD |
||
VAD + non-streaming ASR |
||
VAD + real-time ASR |
For pure VAD, please see https://k2-fsa.github.io/sherpa/onnx/vad/apk.html and https://github.com/k2-fsa/sherpa-onnx/tree/master/android/SherpaOnnxVad
WebAssembly examples
URL |
|
Huggingface space |
|
ModelScope space |
Source code is available at https://github.com/k2-fsa/sherpa-onnx/tree/master/wasm/vad
For WebAssembly with VAD + ASR, please see https://github.com/k2-fsa/sherpa-onnx/tree/master/wasm/vad-asr
C API examples
Filename |
Comment |
ten-vad with SenseVoice for
speech recognition with a very long file
|
|
For APIs of different programming languages, please see https://github.com/k2-fsa/sherpa-onnx