ten-vad

Caution

Please see its license at

https://github.com/TEN-framework/ten-vad/blob/main/LICENSE

before you use it commercially.

如果你需要把它用于商业目的，请先阅读它的协议。

Our support of ten-vad uses https://github.com/TEN-framework/ten-vad/pull/36 as a reference, which use 0 for the pitch feature. It may degrade the performance, but it greatly simplifies the implementation.

Download models files

We have added some meta data to the original ten-vad.onnx, so please use the model files from the following table:

Note that ten-vad supports only 16k Hz samples.

Filename	Comment
vad-moonshine-c-api.c	ten-vad with moonshine for speech recognition with a very long file
vad-sense-voice-c-api.c	ten-vad with SenseVoice for speech recognition with a very long file
vad-whisper-c-api.c	ten-vad with Whisper for speech recognition with a very long file

For APIs of different programming languages, please see https://github.com/k2-fsa/sherpa-onnx