SenseVoice
This section describes how to use models from https://github.com/FunAudioLLM/SenseVoice.
A single model from SenseVoice supports the following languages
Chinese (Mandarin, 普通话)
Cantonese (粤语, 广东话)
English
Japanese
Korean
which is similar to what multilingual Whisper is doing.
We have converted SenseVoice to onnx and provided APIs for the following programming languages
C++
C
Python
C#
Go
Kotlin
Java
JavaScript (Support WebAssembly and Node)
Swift
Note that you can use SenseVoice with sherpa-onnx on the following platforms:
Linux (x64, aarch64, arm, riscv64)
macOS (x64, arm64)
Windows (x64, x86, arm64)
Android (arm64-v8a, armv7-eabi, x86, x86_64)
iOS (arm64)
In the following, we describe how to download pre-trained SenseVoice models and use them in sherpa-onnx.