SenseVoice

This section describes how to use models from https://github.com/FunAudioLLM/SenseVoice.

A single model from SenseVoice supports the following languages

  • Chinese (Mandarin, 普通话)

  • Cantonese (粤语, 广东话)

  • English

  • Japanese

  • Korean

which is similar to what multilingual Whisper is doing.

We have converted SenseVoice to onnx and provided APIs for the following programming languages

Note that you can use SenseVoice with sherpa-onnx on the following platforms:

  • Linux (x64, aarch64, arm, riscv64)

  • macOS (x64, arm64)

  • Windows (x64, x86, arm64)

  • Android (arm64-v8a, armv7-eabi, x86, x86_64)

  • iOS (arm64)

In the following, we describe how to download pre-trained SenseVoice models and use them in sherpa-onnx.