SenseVoice

This section describes how to use models from https://github.com/FunAudioLLM/SenseVoice.

A single model from SenseVoice supports the following languages

Chinese (Mandarin, 普通话)

Cantonese (粤语, 广东话)

English

Japanese

Korean

which is similar to what multilingual Whisper is doing.

We have converted SenseVoice to onnx and provided APIs for the following programming languages

C++

C

Python

C#

Go

Kotlin

Java

JavaScript (Support WebAssembly and Node)

Swift

Dart (Support Flutter)

Note that you can use SenseVoice with sherpa-onnx on the following platforms:

Linux (x64, aarch64, arm, riscv64)

macOS (x64, arm64)

Windows (x64, x86, arm64)

Android (arm64-v8a, armv7-eabi, x86, x86_64)

iOS (arm64)

In the following, we describe how to download pre-trained SenseVoice models and use them in sherpa-onnx.