FunASR Nano

This section describes how to use models from https://github.com/FunAudioLLM/Fun-ASR.

Fun-ASR-Nano-2512

A single model from Fun-ASR-Nano-2512 supports the following languages

Chinese

English

Japanese

Hint

中文包括 7 种方言（吴语、粤语、闽语、客家话、赣语、湘语、晋语）和 26 种地方口音（河南、山西、湖北、四川、重庆、云南、贵州、广东、广西及其他 20 多个地区）。

英文和日文涵盖多种地方口音。

此外还支持歌词识别和说唱语音识别。

We have converted Fun-ASR-Nano-2512 to onnx and provided APIs for the following programming languages

C++

C

Python

C#

Go

Kotlin

Java

JavaScript

Swift

Dart (Support Flutter)

Rust

Pascal

Note that you can use Fun-ASR-Nano-2512 with sherpa-onnx on the following platforms:

Linux (x64, aarch64, arm, riscv64)

macOS (x64, arm64)

Windows (x64, x86, arm64)

Android (arm64-v8a, armv7-eabi, x86, x86_64)

iOS (arm64)

In the following, we describe how to download pre-trained Fun-ASR-Nano-2512 models and use them in sherpa-onnx.