Configuration for a Silero VAD model.
More...
#include <c-api.h>
Definition at line 1848 of file c-api.h.
◆ max_speech_duration
| float SherpaOnnxSileroVadModelConfig::max_speech_duration |
Maximum speech duration in seconds.
When a segment exceeds this value, the detector temporarily uses a higher threshold to encourage a split.
Definition at line 1865 of file c-api.h.
◆ min_silence_duration
| float SherpaOnnxSileroVadModelConfig::min_silence_duration |
Minimum silence duration in seconds used to close a speech segment.
Definition at line 1854 of file c-api.h.
◆ min_speech_duration
| float SherpaOnnxSileroVadModelConfig::min_speech_duration |
Minimum speech duration in seconds to keep a detected segment.
Definition at line 1856 of file c-api.h.
◆ model
| const char* SherpaOnnxSileroVadModelConfig::model |
Path to silero_vad.onnx.
Definition at line 1850 of file c-api.h.
◆ threshold
| float SherpaOnnxSileroVadModelConfig::threshold |
Speech probability threshold. Frames above this value are speech.
Definition at line 1852 of file c-api.h.
◆ window_size
| int32_t SherpaOnnxSileroVadModelConfig::window_size |
Input window size in samples. A common value is 512.
Definition at line 1858 of file c-api.h.
The documentation for this struct was generated from the following file: