名称 最后更新
..
src/websocketsrv 正在载入提交数据...
.gitignore 正在载入提交数据...
AddPunctuation.java 正在载入提交数据...
AudioTaggingCEDFromFile.java 正在载入提交数据...
AudioTaggingZipformerFromFile.java 正在载入提交数据...
InverseTextNormalizationNonStreamingParaformer.java 正在载入提交数据...
InverseTextNormalizationStreamingTransducer.java 正在载入提交数据...
KeywordSpotterFromFile.java 正在载入提交数据...
NonStreamingDecodeFileNemo.java 正在载入提交数据...
NonStreamingDecodeFileParaformer.java 正在载入提交数据...
NonStreamingDecodeFileTeleSpeechCtc.java 正在载入提交数据...
NonStreamingDecodeFileTransducer.java 正在载入提交数据...
NonStreamingDecodeFileWhisper.java 正在载入提交数据...
NonStreamingTtsCoquiDe.java 正在载入提交数据...
NonStreamingTtsPiperEn.java 正在载入提交数据...
NonStreamingTtsVitsZh.java 正在载入提交数据...
README.md 正在载入提交数据...
SpeakerIdentification.java 正在载入提交数据...
SpokenLanguageIdentificationWhisper.java 正在载入提交数据...
StreamingDecodeFileCtc.java 正在载入提交数据...
StreamingDecodeFileCtcHLG.java 正在载入提交数据...
StreamingDecodeFileParaformer.java 正在载入提交数据...
StreamingDecodeFileTransducer.java 正在载入提交数据...
VadFromMic.java 正在载入提交数据...
VadFromMicWithNonStreamingParaformer.java 正在载入提交数据...
VadFromMicWithNonStreamingWhisper.java 正在载入提交数据...
VadNonStreamingParaformer.java 正在载入提交数据...
VadRemoveSilence.java 正在载入提交数据...
run-add-punctuation-zh-en.sh 正在载入提交数据...
run-audio-tagging-ced-from-file.sh 正在载入提交数据...
run-audio-tagging-zipformer-from-file.sh 正在载入提交数据...
run-inverse-text-normalization-paraformer.sh 正在载入提交数据...
run-inverse-text-normalization-transducer.sh 正在载入提交数据...
run-kws-from-file.sh 正在载入提交数据...
run-non-streaming-decode-file-nemo.sh 正在载入提交数据...
run-non-streaming-decode-file-paraformer.sh 正在载入提交数据...
run-non-streaming-decode-file-tele-speech-ctc.sh 正在载入提交数据...
run-non-streaming-decode-file-transducer.sh 正在载入提交数据...
run-non-streaming-decode-file-whisper.sh 正在载入提交数据...
run-non-streaming-tts-coqui-de.sh 正在载入提交数据...
run-non-streaming-tts-piper-en.sh 正在载入提交数据...
run-non-streaming-tts-vits-zh.sh 正在载入提交数据...
run-speaker-identification.sh 正在载入提交数据...
run-spoken-language-identification-whisper.sh 正在载入提交数据...
run-streaming-decode-file-ctc-hlg.sh 正在载入提交数据...
run-streaming-decode-file-ctc.sh 正在载入提交数据...
run-streaming-decode-file-paraformer.sh 正在载入提交数据...
run-streaming-decode-file-transducer.sh 正在载入提交数据...
run-vad-from-mic-non-streaming-paraformer.sh 正在载入提交数据...
run-vad-from-mic-non-streaming-whisper.sh 正在载入提交数据...
run-vad-from-mic.sh 正在载入提交数据...
run-vad-non-streaming-paraformer.sh 正在载入提交数据...
run-vad-remove-slience.sh 正在载入提交数据...

Introduction

This directory contains examples for the JAVA API of sherpa-onnx.

Usage

Streaming Speech recognition

./run-streaming-decode-file-ctc.sh
./run-streaming-decode-file-ctc-hlg.sh
./run-streaming-decode-file-paraformer.sh
./run-streaming-decode-file-transducer.sh

Non-Streaming Speech recognition

./run-non-streaming-decode-file-paraformer.sh
./run-non-streaming-decode-file-transducer.sh
./run-non-streaming-decode-file-whisper.sh
./run-non-streaming-decode-file-nemo.sh

Non-Streaming text-to-speech

./run-non-streaming-tts-piper-en.sh
./run-non-streaming-tts-coqui-de.sh
./run-non-streaming-tts-vits-zh.sh

Spoken language identification

./run-spoken-language-identification-whisper.sh

Add punctuations to text

The punctuation model supports both English and Chinese.

./run-add-punctuation-zh-en.sh

Audio tagging

./run-audio-tagging-zipformer-from-file.sh
./run-audio-tagging-ced-from-file.sh

Speaker identification

./run-speaker-identification.sh

VAD with a microphone

./run-vad-from-mic.sh

VAD with a microphone + Non-streaming Paraformer for speech recognition

./run-vad-from-mic-non-streaming-paraformer.sh

VAD with a microphone + Non-streaming Whisper tiny.en for speech recognition

./run-vad-from-mic-non-streaming-whisper.sh

VAD (Remove silence)

./run-vad-remove-slience.sh

VAD + Non-streaming Paraformer for speech recognition

./run-vad-non-streaming-paraformer.sh

Keyword spotter

./run-kws-from-file.sh