readme.zh.md 6.4 KB

原文件审查历史永久链接



使用指南

适用于 Windows / macOS / Linux（以 Windows 为例说明动态库加载）


1. 前提条件


Java 1.8+ 环境
下载并准备好以下内容：


Sherpa-ONNX Java API（Maven 依赖）
Kokoro TTS 模型文件（包含 model.onnx 等）


2. 添加 Maven 依赖

在你的 pom.xml 中添加如下依赖：

<dependency>
  <groupId>com.litongjava</groupId>
  <artifactId>sherpa-onnx-java-api</artifactId>
  <version>1.0.1</version>
</dependency>


3. 获取并配置本地动态链接库（JNI）


3.1 安装 ONNX Runtime


1. Windows 11

Starting from Windows 10 v1809 and all versions of Windows 11, the system comes with built-in ONNX Runtime as part of Windows ML (WinRT API), exposed through Windows.AI.MachineLearning.dll. You can directly use WinML to load and run ONNX models without additional downloads or installations.
(run-onnx-models)[https://learn.microsoft.com/en-us/windows/ai/new-windows-ml/run-onnx-models]


2. Linux

Sherpa-ONNX 并不包含 ONNX Runtime，需要手动下载并配置：


从微软官方 GitHub Releases 下载 Linux 64 位二进制包：


   wget https://github.com/microsoft/onnxruntime/releases/download/v1.17.1/onnxruntime-linux-x64-1.17.1.tgz
   tar -xzf onnxruntime-linux-x64-1.17.1.tgz


将解压后的 libonnxruntime.so 文件复制到系统库目录，并创建软链接：


   sudo cp onnxruntime-linux-x64-1.17.1/lib/libonnxruntime.so* /usr/local/lib/
   sudo ln -sf /usr/local/lib/libonnxruntime.so.1.17.1 /usr/local/lib/libonnxruntime.so


更新共享库缓存并验证安装：


   sudo ldconfig
   ldconfig -p | grep onnxruntime


3. macOS

Sherpa-ONNX 同样不包含 ONNX Runtime，需要从官方获取并配置：


下载 macOS ARM64 版本二进制包：


   wget https://github.com/microsoft/onnxruntime/releases/download/v1.17.1/onnxruntime-osx-arm64-1.17.1.tgz
   tar -xzf onnxruntime-osx-arm64-1.17.1.tgz


将 libonnxruntime.1.17.1.dylib 复制到 /usr/local/lib：


   sudo cp onnxruntime-osx-arm64-1.17.1/lib/libonnxruntime.1.17.1.dylib /usr/local/lib/


将 /usr/local/lib 添加到 dyld 的搜索路径：


   export DYLD_LIBRARY_PATH=/usr/local/lib:$DYLD_LIBRARY_PATH


使用 otool 验证：


   otool -L /Users/ping/lib/darwin_arm64/libsherpa-onnx-jni.dylib


3.2 常见错误与排查

错误示例：

Exception in thread "main" java.lang.UnsatisfiedLinkError: no sherpa-onnx-jni in java.library.path: ...


说明 JVM 没有在 java.library.path 中找到本地库。

排查步骤：


确认下载的是与你操作系统与架构匹配的版本（如 win-x64 vs arm64 等）。
用绝对路径测试：将 .dll 放在某个目录并运行：


   java -Djava.library.path=C:\full\path\to\jni -jar your-app.jar


打印或检查 java.library.path 内容（示例代码里可输出 System.getProperty("java.library.path")）。
避免通过反射修改 sys_paths（不要尝试 hack java.library.path 的内部字段，容易引发 NoSuchFieldException: sys_paths，建议直接用 -Djava.library.path）。


4. 下载并准备 Kokoro 模型

从官方 release 获取模型包（以英文 Kokoro v0.19 为例）：

https://k2-fsa.github.io/sherpa/onnx/tts/pretrained_models/kokoro.html


# 下载（手工或脚本）
# 例如从 GitHub releases:
wget https://github.com/k2-fsa/sherpa-onnx/releases/download/tts-models/kokoro-en-v0_19.tar.bz2

# 解压
tar -xjf kokoro-en-v0_19.tar.bz2

# 查看结构
ls -lh kokoro-en-v0_19/


该目录结构示例（解压后应包含）：

LICENSE
README.md
espeak-ng-data/        # 语音数据目录
model.onnx            # TTS 模型
tokens.txt           # token 映射
voices.bin           # voice embedding


确保这些路径在你的 Java 程序中指向正确的位置（相对或绝对皆可）。


5. 测试代码（Java 示例）

package com.litongjava.linux.tts;

import com.k2fsa.sherpa.onnx.GeneratedAudio;
import com.k2fsa.sherpa.onnx.OfflineTts;
import com.k2fsa.sherpa.onnx.OfflineTtsConfig;
import com.k2fsa.sherpa.onnx.OfflineTtsKokoroModelConfig;
import com.k2fsa.sherpa.onnx.OfflineTtsModelConfig;

public class NonStreamingTtsKokoroEn {
  public static void main(String[] args) {
    String model = "./kokoro-en-v0_19/model.onnx";
    String voices = "./kokoro-en-v0_19/voices.bin";
    String tokens = "./kokoro-en-v0_19/tokens.txt";
    String dataDir = "./kokoro-en-v0_19/espeak-ng-data";
    String text = "Today as always, men fall into two groups: slaves and free men. Whoever does not have"
        + " two-thirds of his day for himself, is a slave, whatever he may be: a statesman, a"
        + " businessman, an official, or a scholar.";

    OfflineTtsKokoroModelConfig kokoroModelConfig = OfflineTtsKokoroModelConfig.builder()
        .setModel(model)
        .setVoices(voices)
        .setTokens(tokens)
        .setDataDir(dataDir)
        .build();

    OfflineTtsModelConfig modelConfig = OfflineTtsModelConfig.builder()
        .setKokoro(kokoroModelConfig)
        .setNumThreads(2)
        .setDebug(true)
        .build();

    OfflineTtsConfig config = OfflineTtsConfig.builder()
        .setModel(modelConfig)
        .build();

    OfflineTts tts = new OfflineTts(config);

    int sid = 0;
    float speed = 1.0f;
    long start = System.currentTimeMillis();
    GeneratedAudio audio = tts.generate(text, sid, speed);
    long stop = System.currentTimeMillis();

    float timeElapsedSeconds = (stop - start) / 1000.0f;
    float audioDuration = audio.getSamples().length / (float) audio.getSampleRate();
    float real_time_factor = timeElapsedSeconds / audioDuration;

    String waveFilename = "tts-kokoro-en.wav";
    audio.save(waveFilename);
    System.out.printf("-- elapsed : %.3f seconds\n", timeElapsedSeconds);
    System.out.printf("-- audio duration: %.3f seconds\n", audioDuration);
    System.out.printf("-- real-time factor (RTF): %.3f\n", real_time_factor);
    System.out.printf("-- text: %s\n", text);
    System.out.printf("-- Saved to %s\n", waveFilename);

    tts.release();
  }
}


输出说明

成功执行后会输出类似：

-- elapsed : 6.739 seconds
-- audio duration: 6.739 seconds
-- real-time factor (RTF): 0.563
-- text: ...
-- Saved to tts-kokoro-en.wav


并在当前目录生成 tts-kokoro-en.wav，可以用任意音频播放器播放验证。