名称 最后更新
..
.gitignore 正在载入提交数据...
README.md 正在载入提交数据...
dynamic_quantization.py 正在载入提交数据...
generate-asr.py 正在载入提交数据...
generate-kws.py 正在载入提交数据...
parse_options.sh 正在载入提交数据...
run-impl.sh 正在载入提交数据...
run.sh 正在载入提交数据...
run2.sh.in 正在载入提交数据...

Introduction

This folder contains scripts to convert ASR models for mobile platforms supporting only batch size equal to 1.

The advantage of fixing the batch size to 1 is that it provides more opportunities for model optimization and quantization.

To give you a concrete example, for the following model https://k2-fsa.github.io/sherpa/onnx/pretrained_models/online-transducer/zipformer-transducer-models.html#csukuangfj-sherpa-onnx-streaming-zipformer-bilingual-zh-en-2023-02-20-bilingual-chinese-english

encoder-epoch-99-avg-1.onnx encoder-epoch-99-avg-1.int8.onnx
Dynamic batch size 315 MB 174 MB
Batch size fixed to 1 242 MB 100 MB

The following colab notebook provides examples to use the above two models.

WARNING: Tested with onnxruntime==1.16.3 onnx==1.15.0.

pip install onnxruntime==1.16.3 onnx==1.15.0

More examples

sherpa-onnx-streaming-zipformer-korean-2024-06-16

encoder-epoch-99-avg-1.onnx encoder-epoch-99-avg-1.int8.onnx
Dynamic batch size 279 MB 122 MB
Batch size fixed to 1 264 MB 107 MB

sherpa-onnx-streaming-zipformer-en-20M-2023-02-17

encoder-epoch-99-avg-1.onnx encoder-epoch-99-avg-1.int8.onnx
Dynamic batch size 85 MB 41 MB
Batch size fixed to 1 75 MB 32 MB

sherpa-onnx-streaming-zipformer-multi-zh-hans-2023-12-12

encoder-epoch-20-avg-1-chunk-16-left-128.onnx encoder-epoch-20-avg-1-chunk-16-left-128.int8.onnx
Dynamic batch size 249 MB 67 MB
Batch size fixed to 1 247 MB 65 MB

icefall-asr-zipformer-streaming-wenetspeech-20230615

encoder-epoch-12-avg-4-chunk-16-left-128.onnx encoder-epoch-12-avg-4-chunk-16-left-128.int8.onnx
Dynamic batch size 250 MB 68 MB
Batch size fixed to 1 247 MB 65 MB

sherpa-onnx-streaming-zipformer-en-2023-06-26

encoder-epoch-99-avg-1-chunk-16-left-128.onnx encoder-epoch-99-avg-1-chunk-16-left-128.int8.onnx
Dynamic batch size 250 MB 68 MB
Batch size fixed to 1 247 MB 65 MB

sherpa-onnx-streaming-zipformer-en-2023-06-21

encoder-epoch-99-avg-1.onnx encoder-epoch-99-avg-1.int8.onnx
Dynamic batch size 338 MB 180 MB
Batch size fixed to 1 264 MB 107 MB

sherpa-onnx-streaming-zipformer-en-2023-02-21

encoder-epoch-99-avg-1.onnx encoder-epoch-99-avg-1.int8.onnx
Dynamic batch size 279 MB 122 MB
Batch size fixed to 1 264 MB 107 MB

sherpa-onnx-streaming-zipformer-fr-2023-04-14

encoder-epoch-29-avg-9-with-averaged-model.onnx encoder-epoch-29-avg-9-with-averaged-model.int8.onnx
Dynamic batch size 279 MB 121 MB
Batch size fixed to 1 264 MB 107 MB

sherpa-onnx-streaming-zipformer-small-bilingual-zh-en-2023-02-16

encoder-epoch-99-avg-1.onnx encoder-epoch-99-avg-1.int8.onnx
Dynamic batch size 85 MB 41 MB
Batch size fixed to 1 75 MB 32 MB

sherpa-onnx-streaming-zipformer-zh-14M-2023-02-23

encoder-epoch-99-avg-1.onnx encoder-epoch-99-avg-1.int8.onnx
Dynamic batch size 40 MB 21 MB
Batch size fixed to 1 33 MB 15 MB

sherpa-onnx-kws-zipformer-wenetspeech-3.3M-2024-01-01

encoder-epoch-12-avg-2-chunk-16-left-64.onnx encoder-epoch-12-avg-2-chunk-16-left-64.int8.onnx
Dynamic batch size 12 MB 4.6 MB
Batch size fixed to 1 11 MB 3.9 MB

sherpa-onnx-kws-zipformer-gigaspeech-3.3M-2024-01-01

encoder-epoch-12-avg-2-chunk-16-left-64.onnx encoder-epoch-12-avg-2-chunk-16-left-64.int8.onnx
Dynamic batch size 12 MB 4.6 MB
Batch size fixed to 1 11 MB 3.9 MB