Toggle navigation
Toggle navigation
此项目
正在载入...
Sign in
xuning
/
sherpaonnx
转到一个项目
Toggle navigation
项目
群组
代码片段
帮助
Toggle navigation pinning
Project
Activity
Repository
Pipelines
Graphs
Issues
0
Merge Requests
0
Wiki
Network
Create a new issue
Builds
Commits
Authored by
Fangjun Kuang
2024-10-02 18:13:34 +0800
Browse Files
Options
Browse Files
Download
Email Patches
Plain Diff
Committed by
GitHub
2024-10-02 18:13:34 +0800
Commit
66feecb2b55917788a6852e23cdbbd9489b61d30
66feecb2
1 parent
b965f14c
support whisper turbo (#1390)
隐藏空白字符变更
内嵌
并排对比
正在显示
2 个修改的文件
包含
7 行增加
和
5 行删除
.github/workflows/export-whisper-to-onnx.yaml
scripts/whisper/export-onnx.py
.github/workflows/export-whisper-to-onnx.yaml
查看文件 @
66feecb
...
...
@@ -16,7 +16,7 @@ jobs:
fail-fast
:
false
matrix
:
os
:
[
macos-latest
]
model
:
[
"
distil-medium.en"
,
"
distil-small.en"
,
"
tiny.en"
,
"
base.en"
,
"
small.en"
,
"
medium.en"
,
"
tiny"
,
"
base"
,
"
small"
,
"
medium"
,
"
medium-aishell"
,
"
large"
,
"
large-v1"
,
"
large-v2"
,
"
large-v3"
,
"
distil-large-v2"
]
model
:
[
"
turbo"
,
"
distil-medium.en"
,
"
distil-small.en"
,
"
tiny.en"
,
"
base.en"
,
"
small.en"
,
"
medium.en"
,
"
tiny"
,
"
base"
,
"
small"
,
"
medium"
,
"
medium-aishell"
,
"
large"
,
"
large-v1"
,
"
large-v2"
,
"
large-v3"
,
"
distil-large-v2"
]
# model: ["large", "large-v1", "large-v2", "large-v3", "distil-large-v2"]
python-version
:
[
"
3.8"
]
...
...
@@ -32,7 +32,8 @@ jobs:
shell
:
bash
run
:
|
python3 -m pip install torch==1.13.0 torchaudio==0.13.0 -f https://download.pytorch.org/whl/cpu/torch_stable.html
python3 -m pip install openai-whisper==20231117 onnxruntime onnx soundfile librosa
python3 -m pip install -U openai-whisper
python3 -m pip install onnxruntime onnx soundfile librosa
-
name
:
export ${{ matrix.model }}
shell
:
bash
...
...
scripts/whisper/export-onnx.py
查看文件 @
66feecb
...
...
@@ -46,7 +46,8 @@ def get_args():
choices
=
[
"tiny"
,
"tiny.en"
,
"base"
,
"base.en"
,
"small"
,
"small.en"
,
"medium"
,
"medium.en"
,
"large"
,
"large-v1"
,
"large-v2"
,
"large-v3"
,
"large-v1"
,
"large-v2"
,
"large"
,
"large-v3"
,
"turbo"
,
# these three have feature dim 128
"distil-medium.en"
,
"distil-small.en"
,
"distil-large-v2"
,
# "distil-large-v3", # distil-large-v3 is not supported!
# for fine-tuned models from icefall
...
...
@@ -76,7 +77,7 @@ def add_meta_data(filename: str, meta_data: Dict[str, Any]):
meta
.
key
=
key
meta
.
value
=
str
(
value
)
if
"large"
in
filename
:
if
"large"
in
filename
or
"turbo"
in
filename
:
external_filename
=
filename
.
split
(
".onnx"
)[
0
]
onnx
.
save
(
model
,
...
...
@@ -404,7 +405,7 @@ def main():
audio
=
whisper
.
pad_or_trim
(
audio
)
assert
audio
.
shape
==
(
16000
*
30
,),
audio
.
shape
if
args
.
model
in
(
"large"
,
"large-v3"
):
if
args
.
model
in
(
"large"
,
"large-v3"
,
"turbo"
):
n_mels
=
128
else
:
n_mels
=
80
...
...
请
注册
或
登录
后发表评论