Belle-distilwhisper-large-v2-zh开源语音识别模型 - 极速中文识别，参数少易部署

首页

Belle Distilwhisper Large V2 Zh

由 BELLE-2 开发

基于distilwhisper-large-v2微调的中文语音识别模型，速度是whisper-large-v2的5.8倍，参数减少51%

语音识别

Transformers

开源协议:Apache-2.0 #中文语音识别 #高效推理 #轻量化模型

下载量 230

发布时间 : 12/29/2023

模型简介

该模型是对distilwhisper-large-v2进行微调的版本，专门增强中文语音识别能力。相比原版whisper-large-v2，在保持较高准确率的同时显著提升了推理速度。

模型特点

高效推理

推理速度是whisper-large-v2的5.8倍，显著提升处理效率

参数精简

参数数量减少51%，模型体积更小

中文优化

专门针对中文语音识别进行优化，原版distilwhisper-large-v2无法识别中文

性能提升

相比whisper-large-v2，在多个中文数据集上实现-3%到35%的相对提升

模型能力

中文语音识别

高效语音转录

大规模语音处理

使用案例

语音转录

会议记录

将中文会议录音自动转录为文字

在会议数据集上CER为17.039%

语音笔记

将个人语音备忘录转换为文字

语音分析

客服语音分析

分析客服通话内容

🚀 Belle-distilwhisper-large-v2-zh

Belle-distilwhisper-large-v2-zh通过微调distilwhisper-large-v2模型，增强了中文语音识别能力。与distilwhisper-large-v2类似，该模型比whisper-large-v2快5.8倍，且参数减少了51%，在参数减少的情况下，相对whisper-large-v2仍有-3%到35%的提升。

🚀 快速开始

如果你觉得这个模型有帮助，请在模型页面点击“喜欢”，并在以下链接给我们的项目加星：

https://github.com/LianjiaTech/BELLE
https://github.com/shuaijiang/Whisper-Finetune

✨ 主要特性

速度快：与whisper-large-v2相比，Belle-distilwhisper-large-v2-zh速度快5.8倍。
参数少：参数比whisper-large-v2减少了51%。
性能提升：尽管参数减少，但相对whisper-large-v2仍有-3%到35%的提升。
支持中文：原distilwhisper-large-v2无法转录中文（仅输出英文），而本模型支持中文转录。

💻 使用示例

基础用法

from transformers import pipeline

transcriber = pipeline(
  "automatic-speech-recognition", 
  model="BELLE-2/Belle-distilwhisper-large-v2-zh"
)

transcriber.model.config.forced_decoder_ids = (
  transcriber.tokenizer.get_decoder_prompt_ids(
    language="zh", 
    task="transcribe"
  )
)

transcription = transcriber("my_audio.wav")

🔧 技术细节

微调信息

属性	详情
模型类型	Belle-distilwhisper-large-v2-zh
采样率	16KHz
训练数据集	AISHELL-1 AISHELL-2 WenetSpeech HKUST
微调方式	全量微调

如果你想在自己的数据集上微调该模型，请参考GitHub仓库。

字符错误率（CER）

模型	参数（M）	语言标签	aishell_1_test( ↓ )	aishell_2_test( ↓ )	wenetspeech_net ( ↓ )	wenetspeech_meeting( ↓ )	HKUST_dev( ↓ )
whisper-large-v2	1550	中文	8.818%	6.183%	12.343%	26.413%	31.917%
distilwhisper-large-v2	756	中文	-	-	-	-	-
Belle-distilwhisper-large-v2-zh	756	中文	5.958%	6.477%	12.786%	17.039%	20.771%

📄 许可证

本项目采用Apache-2.0许可证。

📚 引用说明

使用我们的代码、数据或模型时，请引用我们的论文和GitHub项目：

@misc{BELLE,
  author = {BELLEGroup},
  title = {BELLE: Be Everyone's Large Language model Engine},
  year = {2023},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/LianjiaTech/BELLE}},
}