锐英源软件
第一信赖

精通

英语

开源

擅长

开发

培训

胸怀四海 

第一信赖

当前位置:锐英源 / 开源技术 / 语音识别开源 / dnn的10个最佳WER

服务方向

人工智能数据处理
人工智能培训
kaldi数据准备
小语种语音识别
语音识别标注
语音识别系统
语音识别转文字
kaldi开发技术服务
软件开发
运动控制卡上位机
机械加工软件
软件开发培训
Java 安卓移动开发
VC++
C#软件
汇编和破解
驱动开发

联系方式

固话:0371-63888850
手机:138-0381-0136
Q Q:396806883
微信:ryysoft

dnn的10个最佳WER

First, thanks to provide awesome speech recognition toolkit & this forum.

I am currently running my own 'isolated word dataset' recognition system in KALDI.

And I am successfully train this system with DNN, and get a 1-best decoding result.

Here is my script for scoring from local/score.sh首先,感谢您提供出色的语音识别工具包和本论坛。

我目前正在KALDI中运行自己的“孤立单词数据集”识别系统。

而且我已经成功地使用DNN训练了该系统,并获得了1最好的解码结果。

这是我用于从local / score.sh评分的脚本

"$cmd LMWT=$min_lmwt:$max_lmwt $dir/scoring/log/best_path.LMWT.log \ lattice-scale --inv-acoustic-scale=LMWT "ark:gunzip -c $dir/lat.*.gz|" ark:- \| \ lattice-add-penalty --word-ins-penalty=$word_ins_penalty ark:- ark:- \| \ lattice-best-path --word-symbol-table=$symtab \ ark:- ark,t:$dir/scoring/LMWT.tra || exit 1; "

I modify this script to get 10-best result by adding 'lattice-to-nbest' line like below :

"$cmd LMWT=$min_lmwt:$max_lmwt $dir/scoring/log/best_path.LMWT.log \ lattice-scale --inv-acoustic-scale=LMWT "ark:gunzip -c $dir/lat.*.gz|" ark:- \| \ lattice-add-penalty --word-ins-penalty=$word_ins_penalty ark:- ark:- \| \ lattice-to-nbest --acoustic-scale=0.1 --n=10 ark:- ark:- \| \ lattice-best-path --word-symbol-table=$symtab \ ark:- ark,t:$dir/scoring/LMWT.tra || exit 1; "

But I got nan WER from this script.

Can anyone give me clue how to get 10-best result?但是我从这个脚本中得到了WER。

谁能给我提示如何获得10个最佳结果?

 

 

if you inspect $dir/scoring/*.tra files and compare them with the previous files, you will notice, that the utterance ID is different for the
"lattice-to-nbest | lattice-best-path", for example UTT-A-1 instead of UTT-A. The program compute-wer then cannot find the reference (as it is looking for UTT-A-1) instead of UTT-A

BTW: I find the concept of nbest wer strange --haven't heard of it yet. Even if there would exist something like that, "lattice-to-nbest |
lattice-best-path" is almost certanly not doing anything better that just lattice-best-path -- lattice-to-nbest selects n best paths through the
lattice and lattice-best-path chooses the best one from these n paths, i.e. you will end up with 1-best path again (and I think it will be exactly the same as the path obtained from lattice-best-path).如果您检查$ dir / scoring / *。tra文件并将它们与以前的文件进行比较,您会注意到,
“lattice-to-nbest | lattice-best-path”的话语ID是不同的,例如UTT- A-1代替UTT-A。
然后,程序计算人员无法找到参考(因为它正在寻找UTT-A-1)而不是UTT-A

顺便说一句:我发现最好的概念很奇怪-还没有听说过。即使存在这样的情况,“lattice-to-nbest | lattice-best-path”几乎没有做得更好,只是lattice-to-nbest | lattice-best-path选择了穿过晶格的n条最佳路径然后,battery-best-path从这n条路径中选择最佳路径,即,您将再次获得1-best路径(我认为它将与从grid-best-path获得的路径完全相同)。

 

 

I think there might be a way to compute the n-best oracle WER given existing command line tools. The program lattice-oracle computes the lattice oracle WER; and if the input lattices consist of the n-best alternatives it will give you what you want. You can achieve this by piping your lattices through lattice-to-nbest | nbest-to-lattice.我认为,鉴于现有的命令行工具,也许有一种方法可以计算出n个最佳的Oracle WER 。
程序lattice-oracle计算格形oracle的WER;如果输入晶格由n个最佳选择组成,它将为您提供所需的内容 。您可以通过将晶格排列成nbest | nbest 最好到晶格。

 

 

I will try piping lattices through lattice-to-nbest | nbest-to-lattice too.Thank you ! I find n-best decoding result in *.tra files.I can manually analyze result from these files.我将尝试通过lattice-to-nbest | nbest-to-lattice管道方法处理晶格。我在* .tra文件中找到n个最佳解码结果。我可以手动分析这些文件的结果。

友情链接
版权所有 Copyright(c)2004-2021 锐英源软件
公司注册号:410105000449586 豫ICP备08007559号 最佳分辨率 1024*768
地址:郑州大学北校区院(文化路97号院)内