论文部分内容阅读
在汉语数码语音识别( M D S R)中,“2”和“8”是最易混淆的一对语音。文章分析了“2”和“8”混淆的原因,发现可用于分辨“2”和“8”的区别特征在于其共振峰轨迹的差异。因此文章提出了基于共振峰轨迹的判决算法( F T B D)来分辨“2”和“8”。实验表明,使用 F T B D 算法,使 M D S R识别率从960% 提高到 977% ,“2”和“8”的识别率从 91% 提高到99% ,消除了这对语音的混淆,提高了 M D S R 的整体性能
In Chinese Digital Speech Recognition (MDSR), “2” and “8” are the most confusing pairs of voices. The article analyzes the reasons for the confusion between “2” and “8” and found that the distinctions that can be used to distinguish “2” and “8” are characterized by differences in their formant trajectories. Therefore, the paper proposes a formant algorithm based on formant loci (F T B D) to distinguish “2” and “8”. Experiments show that using F T B D algorithm, the recognition rate of M D S R is improved from 96.0% to 97.7%, and the recognition rate of “2” and “8” is increased from 91% to 99% Obfuscation of speech improves the overall performance of MDSR