高級檢索

數據挖掘技術在語音識別中的應用

Application of Data Mining Techniques in Speech Recognition

  • 摘要: 通過數據挖掘技術實現對語音來源的識別,從而完成對說話人身份的認證以及操作權限的分配,具有非常重要的理論和實際意義🚣🏿。主要針對相同和不同語音內容兩個類別的說話人語音識別進行了研究。通過在說話人識別領域廣泛應用的梅爾頻率倒譜系數進行語音的特征提取,並結合動態時間規整算法進行模式匹配分類。特別地,在不同的語音內容識別探究中,在采用動態時間規整算法前,結合了K-means++算法以及主成分分析算法來對梅爾頻率倒譜系數矩陣進行降維和聚類🦸🏻,以保證待匹配模板的維度相近或相同。結果表明🥷,在相同語音內容的識別過程中,選擇合適的閾值可以獲得較好的識別效果🏓。

     

    Abstract: Using the data mining techniques to recognize the speech sources, certify the speaker identities and assign the operation permissions is quite meaningful in both theoretical and practical senses. This paper mainly investigates two types of speech recognition. One is based on the same voice contents, while the other is on different voice contents. For the algorithms, the widely used Mel frequency cepstral coefficient (MFCC) algorithm is adopted for the feature extraction; and dynamic time warping algorithm are combined to classify the patterns. In particular, K-means++ algorithm and principle component analysis algorithm are added before the use of dynamic time warping algorithm for the second type. As a result, in the type of the same voice contents, once an appropriate threshold is selected, a good recognition effect can be derived.

     

/

返回文章
返回
摩臣5娱乐专业提供:摩臣5娱乐摩臣5摩臣5平台等服务,提供最新官网平台、地址、注册、登陆、登录、入口、全站、网站、网页、网址、娱乐、手机版、app、下载、欧洲杯、欧冠、nba、世界杯、英超等,界面美观优质完美,安全稳定,服务一流🏰🧑🏿‍🦲,摩臣5娱乐欢迎您。 摩臣5娱乐官網xml地圖