English  |  正體中文  |  简体中文  |  全文筆數/總筆數 : 848/2341 (36%)
造訪人次 : 4999007      線上人數 : 69
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜尋範圍 查詢小技巧:
  • 您可在西文檢索詞彙前後加上"雙引號",以獲取較精準的檢索結果
  • 若欲以作者姓名搜尋,建議至進階搜尋限定作者欄位,可獲得較完整資料
  • 進階搜尋
    主頁登入上傳說明關於TFIR管理 到手機版
    請使用永久網址來引用或連結此文件: http://163.15.40.127/ir/handle/987654321/1918


    題名: Robust Several-Speaker Speech Recognition with Highly Dependable Online Speaker Adaptation and Identification
    作者: Shih, Po-Yi
    Lin, Po-Chuan
    Wang, Jhing-Fa
    Lin, Yuan-Ning
    林博川
    (東方設計學院電子與資訊系)
    貢獻者: 東方設計學院電子與資訊系
    關鍵詞: Speech recognition
    Speaker adaptation
    Speaker identification
    Dependable adaptation
    Confidence score
    日期: 2010-09
    上傳時間: 2015-07-14 14:34:23 (UTC+8)
    摘要: The currently adaptive mechanisms adapt a single acoustic model for a speaker in speaker-independent speech recognition system. However, as more users use the same speech recognizer, single acoustic model adaptation leads to negative adaptation upon switching between users. Such a situation is problematic (undependable adaptation). This paper, considering the situation of a smart home or an office with staff members, presents the speaker-specific acoustic model adaptation based on a multi-model mechanism, to solve the problem of undependable adaptation. First, the identification of the current speaker is confirmed using the SVM classifier, then the corresponding acoustic parameters are extracted and integrated with the speaker-independent acoustic model to yield the speaker-dependent acoustic model and speech recognition accuracy then be promoted for the current speaker. To provide dependable adaptation data to achieve online positive speaker adaptation, a mechanism that measures confidence score is designed to verify each recognition result and determined whether it can be an adaptation datum. The experimental results indicate that the proposed system can effectively increase the average speech recognition accuracy from 62% to 85%. Thus, the proposed system can achieve robust several-speaker speech recognition with highly dependable online speaker adaptation and identification.
    關聯: Journal of Network and Computer Applications, Vol.34 no.5, pp.1459–1467
    顯示於類別:[電子與資訊系(遊戲動畫系、動畫科)] 期刊論文

    文件中的檔案:

    沒有與此文件相關的檔案.



    在TFIR中所有的資料項目都受到原著作權保護.

    TAIR相關文章

    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 回饋