TUNG FANG Institutional Repository:Item 987654321/1911
English  |  正體中文  |  简体中文  |  全文笔数/总笔数 : 848/2341 (36%)
造访人次 : 5042169      在线人数 : 41
RC Version 7.0 © Powered By DSPACE, MIT. Enhanced by NTU Library IR team.
搜寻范围 查询小技巧:
  • 您可在西文检索词汇前后加上"双引号",以获取较精准的检索结果
  • 若欲以作者姓名搜寻,建议至进阶搜寻限定作者字段,可获得较完整数据
  • 进阶搜寻
    主页登入上传说明关于TFIR管理 到手机版


    jsp.display-item.identifier=請使用永久網址來引用或連結此文件: http://163.15.40.127/ir/handle/987654321/1911


    题名: Hardware/Software Co-design for Fast-trainable Speaker Identification System Based on SMO
    作者: Peng, Jr-Shiang
    Wang, Jhing-Fa
    Wang, Jia-Ching
    Lin, Po-Chuan
    Kuan, Ta-Wen
    林博川
    (東方設計學院電子與資訊系)
    贡献者: 東方設計學院電子與資訊系
    关键词: Speaker Identification
    Hardware/Software Codesign
    Sequential Minimal Optimization (SMO)
    日期: 2011-10
    上传时间: 2015-07-14 14:23:00 (UTC+8)
    出版者: Anchorage, Alaska, USA
    摘要: Embedded speaker identification system is a popular research, but most of current systems can not provide fast training ability. Because of the low computational ability in the embedded environment, a large amount of waiting time usually makes the human-machine interface not friendly. This paper presents a hardware and software (HW/SW) co-design solution for fast-trainable speaker identification system. Fast training ability makes this embedded speaker identification system possess high flexibility and enhances the convenience to a wide range of real-world applications. The proposed system consists of a training phase and a multiclass identification phase. The sequential minimal optimization (SMO) training algorithm occupies the heaviest computational load and is realized as a dedicated VLSI module, i.e., the hardware component. The other processes such as speech preprocess, speech feature extraction, and SVM voting strategy are implemented by software. Moreover, a data-packed mechanism is presented to improve the bandwidth utilization. Compared with the embedded C code based on ARM processor, our system reduces 90% of the training time and achieves 89.9% identification rate with the NIST 2010 speaker recognition database. The proposed system was tested and found to be fully functional working on a Socle CDK prototype system with an AMBA based Xilinx FPGA and an ARM926EJ processor.
    關聯: IEEE International Conference on Systems, Man, and Cybernetics conference digest, pp.1621-1625
    IEEE SMC 2011
    显示于类别:[電子與資訊系(遊戲動畫系、動畫科)] 會議論文

    文件中的档案:

    没有与此文件相关的档案.



    在TFIR中所有的数据项都受到原著作权保护.

    TAIR相关文章

    DSpace Software Copyright © 2002-2004  MIT &  Hewlett-Packard  /   Enhanced by   NTU Library IR team Copyright ©   - 回馈