• 中国科技核心期刊
  • Scopus收录期刊
  • DOAJ收录期刊
  • JST收录期刊
  • Euro Pub收录期刊
CHEN Xiao-yi, WANG Ying-min. An Improved Blind Speech Separation Algorithm via Time-frequency Masking Based on a Single Acoustic Vector Sensor[J]. Journal of Unmanned Undersea Systems, 2015, 23(2): 098-103. doi: 10.11993/j.issn.1673-1948.2015.02.005
Citation: CHEN Xiao-yi, WANG Ying-min. An Improved Blind Speech Separation Algorithm via Time-frequency Masking Based on a Single Acoustic Vector Sensor[J]. Journal of Unmanned Undersea Systems, 2015, 23(2): 098-103. doi: 10.11993/j.issn.1673-1948.2015.02.005

An Improved Blind Speech Separation Algorithm via Time-frequency Masking Based on a Single Acoustic Vector Sensor

doi: 10.11993/j.issn.1673-1948.2015.02.005
  • Received Date: 2014-12-02
  • Rev Recd Date: 2015-01-05
  • Publish Date: 2015-04-20
  • An improved blind speech separation algorithm is presented based on the direction of arrival(DOA) estimation, which is obtained by the precise direction finding ability of a single acoustic vector sensor(AVS). The proposed algorithm works in time-frequency domain, in which the probability at each time-frequency unit of a specific source is estimated via an expectation-maximization(EM) algorithm based on the von Mises distribution mixture model. Because the mean value is difficult to estimate when the reverberation level is high or the sources are placed closely, a simple but effective improved algorithm is proposed, and is verified via simulation under different reverberation level, direction difference and source number. Simulation results show that the improved algorithm is superior to the binary time-frequency masking algorithm and the soft time-frequency masking algorithm in terms of signal-to-distortion ratio(SDR) and perceptual evaluation of speech quality(PESQ)

     

  • loading
  • [1]
    Comon P, Jutten C. Handbook of Blind Source Separation: Independent Component Analysis and Applications[J]. IEEE Signal Processing Magazine, 2010, 30(2):133-134.
    [2]
    Yilmaz O, Rickard S. Blind Separation of Speech Mixtures via Time-Frequency Masking[J]. IEEE Transactions on Signal Processing, 2004, 52(7): 1830-1847.
    [3]
    Nehorai A, Paldi E. Acoustic Vector-Sensor Array Processing[J]. IEEE Transaction on Signal Processing, 1994, 42(9): 2481-2489.
    [4]
    Shujau M, Ritz C H, Burnet I S. Separation of Speech Sources using an Acoustic Vector Sensor[C]//IEEE International Workshop on Multimedia Signal Processing, 2011.
    [5]
    Gunel B, Hachabiboglu H, Kondoz A M. Acoustic Source Separation of Convolutive Mixtures based on Intensity Vector Statistics[J]. IEEE Transactions on Audio, Speech and Language Processing, 2008, 16(4): 748-756.
    [6]
    Hung W L, Chang-Chien S J, Yang M S. Self-Updating Clustering Algorithm for Estimating the Parameters in Mixtures of von Mises Distributions[J]. Journal of Applied Statistics, 2012, 39(10): 2259-2274.
    [7]
    Allen J B, Berkley D A. Image Method for Efficiently Simulating Small-Room Acoustics[J]. Journal of the Acoustical Society of America, 1979, 65(4): 943-950.
    [8]
    Thiede T, Treurniet W C, Bitto R. PEAQ-The ITU Standard for Objective Measurement of Perceived Audio Quality[J]. Journal of the Audio Engineering Society, 2000, 48(1): 3-29.
  • 加载中

Catalog

    通讯作者: 陈斌, bchen63@163.com
    • 1. 

      沈阳化工大学材料科学与工程学院 沈阳 110142

    1. 本站搜索
    2. 百度学术搜索
    3. 万方数据库搜索
    4. CNKI搜索

    Article Metrics

    Article Views(941) PDF Downloads(464) Cited by()
    Proportional views
    Related
    Service
    Subscribe

    /

    DownLoad:  Full-Size Img  PowerPoint
    Return
    Return