>
Fa   |   Ar   |   En
   Binaural Speech Separation Using Binary and Ratio Time-Frequency Masks  
   
نویسنده Mahmoodzadch A. ,Abutalebi H. R. ,Soltanian-Zadeh H. ,Sheikhzadeh H.
منبع international journal of information and communication technology research - 2014 - دوره : 6 - شماره : 3 - صفحه:15 -24
چکیده    In many speech applications, the target signal is corrupted by highly correlated noise sources. separating desired speaker signals from the mixture is one of the most challenging research topics in speech signal processing. this paper proposes a binaural system combined with a monaural incoherent post processor for speech segregation. the proposed binaural system is based on spatial localization cues: lntcraural time differences (ltd) and lntcraural t ntensity differences (ltd). a target speech is sepa rated from interfering sounds by estimating time-frequency bina ry and ratio masks. the binary mask is estimated using the multi-level extension of the otsu thresholding algorithm used in image segmentation. ltd and lld are important features for mask estimation in low and high frequencies, respectively. the ratiu mask is estimated using the incoherent monaural speech separation system as the post processing stage. systematic enluations s how that the proposed system can separate the target signal with acceptance quality.
کلیدواژه lnteraural intensity differences; interaural time differences; speech separation; time-frequency hinary mask; ratio mask
آدرس yazd university, Research Lab Elec and Comp Eng Dept , ایران, yazd university, Research Lab Elec and Comp Eng Dept , ایران, university of tehran, Control and Intelligent Processing Center of Excellence, ایران, amirkabir university of technology, Elec Eng Dept , ایران
پست الکترونیکی hsheikhzadeh@aut.ac.ir
 
     
   
Authors
  
 
 

Copyright 2023
Islamic World Science Citation Center
All Rights Reserved