>
Fa   |   Ar   |   En
   voice activity detection using clustering-based method in spectro-temporal features space  
   
نویسنده esfandian nafiseh ,jahani bahnamiri fatemeh ,mavaddati samira
منبع journal of ai and data mining - 2022 - دوره : 10 - شماره : 3 - صفحه:401 -409
چکیده    This paper proposes a novel method for voice activity detection based on clustering in the spectro-temporal domain. in the proposed algorithms, the auditory model is used in order to extract the spectro-temporal features. the gaussian mixture model and the wk-means clustering methods are used to decrease the dimensions of the spectro-temporal space. moreover, the energy and positions of the clusters are used for voice activity detection. silence/speech is recognized using the attributes of clusters and the updated threshold value in each frame. having a higher energy, the first cluster is used as the main speech section in computation. the efficiency of the proposed method is evaluated for silence/speech discrimination in different noisy conditions. displacement of the clusters in the spectro-temporal domain is considered as the criterion to determine the robustness of the features. according to the results obtained, the proposed method improves the speech/non-speech segmentation rate in comparison to the temporal and spectral features in low signal to noise ratios (snrs).
کلیدواژه spectro-temporal features ,auditory model ,gaussian mixture model ,wk-means clustering ,voice activity detection
آدرس islamic azad university, qaemshahr branch, department of electrical engineering, iran, aryan institute of science and technology, department of computer engineering, iran, university of mazandaran, faculty of engineering and technology, department of electrical engineering, iran
پست الکترونیکی s.mavaddati@umz.ac.ir
 
     
   
Authors
  
 
 

Copyright 2023
Islamic World Science Citation Center
All Rights Reserved