|
|
Using Context-based Statistical Models to Promote the Quality of Voice Conversion Systems
|
|
|
|
|
نویسنده
|
Eslami M. ,Sayadian A.
|
منبع
|
aut journal of modeling and simulation - 2011 - دوره : 43 - شماره : 2 - صفحه:11 -17
|
چکیده
|
This article aims to examine methods of optimizing gmm-based voice conversion systems performancein which gmm method is introduced as the basic method for improvement of voice conversion systemsperformance. in the current methods, due to using a single conversion function to convert all speech units and subsequent spectral smoothing arising from statistical averaging, we will observe quality reduction. in this paper, after introducing gmm2 method, several gmm models will be used to model each phoneme. furthermore, in the stage of corresponding the clusters of each state, before applying dynamic time warping algorithm, we use a lmr conversion for further correspondence among the parameters of two corresponding states of two speakers. another reason for quality reduction in voice conversion system is that the precision of speech signal parameters was underestimated. in order to overcome such a problem, generalized harmonic model is introduced which is replaced by sinusoid harmonic model applied in gmm2 giving another method called gmm3. finally, we will present gmm4 method, the objective of which is to promote the system performance with limited data and a restricted number of demi-syllables to train conversion functions.
|
کلیدواژه
|
High quality voice conversion ,Gaussian mixed model (GMM) ,Generalized Harmonic Model (GHM) ,spectral conversion
|
آدرس
|
amirkabir university of technology, Department of Electrical Engineering, ایران, Tamin Telecom Co.(3G mobile operator), Department of Product and Services, ایران
|
پست الکترونیکی
|
m.eslami@tamintelecom.ir
|
|
|
|
|
|
|
|
|
|
|
|
Authors
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|