Performance Improvement of Language Identification Using Transcription Based Sequential Approaches & Sequential Kernels Based SVM

Fa | Ar | En

Performance Improvement of Language Identification Using Transcription Based Sequential Approaches & Sequential Kernels Based SVM


نویسنده	Hosseini Amereei Seyed Abbas ,Homayounpour Mohammad Mehdi
منبع	international journal of information and communication technology research - 2012 - دوره : 4 - شماره : 2 - صفحه:37 -45
چکیده	abstract— in this paper a generative frontend based on both phonetic and prosodic features, and also a couple of approaches based on phonetic transcription- aggregated phone recognizer followed by language models (aprlm) and generalized phone recognizer followed by language models (gprlm), are investigated. aprlm and gprlm have few disadvantages since they need phonetic transcription of speech data, and also they use fewer level of information while the generative frontend built upon an ensemble of gaussian densities uses prosodic and phonetic information altogether. furthermore, no transcription of speech data is needed in support vector machine (svm)-based approaches, and they showed better performances in our experiments too. in addition, aprlm and gprlm are more time consuming than svm-based approaches. we used mel-frequency cepstral coefficients (mfcc) in aprlm and gprlm, and shifted delta cepstrum (sdc) and pitch contour polynomial approximation (pcpa) features in svm-based methods. probabilistic sequence kernel (psk) and generalized linear discriminant sequence (glds) kernels are used in svm experiments. svm using glds and psk kernels outperforms gmm in all our lid experiments conducted by applying pcpa features and lid performance improved about 2.1% and 5.9% respectively. the combination of probabilistic characteristic vector using pcpa (pcv-pcpa) and probabilistic characteristic vector using sdc (pcv-sdc) provides further improvements.
کلیدواژه	Language Identification ,Probabilistic Characteristic Vector ,Pitch Contour Polynomial Approximation ,Probabilistic Sequence Kernel ,Generalized Linear Discriminant Analysis ,APRLM and GPRLM
آدرس	amirkabir university of technology, Laboratory for Intelligent Sound & Speech Processing, ایران, amirkabir university of technology, Laboratory for Intelligent Sound &Speech Processing, ایران
پست الکترونیکی	homayoun@aut.ac.ir



Authors