شکل‌دهنده پرتو mvdr مرتبه کسری و مبتنی بر آنالیز مقادیر ویژه برای بهسازی گفتار

Fa | Ar | En

شکل‌دهنده پرتو mvdr مرتبه کسری و مبتنی بر آنالیز مقادیر ویژه برای بهسازی گفتار


نویسنده	علویان شهری مصطفی ,ابوطالبی حمیدرضا
منبع	مهندسي برق دانشگاه تبريز - 1402 - دوره : 53 - شماره : 1 - صفحه:37 -47
چکیده	یکی از پرکاربردترین الگوریتم‌های شکل‌دهی‌پرتو در بهسازی گفتار، الگوریتم حداقل واریانس بدون اعوجاج است. در محاسبه‌ ضرائب این شکل‌دهنده فرض ناهمبسته بودن نویز و سیگنال مطلوب با توجه به وجود پژواک و نویزهای با طیفی مشابه سیگنال مطلوب برقرار نیست. در نتیجه، ماتریس‌های کواریانس و به‌ تبع آن، ضرائب شکل‌دهنده دارای دقت کافی نخواهد بود. در این مقاله، به‌عنوان اولین تغییر در شکل‌دهنده‌پرتو حداقل واریانس، با اعمال آنالیز مقادیر ویژه به ماتریس کواریانس سیگنال مطلوب و حذف مقادیر ویژه کوچک، دقت ضرائب شکل‌دهنده بهبود داده می‌شود. در دومین تغییر، از تبدیل فوریه مرتبه کسری به‌عنوان حالت تعمیم‌یافته تبدیل فوریه استاندارد استفاده شده است. در ادامه تغییرات پیشنهادی فرمول‌بندی شده و با بدست آوردن پارامتر بهینه تبدیل فوریه کسری به‌صورت تجربی، اثر هریک از دو تغییر با روشهای پایه مقایسه ‌می‌شود. نتایج نشان می‌دهد که روش‌های پیشنهادی ضمن پایداری نسبت به تغییر پارامترها و شرایط محیطی، در سناریوهای مختلف به‌نسبت توان سیگنال به نویزی بین تا دست می‌یابند. در مقابل، روش پایه عملکردی در محدوده تا دارد. همچنین اگرچه هریک از تغییرات فوق به‌تنهایی نیز منجر به بهبود کارایی می‌شوند، اما بهترین عملکرد در حالتی بدست می‌آید که هر دو تغییر پیشنهادی با هم اعمال شود.
کلیدواژه	بهسازی گفتار، آرایه میکروفونی، آنالیز مقادیر ویژه، تبدیل فوریه زمان-کوتاه کسری
آدرس	دانشگاه یزد, دانشکده مهندسی برق, ایران, دانشگاه یزد, دانشکده مهندسی برق, ایران
پست الکترونیکی	habutalebi@yazd.ac.ir

fractional eigen based mvdr beamformer for speech enhancement

Authors	alaviyan shahri m. ,abutalebi h. r.
Abstract	one of the most widely used beamforming algorithms for the application of speech enhancement is the minimum variance distortionless response (mvdr) technique. the optimal coefficients of the mvdr beamformer are calculated based on the incoherence assumption of environmental interferences and the desired signal. due to the nature of noise and speech signals, this assumption is not valid in many practical situations. this, in turn, results in inaccurateness of derived coefficients of the mvdr. in this paper, as the first change in the mvdr beamformer, by applying the eigenvalue analysis to the desired signal covariance matrix and removing small eigenvalues, the accuracy of the beamformer coefficients is improved. as the second contribution, we use a generalized version of the short-time fourier transform (stft), namely the short-time fractional fourier transform (stfrft), to calculate the mvdr beamformer weights. in this research, after obtaining the optimal value of stfrft parameter experimentally, the effect of each of the above two changes on the performance is investigated and compared with the basic methods. the results show that the proposed methods, while being stable to the changes of parameters and environmental conditions, achieve signal-to-noise ratio (snr) values between and , while the performance of the baseline method is in the range of . although each of the above changes alone improves the performance, it is noted that the superior performance is obtained when both changes are applied together on the beamformer.
Keywords	speech enhancement ,microphone array ,eigen value analysis ,short-time fractional fourier transform