کنترل بهینه تطبیقی برخط سیستم‌ های دوخطی زمان پیوسته با دینامیک ناشناخته

Fa | Ar | En

کنترل بهینه تطبیقی برخط سیستم‌ های دوخطی زمان پیوسته با دینامیک ناشناخته


نویسنده	منوچهری رهبر نفیسه ,پریز ناصر ,رمضانی آل محمد رضا ,حیدری عقیله
منبع	كنترل - 1402 - دوره : 17 - شماره : 4 - صفحه:75 -87
چکیده	طراحی کنترل ‌کننده‌ی بهینه برای سیستم ‌های دوخطی زمان پیوسته با معلوم بودن دینامیک سیستم طبق اصل بهینگی بلمن پیچیدگی محاسباتی بالایی دارد و عموماً از روش‌های تقریبی وابسته به دانستن دینامیک سیستم برای طراحی کنترل‌کننده استفاده می شود.‌ هنگامی‌که دینامیک سیستم نامعلوم است این مسئله بسیار پیچیده‌تر می‌شود. اولین چیزی که برای حل این مشکل به نظر می‌رسد شناسایی سیستم دوخطی به کمک روش‌های شناسایی سیستم است. همان‌طور که می‌دانیم روش‌های شناسایی مدلی خطی شده بر اساس داده‌های ورودی و خروجی سیستم در اختیار طراح قرار می‌دهد تا به سراغ طراحی کنترل‌کننده برود. در این مقاله با استفاده از رویه‌ای برخط و تطبیقی، یک روش تکراری جدید به‌منظور طراحی کنترل‌کننده بهینه برای یک سیستم دوخطی که دینامیک آن نامعلوم است پیشنهاد می‌گردد. در روش تکرای پیشنهادی و به صورتی تطبیقی، به‌جای دانستن دینامیک سیستم دوخطی با استفاده از اطلاعات برخط ورودی و اندازه‌گیری حالت‌ها، کنترل‌کننده‌ی بهینه طراحی می‌گردد. همچنین با اعمال نویز به‌منزله ورودی به سیستم در یک بازه‌ی زمانی خاص، نیاز به‌ اندازه‌گیری مجدد حالت‌ها برای تکرارهای بعدی برطرف می‌گردد. همگرایی روش تکراری تطبیقی به کنترل ‌کننده بهینه به ‌صورت قضیه ارائه و اثبات شده است.
کلیدواژه	کنترل بهینه، سیستم ‌های دوخطی، دینامیک ناشناخته، تطبیقی، سیاست تکرار
آدرس	دانشگاه پیام نور مرکز تهران, گروه ریاضی, ایران, دانشگاه فردوسی مشهد, دانشکده فنی و مهندسی, گروه مهندسی برق, ایران, دانشگاه صنعتی قوچان, دانشکده مهندسی برق و کامپیوتر, گروه مهندسی برق, ایران, دانشگاه پیام نور مرکز تهران, گروه ریاضی, ایران
پست الکترونیکی	a_heidari@pnu.ac.ir

an online policy iteration for adaptive optimal control of unknown bilinear systems

Authors	manoochehri rahbar nafiseh ,pariz naser ,ramezani-al mohammad reza ,heydari aghileh
Abstract	bellman’s optimality principle states that designing an optimal controller for continuous-time bilinear systems with known system dynamics has a high computational complexity. as a result, controller design typically uses approximation techniques that depend on system dynamics knowledge. this problem will become more challenging when the system dynamics are unknown. identifying the bilinear system dynamics through identification techniques is the first step toward overcoming this. it is well known that the identification methods give the designer a linear model to use in the controller design, based on the input and output data of the system. this paper proposes a new iterative method to design an optimal controller for a bilinear system whose dynamics are unknown, using an online adaptive policy iteration. in the proposed iterative method, instead of knowing the dynamics of the bilinear system, the optimal controller is designed by using the online input information and measurement of states. also, by applying noise as an input for the system in a certain time interval, the need to measure the states for the next iterations is eliminated. the convergence of the adaptive iterative process to the optimal controller has been presented and proved in a theorem.
Keywords	optimal control ,bilinear systems ,unknown dynamics ,adaptive policy iteration (pi)