طراحی سامانه کنترل برای هواناو و تنظیم بهینه آن با الگوریتم یادگیری تقویتی

Fa | Ar | En

طراحی سامانه کنترل برای هواناو و تنظیم بهینه آن با الگوریتم یادگیری تقویتی


نویسنده	علیزاده محمدحسین ,طلوعی علیرضا ,قاسمی رضا
منبع	دريا فنون - 1403 - دوره : 11 - شماره : 2 - صفحه:1 -12
چکیده	مجموعه کنترلی در بسیاری از وسایل متحرک در آب با یک نظام تداخلی و غیر خطی روبرو است. به صورت خاص، در هواناو به دلیل اختصاص عملگرهای ناقص و ناکافی، اثر کنترل به صورت تداخلی در کانال‌ها دیده می‌شود. در این وسیله در هر مانور (سرعت‌های متفاوت) رفتار تغییر بزرگی می‌کند. با انحراف سکان در حرکت سمتی، کانال طولی تاثیر می‌پذیرد و همچنین با تغییر سرعت طولی، رفتار کانال سمت کاملا متحول می‌شود. در این تحقیق با شناسایی رفتار مورد نظر، ابتدا مدل خطی شده استخراج می‌شود. برای تابع تبدیل بدست آمده با الزامات مشخص، کنترل‌گر برای کانال‌های سرعت طولی، سمتی و سرعت زاویه‌ای طراحی می‌شود. چون این کنترل‌گر برای حالتی خاص طراحی شده‌است و برای تمامی مانورها و یا عدم قطعیت‌های هواناو رفتار لزوما مطلوبی ندارد. ناگزیر با استفاده از روش یادگیری تقویتی بهره‌های کنترل‌گر اولیه به صورت خارج از خط، برای تمام حالات ممکن (مانور و عدم قطعیت) تنظیم می‌گردند. گفتنی است تمامی بار محاسباتی به صورت خارج از خط صورت گرفته، نتیجه حاصل به صورت بهره‌های اصلاحی به کنترل‌گر اولیه اعمال شده و به صورت برخط استفاده می‌گردد. نتایج تحقیق نشان می‌دهد، با تنظیم کنترل‌گر با کمک یادگیری ماشین، در تعقیب فرامین هدایتی حداقل %25 بهبود در خطای تعقیب حاصل می‌شود.
کلیدواژه	هواناو، کنترل گر تناسبی، یادگیری ماشین، یادگیری تقویتی، یادگیری مونت کارلو
آدرس	دانشگاه شهید بهشتی, دانشکده فناوری‌های نوین و مهندسی هوافضا, ایران, دانشگاه شهید بهشتی, دانشکده فناوری‌های نوین و مهندسی هوافضا, ایران, دانشگاه قم, دانشکده مهندسی برق, ایران
پست الکترونیکی	r.ghasemi@qom.ac.ir

designing a control system for a hovercraft and optimal adjusting with reinforcement learning algorithm

Authors	alizadeh mohammad hossein ,toloei alireza ,ghasemi reza
Abstract	the dynamics of many mobile devices in water are interactive and none linear. in particular, due to the assignment of incomplete and insufficient actuators, complex dynamics can be seen clearly in the hovercraft. in this device, the dynamics changes greatly in each maneuver (different speeds). by turning the rudder for lateral movement, the longitudinal channel is affected and also by changing the longitudinal speed, the dynamics of the lateral channel is completely transformed. in this research, by identifying the dynamics of the system, a non-interference linear model is first extracted. for the obtained transfer function, with certain requirements, the controller is designed for the longitudinal, lateral velocity channel and angular velocity channel. as stated, this controller is designed for a specific situation and does not necessarily behave optimally for all maneuvers or uncertainties in the hovercraft’s system specifications. inevitably, with the reinforcement learning method and specifically with the monte-carlo learning algorithm, the primary controller gains are adjusted offline for all possible situations (maneuvers and uncertainties). it should be mentioned that all the calculation load is done off-line, the result is applied to the primary control system as correction gains and is used on-line for the control process. the research results show that by adjusting the controller with the help of machine learning, at least %25 improvement in the tracking error is prosecuted.