زمانبندی توزیع‏ شده وظایف در سیستم ‏های سرویس‏ دهی مبتنی بر gpu بر حسب تقاضا

Fa | Ar | En

زمانبندی توزیع‏ شده وظایف در سیستم ‏های سرویس‏ دهی مبتنی بر gpu بر حسب تقاضا


نویسنده	جهانی آرزو ,مومنی لیلا سادات
منبع	مهندسي برق دانشگاه تبريز - 1403 - دوره : 54 - شماره : 2 - صفحه:143 -152
چکیده	زمانبندی بهینه منابع بر روی سرورهای مبتنی بر gpu که برای وظایف موازی مناسب هستند، بسیار ضروری است. این منابع معمولاً دارای سرعت بالایی بوده و بنابراین هزینه بالایی نیز دارند. جهت استفاده بهینه از این منابع، مراکز ارائه دهنده خدمات، باید بتوانند به ازای هر درخواست، بهترین نوع ماشین مجازی، بهترین نوع پردازندهgpu و همچنین بهترین تعداد این نوع پردازنده را انتخاب نمایند. چنین مسئله‏ای، یک مسئله بهینه‏سازی نامیده می‏شود. مقاله حاضر، ضمن مدلسازی مسئله تخصیص منابع به عنوان یک مسئله بهینه‏سازی خطی، روش جدیدی را برای توزیع درخواست‏ها ارایه می‏دهد. روش پیشنهادی از یک صف مرکزی استفاده نموده و سپس درخواست‏ها را با استفاده از یک روش نوین توزیع درخواست، بین چندین صف محلی توزیع می‏کند. سپس وظایف موجود در هر صف محلی را ‏به صورت موازی زمانبندی و اجرا می‏کند. زمانبندی در هر صف محلی، تعیین می‏کند که به ازای هر درخواست: (1) بهترین نوع ماشین مجازی (2) بهترین نوع پردازنده gpu و (3) بهترین تعداد پردازنده‏های gpu کدام است. مقایسه روش پیشنهادی با آخرین روش‏های موجود، نشانگر کاهش زمان اجرا، کاهش زمان پاسخ و همچنین کاهش چشمگیر هزینه استفاده از منابع در روش پیشنهادی است.
کلیدواژه	زمانبندی وظایف، سرورهای مبتنی بر gpu، توزیع درخواست‏ها، صف محلی
آدرس	دانشگاه صنعتی سهند, دانشکده مهندسی برق, ایران, دانشگاه صنعتی سهند, دانشکده مهندسی برق, ایران
پست الکترونیکی	l_momeni400@sut.ac.ir

distributed job scheduling in on-demand gpu as a service systems

Authors	jahani a. ,momeni leila al-sadat
Abstract	optimal scheduling of resources is essential on gpu-based servers that are suitable for parallel tasks. these resources usually have a high speed and therefore have a high cost. in order to make optimal use of these resources, service providers must be able to choose the best type of virtual machine, the best type of gpu processor, and the best number of this type of processor for each request. such a problem is called an optimization problem. the present article, while modeling the resource allocation problem as a linear optimization problem, presents a new method for distributing requests. the proposed method uses a central queue and then distributes requests among several local queues using a new request distribution method. then it schedules and executes the tasks in each local queue in parallel. scheduling in each local queue determines, for each request: (1) the best type of virtual machine, (2) the best type of gpu processor, and (3) the best number of gpu processors. the comparison of the proposed method with the latest available methods shows a decrease in execution time, a decrease in response time, and a significant decrease in the cost of using resources in the proposed method.
Keywords	task scheduling ,gpu-based servers ,request distribution ,local queuing