سامانه همایش‌ها TSTA | ثبت‌نام و ارسال مقاله کنفرانس‌ها و کنگره‌های ملی

فارسی

Home / سی و دومین کنفرانس ملی و دهمین کنفرانس بین المللی مهندسی زیست پزشکی ایران

Short-term gains vs. long-term Success: Reward strategy design for reinforcement learning in football

Authors :

Mohammad Pashaei¹ Amirhossein Tayebi² Hadi Amiri³ Ali Fahim⁴

1- Department of Engineering Science, University of Tehran, Tehran, Iran 2- Department of Engineering Science, University of Tehran, Tehran, Iran 3- Department of Engineering Science, University of Tehran, Tehran, Iran 4- Department of Engineering Science, University of Tehran, Tehran, Iran

Keywords :

Reinforcement Learning،Multi-agent systems،Soccer Simulation

Abstract :

Reinforcement learning in complex games like soccer relies heavily on how you define your reward function and environment. In this work, we developed a custom 3v3 soccer environment and implemented two RL-based teams with distinct learning trends: one with a fast convergence but limited long-term adaptation, and another with a slower yet more robust learning trajectory. Simulation shows that despite performing better at the start, the short-term agents fall short of the performance of the long-term agents in the long run, and after passing 50% of the episodes, the win rate of long-term agents rises from 30% in the beginning to 50%.

List of archived papers

Using Advanced Ensemble Machine Learning Models to Predict Traffic in SDN-Based Networks: A Comparative Study of Bagging, Boosting, and Stacking Approaches

Raha Pakzad - Sasan GharaPasha - Nasrin Firouz - Ramin Habibzadehsharif

Functionally Graded Material Vertebroplasty Screws: A Finite Element Biomechanical Study

Maryam Rahimi - Mohammad Hosein Zadeh-Posti - َAisan Rafiei - Nima Jamshidi

Coronary Full artery segmentation using U-Net neural network architecture

Rezvan Monjezi - Mahdieh Ghasemi - Mahdi Salehi - Alireza Rowhanimanesh - Samaneh Tabaee

Investigating the effect of alpha/theta neurofeedback on Emotional Intelligence

Saeed Yarmohammadi - Amirreza Ahmadi

کاربرد هوش مصنوعی در ایجاد و توسعه شبکه های صنعتی

بهاره رضاپور - حسین بوداقی خواجه نوبر

نقش کلیدی نانولوله های کربنی در بهبود همزمان خواص مکانیکی، ضدباکتریایی و زیست سازگاری پوشش های HA-Ta2O5 بر روی آلیاژهای حافظه دار NiTi

نازیلا هوراندقدیم - جعفر خلیل علافی

تأثیر محافظه‌کاری حسابداری بر ارزش شرکت با تأکید بر نقش متنوع سازی شرکتی

ابراهیم نویدی عباسپور - فاطمه منافی

Injectability Enhancement and Optimization of a Biphasic Calcium Phosphate Bone Cement

Sepehr Larijani - Mitra Asadi-Eydivand - Nabiollah Abolfathi - Mehran Solati-Hashjin

ارزیابی ساختار بازار حسابرسی در ایجاد ارزش افزوده اقتصادی در صنعت فولاد

کریم ستاری - محمدرضا عباسی استمال

گام بلند هوش مصنوعی در توسعه ارتباطات انسانی

کامیار لاوه ای

more

Samin Hamayesh - Version 44.5.0