Reinforcement Learning for Multi-Agent Systems with Temporal Logic Specifications

Terashima, Keita; Kobayashi, Koichi; Yamashita, Yuh

doi:10.1587/transfun.2023KEP0016


Hokkaido University \| Library \| HUSCAP	Advanced Search		言語

	Home
	About HUSCAP
	Open Access Policy

	Browse by Author

Browse
	Communities & Collections

	Scholarly Journals
	Theses
	Doctoral Dissertations Listed by Graduate Schools
	Conference Procs.
	Events

	HUSCAP Senior (in Japanese)

	Societies

	Downloads (country)

For university staff
	How to post your papers to HUSCAP
	Publication of theses
	Helpline about theses publication

Open Archives Compliant

You can search our collection also at:
	Google
	Google Scholar
	CiNii
	IRDB
	OAIster
	NDLTD

Hokkaido University Collection of Scholarly and Academic Papers >
Graduate School of Information Science and Technology / Faculty of Information Science and Technology >
Peer-reviewed Journal Articles, etc >

Reinforcement Learning for Multi-Agent Systems with Temporal Logic Specifications

Files in This Item:

E107.A_2023KEP0016.pdf

1.83 MB

PDF

View/Open

Please use this identifier to cite or link to this item:http://hdl.handle.net/2115/92468

Title:	Reinforcement Learning for Multi-Agent Systems with Temporal Logic Specifications
Authors:	Terashima, Keita Browse this author
	Kobayashi, Koichi Browse this author →KAKEN DB
	Yamashita, Yuh Browse this author →KAKEN DB
Keywords:	multi-agent systems
	reinforcement learning
	linear temporal logic
	aggregator
	surveillance
Issue Date:	1-Jan-2024
Publisher:	IEICE - Institute of the Electronics, Information and Communication Engineers
Journal Title:	IEICE transactions on fundamentals of electronics communications and computer sciences
Volume:	E107A
Issue:	1
Start Page:	31
End Page:	37
Publisher DOI:	10.1587/transfun.2023KEP0016
Abstract:	In a multi-agent system, it is important to consider a design method of cooperative actions in order to achieve a common goal. In this paper, we propose two novel multi-agent reinforcement learning methods, where the control specification is described by linear temporal logic formulas, which represent a common goal. First, we propose a simple solution method, which is directly extended from the single-agent case. In this method, there are some technical issues caused by the increase in the number of agents. Next, to overcome these technical issues, we propose a new method in which an aggregator is introduced. Finally, these two methods are compared by numerical simulations, with a surveillance problem as an example.
Rights:	copyright©2024 IEICE
Type:	article
URI:	http://hdl.handle.net/2115/92468
Appears in Collections:	情報科学院・情報科学研究院 (Graduate School of Information Science and Technology / Faculty of Information Science and Technology) > 雑誌発表論文等 (Peer-reviewed Journal Articles, etc)

Submitter: 小林孝一

OAI-PMH ( junii2 , jpcoar_1.0 )

- Hokkaido University