【 title 】A Collaboration of Multi-agent Model using an Interactive Interface
【 The author team 】Jingchen Li, Fan Wu, Haobin Shi, Kao-Shing Hwang
【 Date of publication 】2022.7.13
【 Thesis link 】https://www.sciencedirect.com/science/article/pii/S0020025522007411#!
【 Recommended reasons 】 Multi-Agent Reinforcement Learning Algorithms pay little attention to noisy environments , In this environment , Agents cannot achieve the best strategy training and make correct decisions . This paper studies the influence of noise in multi-agent environment , And propose a multi-agent actor - Critic collaboration (MACC) Model . The model uses lightweight communication to overcome the interference of noise .MACC Each agent in has two strategies : Collaboration strategy and behavior strategy . The behavior of an agent depends not only on its own state , And it is also affected by other agents through a scalar collaboration value . The collaboration value is generated by the collaboration strategy of each agent , It ensures a concise consensus on the environment . This paper elaborates on the training of cooperative strategies , It also explains in detail how it coordinates behavior strategies in the way of time abstraction mechanism , At the same time, the observation sequence is considered to obtain more accurate perception . Several experiments on the multi-agent collaborative simulation platform show that ,MACC Better performance than baseline in noisy environments , Especially in a partially observable environment .
当前位置:网站首页>Northwestern Polytechnical University | multi-agent model collaboration using interactive interfaces
Northwestern Polytechnical University | multi-agent model collaboration using interactive interfaces
2022-07-20 22:03:00 【Zhiyuan community】
边栏推荐
猜你喜欢
Y71. Chapter IV Prometheus large factory monitoring system and practice -- Prometheus server installation (II)
走进创客教育课程实践的真实情境
模拟实现库函数strcat--将源字符串的副本追加到目标字符串(理解内存重叠问题)
Analyzing the innovative thinking in the curriculum of maker Education
記錄一下十三届藍橋杯嵌入式省賽題目
VMware solves the problem of not recognizing USB
Dest0g3 520 orientation -web easyphp
索引下推的基本原理
考完PMP,免费获取PDU的方式图解
机器人时代发展大趋势对民众的影响
随机推荐
Simple examples of pointer arrays and array pointers
Essays of this week (sorted out on weekends)
OpenMMLAB系列框架解读(基于PyTorch)
Redis distributed lock implemented by annotation
请问Redis 如何实现库存扣减操作和防止被超卖?
大屏可视化适配文件
【IEEE出版】2022年自然语言处理与信息检索国际会议(ECNLPIR 2022)
nacos注册中心之服务地址的查询
2022 Henan Mengxin League game 2: Henan University of technology L - HPU
MySQL Authentication ‘root‘ ‘mysql_native_password‘ failed: Reading from the stream has failed
在公司解决的问题
2022河南萌新联赛第(二)场:河南理工大学 C - 斩龙
Solution to the fourth weekly test of ACM intensive training of Hunan Institute of technology in 2022
西北工业大学|使用交互式界面的多智能体模型协作
VS2017 30天试用结束后无法使用,登录界面卡主问题
English语法_物主代词
dbeaver连接Oracle用户显示错误及用户不存在?
Analog implementation library function --strcmp (character binary comparison)
CnosDB 涅槃重生:弃用Go, 全面拥抱Rust
Hbuilderx eslint configuration