【二】最新多智能体强化学习文章如何查阅｛顶会：AAAI、 ICML ｝

世博小虎 · 發表於 2022-10-27 21:33:00

【三】多智能体强化学习（MARL）近年研究概览｛Analysis of emergent behaviors（行为分析)_、Learning communication（通信学习）｝

【四】多智能体强化学习（MARL）近年研究概览｛Learning cooperation（协作学习）、Agents modeling agents（智能体建模）｝

1.中国计算机学会(CCF)推荐国际学术会议和期刊目录

CCF官方网站

CCF推荐国际学术会议（参考链接：链接点击查阅具体分类）

类别如下计算机系统与高性能计算，计算机网络，网络与信息安全，软件工程，系统软件与程序设计语言，数据库、数据挖掘与内容检索，计算机科学理论，计算机图形学与多媒体，人工智能与模式识别，人机交互与普适计算，前沿、交叉与综合

2021 ICML 多智能体强化学习论文整理汇总

类别名称	数量
投稿量	5513
接收量	1184
强化学习方向文章	163
其中多智能体强化学习文章	15

ICML地位：

1.1 中国计算机学会推荐国际学术会议
（人工智能与模式识别）

1.1.1 A类

序号	会议简称	会议全称	出版社	网址
1	AAAI	AAAI Conference on Artificial Intelligence	AAAI	http://www.aaai.org
2	CVPR	IEEE Conference on Computer Vision and Pattern Recognition	IEEE	http://www.pamitc.org/cvpr13/
3	ICCV	International Conference on Computer Vision	IEEE	http://www.iccv2013.org/
4	ICML	International Conference on Machine Learning	ACM	http://icml.cc/2013/
5	IJCAI	International Joint Conference on Artificial Intelligence	Morgan Kaufmann	http://www.ijcai.org

1.1.2 B类

序号	会议简称	会议全称	出版社	网址
1	COLT	Annual Conference on Computational Learning Theory	Springer	http://orfe.princeton.edu/conferences/colt2013/
2	NIPS	Annual Conference on Neural Information Processing Systems	MIT Press	http://www.nips.cc

1.1.3 B、C类更多见附录

2.推荐深度强化学习实验室及链接

2.1 arXiv

arXiv是一个免费的分发服务和开放存取的档案，收录了物理、数学、计算机科学、定量生物学、定量金融、统计学、电气工程和系统科学以及经济学等领域的1,917,177篇学术文章。本网站上的材料没有经过arXiv的同行评审。

链接：https://arxiv.org/

2.2 深度强化学习实验室

DeepRL——github：https://github.com/neurondance

微信公众号：Deep-RL

官网：http://www.neurondance.com/

论坛：http://deeprl.neurondance.com/

2.3 AI 会议Deadlines

: https://aideadlin.es

2.4 ICML官网：

https://icml.cc/

3.最新多智能体强化学习方向论文

3.1 ICML International Conference on Machine Learning

[1]. Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning

作者: Shariq Iqbal (University of Southern California) · Christian Schroeder (University of Oxford) · Bei Peng (University of Oxford) · Wendelin Boehmer (Delft University of Technology) · Shimon Whiteson (University of Oxford) · Fei Sha (Google Research)

[2]. UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning

作者: Tarun Gupta (University of Oxford) · Anuj Mahajan (Dept. of Computer Science, University of Oxford) · Bei Peng (University of Oxford) · Wendelin Boehmer (Delft University of Technology) · Shimon Whiteson (University of Oxford)

[3]. Emergent Social Learning via Multi-agent Reinforcement Learning

作者: Kamal Ndousse (OpenAI) · Douglas Eck (Google Brain) · Sergey Levine (UC Berkeley) · Natasha Jaques (Google Brain, UC Berkeley)

[4]. DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning

作者: Wei-Fang Sun (National Tsing Hua University) · Cheng-Kuang Lee (NVIDIA Corporation) · Chun-Yi Lee (National Tsing Hua University)

[5]. Cooperative Exploration for Multi-Agent Deep Reinforcement Learning

作者: Iou-Jen Liu (University of Illinois at Urbana-Champaign) · Unnat Jain (UIUC) · Raymond Yeh (University of Illinois at Urbana–Champaign) · Alexander Schwing (UIUC)

[6]. Large-Scale Multi-Agent Deep FBSDEs

作者: Tianrong Chen (Georgia Institute of Technology) · Ziyi Wang (Georgia Institute of Technology) · Ioannis Exarchos (Stanford University) · Evangelos Theodorou (Georgia Tech)

[7]. Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning

作者: Anuj Mahajan (Dept. of Computer Science, University of Oxford) · Mikayel Samvelyan (University College London) · Lei Mao (NVIDIA) · Viktor Makoviychuk (NVIDIA) · Animesh Garg (University of Toronto, Vector Institute, Nvidia) · Jean Kossaifi (NVIDIA) · Shimon Whiteson (University of Oxford) · Yuke Zhu (University of Texas - Austin) · Anima Anandkumar (Caltech and NVIDIA)

[8]. Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing

作者: Filippos Christianos (University of Edinburgh) · Georgios Papoudakis (The University of Edinburgh) · Muhammad Arrasy Rahman (The University of Edinburgh) · Stefano Albrecht (University of Edinburgh)

[9]. Parallel Droplet Control in MEDA Biochips using Multi-Agent Reinforcement Learning

作者: Tung-Che Liang (Duke University) · Jin Zhou (Duke University) · Yun-Sheng Chan (National Chiao Tung University) · Tsung-Yi Ho (National Tsing Hua University) · Krishnendu Chakrabarty (Duke University) · Cy Lee (National Chiao Tung University)

[10]. A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning

作者: Dong Ki Kim (MIT) · Miao Liu (IBM) · Matthew Riemer (IBM Research) · Chuangchuang Sun (MIT) · Marwa Abdulhai (MIT) · Golnaz Habibi (MIT) · Sebastian Lopez-Cot (MIT) · Gerald Tesauro (IBM Research) · Jonathan How (MIT)

[11]. Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot

作者: Joel Z Leibo (DeepMind) · Edgar Duenez-Guzman (DeepMind) · Alexander Vezhnevets (DeepMind) · John Agapiou (DeepMind) · Peter Sunehag () · Raphael Koster (DeepMind) · Jayd Matyas (DeepMind) · Charles Beattie (DeepMind Technologies Limited) · Igor Mordatch (Google Brain) · Thore Graepel (DeepMind)

[12]. Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers

作者: Luke Marris (DeepMind) · Paul Muller (DeepMind) · Marc Lanctot (DeepMind) · Karl Tuyls (DeepMind) · Thore Graepel (DeepMind)

[13]. Coach-Player Multi-agent Reinforcement Learning for Dynamic Team Composition

作者: Bo Liu (University of Texas, Austin) · Qiang Liu (UT Austin) · Peter Stone (University of Texas at Austin) · Animesh Garg (University of Toronto, Vector Institute, Nvidia) · Yuke Zhu (University of Texas - Austin) · Anima Anandkumar (California Institute of Technology)

[14]. Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning

作者: Matthieu Zimmer (Shanghai Jiao Tong University) · Claire Glanois (Shanghai Jiao Tong University) · Umer Siddique (Shanghai Jiao Tong University) · Paul Weng (Shanghai Jiao Tong University)

[15]. FOP: Factorizing Optimal Joint Policy of Maximum-Entropy Multi-Agent Reinforcement Learning

作者: Tianhao Zhang (Peking University) · yueheng li (Peking university) · Chen Wang (Peking University) · Zongqing Lu (Peking University) · Guangming Xie (1. State Key Laboratory for Turbulence and Complex Systems, College of Engineering, Peking University; 2. Institute of Ocean Research, Peking University)

3.2 AAAI Conference on Artificial Intelligence

会议时间节点

August 15 – August 30, 2020: Authors register on the AAAI web site
September 1, 2020: Electronic abstracts due at 11:59 PM UTC-12 (anywhere on earth)
September 9, 2020: Electronic papers due at 11:59 PM UTC-12 (anywhere on earth)
September 29, 2020: Abstracts AND full papers due for revisions of rejected NeurIPS/EMNLP submissions by 11:59 PM UTC-12 (anywhere on earth)
AAAI-21 Reviewing Process: Two-Phase Reviewing and NeurIPS/EMNLP Fast Track Submissions
November 3-5, 2020: Author Feedback Window (anywhere on earth)
December 1, 2020: Notification of acceptance or rejection

具体论文见链接：http://deeprl.neurondance.com/d/191-82aaai2021

接收论文列表(共84篇)

4.附录

4.1 B类

序号	会议简称	会议全称	出版社	网址
1	COLT	Annual Conference on Computational Learning Theory	Springer	http://orfe.princeton.edu/conferences/colt2013/
2	NIPS	Annual Conference on Neural Information Processing Systems	MIT Press	http://www.nips.cc
3	ACL	Annual Meeting of the Association for Computational Linguistics	ACL	http://acl2013.org/site/index.html
4	EMNLP	Conference on Empirical Methods in Natural Language Processing	ACL	http://www.sigdat.org/
5	ECAI	European Conference on Artificial Intelligence	IOS Press	http://www.ecai2013.upit.ro/?i=2542
6	ECCV	European Conference on Computer Vision	Springer	http://eccv2012.unifi.it/
7	ICRA	IEEE International Conference on Robotics and Automation	IEEE	http://www.icra2013.org/
8	ICAPS	International Conference on Automated Planning and Scheduling	AAAI	http://www.icaps-conference.org/
9	ICCBR	International Conference on Case-Based Reasoning	Springer	http://www.iccbr.org/
10	COLING	International Conference on Computational Linguistics	ACM	http://www.coling2012-iitb.org/
11	KR	International Conference on Principles of Knowledge Representation and Reasoning	Morgan Kaufmann	http://www.kr.org/
12	UAI	International Conference on Uncertainty in Artificial Intelligence	AUAI	http://auai.org/
13	AAMAS	International Joint Conference on Autonomous Agents and Multi-agent Systems	Springer	http://www.aamas-conference.org/

4.2 C类

序号	会议简称	会议全称	出版社	网址
1	ACCV	Asian Conference on Computer Vision	Springer	http://www.accv2012.org/
2	CoNLL	Conference on Natural Language Learning	CoNLL	http://www.clips.ua.ac.be/conll/
3	GECCO	Genetic and Evolutionary Computation Conference	ACM	http://www.sigevo.org/gecco-2013/
4	ICTAI	IEEE International Conference on Tools with Artificial Intelligence	IEEE	http://ictai12.unipi.gr/
5	ALT	International Conference on Algorithmic Learning Theory	Springer	http://www-alg.ist.hokudai.ac.jp/~thomas/ALT13/
6	ICANN	International Conference on Artificial Neural Networks	Springer	https://www.waset.org/conferences/2013/ amsterdam/icann/
7	FGR	International Conference on Automatic Face and Gesture Recognition	IEEE	http://fg2013.cse.sc.edu/
8	ICDAR	International Conference on Document Analysis and Recognition	IEEE	http://www.icdar2013.org/
9	ILP	International Conference on Inductive Logic Programming	Springer	http://ilp13.cos.ufrj.br/
10	KSEM	International conference on Knowledge Science,Engineering and Management	Springer	http://ksem.dlut.edu.cn/
11	ICONIP	International Conference on Neural Information Processing	Springer	http://iconip2013.org/
12	ICPR	International Conference on Pattern Recognition	IEEE	http://www.icpr2014.org/
13	ICB	International Joint Conference on Biometrics	IEEE	http://atvs.ii.uam.es/icb2013/
14	IJCNN	International Joint Conference on Neural Networks	IEEE	http://www.ijcnn2013.org/
15	PRICAI	Pacific Rim International Conference on Artificial Intelligence	Springer	http://ktw.mimos.my/pricai2012/
16	NAACL	The Annual Conference of the North American Chapter of the Association for Computational Linguistics	NAACL	http://naacl2013.naacl.org/
17	BMVC	British Machine Vision Conference	British Machine Vision Association	http://bmvc2013.bristol.ac.uk/

来源：https://www.cnblogs.com/ting1/p/16833964.html

【二】最新多智能体强化学习文章如何查阅｛顶会：AAAI、 ICML ｝

1.中国计算机学会(CCF)推荐国际学术会议和期刊目录

1.1 中国计算机学会推荐国际学术会议 （人工智能与模式识别）

1.1.1 A类

1.1.2 B类

1.1.3 B、C类更多见附录

2.推荐深度强化学习实验室及链接

2.1 arXiv

2.2 深度强化学习实验室

2.3 AI 会议Deadlines

2.4 ICML官网：

3.最新多智能体强化学习方向论文

3.1 ICML International Conference on Machine Learning

3.2 AAAI Conference on Artificial Intelligence

4.附录

4.1 B类

4.2 C类

瀏覽過的版塊

1.1 中国计算机学会推荐国际学术会议
（人工智能与模式识别）