【二】最新多智能体强化学习文章如何查阅｛顶会：AAAI、 ICML ｝

世博小虎 發表於 2022-10-27 21:33:00

【二】最新多智能体强化学习文章如何查阅｛顶会：AAAI、 ICML ｝

相关文章：
【一】最新多智能体强化学习方法【总结】
【二】最新多智能体强化学习文章如何查阅｛顶会：AAAI、 ICML ｝
【三】多智能体强化学习（MARL）近年研究概览｛Analysis of emergent behaviors（行为分析)_、Learning communication（通信学习）｝
【四】多智能体强化学习（MARL）近年研究概览｛Learning cooperation（协作学习）、Agents modeling agents（智能体建模）｝
<h1 id="articleContentId">1.中国计算机学会(CCF)推荐国际学术会议和期刊目录</h1>
CCF官方网站
CCF推荐国际学术会议（参考链接：链接点击查阅具体分类）
类别如下计算机系统与高性能计算，计算机网络，网络与信息安全，软件工程，系统软件与程序设计语言，数据库、数据挖掘与内容检索，计算机科学理论，计算机图形学与多媒体，人工智能与模式识别，人机交互与普适计算，前沿、交叉与综合
2021 ICML 多智能体强化学习论文整理汇总
<table><tbody><tr><th>类别名称</th><th>数量</th></tr><tr><td>投稿量</td><td>5513</td></tr><tr><td>接收量</td><td>1184</td></tr><tr><td>强化学习方向文章</td><td>163</td></tr><tr><td>其中多智能体强化学习文章</td><td>15</td></tr></tbody></table>
ICML地位：
<h2>1.1 中国计算机学会推荐国际学术会议 （人工智能与模式识别）</h2>
<h3>1.1.1 A类</h3>
<table border="1" cellpadding="0" cellspacing="0"><tbody><tr><td> 序号 </td><td> 会议简称 </td><td> 会议全称 </td><td> 出版社 </td><td> 网址 </td></tr><tr><td> 1 </td><td> AAAI </td><td> AAAI Conference on Artificial Intelligence </td><td> AAAI </td><td> http://www.aaai.org </td></tr><tr><td> 2 </td><td> CVPR </td><td> IEEE Conference on Computer Vision and  Pattern Recognition </td><td> IEEE </td><td> http://www.pamitc.org/cvpr13/ </td></tr><tr><td> 3 </td><td> ICCV </td><td> International Conference on Computer Vision </td><td> IEEE </td><td> http://www.iccv2013.org/ </td></tr><tr><td> 4 </td><td> ICML </td><td> International Conference on Machine  Learning </td><td> ACM </td><td> http://icml.cc/2013/ </td></tr><tr><td> 5 </td><td> IJCAI </td><td> International Joint Conference on Artificial Intelligence </td><td> Morgan Kaufmann </td><td> http://www.ijcai.org </td></tr></tbody></table>
<h3>1.1.2 B类</h3>
<table border="1" cellpadding="0" cellspacing="0"><tbody><tr><td> 序号 </td><td> 会议简称 </td><td> 会议全称 </td><td> 出版社 </td><td> 网址 </td></tr><tr><td> 1 </td><td> COLT </td><td> Annual Conference on Computational Learning Theory </td><td> Springer </td><td> http://orfe.princeton.edu/conferences/colt2013/ </td></tr><tr><td> 2 </td><td> NIPS </td><td> Annual Conference on Neural Information Processing Systems </td><td> MIT Press </td><td> http://www.nips.cc </td></tr></tbody></table>
<h3>1.1.3 B、C类更多见附录</h3>
<h1>2.推荐深度强化学习实验室及链接</h1>
<h2>2.1 arXiv</h2>
arXiv是一个免费的分发服务和开放存取的档案，收录了物理、数学、计算机科学、定量生物学、定量金融、统计学、电气工程和系统科学以及经济学等领域的1,917,177篇学术文章。本网站上的材料没有经过arXiv的同行评审。
<blockquote>
链接：https://arxiv.org/
</blockquote>
<img src="https://img-blog.csdnimg.cn/20210721164737582.png?x-oss-process=image/watermark,type_ZmFuZ3poZW5naGVpdGk,shadow_10,text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L3NpbmF0XzM5NjIwMjE3,size_16,color_FFFFFF,t_70">
<h2> 2.2 深度强化学习实验室</h2>
DeepRL——github：https://github.com/neurondance
微信公众号：Deep-RL
官网：http://www.neurondance.com/
<img src="https://img-blog.csdnimg.cn/20210721165233812.png?x-oss-process=image/watermark,type_ZmFuZ3poZW5naGVpdGk,shadow_10,text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L3NpbmF0XzM5NjIwMjE3,size_16,color_FFFFFF,t_70">
论坛：http://deeprl.neurondance.com/
<img src="https://img-blog.csdnimg.cn/20210721165038651.png?x-oss-process=image/watermark,type_ZmFuZ3poZW5naGVpdGk,shadow_10,text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L3NpbmF0XzM5NjIwMjE3,size_16,color_FFFFFF,t_70">
<h2>2.3 AI 会议Deadlines</h2>
: https://aideadlin.es
<img src="https://img-blog.csdnimg.cn/20210721165712113.png?x-oss-process=image/watermark,type_ZmFuZ3poZW5naGVpdGk,shadow_10,text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L3NpbmF0XzM5NjIwMjE3,size_16,color_FFFFFF,t_70">
<h2>2.4 ICML官网：</h2>
https://icml.cc/
<img src="https://img-blog.csdnimg.cn/20210721165705742.png?x-oss-process=image/watermark,type_ZmFuZ3poZW5naGVpdGk,shadow_10,text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L3NpbmF0XzM5NjIwMjE3,size_16,color_FFFFFF,t_70">
<h1>3.最新多智能体强化学习方向论文</h1>
<h2>3.1 ICML  International Conference on Machine Learning</h2>
<blockquote>
. Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning
作者: Shariq Iqbal (University of Southern California) · Christian Schroeder (University of Oxford) · Bei Peng (University of Oxford) · Wendelin Boehmer (Delft University of Technology) · Shimon Whiteson (University of Oxford) · Fei Sha (Google Research)
. UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning
作者: Tarun Gupta (University of Oxford) · Anuj Mahajan (Dept. of Computer Science, University of Oxford) · Bei Peng (University of Oxford) · Wendelin Boehmer (Delft University of Technology) · Shimon Whiteson (University of Oxford)
. Emergent Social Learning via Multi-agent Reinforcement Learning
作者: Kamal Ndousse (OpenAI) · Douglas Eck (Google Brain) · Sergey Levine (UC Berkeley) · Natasha Jaques (Google Brain, UC Berkeley)
. DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning
作者: Wei-Fang Sun (National Tsing Hua University) · Cheng-Kuang Lee (NVIDIA Corporation) · Chun-Yi Lee (National Tsing Hua University)
. Cooperative Exploration for Multi-Agent Deep Reinforcement Learning
作者: Iou-Jen Liu (University of Illinois at Urbana-Champaign) · Unnat Jain (UIUC) · Raymond Yeh (University of Illinois at Urbana–Champaign) · Alexander Schwing (UIUC)
. Large-Scale Multi-Agent Deep FBSDEs
作者: Tianrong Chen (Georgia Institute of Technology) · Ziyi Wang (Georgia Institute of Technology) · Ioannis Exarchos (Stanford University) · Evangelos Theodorou (Georgia Tech)
. Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning
作者: Anuj Mahajan (Dept. of Computer Science, University of Oxford) · Mikayel Samvelyan (University College London) · Lei Mao (NVIDIA) · Viktor Makoviychuk (NVIDIA) · Animesh Garg (University of Toronto, Vector Institute, Nvidia) · Jean Kossaifi (NVIDIA) · Shimon Whiteson (University of Oxford) · Yuke Zhu (University of Texas - Austin) · Anima Anandkumar (Caltech and NVIDIA)
. Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing
作者: Filippos Christianos (University of Edinburgh) · Georgios Papoudakis (The University of Edinburgh) · Muhammad Arrasy Rahman (The University of Edinburgh) · Stefano Albrecht (University of Edinburgh)
. Parallel Droplet Control in MEDA Biochips using Multi-Agent Reinforcement Learning
作者: Tung-Che Liang (Duke University) · Jin Zhou (Duke University) · Yun-Sheng Chan (National Chiao Tung University) · Tsung-Yi Ho (National Tsing Hua University) · Krishnendu Chakrabarty (Duke University) · Cy Lee (National Chiao Tung University)
. A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning
作者: Dong Ki Kim (MIT) · Miao Liu (IBM) · Matthew Riemer (IBM Research) · Chuangchuang Sun (MIT) · Marwa Abdulhai (MIT) · Golnaz Habibi (MIT) · Sebastian Lopez-Cot (MIT) · Gerald Tesauro (IBM Research) · Jonathan How (MIT)
. Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot
作者: Joel Z Leibo (DeepMind) · Edgar Duenez-Guzman (DeepMind) · Alexander Vezhnevets (DeepMind) · John Agapiou (DeepMind) · Peter Sunehag () · Raphael Koster (DeepMind) · Jayd Matyas (DeepMind) · Charles Beattie (DeepMind Technologies Limited) · Igor Mordatch (Google Brain) · Thore Graepel (DeepMind)
. Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers
作者: Luke Marris (DeepMind) · Paul Muller (DeepMind) · Marc Lanctot (DeepMind) · Karl Tuyls (DeepMind) · Thore Graepel (DeepMind)
. Coach-Player Multi-agent Reinforcement Learning for Dynamic Team Composition
作者: Bo Liu (University of Texas, Austin) · Qiang Liu (UT Austin) · Peter Stone (University of Texas at Austin) · Animesh Garg (University of Toronto, Vector Institute, Nvidia) · Yuke Zhu (University of Texas - Austin) · Anima Anandkumar (California Institute of Technology)
. Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning
作者: Matthieu Zimmer (Shanghai Jiao Tong University) · Claire Glanois (Shanghai Jiao Tong University) · Umer Siddique (Shanghai Jiao Tong University) · Paul Weng (Shanghai Jiao Tong University)
. FOP: Factorizing Optimal Joint Policy of Maximum-Entropy Multi-Agent Reinforcement Learning
作者: Tianhao Zhang (Peking University) · yueheng li (Peking university) · Chen Wang (Peking University) · Zongqing Lu (Peking University) · Guangming Xie (1. State Key Laboratory for Turbulence and Complex Systems, College of Engineering, Peking University; 2. Institute of Ocean Research, Peking University)
</blockquote>
<h2>3.2 AAAI Conference on Artificial Intelligence</h2>
<blockquote>
会议时间节点
<ul><li>August 15 – August 30, 2020: Authors register on the AAAI web site</li><li>September 1, 2020: Electronic abstracts due at 11:59 PM UTC-12 (anywhere on earth)</li><li>September 9, 2020: Electronic papers due at 11:59 PM UTC-12 (anywhere on earth)</li><li>September 29, 2020: Abstracts AND full papers due for revisions of rejected NeurIPS/EMNLP submissions by 11:59 PM UTC-12 (anywhere on earth)</li><li>AAAI-21 Reviewing Process: Two-Phase Reviewing and NeurIPS/EMNLP Fast Track Submissions</li><li>November 3-5, 2020: Author Feedback Window (anywhere on earth)</li><li>December 1, 2020: Notification of acceptance or rejection</li></ul>
</blockquote>
具体论文见链接：http://deeprl.neurondance.com/d/191-82aaai2021
接收论文列表(共84篇)
<h1>4.附录</h1>
<h2>4.1 B类</h2>
<table border="1" cellpadding="0" cellspacing="0"><tbody><tr><td> 序号 </td><td> 会议简称 </td><td> 会议全称 </td><td> 出版社 </td><td> 网址 </td></tr><tr><td> 1 </td><td> COLT </td><td> Annual Conference on Computational Learning Theory </td><td> Springer </td><td> http://orfe.princeton.edu/conferences/colt2013/ </td></tr><tr><td> 2 </td><td> NIPS </td><td> Annual Conference on Neural Information Processing Systems </td><td> MIT Press </td><td> http://www.nips.cc </td></tr><tr><td> 3 </td><td> ACL </td><td> Annual Meeting of the Association for  Computational Linguistics </td><td> ACL </td><td> http://acl2013.org/site/index.html </td></tr><tr><td> 4 </td><td> EMNLP </td><td> Conference on Empirical Methods in Natural Language Processing </td><td> ACL </td><td> http://www.sigdat.org/ </td></tr><tr><td> 5 </td><td> ECAI </td><td> European Conference on Artificial  Intelligence </td><td> IOS Press </td><td> http://www.ecai2013.upit.ro/?i=2542 </td></tr><tr><td> 6 </td><td> ECCV </td><td> European Conference on Computer Vision </td><td> Springer </td><td> http://eccv2012.unifi.it/ </td></tr><tr><td> 7 </td><td> ICRA </td><td> IEEE International Conference on Robotics and Automation </td><td> IEEE </td><td> http://www.icra2013.org/ </td></tr><tr><td> 8 </td><td> ICAPS </td><td> International Conference on Automated Planning and Scheduling </td><td> AAAI </td><td> http://www.icaps-conference.org/ </td></tr><tr><td> 9 </td><td> ICCBR </td><td> International Conference on Case-Based Reasoning </td><td> Springer </td><td> http://www.iccbr.org/ </td></tr><tr><td> 10 </td><td> COLING </td><td> International Conference on Computational Linguistics </td><td> ACM </td><td>  http://www.coling2012-iitb.org/ </td></tr><tr><td> 11 </td><td> KR </td><td> International Conference on Principles of Knowledge Representation and Reasoning </td><td> Morgan Kaufmann </td><td> http://www.kr.org/ </td></tr><tr><td> 12 </td><td> UAI </td><td> International Conference on Uncertainty in Artificial Intelligence </td><td> AUAI </td><td> http://auai.org/ </td></tr><tr><td> 13 </td><td> AAMAS </td><td> International Joint Conference on Autonomous Agents and Multi-agent Systems </td><td> Springer </td><td> http://www.aamas-conference.org/ </td></tr></tbody></table>
<h2>4.2 C类</h2>
<table border="1" cellpadding="0" cellspacing="0"><tbody><tr><td> 序号 </td><td> 会议简称 </td><td> 会议全称 </td><td> 出版社 </td><td> 网址 </td></tr><tr><td> 1 </td><td> ACCV </td><td> Asian Conference on Computer Vision </td><td> Springer </td><td> http://www.accv2012.org/ </td></tr><tr><td> 2 </td><td> CoNLL </td><td> Conference on Natural Language Learning </td><td> CoNLL </td><td> http://www.clips.ua.ac.be/conll/ </td></tr><tr><td> 3 </td><td> GECCO </td><td> Genetic and Evolutionary Computation Conference </td><td> ACM </td><td> http://www.sigevo.org/gecco-2013/ </td></tr><tr><td> 4 </td><td> ICTAI </td><td> IEEE International Conference on Tools with Artificial Intelligence </td><td> IEEE </td><td> http://ictai12.unipi.gr/ </td></tr><tr><td> 5 </td><td> ALT </td><td> International Conference on Algorithmic Learning Theory </td><td> Springer </td><td> http://www-alg.ist.hokudai.ac.jp/~thomas/ALT13/ </td></tr><tr><td> 6 </td><td> ICANN </td><td> International Conference on Artificial Neural Networks </td><td> Springer </td><td> https://www.waset.org/conferences/2013/ amsterdam/icann/ </td></tr><tr><td> 7 </td><td> FGR </td><td> International Conference on Automatic Face and Gesture Recognition </td><td> IEEE </td><td> http://fg2013.cse.sc.edu/ </td></tr><tr><td> 8 </td><td> ICDAR </td><td> International Conference on Document Analysis and Recognition </td><td> IEEE </td><td> http://www.icdar2013.org/ </td></tr><tr><td> 9 </td><td> ILP </td><td> International Conference on Inductive Logic Programming </td><td> Springer </td><td> http://ilp13.cos.ufrj.br/ </td></tr><tr><td> 10 </td><td> KSEM </td><td> International conference on Knowledge Science,Engineering and Management </td><td> Springer </td><td> http://ksem.dlut.edu.cn/ </td></tr><tr><td> 11 </td><td> ICONIP </td><td> International Conference on Neural  Information Processing </td><td> Springer </td><td> http://iconip2013.org/ </td></tr><tr><td> 12 </td><td> ICPR </td><td> International Conference on Pattern  Recognition </td><td> IEEE </td><td> http://www.icpr2014.org/ </td></tr><tr><td> 13 </td><td> ICB </td><td> International Joint Conference on Biometrics </td><td> IEEE </td><td> http://atvs.ii.uam.es/icb2013/ </td></tr><tr><td> 14 </td><td> IJCNN </td><td> International Joint Conference on Neural Networks </td><td> IEEE </td><td> http://www.ijcnn2013.org/ </td></tr><tr><td> 15 </td><td> PRICAI </td><td> Pacific Rim International Conference on  Artificial Intelligence </td><td> Springer </td><td> http://ktw.mimos.my/pricai2012/ </td></tr><tr><td> 16 </td><td> NAACL </td><td> The Annual Conference of the North American Chapter of the Association  for Computational Linguistics </td><td> NAACL </td><td> http://naacl2013.naacl.org/ </td></tr><tr><td> 17 </td><td> BMVC </td><td> British Machine Vision Conference </td><td> British Machine Vision  Association </td><td> http://bmvc2013.bristol.ac.uk/ </td></tr></tbody></table> 
来源：https://www.cnblogs.com/ting1/p/16833964.html

頁: [1]

圆梦公社's Archiver

【二】最新多智能体强化学习文章如何查阅｛顶会：AAAI、 ICML ｝