世博小虎 發表於 2022-10-27 21:33:00

【二】最新多智能体强化学习文章如何查阅{顶会:AAAI、 ICML }

<p>相关文章:</p>
<p>【一】最新多智能体强化学习方法【总结】</p>
<p>【二】最新多智能体强化学习文章如何查阅{顶会:AAAI、 ICML }</p>
<p>【三】多智能体强化学习(MARL)近年研究概览 {Analysis of emergent behaviors(行为分析)_、Learning communication(通信学习)}</p>
<p>【四】多智能体强化学习(MARL)近年研究概览 {Learning cooperation(协作学习)、Agents modeling agents(智能体建模)}</p>
<h1 id="articleContentId">1.中国计算机学会(CCF)推荐国际学术会议和期刊目录</h1>
<p>CCF官方网站</p>
<p>CCF推荐国际学术会议(<strong>参考链接:</strong>链接点击查阅具体分类)</p>
<p>类别如下计算机系统与高性能计算,计算机网络,网络与信息安全,软件工程,系统软件与程序设计语言,数据库、数据挖掘与内容检索,计算机科学理论,计算机图形学与多媒体,<span style="color: rgba(254, 44, 36, 1)">人工智能与模式识别</span>,人机交互与普适计算,前沿、交叉与综合</p>
<p><strong>2021 ICML 多智能体强化学习论文整理汇总</strong></p>
<table><tbody><tr><th>类别名称</th><th>数量</th></tr><tr><td>投稿量</td><td>5513​</td></tr><tr><td>接收量</td><td>1184</td></tr><tr><td>强化学习方向文章</td><td>163</td></tr><tr><td>其中多智能体强化学习文章</td><td>15</td></tr></tbody></table>
<p><strong>ICML地位:</strong></p>
<h2><strong>1.1 中国计算机学会推荐国际学术会议<br> (人工智能与模式识别)</strong></h2>
<h3><strong>1.1.1 A类</strong></h3>
<table border="1" cellpadding="0" cellspacing="0"><tbody><tr><td> <p><strong>序号</strong></p> </td><td> <p><strong>会议简称</strong></p> </td><td> <p><strong>会议全称</strong></p> </td><td> <p><strong>出版社</strong></p> </td><td> <p><strong>网址</strong></p> </td></tr><tr><td> <p>1</p> </td><td> <p><span style="color: rgba(254, 44, 36, 1)">AAAI</span></p> </td><td> <p>AAAI Conference on Artificial Intelligence</p> </td><td> <p>AAAI</p> </td><td> <p>http://www.aaai.org</p> </td></tr><tr><td> <p>2</p> </td><td> <p>CVPR</p> </td><td> <p>IEEE Conference on Computer Vision and&nbsp;<br> Pattern Recognition</p> </td><td> <p>IEEE</p> </td><td> <p>http://www.pamitc.org/cvpr13/</p> </td></tr><tr><td> <p>3</p> </td><td> <p>ICCV</p> </td><td> <p>International Conference on Computer<br> Vision</p> </td><td> <p>IEEE</p> </td><td> <p>http://www.iccv2013.org/</p> </td></tr><tr><td> <p>4</p> </td><td> <p><span style="color: rgba(254, 44, 36, 1)">ICML</span></p> </td><td> <p>International Conference on Machine&nbsp;<br> Learning</p> </td><td> <p>ACM</p> </td><td> <p>http://icml.cc/2013/</p> </td></tr><tr><td> <p>5</p> </td><td> <p>IJCAI</p> </td><td> <p>International Joint Conference on Artificial<br> Intelligence</p> </td><td> <p>Morgan Kaufmann</p> </td><td> <p>http://www.ijcai.org</p> </td></tr></tbody></table>
<h3><strong>1.1.2 B类</strong></h3>
<table border="1" cellpadding="0" cellspacing="0"><tbody><tr><td> <p><strong>序号</strong></p> </td><td> <p><strong>会议简称</strong></p> </td><td> <p><strong>会议全称</strong></p> </td><td> <p><strong>出版社</strong></p> </td><td> <p><strong>网址</strong></p> </td></tr><tr><td> <p>1</p> </td><td> <p>COLT</p> </td><td> <p>Annual Conference on Computational<br> Learning Theory</p> </td><td> <p>Springer</p> </td><td> <p>http://orfe.princeton.edu/conferences/colt2013/</p> </td></tr><tr><td> <p>2</p> </td><td> <p><span style="color: rgba(254, 44, 36, 1)">NIPS</span></p> </td><td> <p>Annual Conference on Neural Information<br> Processing Systems</p> </td><td> <p>MIT Press</p> </td><td> <p>http://www.nips.cc</p> </td></tr></tbody></table>
<h3><span style="color: rgba(254, 44, 36, 1)"><strong>1.1.3 B、C类更多见附录</strong></span></h3>
<h1>2.推荐深度强化学习实验室及链接</h1>
<h2>2.1&nbsp;arXiv</h2>
<p>arXiv是一个免费的分发服务和开放存取的档案,收录了物理、数学、计算机科学、定量生物学、定量金融、统计学、电气工程和系统科学以及经济学等领域的1,917,177篇学术文章。本网站上的材料没有经过arXiv的同行评审。</p>
<blockquote>
<p>链接:https://arxiv.org/</p>
</blockquote>
<p><img src="https://img-blog.csdnimg.cn/20210721164737582.png?x-oss-process=image/watermark,type_ZmFuZ3poZW5naGVpdGk,shadow_10,text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L3NpbmF0XzM5NjIwMjE3,size_16,color_FFFFFF,t_70"></p>
<h2>&nbsp;2.2&nbsp;<strong>深度强化学习实验室</strong></h2>
<p>DeepRL——github:https://github.com/neurondance</p>
<p>微信公众号:Deep-RL</p>
<p><strong>官网</strong><strong>:http://www.neurondance.com/</strong></p>
<p><img src="https://img-blog.csdnimg.cn/20210721165233812.png?x-oss-process=image/watermark,type_ZmFuZ3poZW5naGVpdGk,shadow_10,text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L3NpbmF0XzM5NjIwMjE3,size_16,color_FFFFFF,t_70"></p>
<p><strong>论坛</strong><strong>:</strong><strong>http://deeprl.neurondance.com/</strong></p>
<p><img src="https://img-blog.csdnimg.cn/20210721165038651.png?x-oss-process=image/watermark,type_ZmFuZ3poZW5naGVpdGk,shadow_10,text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L3NpbmF0XzM5NjIwMjE3,size_16,color_FFFFFF,t_70"></p>
<h2>2.3 AI 会议Deadlines</h2>
<p>:&nbsp;https://aideadlin.es</p>
<p><img src="https://img-blog.csdnimg.cn/20210721165712113.png?x-oss-process=image/watermark,type_ZmFuZ3poZW5naGVpdGk,shadow_10,text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L3NpbmF0XzM5NjIwMjE3,size_16,color_FFFFFF,t_70"></p>
<h2>2.4 ICML官网:</h2>
<p>https://icml.cc/</p>
<p><img src="https://img-blog.csdnimg.cn/20210721165705742.png?x-oss-process=image/watermark,type_ZmFuZ3poZW5naGVpdGk,shadow_10,text_aHR0cHM6Ly9ibG9nLmNzZG4ubmV0L3NpbmF0XzM5NjIwMjE3,size_16,color_FFFFFF,t_70"></p>
<h1>3.最新多智能体强化学习方向论文</h1>
<h2><strong>3.1 ICML&nbsp;&nbsp;</strong>International Conference on Machine&nbsp;Learning</h2>
<blockquote>
<p><strong>. Randomized Entity-wise Factorization for Multi-Agent Reinforcement Learning</strong></p>
<p>作者: Shariq Iqbal (University of Southern California) · Christian Schroeder (University of Oxford) · Bei Peng (University of Oxford) · Wendelin Boehmer (Delft University of Technology) · Shimon Whiteson (University of Oxford) · Fei Sha (Google Research)</p>
<p><strong>. UneVEn: Universal Value Exploration for Multi-Agent Reinforcement Learning</strong></p>
<p>作者: Tarun Gupta (University of Oxford) · Anuj Mahajan (Dept. of Computer Science, University of Oxford) · Bei Peng (University of Oxford) · Wendelin Boehmer (Delft University of Technology) · Shimon Whiteson (University of Oxford)</p>
<p><strong>. Emergent Social Learning via Multi-agent Reinforcement Learning</strong></p>
<p>作者: Kamal Ndousse (OpenAI) · Douglas Eck (Google Brain) · Sergey Levine (UC Berkeley) · Natasha Jaques (Google Brain, UC Berkeley)</p>
<p><strong>. DFAC Framework: Factorizing the Value Function via Quantile Mixture for Multi-Agent Distributional Q-Learning</strong></p>
<p>作者: Wei-Fang Sun (National Tsing Hua University) · Cheng-Kuang Lee (NVIDIA Corporation) · Chun-Yi Lee (National Tsing Hua University)</p>
<p><strong>. Cooperative Exploration for Multi-Agent Deep Reinforcement Learning</strong></p>
<p>作者: Iou-Jen Liu (University of Illinois at Urbana-Champaign) · Unnat Jain (UIUC) · Raymond Yeh (University of Illinois at Urbana–Champaign) · Alexander Schwing (UIUC)</p>
<p><strong>. Large-Scale Multi-Agent Deep FBSDEs</strong></p>
<p>作者: Tianrong Chen (Georgia Institute of Technology) · Ziyi Wang (Georgia Institute of Technology) · Ioannis Exarchos (Stanford University) · Evangelos Theodorou (Georgia Tech)</p>
<p><strong>. Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning</strong></p>
<p>作者: Anuj Mahajan (Dept. of Computer Science, University of Oxford) · Mikayel Samvelyan (University College London) · Lei Mao (NVIDIA) · Viktor Makoviychuk (NVIDIA) · Animesh Garg (University of Toronto, Vector Institute, Nvidia) · Jean Kossaifi (NVIDIA) · Shimon Whiteson (University of Oxford) · Yuke Zhu (University of Texas - Austin) · Anima Anandkumar (Caltech and NVIDIA)</p>
<p><strong>. Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing</strong></p>
<p>作者: Filippos Christianos (University of Edinburgh) · Georgios Papoudakis (The University of Edinburgh) · Muhammad Arrasy Rahman (The University of Edinburgh) · Stefano Albrecht (University of Edinburgh)</p>
<p><strong>. Parallel Droplet Control in MEDA Biochips using Multi-Agent Reinforcement Learning</strong></p>
<p>作者: Tung-Che Liang (Duke University) · Jin Zhou (Duke University) · Yun-Sheng Chan (National Chiao Tung University) · Tsung-Yi Ho (National Tsing Hua University) · Krishnendu Chakrabarty (Duke University) · Cy Lee (National Chiao Tung University)</p>
<p><strong>. A Policy Gradient Algorithm for Learning to Learn in Multiagent Reinforcement Learning</strong></p>
<p>作者: Dong Ki Kim (MIT) · Miao Liu (IBM) · Matthew Riemer (IBM Research) · Chuangchuang Sun (MIT) · Marwa Abdulhai (MIT) · Golnaz Habibi (MIT) · Sebastian Lopez-Cot (MIT) · Gerald Tesauro (IBM Research) · Jonathan How (MIT)</p>
<p><strong>. Scalable Evaluation of Multi-Agent Reinforcement Learning with Melting Pot</strong></p>
<p>作者: Joel Z Leibo (DeepMind) · Edgar Duenez-Guzman (DeepMind) · Alexander Vezhnevets (DeepMind) · John Agapiou (DeepMind) · Peter Sunehag () · Raphael Koster (DeepMind) · Jayd Matyas (DeepMind) · Charles Beattie (DeepMind Technologies Limited) · Igor Mordatch (Google Brain) · Thore Graepel (DeepMind)</p>
<p><strong>. Multi-Agent Training beyond Zero-Sum with Correlated Equilibrium Meta-Solvers</strong></p>
<p>作者: Luke Marris (DeepMind) · Paul Muller (DeepMind) · Marc Lanctot (DeepMind) · Karl Tuyls (DeepMind) · Thore Graepel (DeepMind)</p>
<p><strong>. Coach-Player Multi-agent Reinforcement Learning for Dynamic Team Composition</strong></p>
<p>作者: Bo Liu (University of Texas, Austin) · Qiang Liu (UT Austin) · Peter Stone (University of Texas at Austin) · Animesh Garg (University of Toronto, Vector Institute, Nvidia) · Yuke Zhu (University of Texas - Austin) · Anima Anandkumar (California Institute of Technology)</p>
<p><strong>. Learning Fair Policies in Decentralized Cooperative Multi-Agent Reinforcement Learning</strong></p>
<p>作者: Matthieu Zimmer (Shanghai Jiao Tong University) · Claire Glanois (Shanghai Jiao Tong University) · Umer Siddique (Shanghai Jiao Tong University) · Paul Weng (Shanghai Jiao Tong University)</p>
<p><strong>. FOP: Factorizing Optimal Joint Policy of Maximum-Entropy Multi-Agent Reinforcement Learning</strong></p>
<p>作者: Tianhao Zhang (Peking University) · yueheng li (Peking university) · Chen Wang (Peking University) · Zongqing Lu (Peking University) · Guangming Xie (1. State Key Laboratory for Turbulence and Complex Systems, College of Engineering, Peking University; 2. Institute of Ocean Research, Peking University)</p>
</blockquote>
<h2>3.2&nbsp;AAAI Conference on Artificial Intelligence</h2>
<blockquote>
<p>会议时间节点</p>
<ul><li>August 15 – August 30, 2020: Authors register on the AAAI web site</li><li>September 1, 2020: Electronic abstracts due at 11:59 PM UTC-12 (anywhere on earth)</li><li>September 9, 2020: Electronic papers due at 11:59 PM UTC-12 (anywhere on earth)</li><li>September 29, 2020: Abstracts AND full papers due for revisions of rejected NeurIPS/EMNLP submissions by 11:59 PM UTC-12 (anywhere on earth)</li><li>AAAI-21 Reviewing Process: Two-Phase Reviewing and NeurIPS/EMNLP Fast Track Submissions</li><li>November 3-5, 2020: Author Feedback Window (anywhere on earth)</li><li>December 1, 2020: Notification of acceptance or rejection</li></ul>
</blockquote>
<p>具体论文见链接:http://deeprl.neurondance.com/d/191-82aaai2021</p>
<p>接收论文列表(共84篇)</p>
<h1>4.附录</h1>
<h2>4.1 B类</h2>
<table border="1" cellpadding="0" cellspacing="0"><tbody><tr><td> <p><strong>序号</strong></p> </td><td> <p><strong>会议简称</strong></p> </td><td> <p><strong>会议全称</strong></p> </td><td> <p><strong>出版社</strong></p> </td><td> <p><strong>网址</strong></p> </td></tr><tr><td> <p>1</p> </td><td> <p>COLT</p> </td><td> <p>Annual Conference on Computational<br> Learning Theory</p> </td><td> <p>Springer</p> </td><td> <p>http://orfe.princeton.edu/conferences/colt2013/</p> </td></tr><tr><td> <p>2</p> </td><td> <p><span style="color: rgba(254, 44, 36, 1)">NIPS</span></p> </td><td> <p>Annual Conference on Neural Information<br> Processing Systems</p> </td><td> <p>MIT Press</p> </td><td> <p>http://www.nips.cc</p> </td></tr><tr><td> <p>3</p> </td><td> <p>ACL</p> </td><td> <p>Annual Meeting of the Association for&nbsp;<br> Computational Linguistics</p> </td><td> <p>ACL</p> </td><td> <p>http://acl2013.org/site/index.html</p> </td></tr><tr><td> <p>4</p> </td><td> <p>EMNLP</p> </td><td> <p>Conference on Empirical Methods in Natural<br> Language Processing</p> </td><td> <p>ACL</p> </td><td> <p>http://www.sigdat.org/</p> </td></tr><tr><td> <p>5</p> </td><td> <p>ECAI</p> </td><td> <p>European Conference on Artificial&nbsp;<br> Intelligence</p> </td><td> <p>IOS Press</p> </td><td> <p>http://www.ecai2013.upit.ro/?i=2542</p> </td></tr><tr><td> <p>6</p> </td><td> <p>ECCV</p> </td><td> <p>European Conference on Computer Vision</p> </td><td> <p>Springer</p> </td><td> <p>http://eccv2012.unifi.it/</p> </td></tr><tr><td> <p>7</p> </td><td> <p>ICRA</p> </td><td> <p>IEEE International Conference on Robotics<br> and Automation</p> </td><td> <p>IEEE</p> </td><td> <p>http://www.icra2013.org/</p> </td></tr><tr><td> <p>8</p> </td><td> <p>ICAPS</p> </td><td> <p>International Conference on Automated<br> Planning and Scheduling</p> </td><td> <p>AAAI</p> </td><td> <p>http://www.icaps-conference.org/</p> </td></tr><tr><td> <p>9</p> </td><td> <p>ICCBR</p> </td><td> <p>International Conference on Case-Based<br> Reasoning</p> </td><td> <p>Springer</p> </td><td> <p>http://www.iccbr.org/</p> </td></tr><tr><td> <p>10</p> </td><td> <p>COLING</p> </td><td> <p>International Conference on Computational<br> Linguistics</p> </td><td> <p>ACM</p> </td><td> <p>&nbsp;http://www.coling2012-iitb.org/</p> </td></tr><tr><td> <p>11</p> </td><td> <p>KR</p> </td><td> <p>International Conference on Principles of<br> Knowledge Representation and Reasoning</p> </td><td> <p>Morgan Kaufmann</p> </td><td> <p>http://www.kr.org/</p> </td></tr><tr><td> <p>12</p> </td><td> <p>UAI</p> </td><td> <p>International Conference on Uncertainty<br> in Artificial Intelligence</p> </td><td> <p>AUAI</p> </td><td> <p>http://auai.org/</p> </td></tr><tr><td> <p>13</p> </td><td> <p>AAMAS</p> </td><td> <p>International Joint Conference<br> on Autonomous Agents and Multi-agent<br> Systems</p> </td><td> <p>Springer</p> </td><td> <p>http://www.aamas-conference.org/</p> </td></tr></tbody></table>
<h2><strong>4.2 C类</strong></h2>
<table border="1" cellpadding="0" cellspacing="0"><tbody><tr><td> <p><strong>序号</strong></p> </td><td> <p><strong>会议简称</strong></p> </td><td> <p><strong>会议全称</strong></p> </td><td> <p><strong>出版社</strong></p> </td><td> <p><strong>网址</strong></p> </td></tr><tr><td> <p>1</p> </td><td> <p>ACCV</p> </td><td> <p>Asian Conference on Computer Vision</p> </td><td> <p>Springer</p> </td><td> <p>http://www.accv2012.org/</p> </td></tr><tr><td> <p>2</p> </td><td> <p>CoNLL</p> </td><td> <p>Conference on Natural Language Learning</p> </td><td> <p>CoNLL</p> </td><td> <p>http://www.clips.ua.ac.be/conll/</p> </td></tr><tr><td> <p>3</p> </td><td> <p>GECCO</p> </td><td> <p>Genetic and Evolutionary Computation<br> Conference</p> </td><td> <p>ACM</p> </td><td> <p>http://www.sigevo.org/gecco-2013/</p> </td></tr><tr><td> <p>4</p> </td><td> <p>ICTAI</p> </td><td> <p>IEEE International Conference on Tools with<br> Artificial Intelligence</p> </td><td> <p>IEEE</p> </td><td> <p>http://ictai12.unipi.gr/</p> </td></tr><tr><td> <p>5</p> </td><td> <p>ALT</p> </td><td> <p>International Conference on Algorithmic<br> Learning Theory</p> </td><td> <p>Springer</p> </td><td> <p>http://www-alg.ist.hokudai.ac.jp/~thomas/ALT13/</p> </td></tr><tr><td> <p>6</p> </td><td> <p>ICANN</p> </td><td> <p>International Conference on Artificial Neural<br> Networks</p> </td><td> <p>Springer</p> </td><td> <p>https://www.waset.org/conferences/2013/<br> amsterdam/icann/</p> </td></tr><tr><td> <p>7</p> </td><td> <p>FGR</p> </td><td> <p>International Conference on Automatic Face<br> and Gesture Recognition</p> </td><td> <p>IEEE</p> </td><td> <p>http://fg2013.cse.sc.edu/</p> </td></tr><tr><td> <p>8</p> </td><td> <p>ICDAR</p> </td><td> <p>International Conference on Document<br> Analysis and Recognition</p> </td><td> <p>IEEE</p> </td><td> <p>http://www.icdar2013.org/</p> </td></tr><tr><td> <p>9</p> </td><td> <p>ILP</p> </td><td> <p>International Conference on Inductive Logic<br> Programming</p> </td><td> <p>Springer</p> </td><td> <p>http://ilp13.cos.ufrj.br/</p> </td></tr><tr><td> <p>10</p> </td><td> <p>KSEM</p> </td><td> <p>International conference on Knowledge<br> Science,Engineering and Management</p> </td><td> <p>Springer</p> </td><td> <p>http://ksem.dlut.edu.cn/</p> </td></tr><tr><td> <p>11</p> </td><td> <p>ICONIP</p> </td><td> <p>International Conference on Neural&nbsp;<br> Information Processing</p> </td><td> <p>Springer</p> </td><td> <p>http://iconip2013.org/</p> </td></tr><tr><td> <p>12</p> </td><td> <p>ICPR</p> </td><td> <p>International Conference on Pattern&nbsp;<br> Recognition</p> </td><td> <p>IEEE</p> </td><td> <p>http://www.icpr2014.org/</p> </td></tr><tr><td> <p>13</p> </td><td> <p>ICB</p> </td><td> <p>International Joint Conference on Biometrics</p> </td><td> <p>IEEE</p> </td><td> <p>http://atvs.ii.uam.es/icb2013/</p> </td></tr><tr><td> <p>14</p> </td><td> <p>IJCNN</p> </td><td> <p>International Joint Conference on Neural<br> Networks</p> </td><td> <p>IEEE</p> </td><td> <p>http://www.ijcnn2013.org/</p> </td></tr><tr><td> <p>15</p> </td><td> <p>PRICAI</p> </td><td> <p>Pacific Rim International Conference on&nbsp;<br> Artificial Intelligence</p> </td><td> <p>Springer</p> </td><td> <p>http://ktw.mimos.my/pricai2012/</p> </td></tr><tr><td> <p>16</p> </td><td> <p>NAACL</p> </td><td> <p>The Annual Conference of the North<br> American Chapter of the Association&nbsp;<br> for Computational Linguistics</p> </td><td> <p>NAACL</p> </td><td> <p>http://naacl2013.naacl.org/</p> </td></tr><tr><td> <p>17</p> </td><td> <p>BMVC</p> </td><td> <p>British Machine Vision Conference</p> </td><td> <p>British Machine<br> Vision&nbsp;<br> Association</p> </td><td> <p>http://bmvc2013.bristol.ac.uk/</p> </td></tr></tbody></table><br><br>
来源:https://www.cnblogs.com/ting1/p/16833964.html
頁: [1]
查看完整版本: 【二】最新多智能体强化学习文章如何查阅{顶会:AAAI、 ICML }