iBet uBet web content aggregator. Adding the entire web to your favor.

Link to original content: https://dblp.uni-trier.de/pid/84/998.ris

Provider: Schloss Dagstuhl - Leibniz Center for Informatics Database: dblp computer science bibliography Content:text/plain; charset="utf-8" TY - JOUR ID - DBLP:journals/tciaig/ZhouZSDLHWWLLH24 AU - Zhou, Tianze AU - Zhang, Fubiao AU - Shao, Kun AU - Dai, Zipeng AU - Li, Kai AU - Huang, Wenhan AU - Wang, Weixun AU - Wang, Bin AU - Li, Dong AU - Liu, Wulong AU - Hao, Jianye TI - Cooperative Multiagent Transfer Learning With Coalition Pattern Decomposition. JO - IEEE Trans. Games VL - 16 IS - 2 SP - 352 EP - 364 PY - 2024/06/ DO - 10.1109/TG.2023.3272386 UR - https://doi.org/10.1109/TG.2023.3272386 ER - TY - CPAPER ID - DBLP:conf/aaai/WuHYHZWT24 AU - Wu, Jizhou AU - Hao, Jianye AU - Yang, Tianpei AU - Hao, Xiaotian AU - Zheng, Yan AU - Wang, Weixun AU - Taylor, Matthew E. TI - PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning. BT - Thirty-Eighth AAAI Conference on Artificial Intelligence, AAAI 2024, Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence, IAAI 2024, Fourteenth Symposium on Educational Advances in Artificial Intelligence, EAAI 2014, February 20-27, 2024, Vancouver, Canada SP - 15934 EP - 15942 PY - 2024// DO - 10.1609/AAAI.V38I14.29524 UR - https://doi.org/10.1609/aaai.v38i14.29524 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2403-17031 AU - Huang, Shengyi AU - Noukhovitch, Michael AU - Hosseini, Arian AU - Rasul, Kashif AU - Wang, Weixun AU - Tunstall, Lewis TI - The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization. JO - CoRR VL - abs/2403.17031 PY - 2024// DO - 10.48550/ARXIV.2403.17031 UR - https://doi.org/10.48550/arXiv.2403.17031 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2405-11143 AU - Hu, Jian AU - Wu, Xibin AU - Wang, Weixun AU - Xianyu AU - Zhang, Dehao AU - Cao, Yu TI - OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework. JO - CoRR VL - abs/2405.11143 PY - 2024// DO - 10.48550/ARXIV.2405.11143 UR - https://doi.org/10.48550/arXiv.2405.11143 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2410-19720 AU - Li, Shilong AU - He, Yancheng AU - Huang, Hui AU - Bu, Xingyuan AU - Liu, Jiaheng AU - Guo, Hangyu AU - Wang, Weixun AU - Gu, Jihao AU - Su, Wenbo AU - Zheng, Bo TI - 2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervision. JO - CoRR VL - abs/2410.19720 PY - 2024// DO - 10.48550/ARXIV.2410.19720 UR - https://doi.org/10.48550/arXiv.2410.19720 ER - TY - JOUR ID - DBLP:journals/aamas/YangWHTLHHCFRHZG23 AU - Yang, Tianpei AU - Wang, Weixun AU - Hao, Jianye AU - Taylor, Matthew E. AU - Liu, Yong AU - Hao, Xiaotian AU - Hu, Yujing AU - Chen, Yingfeng AU - Fan, Changjie AU - Ren, Chunxu AU - Huang, Ye AU - Zhu, Jiangcheng AU - Gao, Yang TI - ASN: action semantics network for multiagent reinforcement learning. JO - Auton. Agents Multi Agent Syst. VL - 37 IS - 2 SP - 45 PY - 2023/10/ DO - 10.1007/S10458-023-09628-3 UR - https://doi.org/10.1007/s10458-023-09628-3 ER - TY - JOUR ID - DBLP:journals/jmlr/HuZGW0L0C023 AU - Hu, Siyi AU - Zhong, Yifan AU - Gao, Minquan AU - Wang, Weixun AU - Dong, Hao AU - Liang, Xiaodan AU - Li, Zhihui AU - Chang, Xiaojun AU - Yang, Yaodong TI - MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library. JO - J. Mach. Learn. Res. VL - 24 SP - 315:1 EP - 315:23 PY - 2023// UR - http://jmlr.org/papers/v24/23-0378.html ER - TY - CPAPER ID - DBLP:conf/atal/0001WW0HORHCF23 AU - Qiu, Wei AU - Wang, Weixun AU - Wang, Rundong AU - An, Bo AU - Hu, Yujing AU - Obraztsova, Svetlana AU - Rabinovich, Zinovi AU - Hao, Jianye AU - Chen, Yingfeng AU - Fan, Changjie TI - Off-Beat Multi-Agent Reinforcement Learning. BT - Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2023, London, United Kingdom, 29 May 2023 - 2 June 2023 SP - 2424 EP - 2426 PY - 2023// DO - 10.5555/3545946.3598955 UR - https://dl.acm.org/doi/10.5555/3545946.3598955 ER - TY - CPAPER ID - DBLP:conf/atal/WuYHHZWT23 AU - Wu, Jizhou AU - Yang, Tianpei AU - Hao, Xiaotian AU - Hao, Jianye AU - Zheng, Yan AU - Wang, Weixun AU - Taylor, Matthew E. TI - PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning. BT - Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2023, London, United Kingdom, 29 May 2023 - 2 June 2023 SP - 2460 EP - 2462 PY - 2023// DO - 10.5555/3545946.3598967 UR - https://dl.acm.org/doi/10.5555/3545946.3598967 ER - TY - CPAPER ID - DBLP:conf/iclr/HaoHMW00ZW23 AU - Hao, Jianye AU - Hao, Xiaotian AU - Mao, Hangyu AU - Wang, Weixun AU - Yang, Yaodong AU - Li, Dong AU - Zheng, Yan AU - Wang, Zhen TI - Boosting Multiagent Reinforcement Learning via Permutation Invariant and Permutation Equivariant Networks. BT - The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023 PY - 2023// UR - https://openreview.net/forum?id=OxNQXyZK-K8 ER - TY - JOUR ID - DBLP:journals/jzusc/ZhaoZWYHZHL22 AU - Zhao, Jian AU - Zhao, Youpeng AU - Wang, Weixun AU - Yang, Mingyu AU - Hu, Xunhan AU - Zhou, Wengang AU - Hao, Jianye AU - Li, Houqiang TI - Coach-assisted multi-agent reinforcement learning framework for unexpected crashed agents. JO - Frontiers Inf. Technol. Electron. Eng. VL - 23 IS - 7 SP - 1032 EP - 1042 PY - 2022// DO - 10.1631/FITEE.2100594 UR - https://doi.org/10.1631/FITEE.2100594 ER - TY - CPAPER ID - DBLP:conf/icml/WangZHWZGHLF22 AU - Wang, Li AU - Zhang, Yupeng AU - Hu, Yujing AU - Wang, Weixun AU - Zhang, Chongjie AU - Gao, Yang AU - Hao, Jianye AU - Lv, Tangjie AU - Fan, Changjie TI - Individual Reward Assisted Multi-Agent Reinforcement Learning. BT - International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA. SP - 23417 EP - 23432 PY - 2022// UR - https://proceedings.mlr.press/v162/wang22ao.html ER - TY - CPAPER ID - DBLP:conf/nips/0001CWHHH22 AU - Yang, Yaodong AU - Chen, Guangyong AU - Wang, Weixun AU - Hao, Xiaotian AU - Hao, Jianye AU - Heng, Pheng-Ann TI - Transformer-based Working Memory for Multiagent Reinforcement Learning with Action Parsing. BT - Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, NeurIPS 2022, New Orleans, LA, USA, November 28 - December 9, 2022. PY - 2022// UR - http://papers.nips.cc/paper_files/paper/2022/hash/e1cf57f1e104c6c05e31894c15a65e99-Abstract-Conference.html ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2202-04427 AU - Zhao, Jian AU - Zhang, Yue AU - Hu, Xunhan AU - Wang, Weixun AU - Zhou, Wengang AU - Hao, Jianye AU - Zhu, Jiangcheng AU - Li, Houqiang TI - Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization. JO - CoRR VL - abs/2202.04427 PY - 2022// UR - https://arxiv.org/abs/2202.04427 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2203-05285 AU - Hao, Xiaotian AU - Wang, Weixun AU - Mao, Hangyu AU - Yang, Yaodong AU - Li, Dong AU - Zheng, Yan AU - Wang, Zhen AU - Hao, Jianye TI - API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks. JO - CoRR VL - abs/2203.05285 PY - 2022// DO - 10.48550/ARXIV.2203.05285 UR - https://doi.org/10.48550/arXiv.2203.05285 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2203-08454 AU - Zhao, Jian AU - Zhao, Youpeng AU - Wang, Weixun AU - Yang, Mingyu AU - Hu, Xunhan AU - Zhou, Wengang AU - Hao, Jianye AU - Li, Houqiang TI - Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents. JO - CoRR VL - abs/2203.08454 PY - 2022// DO - 10.48550/ARXIV.2203.08454 UR - https://doi.org/10.48550/arXiv.2203.08454 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2205-09123 AU - Huang, Shengyi AU - Kanervisto, Anssi AU - Raffin, Antonin AU - Wang, Weixun AU - Ontañón, Santiago AU - Dossa, Rousslan Fernand Julien TI - A2C is a special case of PPO. JO - CoRR VL - abs/2205.09123 PY - 2022// DO - 10.48550/ARXIV.2205.09123 UR - https://doi.org/10.48550/arXiv.2205.09123 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2205-13718 AU - Qiu, Wei AU - Wang, Weixun AU - Wang, Rundong AU - An, Bo AU - Hu, Yujing AU - Obraztsova, Svetlana AU - Rabinovich, Zinovi AU - Hao, Jianye AU - Chen, Yingfeng AU - Fan, Changjie TI - Off-Beat Multi-Agent Reinforcement Learning. JO - CoRR VL - abs/2205.13718 PY - 2022// DO - 10.48550/ARXIV.2205.13718 UR - https://doi.org/10.48550/arXiv.2205.13718 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2210-13708 AU - Hu, Siyi AU - Zhong, Yifan AU - Gao, Minquan AU - Wang, Weixun AU - Dong, Hao AU - Li, Zhihui AU - Liang, Xiaodan AU - Chang, Xiaojun AU - Yang, Yaodong TI - MARLlib: Extending RLlib for Multi-agent Reinforcement Learning. JO - CoRR VL - abs/2210.13708 PY - 2022// DO - 10.48550/ARXIV.2210.13708 UR - https://doi.org/10.48550/arXiv.2210.13708 ER - TY - CPAPER ID - DBLP:conf/nips/YangWTHMMLLCHFZ21 AU - Yang, Tianpei AU - Wang, Weixun AU - Tang, Hongyao AU - Hao, Jianye AU - Meng, Zhaopeng AU - Mao, Hangyu AU - Li, Dong AU - Liu, Wulong AU - Chen, Yingfeng AU - Hu, Yujing AU - Fan, Changjie AU - Zhang, Chengwei TI - An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning. BT - Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual. SP - 17037 EP - 17048 PY - 2021// UR - https://proceedings.neurips.cc/paper/2021/hash/8d9a6e908ed2b731fb96151d9bb94d49-Abstract.html ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2106-00517 AU - Zhou, Tianze AU - Zhang, Fubiao AU - Shao, Kun AU - Li, Kai AU - Huang, Wenhan AU - Luo, Jun AU - Wang, Weixun AU - Yang, Yaodong AU - Mao, Hangyu AU - Wang, Bin AU - Li, Dong AU - Liu, Wulong AU - Hao, Jianye TI - Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment. JO - CoRR VL - abs/2106.00517 PY - 2021// UR - https://arxiv.org/abs/2106.00517 ER - TY - CPAPER ID - DBLP:conf/aaai/LiuWHHC020 AU - Liu, Yong AU - Wang, Weixun AU - Hu, Yujing AU - Hao, Jianye AU - Chen, Xingguo AU - Gao, Yang TI - Multi-Agent Game Abstraction via Graph Attention Neural Network. BT - The Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2020, The Thirty-Second Innovative Applications of Artificial Intelligence Conference, IAAI 2020, The Tenth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2020, New York, NY, USA, February 7-12, 2020. SP - 7211 EP - 7218 PY - 2020// DO - 10.1609/AAAI.V34I05.6211 UR - https://doi.org/10.1609/aaai.v34i05.6211 ER - TY - CPAPER ID - DBLP:conf/aaai/WangYLHHHCFG20 AU - Wang, Weixun AU - Yang, Tianpei AU - Liu, Yong AU - Hao, Jianye AU - Hao, Xiaotian AU - Hu, Yujing AU - Chen, Yingfeng AU - Fan, Changjie AU - Gao, Yang TI - From Few to More: Large-Scale Dynamic Multiagent Curriculum Learning. BT - The Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2020, The Thirty-Second Innovative Applications of Artificial Intelligence Conference, IAAI 2020, The Tenth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2020, New York, NY, USA, February 7-12, 2020. SP - 7293 EP - 7300 PY - 2020// DO - 10.1609/AAAI.V34I05.6221 UR - https://doi.org/10.1609/aaai.v34i05.6221 ER - TY - CPAPER ID - DBLP:conf/atal/YangHMZHCFWWP20 AU - Yang, Tianpei AU - Hao, Jianye AU - Meng, Zhaopeng AU - Zhang, Zongzhang AU - Hu, Yujing AU - Chen, Yingfeng AU - Fan, Changjie AU - Wang, Weixun AU - Wang, Zhaodong AU - Peng, Jiajie TI - Efficient Deep Reinforcement Learning through Policy Transfer. BT - Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS '20, Auckland, New Zealand, May 9-13, 2020 SP - 2053 EP - 2055 PY - 2020// DO - 10.5555/3398761.3399072 UR - https://dl.acm.org/doi/10.5555/3398761.3399072 UR - https://www.ifaamas.org/Proceedings/aamas2020/pdfs/p2053.pdf ER - TY - CPAPER ID - DBLP:conf/iclr/WangYLHHHCFG20 AU - Wang, Weixun AU - Yang, Tianpei AU - Liu, Yong AU - Hao, Jianye AU - Hao, Xiaotian AU - Hu, Yujing AU - Chen, Yingfeng AU - Fan, Changjie AU - Gao, Yang TI - Action Semantics Network: Considering the Effects of Actions in Multiagent Systems. BT - 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020 PY - 2020// UR - https://openreview.net/forum?id=ryg48p4tPH ER - TY - CPAPER ID - DBLP:conf/ijcai/ZhangHWTMDZ20 AU - Zhang, Peng AU - Hao, Jianye AU - Wang, Weixun AU - Tang, Hongyao AU - Ma, Yi AU - Duan, Yihai AU - Zheng, Yan TI - KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge. BT - Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI 2020 SP - 2291 EP - 2297 PY - 2020// DO - 10.24963/IJCAI.2020/317 UR - https://doi.org/10.24963/ijcai.2020/317 ER - TY - CPAPER ID - DBLP:conf/ijcai/YangHMZHCFWLWP20 AU - Yang, Tianpei AU - Hao, Jianye AU - Meng, Zhaopeng AU - Zhang, Zongzhang AU - Hu, Yujing AU - Chen, Yingfeng AU - Fan, Changjie AU - Wang, Weixun AU - Liu, Wulong AU - Wang, Zhaodong AU - Peng, Jiajie TI - Efficient Deep Reinforcement Learning via Adaptive Policy Transfer. BT - Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI 2020 SP - 3094 EP - 3100 PY - 2020// DO - 10.24963/IJCAI.2020/428 UR - https://doi.org/10.24963/ijcai.2020/428 ER - TY - CPAPER ID - DBLP:conf/ijcai/HaoJHLWMZLXG20 AU - Hao, Xiaotian AU - Jin, Junqi AU - Hao, Jianye AU - Li, Jin AU - Wang, Weixun AU - Ma, Yi AU - Zheng, Zhenzhe AU - Li, Han AU - Xu, Jian AU - Gai, Kun TI - Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising. BT - Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI 2020 SP - 3437 EP - 3443 PY - 2020// DO - 10.24963/IJCAI.2020/475 UR - https://doi.org/10.24963/ijcai.2020/475 ER - TY - CPAPER ID - DBLP:conf/nips/HuWJWCH0F20 AU - Hu, Yujing AU - Wang, Weixun AU - Jia, Hangtian AU - Wang, Yixiang AU - Chen, Yingfeng AU - Hao, Jianye AU - Wu, Feng AU - Fan, Changjie TI - Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping. BT - Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual. PY - 2020// UR - https://proceedings.neurips.cc/paper/2020/hash/b710915795b9e9c02cf10d6d2bdb688c-Abstract.html ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2002-07418 AU - Zhang, Peng AU - Hao, Jianye AU - Wang, Weixun AU - Tang, Hongyao AU - Ma, Yi AU - Duan, Yihai AU - Zheng, Yan TI - KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge. JO - CoRR VL - abs/2002.07418 PY - 2020// UR - https://arxiv.org/abs/2002.07418 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2002-08030 AU - Yang, Tianpei AU - Wang, Weixun AU - Tang, Hongyao AU - Hao, Jianye AU - Meng, Zhaopeng AU - Liu, Wulong AU - Hu, Yujing AU - Chen, Yingfeng TI - Learning When to Transfer among Agents: An Efficient Multiagent Transfer Learning Framework. JO - CoRR VL - abs/2002.08030 PY - 2020// UR - https://arxiv.org/abs/2002.08030 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2002-08037 AU - Yang, Tianpei AU - Hao, Jianye AU - Meng, Zhaopeng AU - Zhang, Zongzhang AU - Wang, Weixun AU - Hu, Yujing AU - Chen, Yingfeng AU - Fan, Changjie AU - Wang, Zhaodong AU - Peng, Jiajie TI - Efficient Deep Reinforcement Learning through Policy Transfer. JO - CoRR VL - abs/2002.08037 PY - 2020// UR - https://arxiv.org/abs/2002.08037 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2005-04355 AU - Hao, Xiaotian AU - Jin, Junqi AU - Hao, Jianye AU - Li, Jin AU - Wang, Weixun AU - Ma, Yi AU - Zheng, Zhenzhe AU - Li, Han AU - Xu, Jian AU - Gai, Kun TI - Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising. JO - CoRR VL - abs/2005.04355 PY - 2020// UR - https://arxiv.org/abs/2005.04355 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2011-02669 AU - Hu, Yujing AU - Wang, Weixun AU - Jia, Hangtian AU - Wang, Yixiang AU - Chen, Yingfeng AU - Hao, Jianye AU - Wu, Feng AU - Fan, Changjie TI - Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping. JO - CoRR VL - abs/2011.02669 PY - 2020// UR - https://arxiv.org/abs/2011.02669 ER - TY - CPAPER ID - DBLP:conf/atal/HaoWHY19 AU - Hao, Xiaotian AU - Wang, Weixun AU - Hao, Jianye AU - Yang, Yaodong TI - Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems. BT - Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, AAMAS '19, Montreal, QC, Canada, May 13-17, 2019 SP - 1315 EP - 1323 PY - 2019// UR - http://dl.acm.org/citation.cfm?id=3331837 ER - TY - CPAPER ID - DBLP:conf/cikm/WangJH0YZWHWLXG19 AU - Wang, Weixun AU - Jin, Junqi AU - Hao, Jianye AU - Chen, Chunjie AU - Yu, Chuan AU - Zhang, Weinan AU - Wang, Jun AU - Hao, Xiaotian AU - Wang, Yixi AU - Li, Han AU - Xu, Jian AU - Gai, Kun TI - Learning Adaptive Display Exposure for Real-Time Advertising. BT - Proceedings of the 28th ACM International Conference on Information and Knowledge Management, CIKM 2019, Beijing, China, November 3-7, 2019. SP - 2595 EP - 2603 PY - 2019// DO - 10.1145/3357384.3357806 UR - https://doi.org/10.1145/3357384.3357806 ER - TY - CPAPER ID - DBLP:conf/dai2/WangHWT19 AU - Wang, Weixun AU - Hao, Jianye AU - Wang, Yixi AU - Taylor, Matthew E. TI - Achieving cooperation through deep multiagent reinforcement learning in sequential prisoner's dilemmas. BT - Proceedings of the First International Conference on Distributed Artificial Intelligence, DAI 2019, Beijing, China, October 13-15, 2019 SP - 11:1 EP - 11:7 PY - 2019// DO - 10.1145/3356464.3357712 UR - https://doi.org/10.1145/3356464.3357712 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-1907-11461 AU - Wang, Weixun AU - Yang, Tianpei AU - Liu, Yong AU - Hao, Jianye AU - Hao, Xiaotian AU - Hu, Yujing AU - Chen, Yingfeng AU - Fan, Changjie AU - Gao, Yang TI - Action Semantics Network: Considering the Effects of Actions in Multiagent Systems. JO - CoRR VL - abs/1907.11461 PY - 2019// UR - http://arxiv.org/abs/1907.11461 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-1909-02790 AU - Wang, Weixun AU - Yang, Tianpei AU - Liu, Yong AU - Hao, Jianye AU - Hao, Xiaotian AU - Hu, Yujing AU - Chen, Yingfeng AU - Fan, Changjie AU - Gao, Yang TI - From Few to More: Large-scale Dynamic Multiagent Curriculum Learning. JO - CoRR VL - abs/1909.02790 PY - 2019// UR - http://arxiv.org/abs/1909.02790 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-1909-11468 AU - Hao, Xiaotian AU - Wang, Weixun AU - Hao, Jianye AU - Yang, Yaodong TI - Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems. JO - CoRR VL - abs/1909.11468 PY - 2019// UR - http://arxiv.org/abs/1909.11468 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-1911-10715 AU - Liu, Yong AU - Wang, Weixun AU - Hu, Yujing AU - Hao, Jianye AU - Chen, Xingguo AU - Gao, Yang TI - Multi-Agent Game Abstraction via Graph Attention Neural Network. JO - CoRR VL - abs/1911.10715 PY - 2019// UR - http://arxiv.org/abs/1911.10715 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-1803-00162 AU - Wang, Weixun AU - Hao, Jianye AU - Wang, Yixi AU - Taylor, Matthew E. TI - Towards Cooperation in Sequential Prisoner's Dilemmas: a Deep Multiagent Reinforcement Learning Approach. JO - CoRR VL - abs/1803.00162 PY - 2018// UR - http://arxiv.org/abs/1803.00162 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-1809-03149 AU - Wang, Weixun AU - Jin, Junqi AU - Hao, Jianye AU - Chen, Chunjie AU - Yu, Chuan AU - Zhang, Weinan AU - Wang, Jun AU - Wang, Yixi AU - Li, Han AU - Xu, Jian AU - Gai, Kun TI - Learning to Advertise with Adaptive Exposure via Constrained Two-Level Reinforcement Learning. JO - CoRR VL - abs/1809.03149 PY - 2018// UR - http://arxiv.org/abs/1809.03149 ER - TY - JOUR ID - DBLP:journals/suscom/WangRM12 AU - Wang, Weixun AU - Ranka, Sanjay AU - Mishra, Prabhat TI - Energy-aware dynamic slack allocation for real-time multitasking systems. JO - Sustain. Comput. Informatics Syst. VL - 2 IS - 3 SP - 128 EP - 137 PY - 2012// DO - 10.1016/J.SUSCOM.2012.04.001 UR - https://doi.org/10.1016/j.suscom.2012.04.001 ER - TY - JOUR ID - DBLP:journals/tcad/QinWM12 AU - Qin, Xiaoke AU - Wang, Weixun AU - Mishra, Prabhat TI - TCEC: Temperature and Energy-Constrained Scheduling in Real-Time Multitasking Systems. JO - IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. VL - 31 IS - 8 SP - 1159 EP - 1168 PY - 2012// DO - 10.1109/TCAD.2012.2190824 UR - https://doi.org/10.1109/TCAD.2012.2190824 ER - TY - JOUR ID - DBLP:journals/tecs/WangMG12 AU - Wang, Weixun AU - Mishra, Prabhat AU - Gordon-Ross, Ann TI - Dynamic Cache Reconfiguration for Soft Real-Time Systems. JO - ACM Trans. Embed. Comput. Syst. VL - 11 IS - 2 SP - 28:1 EP - 28:31 PY - 2012// DO - 10.1145/2220336.2220340 UR - https://doi.org/10.1145/2220336.2220340 ER - TY - JOUR ID - DBLP:journals/tvlsi/WangM12 AU - Wang, Weixun AU - Mishra, Prabhat TI - System-Wide Leakage-Aware Energy Minimization Using Dynamic Voltage Scaling and Cache Reconfiguration in Multitasking Systems. JO - IEEE Trans. Very Large Scale Integr. Syst. VL - 20 IS - 5 SP - 902 EP - 910 PY - 2012// DO - 10.1109/TVLSI.2011.2116814 UR - https://doi.org/10.1109/TVLSI.2011.2116814 ER - TY - ENCYC ID - DBLP:reference/crc/WangQM12 AU - Wang, Weixun AU - Qin, Xiaoke AU - Mishra, Prabhat TI - Energy-Aware Scheduling and Dynamic Reconfiguration in Real-Time Systems. BT - Handbook of Energy-Aware and Green Computing - Two Volume Set. SP - 543 EP - 572 PY - 2012// DO - 10.1201/B16631-30 UR - http://www.crcnetbase.com/doi/abs/10.1201/b16631-30 ER - TY - Informal or Other Publication ID - DBLP:journals/corr/abs-1211-1736 AU - Basu, Kanad AU - Mitra, Subrata AU - Mukherjee, Srishti AU - Wang, Weixun TI - A Novel Approach for Handling Misbehaving Nodes in Behavior-Aware Mobile Networking JO - CoRR VL - abs/1211.1736 PY - 2012// UR - http://arxiv.org/abs/1211.1736 ER - TY - JOUR ID - DBLP:journals/jolpe/WangM11 AU - Wang, Weixun AU - Mishra, Prabhat TI - Dynamic Reconfiguration of Two-Level Cache Hierarchy in Real-Time Embedded Systems. JO - J. Low Power Electron. VL - 7 IS - 1 SP - 17 EP - 28 PY - 2011// DO - 10.1166/JOLPE.2011.1113 UR - https://doi.org/10.1166/jolpe.2011.1113 ER - TY - JOUR ID - DBLP:journals/suscom/WangRM11 AU - Wang, Weixun AU - Ranka, Sanjay AU - Mishra, Prabhat TI - Energy-aware dynamic reconfiguration algorithms for real-time multitasking systems. JO - Sustain. Comput. Informatics Syst. VL - 1 IS - 1 SP - 35 EP - 45 PY - 2011// DO - 10.1016/J.SUSCOM.2010.10.006 UR - https://doi.org/10.1016/j.suscom.2010.10.006 ER - TY - CPAPER ID - DBLP:conf/dac/WangMR11 AU - Wang, Weixun AU - Mishra, Prabhat AU - Ranka, Sanjay TI - Dynamic cache reconfiguration and partitioning for energy optimization in real-time multi-core systems. BT - Proceedings of the 48th Design Automation Conference, DAC 2011, San Diego, California, USA, June 5-10, 2011 SP - 948 EP - 953 PY - 2011// DO - 10.1145/2024724.2024935 UR - https://doi.org/10.1145/2024724.2024935 ER - TY - CPAPER ID - DBLP:conf/vlsid/WangRM11 AU - Wang, Weixun AU - Ranka, Sanjay AU - Mishra, Prabhat TI - A General Algorithm for Energy-Aware Dynamic Reconfiguration in Multitasking Systems. BT - VLSI Design 2011: 24th International Conference on VLSI Design, IIT Madras, Chennai, India, 2-7 January 2011 SP - 334 EP - 339 PY - 2011// DO - 10.1109/VLSID.2011.17 UR - https://doi.org/10.1109/VLSID.2011.17 UR - https://doi.ieeecomputersociety.org/10.1109/VLSID.2011.17 ER - TY - CPAPER ID - DBLP:conf/dac/WangM10 AU - Wang, Weixun AU - Mishra, Prabhat TI - PreDVS: preemptive dynamic voltage scaling for real-time systems using approximation scheme. BT - Proceedings of the 47th Design Automation Conference, DAC 2010, Anaheim, California, USA, July 13-18, 2010 SP - 705 EP - 710 PY - 2010// DO - 10.1145/1837274.1837452 UR - https://doi.org/10.1145/1837274.1837452 ER - TY - CPAPER ID - DBLP:conf/islped/WangQM10 AU - Wang, Weixun AU - Qin, Xiaoke AU - Mishra, Prabhat TI - Temperature- and energy-constrained scheduling in multitasking systems: a model checking approach. BT - Proceedings of the 2010 International Symposium on Low Power Electronics and Design, 2010, Austin, Texas, USA, August 18-20, 2010 SP - 85 EP - 90 PY - 2010// DO - 10.1145/1840845.1840863 UR - https://doi.org/10.1145/1840845.1840863 ER - TY - CPAPER ID - DBLP:conf/vlsid/WangM10 AU - Wang, Weixun AU - Mishra, Prabhat TI - Leakage-Aware Energy Minimization Using Dynamic Voltage Scaling and Cache Reconfiguration in Real-Time Systems. BT - VLSI Design 2010: 23rd International Conference on VLSI Design, 9th International Conference on Embedded Systems, Bangalore, India, 3-7 January 2010 SP - 357 EP - 362 PY - 2010// DO - 10.1109/VLSI.DESIGN.2010.22 UR - https://doi.org/10.1109/VLSI.Design.2010.22 UR - https://doi.ieeecomputersociety.org/10.1109/VLSI.Design.2010.22 ER - TY - CPAPER ID - DBLP:conf/isvlsi/WangM09 AU - Wang, Weixun AU - Mishra, Prabhat TI - Dynamic Reconfiguration of Two-Level Caches in Soft Real-Time Embedded Systems. BT - IEEE Computer Society Annual Symposium on VLSI, ISVLSI 2009, 13-15 May 2009, Tampa, Florida, USA SP - 145 EP - 150 PY - 2009// DO - 10.1109/ISVLSI.2009.22 UR - https://doi.org/10.1109/ISVLSI.2009.22 UR - https://doi.ieeecomputersociety.org/10.1109/ISVLSI.2009.22 ER - TY - CPAPER ID - DBLP:conf/vlsid/WangMG09 AU - Wang, Weixun AU - Mishra, Prabhat AU - Gordon-Ross, Ann TI - SACR: Scheduling-Aware Cache Reconfiguration for Real-Time Embedded Systems. BT - VLSI Design 2009: Improving Productivity through Higher Abstraction, The 22nd International Conference on VLSI Design, New Delhi, India, 5-9 January 2009 SP - 547 EP - 552 PY - 2009// DO - 10.1109/VLSI.DESIGN.2009.66 UR - https://doi.org/10.1109/VLSI.Design.2009.66 UR - https://doi.ieeecomputersociety.org/10.1109/VLSI.Design.2009.66 ER -