Provider: Schloss Dagstuhl - Leibniz Center for Informatics
Database: dblp computer science bibliography
Content:text/plain; charset="utf-8"
TY - JOUR
ID - DBLP:journals/tciaig/ZhouZSDLHWWLLH24
AU - Zhou, Tianze
AU - Zhang, Fubiao
AU - Shao, Kun
AU - Dai, Zipeng
AU - Li, Kai
AU - Huang, Wenhan
AU - Wang, Weixun
AU - Wang, Bin
AU - Li, Dong
AU - Liu, Wulong
AU - Hao, Jianye
TI - Cooperative Multiagent Transfer Learning With Coalition Pattern Decomposition.
JO - IEEE Trans. Games
VL - 16
IS - 2
SP - 352
EP - 364
PY - 2024/06/
DO - 10.1109/TG.2023.3272386
UR - https://doi.org/10.1109/TG.2023.3272386
ER -
TY - CPAPER
ID - DBLP:conf/aaai/WuHYHZWT24
AU - Wu, Jizhou
AU - Hao, Jianye
AU - Yang, Tianpei
AU - Hao, Xiaotian
AU - Zheng, Yan
AU - Wang, Weixun
AU - Taylor, Matthew E.
TI - PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning.
BT - Thirty-Eighth AAAI Conference on Artificial Intelligence, AAAI 2024, Thirty-Sixth Conference on Innovative Applications of Artificial Intelligence, IAAI 2024, Fourteenth Symposium on Educational Advances in Artificial Intelligence, EAAI 2014, February 20-27, 2024, Vancouver, Canada
SP - 15934
EP - 15942
PY - 2024//
DO - 10.1609/AAAI.V38I14.29524
UR - https://doi.org/10.1609/aaai.v38i14.29524
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2403-17031
AU - Huang, Shengyi
AU - Noukhovitch, Michael
AU - Hosseini, Arian
AU - Rasul, Kashif
AU - Wang, Weixun
AU - Tunstall, Lewis
TI - The N+ Implementation Details of RLHF with PPO: A Case Study on TL;DR Summarization.
JO - CoRR
VL - abs/2403.17031
PY - 2024//
DO - 10.48550/ARXIV.2403.17031
UR - https://doi.org/10.48550/arXiv.2403.17031
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2405-11143
AU - Hu, Jian
AU - Wu, Xibin
AU - Wang, Weixun
AU - Xianyu
AU - Zhang, Dehao
AU - Cao, Yu
TI - OpenRLHF: An Easy-to-use, Scalable and High-performance RLHF Framework.
JO - CoRR
VL - abs/2405.11143
PY - 2024//
DO - 10.48550/ARXIV.2405.11143
UR - https://doi.org/10.48550/arXiv.2405.11143
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2410-19720
AU - Li, Shilong
AU - He, Yancheng
AU - Huang, Hui
AU - Bu, Xingyuan
AU - Liu, Jiaheng
AU - Guo, Hangyu
AU - Wang, Weixun
AU - Gu, Jihao
AU - Su, Wenbo
AU - Zheng, Bo
TI - 2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervision.
JO - CoRR
VL - abs/2410.19720
PY - 2024//
DO - 10.48550/ARXIV.2410.19720
UR - https://doi.org/10.48550/arXiv.2410.19720
ER -
TY - JOUR
ID - DBLP:journals/aamas/YangWHTLHHCFRHZG23
AU - Yang, Tianpei
AU - Wang, Weixun
AU - Hao, Jianye
AU - Taylor, Matthew E.
AU - Liu, Yong
AU - Hao, Xiaotian
AU - Hu, Yujing
AU - Chen, Yingfeng
AU - Fan, Changjie
AU - Ren, Chunxu
AU - Huang, Ye
AU - Zhu, Jiangcheng
AU - Gao, Yang
TI - ASN: action semantics network for multiagent reinforcement learning.
JO - Auton. Agents Multi Agent Syst.
VL - 37
IS - 2
SP - 45
PY - 2023/10/
DO - 10.1007/S10458-023-09628-3
UR - https://doi.org/10.1007/s10458-023-09628-3
ER -
TY - JOUR
ID - DBLP:journals/jmlr/HuZGW0L0C023
AU - Hu, Siyi
AU - Zhong, Yifan
AU - Gao, Minquan
AU - Wang, Weixun
AU - Dong, Hao
AU - Liang, Xiaodan
AU - Li, Zhihui
AU - Chang, Xiaojun
AU - Yang, Yaodong
TI - MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library.
JO - J. Mach. Learn. Res.
VL - 24
SP - 315:1
EP - 315:23
PY - 2023//
UR - http://jmlr.org/papers/v24/23-0378.html
ER -
TY - CPAPER
ID - DBLP:conf/atal/0001WW0HORHCF23
AU - Qiu, Wei
AU - Wang, Weixun
AU - Wang, Rundong
AU - An, Bo
AU - Hu, Yujing
AU - Obraztsova, Svetlana
AU - Rabinovich, Zinovi
AU - Hao, Jianye
AU - Chen, Yingfeng
AU - Fan, Changjie
TI - Off-Beat Multi-Agent Reinforcement Learning.
BT - Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2023, London, United Kingdom, 29 May 2023 - 2 June 2023
SP - 2424
EP - 2426
PY - 2023//
DO - 10.5555/3545946.3598955
UR - https://dl.acm.org/doi/10.5555/3545946.3598955
ER -
TY - CPAPER
ID - DBLP:conf/atal/WuYHHZWT23
AU - Wu, Jizhou
AU - Yang, Tianpei
AU - Hao, Xiaotian
AU - Hao, Jianye
AU - Zheng, Yan
AU - Wang, Weixun
AU - Taylor, Matthew E.
TI - PORTAL: Automatic Curricula Generation for Multiagent Reinforcement Learning.
BT - Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, AAMAS 2023, London, United Kingdom, 29 May 2023 - 2 June 2023
SP - 2460
EP - 2462
PY - 2023//
DO - 10.5555/3545946.3598967
UR - https://dl.acm.org/doi/10.5555/3545946.3598967
ER -
TY - CPAPER
ID - DBLP:conf/iclr/HaoHMW00ZW23
AU - Hao, Jianye
AU - Hao, Xiaotian
AU - Mao, Hangyu
AU - Wang, Weixun
AU - Yang, Yaodong
AU - Li, Dong
AU - Zheng, Yan
AU - Wang, Zhen
TI - Boosting Multiagent Reinforcement Learning via Permutation Invariant and Permutation Equivariant Networks.
BT - The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023
PY - 2023//
UR - https://openreview.net/forum?id=OxNQXyZK-K8
ER -
TY - JOUR
ID - DBLP:journals/jzusc/ZhaoZWYHZHL22
AU - Zhao, Jian
AU - Zhao, Youpeng
AU - Wang, Weixun
AU - Yang, Mingyu
AU - Hu, Xunhan
AU - Zhou, Wengang
AU - Hao, Jianye
AU - Li, Houqiang
TI - Coach-assisted multi-agent reinforcement learning framework for unexpected crashed agents.
JO - Frontiers Inf. Technol. Electron. Eng.
VL - 23
IS - 7
SP - 1032
EP - 1042
PY - 2022//
DO - 10.1631/FITEE.2100594
UR - https://doi.org/10.1631/FITEE.2100594
ER -
TY - CPAPER
ID - DBLP:conf/icml/WangZHWZGHLF22
AU - Wang, Li
AU - Zhang, Yupeng
AU - Hu, Yujing
AU - Wang, Weixun
AU - Zhang, Chongjie
AU - Gao, Yang
AU - Hao, Jianye
AU - Lv, Tangjie
AU - Fan, Changjie
TI - Individual Reward Assisted Multi-Agent Reinforcement Learning.
BT - International Conference on Machine Learning, ICML 2022, 17-23 July 2022, Baltimore, Maryland, USA.
SP - 23417
EP - 23432
PY - 2022//
UR - https://proceedings.mlr.press/v162/wang22ao.html
ER -
TY - CPAPER
ID - DBLP:conf/nips/0001CWHHH22
AU - Yang, Yaodong
AU - Chen, Guangyong
AU - Wang, Weixun
AU - Hao, Xiaotian
AU - Hao, Jianye
AU - Heng, Pheng-Ann
TI - Transformer-based Working Memory for Multiagent Reinforcement Learning with Action Parsing.
BT - Advances in Neural Information Processing Systems 35: Annual Conference on Neural Information Processing Systems 2022, NeurIPS 2022, New Orleans, LA, USA, November 28 - December 9, 2022.
PY - 2022//
UR - http://papers.nips.cc/paper_files/paper/2022/hash/e1cf57f1e104c6c05e31894c15a65e99-Abstract-Conference.html
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2202-04427
AU - Zhao, Jian
AU - Zhang, Yue
AU - Hu, Xunhan
AU - Wang, Weixun
AU - Zhou, Wengang
AU - Hao, Jianye
AU - Zhu, Jiangcheng
AU - Li, Houqiang
TI - Revisiting QMIX: Discriminative Credit Assignment by Gradient Entropy Regularization.
JO - CoRR
VL - abs/2202.04427
PY - 2022//
UR - https://arxiv.org/abs/2202.04427
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2203-05285
AU - Hao, Xiaotian
AU - Wang, Weixun
AU - Mao, Hangyu
AU - Yang, Yaodong
AU - Li, Dong
AU - Zheng, Yan
AU - Wang, Zhen
AU - Hao, Jianye
TI - API: Boosting Multi-Agent Reinforcement Learning via Agent-Permutation-Invariant Networks.
JO - CoRR
VL - abs/2203.05285
PY - 2022//
DO - 10.48550/ARXIV.2203.05285
UR - https://doi.org/10.48550/arXiv.2203.05285
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2203-08454
AU - Zhao, Jian
AU - Zhao, Youpeng
AU - Wang, Weixun
AU - Yang, Mingyu
AU - Hu, Xunhan
AU - Zhou, Wengang
AU - Hao, Jianye
AU - Li, Houqiang
TI - Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents.
JO - CoRR
VL - abs/2203.08454
PY - 2022//
DO - 10.48550/ARXIV.2203.08454
UR - https://doi.org/10.48550/arXiv.2203.08454
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2205-09123
AU - Huang, Shengyi
AU - Kanervisto, Anssi
AU - Raffin, Antonin
AU - Wang, Weixun
AU - Ontañón, Santiago
AU - Dossa, Rousslan Fernand Julien
TI - A2C is a special case of PPO.
JO - CoRR
VL - abs/2205.09123
PY - 2022//
DO - 10.48550/ARXIV.2205.09123
UR - https://doi.org/10.48550/arXiv.2205.09123
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2205-13718
AU - Qiu, Wei
AU - Wang, Weixun
AU - Wang, Rundong
AU - An, Bo
AU - Hu, Yujing
AU - Obraztsova, Svetlana
AU - Rabinovich, Zinovi
AU - Hao, Jianye
AU - Chen, Yingfeng
AU - Fan, Changjie
TI - Off-Beat Multi-Agent Reinforcement Learning.
JO - CoRR
VL - abs/2205.13718
PY - 2022//
DO - 10.48550/ARXIV.2205.13718
UR - https://doi.org/10.48550/arXiv.2205.13718
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2210-13708
AU - Hu, Siyi
AU - Zhong, Yifan
AU - Gao, Minquan
AU - Wang, Weixun
AU - Dong, Hao
AU - Li, Zhihui
AU - Liang, Xiaodan
AU - Chang, Xiaojun
AU - Yang, Yaodong
TI - MARLlib: Extending RLlib for Multi-agent Reinforcement Learning.
JO - CoRR
VL - abs/2210.13708
PY - 2022//
DO - 10.48550/ARXIV.2210.13708
UR - https://doi.org/10.48550/arXiv.2210.13708
ER -
TY - CPAPER
ID - DBLP:conf/nips/YangWTHMMLLCHFZ21
AU - Yang, Tianpei
AU - Wang, Weixun
AU - Tang, Hongyao
AU - Hao, Jianye
AU - Meng, Zhaopeng
AU - Mao, Hangyu
AU - Li, Dong
AU - Liu, Wulong
AU - Chen, Yingfeng
AU - Hu, Yujing
AU - Fan, Changjie
AU - Zhang, Chengwei
TI - An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning.
BT - Advances in Neural Information Processing Systems 34: Annual Conference on Neural Information Processing Systems 2021, NeurIPS 2021, December 6-14, 2021, virtual.
SP - 17037
EP - 17048
PY - 2021//
UR - https://proceedings.neurips.cc/paper/2021/hash/8d9a6e908ed2b731fb96151d9bb94d49-Abstract.html
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2106-00517
AU - Zhou, Tianze
AU - Zhang, Fubiao
AU - Shao, Kun
AU - Li, Kai
AU - Huang, Wenhan
AU - Luo, Jun
AU - Wang, Weixun
AU - Yang, Yaodong
AU - Mao, Hangyu
AU - Wang, Bin
AU - Li, Dong
AU - Liu, Wulong
AU - Hao, Jianye
TI - Cooperative Multi-Agent Transfer Learning with Level-Adaptive Credit Assignment.
JO - CoRR
VL - abs/2106.00517
PY - 2021//
UR - https://arxiv.org/abs/2106.00517
ER -
TY - CPAPER
ID - DBLP:conf/aaai/LiuWHHC020
AU - Liu, Yong
AU - Wang, Weixun
AU - Hu, Yujing
AU - Hao, Jianye
AU - Chen, Xingguo
AU - Gao, Yang
TI - Multi-Agent Game Abstraction via Graph Attention Neural Network.
BT - The Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2020, The Thirty-Second Innovative Applications of Artificial Intelligence Conference, IAAI 2020, The Tenth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2020, New York, NY, USA, February 7-12, 2020.
SP - 7211
EP - 7218
PY - 2020//
DO - 10.1609/AAAI.V34I05.6211
UR - https://doi.org/10.1609/aaai.v34i05.6211
ER -
TY - CPAPER
ID - DBLP:conf/aaai/WangYLHHHCFG20
AU - Wang, Weixun
AU - Yang, Tianpei
AU - Liu, Yong
AU - Hao, Jianye
AU - Hao, Xiaotian
AU - Hu, Yujing
AU - Chen, Yingfeng
AU - Fan, Changjie
AU - Gao, Yang
TI - From Few to More: Large-Scale Dynamic Multiagent Curriculum Learning.
BT - The Thirty-Fourth AAAI Conference on Artificial Intelligence, AAAI 2020, The Thirty-Second Innovative Applications of Artificial Intelligence Conference, IAAI 2020, The Tenth AAAI Symposium on Educational Advances in Artificial Intelligence, EAAI 2020, New York, NY, USA, February 7-12, 2020.
SP - 7293
EP - 7300
PY - 2020//
DO - 10.1609/AAAI.V34I05.6221
UR - https://doi.org/10.1609/aaai.v34i05.6221
ER -
TY - CPAPER
ID - DBLP:conf/atal/YangHMZHCFWWP20
AU - Yang, Tianpei
AU - Hao, Jianye
AU - Meng, Zhaopeng
AU - Zhang, Zongzhang
AU - Hu, Yujing
AU - Chen, Yingfeng
AU - Fan, Changjie
AU - Wang, Weixun
AU - Wang, Zhaodong
AU - Peng, Jiajie
TI - Efficient Deep Reinforcement Learning through Policy Transfer.
BT - Proceedings of the 19th International Conference on Autonomous Agents and Multiagent Systems, AAMAS '20, Auckland, New Zealand, May 9-13, 2020
SP - 2053
EP - 2055
PY - 2020//
DO - 10.5555/3398761.3399072
UR - https://dl.acm.org/doi/10.5555/3398761.3399072
UR - https://www.ifaamas.org/Proceedings/aamas2020/pdfs/p2053.pdf
ER -
TY - CPAPER
ID - DBLP:conf/iclr/WangYLHHHCFG20
AU - Wang, Weixun
AU - Yang, Tianpei
AU - Liu, Yong
AU - Hao, Jianye
AU - Hao, Xiaotian
AU - Hu, Yujing
AU - Chen, Yingfeng
AU - Fan, Changjie
AU - Gao, Yang
TI - Action Semantics Network: Considering the Effects of Actions in Multiagent Systems.
BT - 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020
PY - 2020//
UR - https://openreview.net/forum?id=ryg48p4tPH
ER -
TY - CPAPER
ID - DBLP:conf/ijcai/ZhangHWTMDZ20
AU - Zhang, Peng
AU - Hao, Jianye
AU - Wang, Weixun
AU - Tang, Hongyao
AU - Ma, Yi
AU - Duan, Yihai
AU - Zheng, Yan
TI - KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge.
BT - Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI 2020
SP - 2291
EP - 2297
PY - 2020//
DO - 10.24963/IJCAI.2020/317
UR - https://doi.org/10.24963/ijcai.2020/317
ER -
TY - CPAPER
ID - DBLP:conf/ijcai/YangHMZHCFWLWP20
AU - Yang, Tianpei
AU - Hao, Jianye
AU - Meng, Zhaopeng
AU - Zhang, Zongzhang
AU - Hu, Yujing
AU - Chen, Yingfeng
AU - Fan, Changjie
AU - Wang, Weixun
AU - Liu, Wulong
AU - Wang, Zhaodong
AU - Peng, Jiajie
TI - Efficient Deep Reinforcement Learning via Adaptive Policy Transfer.
BT - Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI 2020
SP - 3094
EP - 3100
PY - 2020//
DO - 10.24963/IJCAI.2020/428
UR - https://doi.org/10.24963/ijcai.2020/428
ER -
TY - CPAPER
ID - DBLP:conf/ijcai/HaoJHLWMZLXG20
AU - Hao, Xiaotian
AU - Jin, Junqi
AU - Hao, Jianye
AU - Li, Jin
AU - Wang, Weixun
AU - Ma, Yi
AU - Zheng, Zhenzhe
AU - Li, Han
AU - Xu, Jian
AU - Gai, Kun
TI - Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising.
BT - Proceedings of the Twenty-Ninth International Joint Conference on Artificial Intelligence, IJCAI 2020
SP - 3437
EP - 3443
PY - 2020//
DO - 10.24963/IJCAI.2020/475
UR - https://doi.org/10.24963/ijcai.2020/475
ER -
TY - CPAPER
ID - DBLP:conf/nips/HuWJWCH0F20
AU - Hu, Yujing
AU - Wang, Weixun
AU - Jia, Hangtian
AU - Wang, Yixiang
AU - Chen, Yingfeng
AU - Hao, Jianye
AU - Wu, Feng
AU - Fan, Changjie
TI - Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping.
BT - Advances in Neural Information Processing Systems 33: Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6-12, 2020, virtual.
PY - 2020//
UR - https://proceedings.neurips.cc/paper/2020/hash/b710915795b9e9c02cf10d6d2bdb688c-Abstract.html
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2002-07418
AU - Zhang, Peng
AU - Hao, Jianye
AU - Wang, Weixun
AU - Tang, Hongyao
AU - Ma, Yi
AU - Duan, Yihai
AU - Zheng, Yan
TI - KoGuN: Accelerating Deep Reinforcement Learning via Integrating Human Suboptimal Knowledge.
JO - CoRR
VL - abs/2002.07418
PY - 2020//
UR - https://arxiv.org/abs/2002.07418
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2002-08030
AU - Yang, Tianpei
AU - Wang, Weixun
AU - Tang, Hongyao
AU - Hao, Jianye
AU - Meng, Zhaopeng
AU - Liu, Wulong
AU - Hu, Yujing
AU - Chen, Yingfeng
TI - Learning When to Transfer among Agents: An Efficient Multiagent Transfer Learning Framework.
JO - CoRR
VL - abs/2002.08030
PY - 2020//
UR - https://arxiv.org/abs/2002.08030
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2002-08037
AU - Yang, Tianpei
AU - Hao, Jianye
AU - Meng, Zhaopeng
AU - Zhang, Zongzhang
AU - Wang, Weixun
AU - Hu, Yujing
AU - Chen, Yingfeng
AU - Fan, Changjie
AU - Wang, Zhaodong
AU - Peng, Jiajie
TI - Efficient Deep Reinforcement Learning through Policy Transfer.
JO - CoRR
VL - abs/2002.08037
PY - 2020//
UR - https://arxiv.org/abs/2002.08037
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2005-04355
AU - Hao, Xiaotian
AU - Jin, Junqi
AU - Hao, Jianye
AU - Li, Jin
AU - Wang, Weixun
AU - Ma, Yi
AU - Zheng, Zhenzhe
AU - Li, Han
AU - Xu, Jian
AU - Gai, Kun
TI - Learning to Accelerate Heuristic Searching for Large-Scale Maximum Weighted b-Matching Problems in Online Advertising.
JO - CoRR
VL - abs/2005.04355
PY - 2020//
UR - https://arxiv.org/abs/2005.04355
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2011-02669
AU - Hu, Yujing
AU - Wang, Weixun
AU - Jia, Hangtian
AU - Wang, Yixiang
AU - Chen, Yingfeng
AU - Hao, Jianye
AU - Wu, Feng
AU - Fan, Changjie
TI - Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping.
JO - CoRR
VL - abs/2011.02669
PY - 2020//
UR - https://arxiv.org/abs/2011.02669
ER -
TY - CPAPER
ID - DBLP:conf/atal/HaoWHY19
AU - Hao, Xiaotian
AU - Wang, Weixun
AU - Hao, Jianye
AU - Yang, Yaodong
TI - Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems.
BT - Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, AAMAS '19, Montreal, QC, Canada, May 13-17, 2019
SP - 1315
EP - 1323
PY - 2019//
UR - http://dl.acm.org/citation.cfm?id=3331837
ER -
TY - CPAPER
ID - DBLP:conf/cikm/WangJH0YZWHWLXG19
AU - Wang, Weixun
AU - Jin, Junqi
AU - Hao, Jianye
AU - Chen, Chunjie
AU - Yu, Chuan
AU - Zhang, Weinan
AU - Wang, Jun
AU - Hao, Xiaotian
AU - Wang, Yixi
AU - Li, Han
AU - Xu, Jian
AU - Gai, Kun
TI - Learning Adaptive Display Exposure for Real-Time Advertising.
BT - Proceedings of the 28th ACM International Conference on Information and Knowledge Management, CIKM 2019, Beijing, China, November 3-7, 2019.
SP - 2595
EP - 2603
PY - 2019//
DO - 10.1145/3357384.3357806
UR - https://doi.org/10.1145/3357384.3357806
ER -
TY - CPAPER
ID - DBLP:conf/dai2/WangHWT19
AU - Wang, Weixun
AU - Hao, Jianye
AU - Wang, Yixi
AU - Taylor, Matthew E.
TI - Achieving cooperation through deep multiagent reinforcement learning in sequential prisoner's dilemmas.
BT - Proceedings of the First International Conference on Distributed Artificial Intelligence, DAI 2019, Beijing, China, October 13-15, 2019
SP - 11:1
EP - 11:7
PY - 2019//
DO - 10.1145/3356464.3357712
UR - https://doi.org/10.1145/3356464.3357712
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-1907-11461
AU - Wang, Weixun
AU - Yang, Tianpei
AU - Liu, Yong
AU - Hao, Jianye
AU - Hao, Xiaotian
AU - Hu, Yujing
AU - Chen, Yingfeng
AU - Fan, Changjie
AU - Gao, Yang
TI - Action Semantics Network: Considering the Effects of Actions in Multiagent Systems.
JO - CoRR
VL - abs/1907.11461
PY - 2019//
UR - http://arxiv.org/abs/1907.11461
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-1909-02790
AU - Wang, Weixun
AU - Yang, Tianpei
AU - Liu, Yong
AU - Hao, Jianye
AU - Hao, Xiaotian
AU - Hu, Yujing
AU - Chen, Yingfeng
AU - Fan, Changjie
AU - Gao, Yang
TI - From Few to More: Large-scale Dynamic Multiagent Curriculum Learning.
JO - CoRR
VL - abs/1909.02790
PY - 2019//
UR - http://arxiv.org/abs/1909.02790
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-1909-11468
AU - Hao, Xiaotian
AU - Wang, Weixun
AU - Hao, Jianye
AU - Yang, Yaodong
TI - Independent Generative Adversarial Self-Imitation Learning in Cooperative Multiagent Systems.
JO - CoRR
VL - abs/1909.11468
PY - 2019//
UR - http://arxiv.org/abs/1909.11468
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-1911-10715
AU - Liu, Yong
AU - Wang, Weixun
AU - Hu, Yujing
AU - Hao, Jianye
AU - Chen, Xingguo
AU - Gao, Yang
TI - Multi-Agent Game Abstraction via Graph Attention Neural Network.
JO - CoRR
VL - abs/1911.10715
PY - 2019//
UR - http://arxiv.org/abs/1911.10715
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-1803-00162
AU - Wang, Weixun
AU - Hao, Jianye
AU - Wang, Yixi
AU - Taylor, Matthew E.
TI - Towards Cooperation in Sequential Prisoner's Dilemmas: a Deep Multiagent Reinforcement Learning Approach.
JO - CoRR
VL - abs/1803.00162
PY - 2018//
UR - http://arxiv.org/abs/1803.00162
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-1809-03149
AU - Wang, Weixun
AU - Jin, Junqi
AU - Hao, Jianye
AU - Chen, Chunjie
AU - Yu, Chuan
AU - Zhang, Weinan
AU - Wang, Jun
AU - Wang, Yixi
AU - Li, Han
AU - Xu, Jian
AU - Gai, Kun
TI - Learning to Advertise with Adaptive Exposure via Constrained Two-Level Reinforcement Learning.
JO - CoRR
VL - abs/1809.03149
PY - 2018//
UR - http://arxiv.org/abs/1809.03149
ER -
TY - JOUR
ID - DBLP:journals/suscom/WangRM12
AU - Wang, Weixun
AU - Ranka, Sanjay
AU - Mishra, Prabhat
TI - Energy-aware dynamic slack allocation for real-time multitasking systems.
JO - Sustain. Comput. Informatics Syst.
VL - 2
IS - 3
SP - 128
EP - 137
PY - 2012//
DO - 10.1016/J.SUSCOM.2012.04.001
UR - https://doi.org/10.1016/j.suscom.2012.04.001
ER -
TY - JOUR
ID - DBLP:journals/tcad/QinWM12
AU - Qin, Xiaoke
AU - Wang, Weixun
AU - Mishra, Prabhat
TI - TCEC: Temperature and Energy-Constrained Scheduling in Real-Time Multitasking Systems.
JO - IEEE Trans. Comput. Aided Des. Integr. Circuits Syst.
VL - 31
IS - 8
SP - 1159
EP - 1168
PY - 2012//
DO - 10.1109/TCAD.2012.2190824
UR - https://doi.org/10.1109/TCAD.2012.2190824
ER -
TY - JOUR
ID - DBLP:journals/tecs/WangMG12
AU - Wang, Weixun
AU - Mishra, Prabhat
AU - Gordon-Ross, Ann
TI - Dynamic Cache Reconfiguration for Soft Real-Time Systems.
JO - ACM Trans. Embed. Comput. Syst.
VL - 11
IS - 2
SP - 28:1
EP - 28:31
PY - 2012//
DO - 10.1145/2220336.2220340
UR - https://doi.org/10.1145/2220336.2220340
ER -
TY - JOUR
ID - DBLP:journals/tvlsi/WangM12
AU - Wang, Weixun
AU - Mishra, Prabhat
TI - System-Wide Leakage-Aware Energy Minimization Using Dynamic Voltage Scaling and Cache Reconfiguration in Multitasking Systems.
JO - IEEE Trans. Very Large Scale Integr. Syst.
VL - 20
IS - 5
SP - 902
EP - 910
PY - 2012//
DO - 10.1109/TVLSI.2011.2116814
UR - https://doi.org/10.1109/TVLSI.2011.2116814
ER -
TY - ENCYC
ID - DBLP:reference/crc/WangQM12
AU - Wang, Weixun
AU - Qin, Xiaoke
AU - Mishra, Prabhat
TI - Energy-Aware Scheduling and Dynamic Reconfiguration in Real-Time Systems.
BT - Handbook of Energy-Aware and Green Computing - Two Volume Set.
SP - 543
EP - 572
PY - 2012//
DO - 10.1201/B16631-30
UR - http://www.crcnetbase.com/doi/abs/10.1201/b16631-30
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-1211-1736
AU - Basu, Kanad
AU - Mitra, Subrata
AU - Mukherjee, Srishti
AU - Wang, Weixun
TI - A Novel Approach for Handling Misbehaving Nodes in Behavior-Aware Mobile Networking
JO - CoRR
VL - abs/1211.1736
PY - 2012//
UR - http://arxiv.org/abs/1211.1736
ER -
TY - JOUR
ID - DBLP:journals/jolpe/WangM11
AU - Wang, Weixun
AU - Mishra, Prabhat
TI - Dynamic Reconfiguration of Two-Level Cache Hierarchy in Real-Time Embedded Systems.
JO - J. Low Power Electron.
VL - 7
IS - 1
SP - 17
EP - 28
PY - 2011//
DO - 10.1166/JOLPE.2011.1113
UR - https://doi.org/10.1166/jolpe.2011.1113
ER -
TY - JOUR
ID - DBLP:journals/suscom/WangRM11
AU - Wang, Weixun
AU - Ranka, Sanjay
AU - Mishra, Prabhat
TI - Energy-aware dynamic reconfiguration algorithms for real-time multitasking systems.
JO - Sustain. Comput. Informatics Syst.
VL - 1
IS - 1
SP - 35
EP - 45
PY - 2011//
DO - 10.1016/J.SUSCOM.2010.10.006
UR - https://doi.org/10.1016/j.suscom.2010.10.006
ER -
TY - CPAPER
ID - DBLP:conf/dac/WangMR11
AU - Wang, Weixun
AU - Mishra, Prabhat
AU - Ranka, Sanjay
TI - Dynamic cache reconfiguration and partitioning for energy optimization in real-time multi-core systems.
BT - Proceedings of the 48th Design Automation Conference, DAC 2011, San Diego, California, USA, June 5-10, 2011
SP - 948
EP - 953
PY - 2011//
DO - 10.1145/2024724.2024935
UR - https://doi.org/10.1145/2024724.2024935
ER -
TY - CPAPER
ID - DBLP:conf/vlsid/WangRM11
AU - Wang, Weixun
AU - Ranka, Sanjay
AU - Mishra, Prabhat
TI - A General Algorithm for Energy-Aware Dynamic Reconfiguration in Multitasking Systems.
BT - VLSI Design 2011: 24th International Conference on VLSI Design, IIT Madras, Chennai, India, 2-7 January 2011
SP - 334
EP - 339
PY - 2011//
DO - 10.1109/VLSID.2011.17
UR - https://doi.org/10.1109/VLSID.2011.17
UR - https://doi.ieeecomputersociety.org/10.1109/VLSID.2011.17
ER -
TY - CPAPER
ID - DBLP:conf/dac/WangM10
AU - Wang, Weixun
AU - Mishra, Prabhat
TI - PreDVS: preemptive dynamic voltage scaling for real-time systems using approximation scheme.
BT - Proceedings of the 47th Design Automation Conference, DAC 2010, Anaheim, California, USA, July 13-18, 2010
SP - 705
EP - 710
PY - 2010//
DO - 10.1145/1837274.1837452
UR - https://doi.org/10.1145/1837274.1837452
ER -
TY - CPAPER
ID - DBLP:conf/islped/WangQM10
AU - Wang, Weixun
AU - Qin, Xiaoke
AU - Mishra, Prabhat
TI - Temperature- and energy-constrained scheduling in multitasking systems: a model checking approach.
BT - Proceedings of the 2010 International Symposium on Low Power Electronics and Design, 2010, Austin, Texas, USA, August 18-20, 2010
SP - 85
EP - 90
PY - 2010//
DO - 10.1145/1840845.1840863
UR - https://doi.org/10.1145/1840845.1840863
ER -
TY - CPAPER
ID - DBLP:conf/vlsid/WangM10
AU - Wang, Weixun
AU - Mishra, Prabhat
TI - Leakage-Aware Energy Minimization Using Dynamic Voltage Scaling and Cache Reconfiguration in Real-Time Systems.
BT - VLSI Design 2010: 23rd International Conference on VLSI Design, 9th International Conference on Embedded Systems, Bangalore, India, 3-7 January 2010
SP - 357
EP - 362
PY - 2010//
DO - 10.1109/VLSI.DESIGN.2010.22
UR - https://doi.org/10.1109/VLSI.Design.2010.22
UR - https://doi.ieeecomputersociety.org/10.1109/VLSI.Design.2010.22
ER -
TY - CPAPER
ID - DBLP:conf/isvlsi/WangM09
AU - Wang, Weixun
AU - Mishra, Prabhat
TI - Dynamic Reconfiguration of Two-Level Caches in Soft Real-Time Embedded Systems.
BT - IEEE Computer Society Annual Symposium on VLSI, ISVLSI 2009, 13-15 May 2009, Tampa, Florida, USA
SP - 145
EP - 150
PY - 2009//
DO - 10.1109/ISVLSI.2009.22
UR - https://doi.org/10.1109/ISVLSI.2009.22
UR - https://doi.ieeecomputersociety.org/10.1109/ISVLSI.2009.22
ER -
TY - CPAPER
ID - DBLP:conf/vlsid/WangMG09
AU - Wang, Weixun
AU - Mishra, Prabhat
AU - Gordon-Ross, Ann
TI - SACR: Scheduling-Aware Cache Reconfiguration for Real-Time Embedded Systems.
BT - VLSI Design 2009: Improving Productivity through Higher Abstraction, The 22nd International Conference on VLSI Design, New Delhi, India, 5-9 January 2009
SP - 547
EP - 552
PY - 2009//
DO - 10.1109/VLSI.DESIGN.2009.66
UR - https://doi.org/10.1109/VLSI.Design.2009.66
UR - https://doi.ieeecomputersociety.org/10.1109/VLSI.Design.2009.66
ER -