default search action
Chuheng Zhang
Person information
SPARQL queries
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
2020 – today
- 2024
- [c20]Yunseon Choi, Sangmin Bae, Seonghyun Ban, Minchan Jeong, Chuheng Zhang, Lei Song, Li Zhao, Jiang Bian, Kee-Eung Kim:
Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL. ACL (1) 2024: 8252-8271 - [c19]Chuheng Zhang, Xiangsen Wang, Wei Jiang, Xianliang Yang, Siwei Wang, Lei Song, Jiang Bian:
Whittle Index with Multiple Actions and State Constraint for Inventory Management. ICLR 2024 - [c18]Yunseon Choi, Li Zhao, Chuheng Zhang, Lei Song, Jiang Bian, Kee-Eung Kim:
Diversification of Adaptive Policy for Effective Offline Reinforcement Learning. IJCAI 2024: 3863-3871 - [i21]Yiwen Chen, Yuyao Ye, Ziyi Chen, Chuheng Zhang, Marcelo H. Ang:
ARO: Large Language Model Supervised Robotics Text2Skill Autonomous Learning. CoRR abs/2403.15834 (2024) - [i20]Guangran Cheng, Chuheng Zhang, Wenzhe Cai, Li Zhao, Changyin Sun, Jiang Bian:
Empowering Large Language Models on Robotic Manipulation with Affordance Prompting. CoRR abs/2404.11027 (2024) - [i19]Yunseon Choi, Sangmin Bae, Seonghyun Ban, Minchan Jeong, Chuheng Zhang, Lei Song, Li Zhao, Jiang Bian, Kee-Eung Kim:
Hard Prompts Made Interpretable: Sparse Entropy Regularization for Prompt Tuning with RL. CoRR abs/2407.14733 (2024) - [i18]Wei Shen, Chuheng Zhang:
Policy Filtration in RLHF to Fine-Tune LLM for Code Generation. CoRR abs/2409.06957 (2024) - 2023
- [c17]Yuanying Cai, Chuheng Zhang, Wei Shen, Xuyun Zhang, Wenjie Ruan, Longbo Huang:
RePreM: Representation Pre-training with Masked Model for Reinforcement Learning. AAAI 2023: 6879-6887 - [c16]Yuanying Cai, Chuheng Zhang, Hanye Zhao, Li Zhao, Jiang Bian:
Curriculum Offline Reinforcement Learning. AAMAS 2023: 1221-1229 - [c15]Jinpeng Zhang, Yufeng Zheng, Chuheng Zhang, Li Zhao, Lei Song, Yuan Zhou, Jiang Bian:
Robust Situational Reinforcement Learning in Face of Context Disturbances. ICML 2023: 41973-41989 - [c14]Chuheng Zhang, Yitong Duan, Xiaoyu Chen, Jianyu Chen, Jian Li, Li Zhao:
Towards Generalizable Reinforcement Learning for Trade Execution. IJCAI 2023: 4975-4983 - [i17]Yuanying Cai, Chuheng Zhang, Wei Shen, Xuyun Zhang, Wenjie Ruan, Longbo Huang:
RePreM: Representation Pre-training with Masked Model for Reinforcement Learning. CoRR abs/2303.01668 (2023) - [i16]Xianliang Yang, Zhihao Liu, Wei Jiang, Chuheng Zhang, Li Zhao, Lei Song, Jiang Bian:
A Versatile Multi-Agent Reinforcement Learning Benchmark for Inventory Management. CoRR abs/2306.07542 (2023) - [i15]Chuheng Zhang, Yitong Duan, Xiaoyu Chen, Jianyu Chen, Jian Li, Li Zhao:
Towards Generalizable Reinforcement Learning for Trade Execution. CoRR abs/2307.11685 (2023) - [i14]Lei Song, Chuheng Zhang, Li Zhao, Jiang Bian:
Pre-Trained Large Language Models for Industrial Control. CoRR abs/2308.03028 (2023) - 2022
- [c13]Yuanying Cai, Chuheng Zhang, Wei Shen, Xiaonan He, Xuyun Zhang, Longbo Huang:
Imitation Learning to Outperform Demonstrators by Directly Extrapolating Demonstrations. CIKM 2022: 128-137 - [c12]Wei Shen, Xiaonan He, Chuheng Zhang, Xuyun Zhang, Jian Xie:
A Transformer-Based User Satisfaction Prediction for Proactive Interaction Mechanism in DuerOS. CIKM 2022: 1777-1786 - [c11]Ze Wang, Guogang Liao, Xiaowen Shi, Xiaoxu Wu, Chuheng Zhang, Yongkang Wang, Xingxing Wang, Dong Wang:
Learning List-wise Representation in Reinforcement Learning for Ads Allocation with Multiple Auxiliary Tasks. CIKM 2022: 3555-3564 - [c10]Ze Wang, Guogang Liao, Xiaowen Shi, Xiaoxu Wu, Chuheng Zhang, Bingqi Zhu, Yongkang Wang, Xingxing Wang, Dong Wang:
Hybrid Transfer in Deep Reinforcement Learning for Ads Allocation. CIKM 2022: 4560-4564 - [c9]Yuanying Cai, Chuheng Zhang, Li Zhao, Wei Shen, Xuyun Zhang, Lei Song, Jiang Bian, Tao Qin, Tieyan Liu:
TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets. ICDM 2022: 21-30 - [c8]Guogang Liao, Xiaowen Shi, Ze Wang, Xiaoxu Wu, Chuheng Zhang, Yongkang Wang, Xingxing Wang, Dong Wang:
Deep Page-Level Interest Network in Reinforcement Learning for Ads Allocation. SIGIR 2022: 2292-2296 - [c7]Guogang Liao, Ze Wang, Xiaoxu Wu, Xiaowen Shi, Chuheng Zhang, Yongkang Wang, Xingxing Wang, Dong Wang:
Cross DQN: Cross Deep Q Network for Ads Allocation in Feed. WWW 2022: 401-409 - [i13]Guogang Liao, Xiaowen Shi, Ze Wang, Xiaoxu Wu, Chuheng Zhang, Yongkang Wang, Xingxing Wang, Dong Wang:
Deep Page-Level Interest Network in Reinforcement Learning for Ads Allocation. CoRR abs/2204.00377 (2022) - [i12]Guogang Liao, Ze Wang, Xiaowen Shi, Xiaoxu Wu, Chuheng Zhang, Yongkang Wang, Xingxing Wang, Dong Wang:
Learning List-wise Representation in Reinforcement Learning for Ads Allocation with Multiple Auxiliary Tasks. CoRR abs/2204.00888 (2022) - [i11]Guogang Liao, Ze Wang, Xiaowen Shi, Xiaoxu Wu, Chuheng Zhang, Bingqi Zhu, Yongkang Wang, Xingxing Wang, Dong Wang:
Hybrid Transfer in Deep Reinforcement Learning for Ads Allocation. CoRR abs/2204.11589 (2022) - [i10]Yuanying Cai, Chuheng Zhang, Li Zhao, Wei Shen, Xuyun Zhang, Lei Song, Jiang Bian, Tao Qin, Tieyan Liu:
TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets. CoRR abs/2212.02125 (2022) - [i9]Wei Shen, Xiaonan He, Chuheng Zhang, Xuyun Zhang, Jian Xie:
A Transformer-Based User Satisfaction Prediction for Proactive Interaction Mechanism in DuerOS. CoRR abs/2212.03817 (2022) - [i8]Yuandong Ding, Mingxiao Feng, Guozi Liu, Wei Jiang, Chuheng Zhang, Li Zhao, Lei Song, Houqiang Li, Yan Jin, Jiang Bian:
Multi-Agent Reinforcement Learning with Shared Resources for Inventory Management. CoRR abs/2212.07684 (2022) - 2021
- [c6]Chuheng Zhang, Yuanying Cai, Longbo Huang, Jian Li:
Exploration by Maximizing Renyi Entropy for Reward-Free RL Framework. AAAI 2021: 10859-10867 - [c5]Wei Shen, Chuheng Zhang, Yun Tian, Liang Zeng, Xiaonan He, Wanchun Dou, Xiaolong Xu:
Inductive Matrix Completion Using Graph Autoencoder. CIKM 2021: 1609-1618 - [c4]Guoqing Liu, Chuheng Zhang, Li Zhao, Tao Qin, Jinhua Zhu, Jian Li, Nenghai Yu, Tie-Yan Liu:
Return-Based Contrastive Representation Learning for Reinforcement Learning. ICLR 2021 - [i7]Guoqing Liu, Chuheng Zhang, Li Zhao, Tao Qin, Jinhua Zhu, Jian Li, Nenghai Yu, Tie-Yan Liu:
Return-Based Contrastive Representation Learning for Reinforcement Learning. CoRR abs/2102.10960 (2021) - [i6]Wei Shen, Chuheng Zhang, Yun Tian, Liang Zeng, Xiaonan He, Wanchun Dou, Xiaolong Xu:
Inductive Matrix Completion Using Graph Autoencoder. CoRR abs/2108.11124 (2021) - [i5]Guogang Liao, Ze Wang, Xiaoxu Wu, Xiaowen Shi, Chuheng Zhang, Yongkang Wang, Xingxing Wang, Dong Wang:
Cross DQN: Cross Deep Q Network for Ads Allocation in Feed. CoRR abs/2109.04353 (2021) - 2020
- [c3]Chuheng Zhang, Yuanqi Li, Jian Li:
Policy Search by Target Distribution Learning for Continuous Control. AAAI 2020: 6770-6777 - [c2]Wei Shen, Xiaonan He, Chuheng Zhang, Qiang Ni, Wanchun Dou, Yan Wang:
Auxiliary-task Based Deep Reinforcement Learning for Participant Selection Problem in Mobile Crowdsourcing. CIKM 2020: 1355-1364 - [c1]Chuheng Zhang, Yuanqi Li, Xi Chen, Yifei Jin, Pingzhong Tang, Jian Li:
DoubleEnsemble: A New Ensemble Method Based on Sample Reweighting and Feature Selection for Financial Data Analysis. ICDM 2020: 781-790 - [i4]Chuheng Zhang, Yuanying Cai, Longbo Huang, Jian Li:
Exploration by Maximizing Rényi Entropy for Zero-Shot Meta RL. CoRR abs/2006.06193 (2020) - [i3]Wei Shen, Xiaonan He, Chuheng Zhang, Qiang Ni, Wanchun Dou, Yan Wang:
Auxiliary-task Based Deep Reinforcement Learning for Participant Selection Problem in Mobile Crowdsourcing. CoRR abs/2008.11087 (2020) - [i2]Chuheng Zhang, Yuanqi Li, Xi Chen, Yifei Jin, Pingzhong Tang, Jian Li:
DoubleEnsemble: A New Ensemble Method Based on Sample Reweighting and Feature Selection for Financial Data Analysis. CoRR abs/2010.01265 (2020)
2010 – 2019
- 2019
- [i1]Chuheng Zhang, Yuanqi Li, Jian Li:
Policy Search by Target Distribution Learning for Continuous Control. CoRR abs/1905.11041 (2019)
Coauthor Index
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.
Unpaywalled article links
Add open access links from to the list of external document links (if available).
Privacy notice: By enabling the option above, your browser will contact the API of unpaywall.org to load hyperlinks to open access articles. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Unpaywall privacy policy.
Archived links via Wayback Machine
For web page which are no longer available, try to retrieve content from the of the Internet Archive (if available).
Privacy notice: By enabling the option above, your browser will contact the API of archive.org to check for archived content of web pages that are no longer available. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Internet Archive privacy policy.
Reference lists
Add a list of references from , , and to record detail pages.
load references from crossref.org and opencitations.net
Privacy notice: By enabling the option above, your browser will contact the APIs of crossref.org, opencitations.net, and semanticscholar.org to load article reference information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the Crossref privacy policy and the OpenCitations privacy policy, as well as the AI2 Privacy Policy covering Semantic Scholar.
Citation data
Add a list of citing articles from and to record detail pages.
load citations from opencitations.net
Privacy notice: By enabling the option above, your browser will contact the API of opencitations.net and semanticscholar.org to load citation information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the OpenCitations privacy policy as well as the AI2 Privacy Policy covering Semantic Scholar.
OpenAlex data
Load additional information about publications from .
Privacy notice: By enabling the option above, your browser will contact the API of openalex.org to load additional information. Although we do not have any reason to believe that your call will be tracked, we do not have any control over how the remote server uses your data. So please proceed with care and consider checking the information given by OpenAlex.
last updated on 2024-10-21 20:32 CEST by the dblp team
all metadata released as open data under CC0 1.0 license
see also: Terms of Use | Privacy Policy | Imprint