Provider: Schloss Dagstuhl - Leibniz Center for Informatics
Database: dblp computer science bibliography
Content:text/plain; charset="utf-8"
TY - CPAPER
ID - DBLP:conf/iclr/DongYHNM0HLFH24
AU - Dong, Zibin
AU - Yuan, Yifu
AU - Hao, Jianye
AU - Ni, Fei
AU - Mu, Yao
AU - Zheng, Yan
AU - Hu, Yujing
AU - Lv, Tangjie
AU - Fan, Changjie
AU - Hu, Zhipeng
TI - AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model.
BT - The Twelfth International Conference on Learning Representations, ICLR 2024, Vienna, Austria, May 7-11, 2024
PY - 2024//
UR - https://openreview.net/forum?id=bxfKIYfHyx
ER -
TY - CPAPER
ID - DBLP:conf/iclr/YuanHMDL0FZ024
AU - Yuan, Yifu
AU - Hao, Jianye
AU - Ma, Yi
AU - Dong, Zibin
AU - Liang, Hebin
AU - Liu, Jinyi
AU - Feng, Zhixin
AU - Zhao, Kai
AU - Zheng, Yan
TI - Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback.
BT - The Twelfth International Conference on Learning Representations, ICLR 2024, Vienna, Austria, May 7-11, 2024
PY - 2024//
UR - https://openreview.net/forum?id=WesY0H9ghM
ER -
TY - CPAPER
ID - DBLP:conf/icml/KouN00YDH24
AU - Kou, Longxin
AU - Ni, Fei
AU - Zheng, Yan
AU - Liu, Jinyi
AU - Yuan, Yifu
AU - Dong, Zibin
AU - Hao, Jianye
TI - KISA: A Unified Keyframe Identifier and Skill Annotator for Long-Horizon Robotics Demonstrations.
BT - Forty-first International Conference on Machine Learning, ICML 2024, Vienna, Austria, July 21-27, 2024
PY - 2024//
UR - https://openreview.net/forum?id=oCI9gHocws
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2401-15443
AU - Dong, Zibin
AU - Hao, Jianye
AU - Yuan, Yifu
AU - Ni, Fei
AU - Wang, Yitian
AU - Li, Pengyi
AU - Zheng, Yan
TI - DiffuserLite: Towards Real-time Diffusion Planning.
JO - CoRR
VL - abs/2401.15443
PY - 2024//
DO - 10.48550/ARXIV.2401.15443
UR - https://doi.org/10.48550/arXiv.2401.15443
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2402-02423
AU - Yuan, Yifu
AU - Hao, Jianye
AU - Ma, Yi
AU - Dong, Zibin
AU - Liang, Hebin
AU - Liu, Jinyi
AU - Feng, Zhixin
AU - Zhao, Kai
AU - Zheng, Yan
TI - Uni-RLHF: Universal Platform and Benchmark Suite for Reinforcement Learning with Diverse Human Feedback.
JO - CoRR
VL - abs/2402.02423
PY - 2024//
DO - 10.48550/ARXIV.2402.02423
UR - https://doi.org/10.48550/arXiv.2402.02423
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2402-14244
AU - Zhou, Xinglin
AU - Yuan, Yifu
AU - Yang, Shaofu
AU - Hao, Jianye
TI - MENTOR: Guiding Hierarchical Reinforcement Learning with Human Feedback and Dynamic Distance Constraint.
JO - CoRR
VL - abs/2402.14244
PY - 2024//
DO - 10.48550/ARXIV.2402.14244
UR - https://doi.org/10.48550/arXiv.2402.14244
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2402-14245
AU - Liu, Jinyi
AU - Yuan, Yifu
AU - Hao, Jianye
AU - Ni, Fei
AU - Fu, Lingzhi
AU - Chen, Yibin
AU - Zheng, Yan
TI - Enhancing Robotic Manipulation with AI Feedback from Multimodal Large Language Models.
JO - CoRR
VL - abs/2402.14245
PY - 2024//
DO - 10.48550/ARXIV.2402.14245
UR - https://doi.org/10.48550/arXiv.2402.14245
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2403-03636
AU - Chen, Yibin
AU - Yuan, Yifu
AU - Zhang, Zeyu
AU - Zheng, Yan
AU - Liu, Jinyi
AU - Ni, Fei
AU - Hao, Jianye
TI - SheetAgent: A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models.
JO - CoRR
VL - abs/2403.03636
PY - 2024//
DO - 10.48550/ARXIV.2403.03636
UR - https://doi.org/10.48550/arXiv.2403.03636
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2405-12954
AU - Sun, Haoyuan
AU - Wu, Zihao
AU - Xia, Bo
AU - Chang, Pu
AU - Dong, Zibin
AU - Yuan, Yifu
AU - Chang, Yongzhe
AU - Wang, Xueqian
TI - A Method on Searching Better Activation Functions.
JO - CoRR
VL - abs/2405.12954
PY - 2024//
DO - 10.48550/ARXIV.2405.12954
UR - https://doi.org/10.48550/arXiv.2405.12954
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2406-09509
AU - Dong, Zibin
AU - Yuan, Yifu
AU - Hao, Jianye
AU - Ni, Fei
AU - Ma, Yi
AU - Li, Pengyi
AU - Zheng, Yan
TI - CleanDiffuser: An Easy-to-use Modularized Library for Diffusion Models in Decision Making.
JO - CoRR
VL - abs/2406.09509
PY - 2024//
DO - 10.48550/ARXIV.2406.09509
UR - https://doi.org/10.48550/arXiv.2406.09509
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2408-15501
AU - Yuan, Yifu
AU - Zheng, Zhenrui
AU - Dong, Zibin
AU - Hao, Jianye
TI - MODULI: Unlocking Preference Generalization via Diffusion Models for Offline Multi-Objective Reinforcement Learning.
JO - CoRR
VL - abs/2408.15501
PY - 2024//
DO - 10.48550/ARXIV.2408.15501
UR - https://doi.org/10.48550/arXiv.2408.15501
ER -
TY - CPAPER
ID - DBLP:conf/iclr/YuanHNMZHLCF23
AU - Yuan, Yifu
AU - Hao, Jianye
AU - Ni, Fei
AU - Mu, Yao
AU - Zheng, Yan
AU - Hu, Yujing
AU - Liu, Jinyi
AU - Chen, Yingfeng
AU - Fan, Changjie
TI - EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model.
BT - The Eleventh International Conference on Learning Representations, ICLR 2023, Kigali, Rwanda, May 1-5, 2023
PY - 2023//
UR - https://openreview.net/forum?id=xQAjSr64PTc
ER -
TY - CPAPER
ID - DBLP:conf/icml/NiHMYZWL23
AU - Ni, Fei
AU - Hao, Jianye
AU - Mu, Yao
AU - Yuan, Yifu
AU - Zheng, Yan
AU - Wang, Bin
AU - Liang, Zhixuan
TI - MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL.
BT - International Conference on Machine Learning, ICML 2023, 23-29 July 2023, Honolulu, Hawaii, USA.
SP - 26087
EP - 26105
PY - 2023//
UR - https://proceedings.mlr.press/v202/ni23a.html
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2305-19923
AU - Ni, Fei
AU - Hao, Jianye
AU - Mu, Yao
AU - Yuan, Yifu
AU - Zheng, Yan
AU - Wang, Bin
AU - Liang, Zhixuan
TI - MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL.
JO - CoRR
VL - abs/2305.19923
PY - 2023//
DO - 10.48550/ARXIV.2305.19923
UR - https://doi.org/10.48550/arXiv.2305.19923
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2310-02054
AU - Dong, Zibin
AU - Yuan, Yifu
AU - Hao, Jianye
AU - Ni, Fei
AU - Mu, Yao
AU - Zheng, Yan
AU - Hu, Yujing
AU - Lv, Tangjie
AU - Fan, Changjie
AU - Hu, Zhipeng
TI - AlignDiff: Aligning Diverse Human Preferences via Behavior-Customisable Diffusion Model.
JO - CoRR
VL - abs/2310.02054
PY - 2023//
DO - 10.48550/ARXIV.2310.02054
UR - https://doi.org/10.48550/arXiv.2310.02054
ER -
TY - JOUR
ID - DBLP:journals/itiis/SiTYPL22
AU - Si, Huaiwei
AU - Tan, Guozhen
AU - Yuan, Yifu
AU - Peng, Yanfei
AU - Li, Jianping
TI - Explicit Dynamic Coordination Reinforcement Learning Based on Utility.
JO - KSII Trans. Internet Inf. Syst.
VL - 16
IS - 3
SP - 792
EP - 812
PY - 2022//
DO - 10.3837/TIIS.2022.03.003
UR - https://doi.org/10.3837/tiis.2022.03.003
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2210-00498
AU - Yuan, Yifu
AU - Hao, Jianye
AU - Ni, Fei
AU - Mu, Yao
AU - Zheng, Yan
AU - Hu, Yujing
AU - Liu, Jinyi
AU - Chen, Yingfeng
AU - Fan, Changjie
TI - EUCLID: Towards Efficient Unsupervised Reinforcement Learning with Multi-choice Dynamics Model.
JO - CoRR
VL - abs/2210.00498
PY - 2022//
DO - 10.48550/ARXIV.2210.00498
UR - https://doi.org/10.48550/arXiv.2210.00498
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2212-01968
AU - Ren, Zhicheng
AU - Yuan, Yifu
AU - Wu, Yuxin
AU - Gao, Xiaxuan
AU - Wang, Yewen
AU - Sun, Yizhou
TI - Dissimilar Nodes Improve Graph Active Learning.
JO - CoRR
VL - abs/2212.01968
PY - 2022//
DO - 10.48550/ARXIV.2212.01968
UR - https://doi.org/10.48550/arXiv.2212.01968
ER -
TY - CPAPER
ID - DBLP:conf/cvpr/XiangQMXZLLJYWY20
AU - Xiang, Fanbo
AU - Qin, Yuzhe
AU - Mo, Kaichun
AU - Xia, Yikuan
AU - Zhu, Hao
AU - Liu, Fangchen
AU - Liu, Minghua
AU - Jiang, Hanxiao
AU - Yuan, Yifu
AU - Wang, He
AU - Yi, Li
AU - Chang, Angel X.
AU - Guibas, Leonidas J.
AU - Su, Hao
TI - SAPIEN: A SimulAted Part-Based Interactive ENvironment.
BT - 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition, CVPR 2020, Seattle, WA, USA, June 13-19, 2020
SP - 11094
EP - 11104
PY - 2020//
UR - https://openaccess.thecvf.com/content_CVPR_2020/html/Xiang_SAPIEN_A_SimulAted_Part-Based_Interactive_ENvironment_CVPR_2020_paper.html
UR - https://doi.org/10.1109/CVPR42600.2020.01111
ER -
TY - Informal or Other Publication
ID - DBLP:journals/corr/abs-2003-08515
AU - Xiang, Fanbo
AU - Qin, Yuzhe
AU - Mo, Kaichun
AU - Xia, Yikuan
AU - Zhu, Hao
AU - Liu, Fangchen
AU - Liu, Minghua
AU - Jiang, Hanxiao
AU - Yuan, Yifu
AU - Wang, He
AU - Yi, Li
AU - Chang, Angel X.
AU - Guibas, Leonidas J.
AU - Su, Hao
TI - SAPIEN: A SimulAted Part-based Interactive ENvironment.
JO - CoRR
VL - abs/2003.08515
PY - 2020//
UR - https://arxiv.org/abs/2003.08515
ER -