iBet uBet web content aggregator. Adding the entire web to your favor.
iBet uBet web content aggregator. Adding the entire web to your favor.



Link to original content: https://dblp.uni-trier.de/rec/journals/corr/abs-2410-19720.ris
Provider: Schloss Dagstuhl - Leibniz Center for Informatics Database: dblp computer science bibliography Content:text/plain; charset="utf-8" TY - Informal or Other Publication ID - DBLP:journals/corr/abs-2410-19720 AU - Li, Shilong AU - He, Yancheng AU - Huang, Hui AU - Bu, Xingyuan AU - Liu, Jiaheng AU - Guo, Hangyu AU - Wang, Weixun AU - Gu, Jihao AU - Su, Wenbo AU - Zheng, Bo TI - 2D-DPO: Scaling Direct Preference Optimization with 2-Dimensional Supervision. JO - CoRR VL - abs/2410.19720 PY - 2024// DO - 10.48550/ARXIV.2410.19720 UR - https://doi.org/10.48550/arXiv.2410.19720 ER -