Tianxiang Sun Xiaotian Zhang Zhengfu He Peng Li Qinyuan Cheng Xiangyang Liu Hang Yan 0001 Yunfan Shao Qiong Tang Shiduo Zhang Xingjian Zhao Ke Chen Yining Zheng Zhejian Zhou Ruixiao Li Jun Zhan Yunhua Zhou Linyang Li Xiaogui Yang Lingling Wu Zhangyue Yin Xuanjing Huang 0001 Yu-Gang Jiang Xipeng Qiu MOSS: An Open Conversational Large Language Model. 888-905 2024 October 21 Mach. Intell. Res. 5 https://doi.org/10.1007/s11633-024-1502-8 db/journals/ijautcomp/ijautcomp21.html#SunZHLCLYSTZZCZZLZZLYWY24 streams/journals/ijautcomp
Shimin Li Tianxiang Sun Xipeng Qiu Agent Alignment in Evolving Social Norms. 2024 abs/2401.04620 CoRR https://doi.org/10.48550/arXiv.2401.04620 db/journals/corr/corr2401.html#abs-2401-04620
Qinyuan Cheng Tianxiang Sun Xiangyang Liu Wenwei Zhang Zhangyue Yin Shimin Li Linyang Li Zhengfu He Kai Chen 0026 Xipeng Qiu Can AI Assistants Know What They Don't Know? 2024 abs/2401.13275 CoRR https://doi.org/10.48550/arXiv.2401.13275 db/journals/corr/corr2401.html#abs-2401-13275
Xinghao Wang Junliang He Pengyu Wang Yunhua Zhou Tianxiang Sun Xipeng Qiu DenoSent: A Denoising Objective for Self-Supervised Sentence Representation Learning. 2024 abs/2401.13621 CoRR https://doi.org/10.48550/arXiv.2401.13621 db/journals/corr/corr2401.html#abs-2401-13621
Siyin Wang Shimin Li Tianxiang Sun Jinlan Fu Qinyuan Cheng Jiasheng Ye Junjie Ye Xipeng Qiu Xuanjing Huang 0001 LLM can Achieve Self-Regulation via Hyperparameter Aware Generation. 2024 abs/2402.11251 CoRR https://doi.org/10.48550/arXiv.2402.11251 db/journals/corr/corr2402.html#abs-2402-11251
Zhengfu He Xuyang Ge Qiong Tang Tianxiang Sun Qinyuan Cheng Xipeng Qiu Dictionary Learning Improves Patch-Free Circuit Discovery in Mechanistic Interpretability: A Case Study on Othello-GPT. 2024 abs/2402.12201 CoRR https://doi.org/10.48550/arXiv.2402.12201 db/journals/corr/corr2402.html#abs-2402-12201
Jun Zhan Junqi Dai Jiasheng Ye Yunhua Zhou Dong Zhang Zhigeng Liu Xin Zhang Ruibin Yuan Ge Zhang Linyang Li Hang Yan 0001 Jie Fu 0001 Tao Gui Tianxiang Sun Yugang Jiang Xipeng Qiu AnyGPT: Unified Multimodal LLM with Discrete Sequence Modeling. 2024 abs/2402.12226 CoRR https://doi.org/10.48550/arXiv.2402.12226 db/journals/corr/corr2402.html#abs-2402-12226
Zhiyuan Zeng Qipeng Guo Zhaoye Fei Zhangyue Yin Yunhua Zhou Linyang Li Tianxiang Sun Hang Yan 0001 Dahua Lin Xipeng Qiu Turn Waste into Worth: Rectifying Top-k Router of MoE. 2024 abs/2402.12399 CoRR https://doi.org/10.48550/arXiv.2402.12399 db/journals/corr/corr2402.html#abs-2402-12399
Bo Wang Tianxiang Sun Hang Yan 0001 Siyin Wang Qingyuan Cheng Xipeng Qiu In-Memory Learning: A Declarative Learning Framework for Large Language Models. 2024 abs/2403.02757 CoRR https://doi.org/10.48550/arXiv.2403.02757 db/journals/corr/corr2403.html#abs-2403-02757
Jiasheng Ye Peiju Liu Tianxiang Sun Yunhua Zhou Jun Zhan Xipeng Qiu Data Mixing Laws: Optimizing Data Mixtures by Predicting Language Modeling Performance. 2024 abs/2403.16952 CoRR https://doi.org/10.48550/arXiv.2403.16952 db/journals/corr/corr2403.html#abs-2403-16952
Zhangyue Yin Qiushi Sun Qipeng Guo Zhiyuan Zeng Xiaonan Li Tianxiang Sun Cheng Chang Qinyuan Cheng Ding Wang Xiaofeng Mou Xipeng Qiu Xuanjing Huang 0001 Aggregation of Reasoning: A Hierarchical Framework for Enhancing Answer Selection in Large Language Models. 2024 abs/2405.12939 CoRR https://doi.org/10.48550/arXiv.2405.12939 db/journals/corr/corr2405.html#abs-2405-12939
Qinyuan Cheng Xiaonan Li Shimin Li Qin Zhu Zhangyue Yin Yunfan Shao Linyang Li Tianxiang Sun Hang Yan 0001 Xipeng Qiu Unified Active Retrieval for Retrieval Augmented Generation. 2024 abs/2406.12534 CoRR https://doi.org/10.48550/arXiv.2406.12534 db/journals/corr/corr2406.html#abs-2406-12534
Zhe Xu Jiasheng Ye Xiangyang Liu Tianxiang Sun Xiaoran Liu Qipeng Guo Linlin Li 0001 Qun Liu 0001 Xuanjing Huang 0001 Xipeng Qiu DetectiveQA: Evaluating Long-Context Reasoning on Detective Novels. 2024 abs/2409.02465 CoRR https://doi.org/10.48550/arXiv.2409.02465 db/journals/corr/corr2409.html#abs-2409-02465 streams/journals/corr
Linyang Li Pengyu Wang Ke Ren Tianxiang Sun Xipeng Qiu Origin Tracing and Detecting of LLMs. 2023 abs/2304.14072 CoRR https://doi.org/10.48550/arXiv.2304.14072 db/journals/corr/corr2304.html#abs-2304-14072
Qinyuan Cheng Xiaogui Yang Tianxiang Sun Linyang Li Xipeng Qiu Improving Contrastive Learning of Sentence Embeddings from AI Feedback. 2023 abs/2305.01918 CoRR https://doi.org/10.48550/arXiv.2305.01918 db/journals/corr/corr2305.html#abs-2305-01918
Peng Li Tianxiang Sun Qiong Tang Hang Yan 0001 Yuanbin Wu Xuanjing Huang 0001 Xipeng Qiu CodeIE: Large Code Generation Models are Better Few-Shot Information Extractors. 2023 abs/2305.05711 CoRR https://doi.org/10.48550/arXiv.2305.05711 db/journals/corr/corr2305.html#abs-2305-05711
Rui Zheng Shihan Dou Songyang Gao Yuan Hua Wei Shen Binghai Wang Yan Liu 0002 Senjie Jin Qin Liu Yuhao Zhou Limao Xiong Lu Chen Zhiheng Xi Nuo Xu Wenbin Lai Minghao Zhu Cheng Chang Zhangyue Yin Rongxiang Weng Wensen Cheng Haoran Huang Tianxiang Sun Hang Yan 0001 Tao Gui Qi Zhang 0001 Xipeng Qiu Xuanjing Huang 0001 Secrets of RLHF in Large Language Models Part I: PPO. 2023 abs/2307.04964 CoRR https://doi.org/10.48550/arXiv.2307.04964 db/journals/corr/corr2307.html#abs-2307-04964
Qinyuan Cheng Tianxiang Sun Wenwei Zhang Siyin Wang Xiangyang Liu Mozhi Zhang Junliang He Mianqiu Huang Zhangyue Yin Kai Chen 0026 Xipeng Qiu Evaluating Hallucinations in Chinese Large Language Models. 2023 abs/2310.03368 CoRR https://doi.org/10.48550/arXiv.2310.03368 db/journals/corr/corr2310.html#abs-2310-03368
Kexin Huang Xiangyang Liu Qianyu Guo Tianxiang Sun Jiawei Sun Yaru Wang Zeyang Zhou Yixu Wang Yan Teng Xipeng Qiu Yingchun Wang Dahua Lin Flames: Benchmarking Value Alignment of Chinese Large Language Models. 2023 abs/2311.06899 CoRR https://doi.org/10.48550/arXiv.2311.06899 db/journals/corr/corr2311.html#abs-2311-06899
Xiaonan Li Changtai Zhu Linyang Li Zhangyue Yin Tianxiang Sun Xipeng Qiu LLatrieval: LLM-Verified Retrieval for Verifiable Generation. 2023 abs/2311.07838 CoRR https://doi.org/10.48550/arXiv.2311.07838 db/journals/corr/corr2311.html#abs-2311-07838
Tianxiang Sun Xiangyang Liu Xipeng Qiu Xuanjing Huang 0001 Paradigm Shift in Natural Language Processing. 169-183 2022 19 Int. J. Autom. Comput. 3 https://doi.org/10.1007/s11633-022-1331-6 db/journals/ijautcomp/ijautcomp19.html#SunLQH22
Tianxiang Sun Yunfan Shao Hong Qian Xuanjing Huang 0001 Xipeng Qiu Black-Box Tuning for Language-Model-as-a-Service. 2022 abs/2201.03514 CoRR https://arxiv.org/abs/2201.03514 db/journals/corr/corr2201.html#abs-2201-03514
Tianxiang Sun Xiangyang Liu Wei Zhu 0016 Zhichao Geng Lingling Wu Yilong He Yuan Ni Guotong Xie Xuanjing Huang 0001 Xipeng Qiu A Simple Hash-Based Early Exiting Approach For Language Understanding and Generation. 2022 abs/2203.01670 CoRR https://doi.org/10.48550/arXiv.2203.01670 db/journals/corr/corr2203.html#abs-2203-01670
Tianxiang Sun Zhengfu He Hong Qian Xuanjing Huang 0001 Xipeng Qiu BBTv2: Pure Black-Box Optimization Can Be Comparable to Gradient Descent for Few-Shot Learning. 2022 abs/2205.11200 CoRR https://doi.org/10.48550/arXiv.2205.11200 db/journals/corr/corr2205.html#abs-2205-11200
Tianxiang Sun Zhengfu He Qin Zhu Xipeng Qiu Xuanjing Huang 0001 Multi-Task Pre-Training of Modular Prompt for Few-Shot Learning. 2022 abs/2210.07565 CoRR https://doi.org/10.48550/arXiv.2210.07565 db/journals/corr/corr2210.html#abs-2210-07565
Tianxiang Sun Junliang He Xipeng Qiu Xuanjing Huang 0001 BERTScore is Unfair: On Social Bias in Language Model-Based Metrics for Text Generation. 2022 abs/2210.07626 CoRR https://doi.org/10.48550/arXiv.2210.07626 db/journals/corr/corr2210.html#abs-2210-07626
Xiangyang Liu Tianxiang Sun Xuanjing Huang 0001 Xipeng Qiu Late Prompt Tuning: A Late Prompt Could Be Better Than Many Prompts. 2022 abs/2210.11292 CoRR https://doi.org/10.48550/arXiv.2210.11292 db/journals/corr/corr2210.html#abs-2210-11292
Zhengfu He Tianxiang Sun Kuanning Wang Xuanjing Huang 0001 Xipeng Qiu DiffusionBERT: Improving Generative Masked Language Models with Diffusion Models. 2022 abs/2211.15029 CoRR https://doi.org/10.48550/arXiv.2211.15029 db/journals/corr/corr2211.html#abs-2211-15029
Junqi Dai Hang Yan 0001 Tianxiang Sun Pengfei Liu 0003 Xipeng Qiu Does syntax matter? A strong baseline for Aspect-based Sentiment Analysis with RoBERTa. 2021 abs/2104.04986 CoRR https://arxiv.org/abs/2104.04986 db/journals/corr/corr2104.html#abs-2104-04986
Tianxiang Sun Yunhua Zhou Xiangyang Liu Xinyu Zhang 0018 Hao Jiang Zhao Cao Xuanjing Huang 0001 Xipeng Qiu Early Exiting with Ensemble Internal Classifiers. 2021 abs/2105.13792 CoRR https://arxiv.org/abs/2105.13792 db/journals/corr/corr2105.html#abs-2105-13792
Xiaonan Li Yunfan Shao Tianxiang Sun Hang Yan 0001 Xipeng Qiu Xuanjing Huang 0001 Accelerating BERT Inference for Sequence Labeling via Early-Exit. 2021 abs/2105.13878 CoRR https://arxiv.org/abs/2105.13878 db/journals/corr/corr2105.html#abs-2105-13878
Yitao Liu Tianxiang Sun Xipeng Qiu Xuanjing Huang 0001 Learning to Teach with Student Feedback. 2021 abs/2109.04641 CoRR https://arxiv.org/abs/2109.04641 db/journals/corr/corr2109.html#abs-2109-04641
Tianxiang Sun Xiangyang Liu Xipeng Qiu Xuanjing Huang 0001 Paradigm Shift in Natural Language Processing. 2021 abs/2109.12575 CoRR https://arxiv.org/abs/2109.12575 db/journals/corr/corr2109.html#abs-2109-12575
Xiangyang Liu Tianxiang Sun Junliang He Lingling Wu Xinyu Zhang 0018 Hao Jiang Zhao Cao Xuanjing Huang 0001 Xipeng Qiu Towards Efficient NLP: A Standard Evaluation and A Strong Baseline. 2021 abs/2110.07038 CoRR https://arxiv.org/abs/2110.07038 db/journals/corr/corr2110.html#abs-2110-07038
Xipeng Qiu Tianxiang Sun Yige Xu 0001 Yunfan Shao Ning Dai Xuanjing Huang 0001 Pre-trained Models for Natural Language Processing: A Survey. 2020 abs/2003.08271 CoRR https://arxiv.org/abs/2003.08271 db/journals/corr/corr2003.html#abs-2003-08271
Tianxiang Sun Yunfan Shao Xipeng Qiu Qipeng Guo Yaru Hu Xuanjing Huang 0001 Zheng Zhang 0001 CoLAKE: Contextualized Language and Knowledge Embedding. 2020 abs/2010.00309 CoRR https://arxiv.org/abs/2010.00309 db/journals/corr/corr2010.html#abs-2010-00309
Tianxiang Sun Yunfan Shao Xiaonan Li Pengfei Liu 0003 Hang Yan 0001 Xipeng Qiu Xuanjing Huang 0001 Learning Sparse Sharing Architectures for Multiple Tasks. 2019 abs/1911.05034 CoRR http://arxiv.org/abs/1911.05034 db/journals/corr/corr1911.html#abs-1911-05034