default search action
Xiaodan Liang
This is just a disambiguation page, and is not intended to be the bibliography of an actual person. Any publication listed on this page has not been assigned to an actual author yet. If you know the true author of one of the publications listed below, you are welcome to contact us.
Person information
SPARQL queries
🛈 Please note that only 66% of the records listed on this page have a DOI. Therefore, DOI-based queries can only provide partial results.
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j67]Xiaojun Wang, Zichen Lou, Xiaodan Liang:
Optimal operation of integrated electricity and gas networks with risk analysis using downside risk constraints method. Comput. Chem. Eng. 184: 108641 (2024) - [j66]Linfeng Li, Weixing Su, Fang Liu, Maowei He, Xiaodan Liang:
Multi-scale adaptive networks for efficient inference. Int. J. Mach. Learn. Cybern. 15(2): 267-282 (2024) - [j65]Guangrun Wang, Changlin Li, Liuchun Yuan, Jiefeng Peng, Xiaoyu Xian, Xiaodan Liang, Xiaojun Chang, Liang Lin:
DNA Family: Boosting Weight-Sharing NAS With Block-Wise Supervisions. IEEE Trans. Pattern Anal. Mach. Intell. 46(5): 2722-2740 (2024) - [j64]Bingqian Lin, Yunshuang Nie, Ziming Wei, Yi Zhu, Hang Xu, Shikui Ma, Jianzhuang Liu, Xiaodan Liang:
Correctable Landmark Discovery via Large Models for Vision-Language Navigation. IEEE Trans. Pattern Anal. Mach. Intell. 46(12): 8534-8548 (2024) - [j63]Hanlin Zhang, Shuai Lin, Weiyang Liu, Pan Zhou, Jian Tang, Xiaodan Liang, Eric P. Xing:
Iterative Graph Self-Distillation. IEEE Trans. Knowl. Data Eng. 36(3): 1161-1169 (2024) - [j62]Shuai Lin, Chen Liu, Pan Zhou, Zi-Yuan Hu, Shuojia Wang, Ruihui Zhao, Yefeng Zheng, Liang Lin, Eric P. Xing, Xiaodan Liang:
Prototypical Graph Contrastive Learning. IEEE Trans. Neural Networks Learn. Syst. 35(2): 2747-2758 (2024) - [j61]Jinghui Qin, Zhicheng Yang, Jiaqi Chen, Xiaodan Liang, Liang Lin:
Template-Based Contrastive Distillation Pretraining for Math Word Problem Solving. IEEE Trans. Neural Networks Learn. Syst. 35(9): 12823-12835 (2024) - [j60]Yanxin Long, Jianhua Han, Runhui Huang, Hang Xu, Yi Zhu, Chunjing Xu, Xiaodan Liang:
Fine-Grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection. IEEE Trans. Neural Networks Learn. Syst. 35(11): 16277-16287 (2024) - [c238]Xuan Huang, Hanhui Li, Zejun Yang, Zhisheng Wang, Xiaodan Liang:
3D Visibility-Aware Generalizable Neural Radiance Fields for Interacting Hands. AAAI 2024: 2400-2408 - [c237]Hanhui Li, Xiaojian Lin, Xuan Huang, Zejun Yang, Zhisheng Wang, Xiaodan Liang:
Monocular 3D Hand Mesh Recovery via Dual Noise Estimation. AAAI 2024: 3046-3054 - [c236]Luoyang Lin, Zutao Jiang, Xiaodan Liang, Liqian Ma, Michael C. Kampffmeyer, Xiaochun Cao:
PTUS: Photo-Realistic Talking Upper-Body Synthesis via 3D-Aware Motion Decomposition Warping. AAAI 2024: 3441-3449 - [c235]Zhenyu Xie, Yang Wu, Xuehao Gao, Zhongqian Sun, Wei Yang, Xiaodan Liang:
Towards Detailed Text-to-Motion Synthesis via Basic-to-Advanced Hierarchical Diffusion Model. AAAI 2024: 6252-6260 - [c234]Meng Cao, Haoran Tang, Jinfa Huang, Peng Jin, Can Zhang, Ruyang Liu, Long Chen, Xiaodan Liang, Li Yuan, Ge Li:
RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter. ACL (Findings) 2024: 7160-7174 - [c233]Jiaqi Chen, Bingqian Lin, Ran Xu, Zhenhua Chai, Xiaodan Liang, Kwan-Yee Kenneth Wong:
MapGPT: Map-Guided Prompting with Adaptive Path Planning for Vision-and-Language Navigation. ACL (1) 2024: 9796-9810 - [c232]Yinya Huang, Ruixin Hong, Hongming Zhang, Wei Shao, Zhicheng Yang, Dong Yu, Changshui Zhang, Xiaodan Liang, Linqi Song:
CLOMO: Counterfactual Logical Modification with Large Language Models. ACL (1) 2024: 11012-11034 - [c231]Qingxing Cao, Junhao Cheng, Xiaodan Liang, Liang Lin:
VisDiaHalBench: A Visual Dialogue Benchmark For Diagnosing Hallucination in Large Vision-Language Models. ACL (1) 2024: 12161-12176 - [c230]Xiwen Liang, Liang Ma, Shanshan Guo, Jianhua Han, Hang Xu, Shikui Ma, Xiaodan Liang:
CorNav: Autonomous Agent with Self-Corrected Planning for Zero-Shot Vision-and-Language Navigation. ACL (Findings) 2024: 12538-12559 - [c229]Yuxuan Hu, Minghuan Tan, Chenwei Zhang, Zixuan Li, Xiaodan Liang, Min Yang, Chengming Li, Xiping Hu:
APTNESS: Incorporating Appraisal Theory and Emotion Support Strategies for Empathetic Response Generation. CIKM 2024: 900-909 - [c228]Lewei Yao, Renjie Pi, Jianhua Han, Xiaodan Liang, Hang Xu, Wei Zhang, Zhenguo Li, Dan Xu:
DetCLIPv3: Towards Versatile Generative Open-Vocabulary Object Detection. CVPR 2024: 5610-5619 - [c227]Xinpeng Ding, Jianhua Han, Hang Xu, Xiaodan Liang, Wei Zhang, Xiaomeng Li:
Holistic Autonomous Driving Understanding by Bird'View Injected Multi-Modal Large Models. CVPR 2024: 13668-13677 - [c226]Sihao Lin, Pumeng Lyu, Dongrui Liu, Tao Tang, Xiaodan Liang, Andy Song, Xiaojun Chang:
MLP Can Be a Good Transformer Learner. CVPR 2024: 19489-19498 - [c225]Tang Tao, Guangrun Wang, Yixing Lao, Peng Chen, Jie Liu, Liang Lin, Kaicheng Yu, Xiaodan Liang:
AlignMiF: Geometry-Aligned Multimodal Implicit Field for LiDAR-Camera Joint Synthesis. CVPR 2024: 21230-21240 - [c224]Zhijian Huang, Tao Tang, Shaoxiang Chen, Sihao Lin, Zequn Jie, Lin Ma, Guangrun Wang, Xiaodan Liang:
Making Large Language Models Better Planners with Reasoning-Decision Alignment. ECCV (36) 2024: 73-90 - [c223]Runhui Huang, Kaixin Cai, Jianhua Han, Xiaodan Liang, Renjing Pei, Guansong Lu, Songcen Xu, Wei Zhang, Hang Xu:
LayerDiff: Exploring Text-Guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model. ECCV (76) 2024: 144-160 - [c222]Shiyue Zhang, Zheng Chong, Xujie Zhang, Hanhui Li, Yuhao Cheng, Yiqiang Yan, Xiaodan Liang:
GarmentAligner: Text-to-Garment Generation via Retrieval-Augmented Multi-level Corrections. ECCV (25) 2024: 148-164 - [c221]Mingjie Li, Haokun Lin, Liang Qiu, Xiaodan Liang, Ling Chen, Abdulmotaleb Elsaddik, Xiaojun Chang:
Contrastive Learning with Counterfactual Explanations for Radiology Report Generation. ECCV (43) 2024: 162-180 - [c220]Zhicheng Yang, Yinya Huang, Jing Xiong, Liang Feng, Xiaodan Liang, Yiwei Wang, Jing Tang:
AlignedCoT: Prompting Large Language Models via Native-Speaking Demonstrations. EMNLP (Findings) 2024: 2857-2896 - [c219]Yinya Huang, Xiaohan Lin, Zhengying Liu, Qingxing Cao, Huajian Xin, Haiming Wang, Zhenguo Li, Linqi Song, Xiaodan Liang:
MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data. ICLR 2024 - [c218]Renjie Pi, Lewei Yao, Jianhua Han, Xiaodan Liang, Wei Zhang, Hang Xu:
Ins-DetCLIP: Aligning Detection Model to Follow Human-Language Instruction. ICLR 2024 - [c217]Haiming Wang, Huajian Xin, Chuanyang Zheng, Zhengying Liu, Qingxing Cao, Yinya Huang, Jing Xiong, Han Shi, Enze Xie, Jian Yin, Zhenguo Li, Xiaodan Liang:
LEGO-Prover: Neural Theorem Proving with Growing Libraries. ICLR 2024 - [c216]Jing Xiong, Zixuan Li, Chuanyang Zheng, Zhijiang Guo, Yichun Yin, Enze Xie, Zhicheng Yang, Qingxing Cao, Haiming Wang, Xiongwei Han, Jing Tang, Chengming Li, Xiaodan Liang:
DQ-LoRe: Dual Queries with Low Rank Approximation Re-ranking for In-Context Learning. ICLR 2024 - [c215]Tang Tao, Longfei Gao, Guangrun Wang, Yixing Lao, Peng Chen, Hengshuang Zhao, Dayang Hao, Xiaodan Liang, Mathieu Salzmann, Kaicheng Yu:
LiDAR-NeRF: Novel LiDAR View Synthesis via Neural Radiance Fields. ACM Multimedia 2024: 390-398 - [c214]Zhenyu Xie, Haoye Dong, Yufei Gao, Zehua Ma, Xiaodan Liang:
DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion Models. ACM Multimedia 2024: 10784-10793 - [c213]Xiaohan Lin, Qingxing Cao, Yinya Huang, Zhicheng Yang, Zhengying Liu, Zhenguo Li, Xiaodan Liang:
ATG: Benchmarking Automated Theorem Generation for Generative Language Models. NAACL-HLT (Findings) 2024: 4465-4480 - [i276]Xuan Huang, Hanhui Li, Zejun Yang, Zhisheng Wang, Xiaodan Liang:
3D Visibility-aware Generalizable Neural Radiance Fields for Interacting Hands. CoRR abs/2401.00979 (2024) - [i275]Xinpeng Ding, Jianhua Han, Hang Xu, Xiaodan Liang, Wei Zhang, Xiaomeng Li:
Holistic Autonomous Driving Understanding by Bird's-Eye-View Injected Multi-Modal Large Models. CoRR abs/2401.00988 (2024) - [i274]Jiaqi Chen, Bingqian Lin, Ran Xu, Zhenhua Chai, Xiaodan Liang, Kwan-Yee K. Wong:
MapGPT: Map-Guided Prompting for Unified Vision-and-Language Navigation. CoRR abs/2401.07314 (2024) - [i273]Yinya Huang, Xiaohan Lin, Zhengying Liu, Qingxing Cao, Huajian Xin, Haiming Wang, Zhenguo Li, Linqi Song, Xiaodan Liang:
MUSTARD: Mastering Uniform Synthesis of Theorem and Proof Data. CoRR abs/2402.08957 (2024) - [i272]Tao Tang, Guangrun Wang, Yixing Lao, Peng Chen, Jie Liu, Liang Lin, Kaicheng Yu, Xiaodan Liang:
AlignMiF: Geometry-Aligned Multimodal Implicit Field for LiDAR-Camera Joint Synthesis. CoRR abs/2402.17483 (2024) - [i271]Guangrun Wang, Changlin Li, Liuchun Yuan, Jiefeng Peng, Xiaoyu Xian, Xiaodan Liang, Xiaojun Chang, Liang Lin:
DNA Family: Boosting Weight-Sharing NAS with Block-Wise Supervisions. CoRR abs/2403.01326 (2024) - [i270]Bingqian Lin, Yanxin Long, Yi Zhu, Fengda Zhu, Xiaodan Liang, Qixiang Ye, Liang Lin:
Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning. CoRR abs/2403.05770 (2024) - [i269]Bingqian Lin, Yunshuang Nie, Ziming Wei, Jiaqi Chen, Shikui Ma, Jianhua Han, Hang Xu, Xiaojun Chang, Xiaodan Liang:
NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning. CoRR abs/2403.07376 (2024) - [i268]Zicheng Zhang, Tong Zhang, Yi Zhu, Jianzhuang Liu, Xiaodan Liang, Qixiang Ye, Wei Ke:
Language-Driven Visual Consensus for Zero-Shot Semantic Segmentation. CoRR abs/2403.08426 (2024) - [i267]Minbin Huang, Yanxin Long, Xinchi Deng, Ruihang Chu, Jiangfeng Xiong, Xiaodan Liang, Hong Cheng, Qinglin Lu, Wei Liu:
DialogGen: Multi-modal Interactive Dialogue System for Multi-turn Text-to-Image Generation. CoRR abs/2403.08857 (2024) - [i266]Runhui Huang, Kaixin Cai, Jianhua Han, Xiaodan Liang, Renjing Pei, Guansong Lu, Songcen Xu, Wei Zhang, Hang Xu:
LayerDiff: Exploring Text-guided Multi-layered Composable Image Synthesis via Layer-Collaborative Diffusion Model. CoRR abs/2403.11929 (2024) - [i265]Sihao Lin, Pumeng Lyu, Dongrui Liu, Tao Tang, Xiaodan Liang, Andy Song, Xiaojun Chang:
MLP Can Be A Good Transformer Learner. CoRR abs/2404.05657 (2024) - [i264]Lewei Yao, Renjie Pi, Jianhua Han, Xiaodan Liang, Hang Xu, Wei Zhang, Zhenguo Li, Dan Xu:
DetCLIPv3: Towards Versatile Generative Open-vocabulary Object Detection. CoRR abs/2404.09216 (2024) - [i263]Jiehui Huang, Xiao Dong, Wenhui Song, Hanhui Li, Jun Zhou, Yuhao Cheng, Shutao Liao, Long Chen, Yiqiang Yan, Shengcai Liao, Xiaodan Liang:
ConsistentID: Portrait Generation with Multimodal Fine-Grained Identity Preserving. CoRR abs/2404.16771 (2024) - [i262]Junhao Cheng, Baiqiao Yin, Kaixin Cai, Minbin Huang, Hanhui Li, Yuxin He, Xi Lu, Yue Li, Yifei Li, Yuhao Cheng, Yiqiang Yan, Xiaodan Liang:
TheaterGen: Character Management with LLM for Consistent Multi-turn Image Generation. CoRR abs/2404.18919 (2024) - [i261]Xujie Zhang, Ente Lin, Xiu Li, Yuxuan Luo, Michael Kampffmeyer, Xin Dong, Xiaodan Liang:
MMTryon: Multi-Modal Multi-Reference Control for High-Quality Fashion Generation. CoRR abs/2405.00448 (2024) - [i260]Xiaohan Lin, Qingxing Cao, Yinya Huang, Zhicheng Yang, Zhengying Liu, Zhenguo Li, Xiaodan Liang:
ATG: Benchmarking Automated Theorem Generation for Generative Language Models. CoRR abs/2405.06677 (2024) - [i259]Siyu Lou, Yuntian Chen, Xiaodan Liang, Liang Lin, Quanshi Zhang:
Quantifying In-Context Reasoning Effects and Memorization Effects in LLMs. CoRR abs/2405.11880 (2024) - [i258]Huajian Xin, Daya Guo, Zhihong Shao, Zhizhou Ren, Qihao Zhu, Bo Liu, Chong Ruan, Wenda Li, Xiaodan Liang:
DeepSeek-Prover: Advancing Theorem Proving in LLMs through Large-Scale Synthetic Data. CoRR abs/2405.14333 (2024) - [i257]Haiming Wang, Huajian Xin, Zhengying Liu, Wenda Li, Yinya Huang, Jianqiao Lu, Zhicheng Yang, Jing Tang, Jian Yin, Zhenguo Li, Xiaodan Liang:
Proving Theorems Recursively. CoRR abs/2405.14414 (2024) - [i256]Jian Zhao, Lei Jin, Jianshu Li, Zheng Zhu, Yinglei Teng, Jiaojiao Zhao, Sadaf Gulshad, Zheng Wang, Bo Zhao, Xiangbo Shu, Yunchao Wei, Xuecheng Nie, Xiaojie Jin, Xiaodan Liang, Shin'ichi Satoh, Yandong Guo, Cewu Lu, Junliang Xing, Jane Shengmei Shen:
The SkatingVerse Workshop & Challenge: Methods and Results. CoRR abs/2405.17188 (2024) - [i255]Jun Zheng, Fuwei Zhao, Youjiang Xu, Xin Dong, Xiaodan Liang:
VITON-DiT: Learning In-the-Wild Video Try-On from Human Dance Videos via Diffusion Transformers. CoRR abs/2405.18326 (2024) - [i254]Bingqian Lin, Yunshuang Nie, Ziming Wei, Yi Zhu, Hang Xu, Shikui Ma, Jianzhuang Liu, Xiaodan Liang:
Correctable Landmark Discovery via Large Models for Vision-Language Navigation. CoRR abs/2405.18721 (2024) - [i253]Meng Cao, Haoran Tang, Jinfa Huang, Peng Jin, Can Zhang, Ruyang Liu, Long Chen, Xiaodan Liang, Li Yuan, Ge Li:
RAP: Efficient Text-Video Retrieval with Sparse-and-Correlated Adapter. CoRR abs/2405.19465 (2024) - [i252]Junhao Cheng, Xi Lu, Hanhui Li, Khun Loun Zai, Baiqiao Yin, Yuhao Cheng, Yiqiang Yan, Xiaodan Liang:
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation. CoRR abs/2406.01388 (2024) - [i251]Lijun Zhou, Tao Tang, Pengkun Hao, Zihang He, Kalok Ho, Shuo Gu, Wenbo Hou, Zhihui Hao, Haiyang Sun, Kun Zhan, Peng Jia, Xianpeng Lang, Xiaodan Liang:
UA-Track: Uncertainty-Aware End-to-End 3D Multi-Object Tracking. CoRR abs/2406.02147 (2024) - [i250]Gexin Huang, Chenfei Wu, Mingjie Li, Xiaojun Chang, Ling Chen, Ying Sun, Shen Zhao, Xiaodan Liang, Liang Lin:
Predicting Genetic Mutation from Whole Slide Images via Biomedical-Linguistic Knowledge Enhanced Multi-label Classification. CoRR abs/2406.02990 (2024) - [i249]Xiaohan Lin, Qingxing Cao, Yinya Huang, Haiming Wang, Jianqiao Lu, Zhengying Liu, Linqi Song, Xiaodan Liang:
FVEL: Interactive Formal Verification Environment with Large Language Models via Theorem Proving. CoRR abs/2406.14408 (2024) - [i248]Sukmin Yun, Haokun Lin, Rusiru Thushara, Mohammad Qazim Bhat, Yongxin Wang, Zutao Jiang, Mingkai Deng, Jinhong Wang, Tianhua Tao, Junbo Li, Haonan Li, Preslav Nakov, Timothy Baldwin, Zhengzhong Liu, Eric P. Xing, Xiaodan Liang, Zhiqiang Shen:
Web2Code: A Large-scale Webpage-to-Code Dataset and Evaluation Framework for Multimodal LLMs. CoRR abs/2406.20098 (2024) - [i247]Jiaqi Chen, Bingqian Lin, Xinmin Liu, Xiaodan Liang, Kwan-Yee K. Wong:
Affordances-Oriented Planning using Foundation Models for Continuous Vision-Language Navigation. CoRR abs/2407.05890 (2024) - [i246]Guian Fang, Wenbiao Yan, Yuanfan Guo, Jianhua Han, Zutao Jiang, Hang Xu, Shengcai Liao, Xiaodan Liang:
HumanRefiner: Benchmarking Abnormal Human Generation and Refining with Coarse-to-fine Pose-Reversible Guidance. CoRR abs/2407.06937 (2024) - [i245]Hao Wang, Pengzhen Ren, Zequn Jie, Xiao Dong, Chengjian Feng, Yinlong Qian, Lin Ma, Dongmei Jiang, Yaowei Wang, Xiangyuan Lan, Xiaodan Liang:
OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion. CoRR abs/2407.07844 (2024) - [i244]Runhui Huang, Xinpeng Ding, Chunwei Wang, Jianhua Han, Yulong Liu, Hengshuang Zhao, Hang Xu, Lu Hou, Wei Zhang, Xiaodan Liang:
HiRes-LLaVA: Restoring Fragmentation Input in High-Resolution Large Vision-Language Models. CoRR abs/2407.08706 (2024) - [i243]Zhicheng Yang, Yinya Huang, Wei Shi, Liang Feng, Linqi Song, Yiwei Wang, Xiaodan Liang, Jing Tang:
Benchmarking LLMs for Optimization Modeling and Enhancing Reasoning via Reverse Socratic Synthesis. CoRR abs/2407.09887 (2024) - [i242]Mingjie Li, Haokun Lin, Liang Qiu, Xiaodan Liang, Ling Chen, Abdulmotaleb Elsaddik, Xiaojun Chang:
Contrastive Learning with Counterfactual Explanations for Radiology Report Generation. CoRR abs/2407.14474 (2024) - [i241]Zheng Chong, Xiao Dong, Haoxiang Li, Shiyue Zhang, Wenqing Zhang, Xujie Zhang, Hanqing Zhao, Xiaodan Liang:
CatVTON: Concatenation Is All You Need for Virtual Try-On with Diffusion Models. CoRR abs/2407.15886 (2024) - [i240]Zhenyu Xie, Haoye Dong, Yufei Gao, Zehua Ma, Xiaodan Liang:
DreamVTON: Customizing 3D Virtual Try-on with Personalized Diffusion Models. CoRR abs/2407.16511 (2024) - [i239]Yuxuan Hu, Minghuan Tan, Chenwei Zhang, Zixuan Li, Xiaodan Liang, Min Yang, Chengming Li, Xiping Hu:
APTNESS: Incorporating Appraisal Theory and Emotion Support Strategies for Empathetic Response Generation. CoRR abs/2407.21048 (2024) - [i238]Jiasong Feng, Ao Ma, Jing Wang, Bo Cheng, Xiaodan Liang, Dawei Leng, Yuhui Yin:
FancyVideo: Towards Dynamic and Consistent Video Generation via Cross-frame Textual Guidance. CoRR abs/2408.08189 (2024) - [i237]Haoran Tang, Meng Cao, Jinfa Huang, Ruyang Liu, Peng Jin, Ge Li, Xiaodan Liang:
MUSE: Mamba is Efficient Multi-scale Learner for Text-video Retrieval. CoRR abs/2408.10575 (2024) - [i236]Zhiqiang Wang, Hao Zheng, Yunshuang Nie, Wenjun Xu, Qingwei Wang, Hua Ye, Zhe Li, Kaidong Zhang, Xuewen Cheng, Wanxi Dong, Chang Cai, Liang Lin, Feng Zheng, Xiaodan Liang:
All Robots in One: A New Standard and Unified Dataset for Versatile, General-Purpose Embodied Agents. CoRR abs/2408.10899 (2024) - [i235]Shiyue Zhang, Zheng Chong, Xujie Zhang, Hanhui Li, Yuhao Cheng, Yiqiang Yan, Xiaodan Liang:
GarmentAligner: Text-to-Garment Generation via Retrieval-augmented Multi-level Corrections. CoRR abs/2408.12352 (2024) - [i234]Cong Wang, Jiaxi Gu, Panwen Hu, Haoyu Zhao, Yuanfan Guo, Jianhua Han, Hang Xu, Xiaodan Liang:
EasyControl: Transfer ControlNet to Video Diffusion for Controllable Generation and Interpolation. CoRR abs/2408.13005 (2024) - [i233]Zhijian Huang, Tao Tang, Shaoxiang Chen, Sihao Lin, Zequn Jie, Lin Ma, Guangrun Wang, Xiaodan Liang:
Making Large Language Models Better Planners with Reasoning-Decision Alignment. CoRR abs/2408.13890 (2024) - [i232]Jing Wang, Ao Ma, Jiasong Feng, Dawei Leng, Yuhui Yin, Xiaodan Liang:
Qihoo-T2X: An Efficiency-Focused Diffusion Transformer via Proxy Tokens for Text-to-Any-Task. CoRR abs/2409.04005 (2024) - [i231]Sanoojan Baliah, Qinliang Lin, Shengcai Liao, Xiaodan Liang, Muhammad Haris Khan:
Realistic and Efficient Face Swapping: A Unified Approach with Diffusion Models. CoRR abs/2409.07269 (2024) - [i230]Yuxuan Hu, Chenwei Zhang, Min Yang, Xiaodan Liang, Chengming Li, Xiping Hu:
Learning to Generalize Unseen Domains via Multi-Source Meta Learning for Text Classification. CoRR abs/2409.13787 (2024) - [i229]Changlin Li, Jiawei Zhang, Sihao Lin, Zongxin Yang, Junwei Liang, Xiaodan Liang, Xiaojun Chang:
Efficient Training of Large Vision Models via Advanced Automated Progressive Learning. CoRR abs/2410.00350 (2024) - [i228]Zixuan Li, Jing Xiong, Fanghua Ye, Chuanyang Zheng, Xun Wu, Jianqiao Lu, Zhongwei Wan, Xiaodan Liang, Chengming Li, Zhenan Sun, Lingpeng Kong, Ngai Wong:
UncertaintyRAG: Span-Level Uncertainty Enhanced Long-Context Modeling for Retrieval-Augmented Generation. CoRR abs/2410.02719 (2024) - [i227]Xuan Huang, Hanhui Li, Wanquan Liu, Xiaodan Liang, Yiqiang Yan, Yuhao Cheng, Chengqiang Gao:
Learning Interaction-aware 3D Gaussian Splatting for One-shot Hand Avatars. CoRR abs/2410.08840 (2024) - [i226]Kaidong Zhang, Pengzhen Ren, Bingqian Lin, Junfan Lin, Shikui Ma, Hang Xu, Xiaodan Liang:
PIVOT-R: Primitive-Driven Waypoint-Aware World Model for Robotic Manipulation. CoRR abs/2410.10394 (2024) - [i225]Jianqi Chen, Panwen Hu, Xiaojun Chang, Zhenwei Shi, Michael Christian Kampffmeyer, Xiaodan Liang:
Sitcom-Crafter: A Plot-Driven Human Motion Generation System in 3D Scenes. CoRR abs/2410.10790 (2024) - 2023
- [j59]Qiuyan Wang, Xiaodan Liang, Rize Jin, Yang Yan:
Applications of Strongly Regular Cayley Graphs to Codebooks. IEEE Access 11: 106980-106986 (2023) - [j58]Siyi Hu, Yifan Zhong, Minquan Gao, Weixun Wang, Hao Dong, Xiaodan Liang, Zhihui Li, Xiaojun Chang, Yaodong Yang:
MARLlib: A Scalable and Efficient Multi-agent Reinforcement Learning Library. J. Mach. Learn. Res. 24: 315:1-315:23 (2023) - [j57]Hang Chen, Bowei Cao, Jiangcun Yang, He Ren, Xingqiu Xia, Xiaowen Zhang, Wei Yan, Xiaodan Liang, Chen Li:
Construction and effect evaluation of prediction model for red blood cell transfusion requirement in cesarean section based on artificial intelligence. BMC Medical Informatics Decis. Mak. 23(1): 213 (2023) - [j56]Linfeng Li, Weixing Su, Fang Liu, Maowei He, Xiaodan Liang:
Knowledge Fusion Distillation: Improving Distillation with Multi-scale Attention Mechanisms. Neural Process. Lett. 55(5): 6165-6180 (2023) - [j55]Boyu Yang, Mingbao Lin, Yunxiao Zhang, Binghao Liu, Xiaodan Liang, Rongrong Ji, Qixiang Ye:
Dynamic Support Network for Few-Shot Class Incremental Learning. IEEE Trans. Pattern Anal. Mach. Intell. 45(3): 2945-2951 (2023) - [j54]Changlin Li, Guangrun Wang, Bing Wang, Xiaodan Liang, Zhihui Li, Xiaojun Chang:
DS-Net++: Dynamic Weight Slicing for Efficient Inference in CNNs and Vision Transformers. IEEE Trans. Pattern Anal. Mach. Intell. 45(4): 4430-4446 (2023) - [j53]Yinya Huang, Lemao Liu, Kun Xu, Meng Fang, Liang Lin, Xiaodan Liang:
Discourse-Aware Graph Networks for Textual Logical Reasoning. IEEE Trans. Pattern Anal. Mach. Intell. 45(10): 11668-11688 (2023) - [j52]Bingqian Lin, Yanxin Long, Yi Zhu, Fengda Zhu, Xiaodan Liang, Qixiang Ye, Liang Lin:
Towards Deviation-Robust Agent Navigation via Perturbation-Aware Contrastive Learning. IEEE Trans. Pattern Anal. Mach. Intell. 45(10): 12535-12549 (2023) - [j51]Xiao Dong, Xunlin Zhan, Yunchao Wei, Xiaoyong Wei, Yaowei Wang, Minlong Lu, Xiaochun Cao, Xiaodan Liang:
Entity-Graph Enhanced Cross-Modal Pretraining for Instance-Level Product Retrieval. IEEE Trans. Pattern Anal. Mach. Intell. 45(11): 13117-13133 (2023) - [j50]Junfan Lin, Keze Wang, Ziliang Chen, Xiaodan Liang, Liang Lin:
Towards Causality-Aware Inferring: A Sequential Discriminative Approach for Medical Diagnosis. IEEE Trans. Pattern Anal. Mach. Intell. 45(11): 13363-13375 (2023) - [j49]Dapeng Feng, Songfang Han, Hang Xu, Xiaodan Liang, Xiaojun Tan:
Point-Guided Contrastive Learning for Monocular 3-D Object Detection. IEEE Trans. Cybern. 53(2): 954-966 (2023) - [j48]Xiao Dong, Gengwei Zhang, Xunlin Zhan, Yi Ding, Yunchao Wei, Minlong Lu, Xiaodan Liang:
Caption-Aided Product Detection via Collaborative Pseudo-Label Harmonization. IEEE Trans. Multim. 25: 1916-1927 (2023) - [j47]Mingjie Li, Rui Liu, Fuyu Wang, Xiaojun Chang, Xiaodan Liang:
Auxiliary signal-guided knowledge encoder-decoder for medical report generation. World Wide Web (WWW) 26(1): 253-270 (2023) - [c212]Runhui Huang, Yanxin Long, Jianhua Han, Hang Xu, Xiwen Liang, Chunjing Xu, Xiaodan Liang:
NLIP: Noise-Robust Language-Image Pre-training. AAAI 2023: 926-934 - [c211]Zutao Jiang, Guansong Lu, Xiaodan Liang, Jihua Zhu, Wei Zhang, Xiaojun Chang, Hang Xu:
3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation. AAAI 2023: 1051-1059 - [c210]Bingqian Lin, Yi Zhu, Xiaodan Liang, Liang Lin, Jianzhuang Liu:
Actional Atomic-Concept Learning for Demystifying Vision-Language Navigation. AAAI 2023: 1568-1576 - [c209]Haiming Wang, Ye Yuan, Zhengying Liu, Jianhao Shen, Yichun Yin, Jing Xiong, Enze Xie, Han Shi, Yujun Li, Lin Li, Jian Yin, Zhenguo Li, Xiaodan Liang:
DT-Solver: Automated Theorem Proving with Dynamic-Tree Sampling Guided by Proof-level Value Function. ACL (1) 2023: 12632-12646 - [c208]Shida Chen, Xiaodan Liang, Pan Zhao:
Application of Intelligent Mobile Terminal in Virtual Building Construction Training Teaching. ADHIP (2) 2023: 345-360 - [c207]Mengxue Qu, Yu Wu, Yunchao Wei, Wu Liu, Xiaodan Liang, Yao Zhao:
Learning to Segment Every Referring Object Point by Point. CVPR 2023: 3021-3030 - [c206]Kaicheng Yu, Tang Tao, Hongwei Xie, Zhiwei Lin, Tingting Liang, Bing Wang, Peng Chen, Dayang Hao, Yongtao Wang, Xiaodan Liang:
Benchmarking the Robustness of LiDAR-Camera Fusion for 3D Object Detection. CVPR Workshops 2023: 3188-3198 - [c205]Mingjie Li, Bingqian Lin, Zicong Chen, Haokun Lin, Xiaodan Liang, Xiaojun Chang:
Dynamic Graph Enhanced Contrastive Learning for Chest X-Ray Report Generation. CVPR 2023: 3334-3343 - [c204]Xiwen Liang, Minzhe Niu, Jianhua Han, Hang Xu, Chunjing Xu, Xiaodan Liang:
Visual Exemplar Driven Task-Prompting for Unified Perception in Autonomous Driving. CVPR 2023: 9611-9621 - [c203]Yanxin Long, Youpeng Wen, Jianhua Han, Hang Xu, Pengzhen Ren, Wei Zhang, Shen Zhao, Xiaodan Liang:
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining. CVPR 2023: 15233-15243 - [c202]Yihan Zeng, Chenhan Jiang, Jiageng Mao, Jianhua Han, Chaoqiang Ye, Qingqiu Huang, Dit-Yan Yeung, Zhen Yang, Xiaodan Liang, Hang Xu:
CLIP2: Contrastive Language-Image-Point Pretraining from Real-World Point Cloud Data. CVPR 2023: 15244-15253 - [c201]Lewei Yao, Jianhua Han, Xiaodan Liang, Dan Xu, Wei Zhang, Zhenguo Li, Hang Xu:
DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment. CVPR 2023: 23497-23506 - [c200]Zhenyu Xie, Zaiyu Huang, Xin Dong, Fuwei Zhao, Haoye Dong, Xijin Zhang, Feida Zhu, Xiaodan Liang:
GP-VTON: Towards General Purpose Virtual Try-On via Collaborative Local-Flow Global-Parsing Learning. CVPR 2023: 23550-23559 - [c199]Jing Xiong, Jianhao Shen, Ye Yuan, Haiming Wang, Yichun Yin, Zhengying Liu, Lin Li, Zhijiang Guo, Qingxing Cao, Yinya Huang, Chuanyang Zheng, Xiaodan Liang, Ming Zhang, Qun Liu:
TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative Language Models. EMNLP 2023: 11594-11632 - [c198]Guangyi Liu, Zeyu Feng, Yuan Gao, Zichao Yang, Xiaodan Liang, Junwei Bao, Xiaodong He, Shuguang Cui, Zhen Li, Zhiting Hu:
Composable Text Controls in Latent Space with ODEs. EMNLP 2023: 16543-16570 - [c197]Kaixin Cai, Pengzhen Ren, Yi Zhu, Hang Xu, Jianzhuang Liu, Changlin Li, Guangrun Wang, Xiaodan Liang:
MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation. ICCV 2023: 1196-1205 - [c196]Zhijian Huang, Sihao Lin, Guiyu Liu, Mukun Luo, Chaoqiang Ye, Hang Xu, Xiaojun Chang, Xiaodan Liang:
FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration. ICCV 2023: 3479-3488 - [c195]Haoyuan Li, Haoye Dong, Hanchao Jia, Dong Huang, Michael C. Kampffmeyer, Liang Lin, Xiaodan Liang:
Coordinate Transformer: Achieving Single-stage Multi-person Mesh Recovery from Videos. ICCV 2023: 8710-8719 - [c194]Cuican Yu, Guansong Lu, Yihan Zeng, Jian Sun, Xiaodan Liang, Huibin Li, Zongben Xu, Songcen Xu, Wei Zhang, Hang Xu:
Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images. ICCV 2023: 15280-15291 - [c193]Runhui Huang, Jianhua Han, Guansong Lu, Xiaodan Liang, Yihan Zeng, Wei Zhang, Hang Xu:
DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability. ICCV 2023: 15667-15677 - [c192]Xinchi Deng, Han Shi, Runhui Huang, Changlin Li, Hang Xu, Jianhua Han, James T. Kwok, Shen Zhao, Wei Zhang, Xiaodan Liang:
GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training. ICCV 2023: 22121-22132 - [c191]Hongguang Zhu, Yunchao Wei, Xiaodan Liang, Chunjie Zhang, Yao Zhao:
CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation. ICCV 2023: 22200-22210 - [c190]Binbin Yang, Yi Luo, Ziliang Chen, Guangrun Wang, Xiaodan Liang, Liang Lin:
LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts. ICCV 2023: 22612-22622 - [c189]Xujie Zhang, Binbin Yang, Michael C. Kampffmeyer, Wenqing Zhang, Shiyue Zhang, Guansong Lu, Liang Lin, Hang Xu, Xiaodan Liang:
DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment. ICCV 2023: 23097-23106 - [c188]Jiahui Gao, Renjie Pi, Yong Lin, Hang Xu, Jiacheng Ye, Zhiyong Wu, Weizhong Zhang, Xiaodan Liang, Zhenguo Li, Lingpeng Kong:
Self-Guided Noise-Free Data Generation for Efficient Zero-Shot Learning. ICLR 2023 - [c187]Pengzhen Ren, Changlin Li, Hang Xu, Yi Zhu, Guangrun Wang, Jianzhuang Liu, Xiaojun Chang, Xiaodan Liang:
ViewCo: Discovering Text-Supervised Segmentation Masks via Multi-View Semantic Consistency. ICLR 2023 - [c186]Fengda Zhu, Vincent CS Lee, Xiaojun Chang, Xiaodan Liang:
Vision Language Navigation with Knowledge-driven Environmental Dreamer. IJCAI 2023: 1840-1848 - [c185]Mengxue Qu, Yu Wu, Wu Liu, Xiaodan Liang, Jingkuan Song, Yao Zhao, Yunchao Wei:
RIO: A Benchmark for Reasoning Intention-Oriented Objects in Open Environments. NeurIPS 2023 - [c184]Liucun Lu, Jinghui Qin, Zequn Jie, Lin Ma, Liang Lin, Xiaodan Liang:
RecFormer: Recurrent Multi-modal Transformer with History-Aware Contrastive Learning for Visual Dialog. PRCV (1) 2023: 159-171 - [i224]Bingqian Lin, Yi Zhu, Xiaodan Liang, Liang Lin, Jianzhuang Liu:
Actional Atomic-Concept Learning for Demystifying Vision-Language Navigation. CoRR abs/2302.06072 (2023) - [i223]Pengzhen Ren, Changlin Li, Hang Xu, Yi Zhu, Guangrun Wang, Jianzhuang Liu, Xiaojun Chang, Xiaodan Liang:
ViewCo: Discovering Text-Supervised Segmentation Masks via Multi-View Semantic Consistency. CoRR abs/2302.10307 (2023) - [i222]Xiwen Liang, Minzhe Niu, Jianhua Han, Hang Xu, Chunjing Xu, Xiaodan Liang:
Visual Exemplar Driven Task-Prompting for Unified Perception in Autonomous Driving. CoRR abs/2303.01788 (2023) - [i221]Yanxin Long, Youpeng Wen, Jianhua Han, Hang Xu, Pengzhen Ren, Wei Zhang, Shen Zhao, Xiaodan Liang:
CapDet: Unifying Dense Captioning and Open-World Detection Pretraining. CoRR abs/2303.02489 (2023) - [i220]Mingjie Li, Bingqian Lin, Zicong Chen, Haokun Lin, Xiaodan Liang, Xiaojun Chang:
Dynamic Graph Enhanced Contrastive Learning for Chest X-ray Report Generation. CoRR abs/2303.10323 (2023) - [i219]Yihan Zeng, Chenhan Jiang, Jiageng Mao, Jianhua Han, Chaoqiang Ye, Qingqiu Huang, Dit-Yan Yeung, Zhen Yang, Xiaodan Liang, Hang Xu:
CLIP2: Contrastive Language-Image-Point Pretraining from Real-World Point Cloud Data. CoRR abs/2303.12417 (2023) - [i218]Zhenyu Xie, Zaiyu Huang, Xin Dong, Fuwei Zhao, Haoye Dong, Xijin Zhang, Feida Zhu, Xiaodan Liang:
GP-VTON: Towards General Purpose Virtual Try-on via Collaborative Local-Flow Global-Parsing Learning. CoRR abs/2303.13756 (2023) - [i217]Lewei Yao, Jianhua Han, Xiaodan Liang, Dan Xu, Wei Zhang, Zhenguo Li, Hang Xu:
DetCLIPv2: Scalable Open-Vocabulary Object Detection Pre-training via Word-Region Alignment. CoRR abs/2304.04514 (2023) - [i216]Tang Tao, Longfei Gao, Guangrun Wang, Peng Chen, Dayang Hao, Xiaodan Liang, Mathieu Salzmann, Kaicheng Yu:
LiDAR-NeRF: Novel LiDAR View Synthesis via Neural Radiance Fields. CoRR abs/2304.10406 (2023) - [i215]Bingqian Lin, Zicong Chen, Mingjie Li, Haokun Lin, Hang Xu, Yi Zhu, Jianzhuang Liu, Wenjia Cai, Lei Yang, Shen Zhao, Chenfei Wu, Ling Chen, Xiaojun Chang, Yi Yang, Lei Xing, Xiaodan Liang:
Towards Medical Artificial General Intelligence via Knowledge-Enhanced Multimodal Pretraining. CoRR abs/2304.14204 (2023) - [i214]Haonan Wang, Minbin Huang, Runhui Huang, Lanqing Hong, Hang Xu, Tianyang Hu, Xiaodan Liang, Zhenguo Li:
Boosting Visual-Language Models by Exploiting Hard Samples. CoRR abs/2305.05208 (2023) - [i213]Guian Fang, Zutao Jiang, Jianhua Han, Guansong Lu, Hang Xu, Xiaodan Liang:
Boosting Text-to-Image Diffusion Models with Fine-Grained Semantic Rewards. CoRR abs/2305.19599 (2023) - [i212]Xiao Dong, Runhui Huang, Xiaoyong Wei, Zequn Jie, Jianxing Yu, Jian Yin, Xiaodan Liang:
UniDiff: Advancing Vision-Language Models with Generative and Discriminative Learning. CoRR abs/2306.00813 (2023) - [i211]Xiwen Liang, Liang Ma, Shanshan Guo, Jianhua Han, Hang Xu, Shikui Ma, Xiaodan Liang:
MO-VLN: A Multi-Task Benchmark for Open-set Zero-Shot Vision-and-Language Navigation. CoRR abs/2306.10322 (2023) - [i210]Pengzhen Ren, Kaidong Zhang, Hetao Zheng, Zixuan Li, Yuhang Wen, Fengda Zhu, Mas Ma, Xiaodan Liang:
RM-PRT: Realistic Robotic Manipulation Simulator and Benchmark with Progressive Reasoning Tasks. CoRR abs/2306.11335 (2023) - [i209]Zheng Chong, Xujie Zhang, Fuwei Zhao, Zhenyu Xie, Xiaodan Liang:
Fashion Matrix: Editing Photos by Just Talking. CoRR abs/2307.13240 (2023) - [i208]Zhijian Huang, Sihao Lin, Guiyu Liu, Mukun Luo, Chaoqiang Ye, Hang Xu, Xiaojun Chang, Xiaodan Liang:
FULLER: Unified Multi-modality Multi-task 3D Perception via Multi-level Gradient Calibration. CoRR abs/2307.16617 (2023) - [i207]Kaixin Cai, Pengzhen Ren, Yi Zhu, Hang Xu, Jianzhuang Liu, Changlin Li, Guangrun Wang, Xiaodan Liang:
MixReorg: Cross-Modal Mixed Patch Reorganization is a Good Mask Learner for Open-World Semantic Segmentation. CoRR abs/2308.04829 (2023) - [i206]Binbin Yang, Yi Luo, Ziliang Chen, Guangrun Wang, Xiaodan Liang, Liang Lin:
LAW-Diffusion: Complex Scene Generation by Diffusion with Layouts. CoRR abs/2308.06713 (2023) - [i205]Hongguang Zhu, Yunchao Wei, Xiaodan Liang, Chunjie Zhang, Yao Zhao:
CTP: Towards Vision-Language Continual Pretraining via Compatible Momentum Contrast and Topology Preservation. CoRR abs/2308.07146 (2023) - [i204]Runhui Huang, Jianhua Han, Guansong Lu, Xiaodan Liang, Yihan Zeng, Wei Zhang, Hang Xu:
DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability. CoRR abs/2308.09306 (2023) - [i203]Haoyuan Li, Haoye Dong, Hanchao Jia, Dong Huang, Michael C. Kampffmeyer, Liang Lin, Xiaodan Liang:
Coordinate Transformer: Achieving Single-stage Multi-person Mesh Recovery from Videos. CoRR abs/2308.10334 (2023) - [i202]Xujie Zhang, Binbin Yang, Michael C. Kampffmeyer, Wenqing Zhang, Shiyue Zhang, Guansong Lu, Liang Lin, Hang Xu, Xiaodan Liang:
DiffCloth: Diffusion Based Garment Synthesis and Manipulation via Structural Cross-modal Semantic Alignment. CoRR abs/2308.11206 (2023) - [i201]Xinchi Deng, Han Shi, Runhui Huang, Changlin Li, Hang Xu, Jianhua Han, James T. Kwok, Shen Zhao, Wei Zhang, Xiaodan Liang:
GrowCLIP: Data-aware Automatic Model Growing for Large-scale Contrastive Language-Image Pre-training. CoRR abs/2308.11331 (2023) - [i200]Cuican Yu, Guansong Lu, Yihan Zeng, Jian Sun, Xiaodan Liang, Huibin Li, Zongben Xu, Songcen Xu, Wei Zhang, Hang Xu:
Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images. CoRR abs/2308.16758 (2023) - [i199]Haiming Wang, Huajian Xin, Chuanyang Zheng, Lin Li, Zhengying Liu, Qingxing Cao, Yinya Huang, Jing Xiong, Han Shi, Enze Xie, Jian Yin, Zhenguo Li, Heng Liao, Xiaodan Liang:
LEGO-Prover: Neural Theorem Proving with Growing Libraries. CoRR abs/2310.00656 (2023) - [i198]Jing Xiong, Zixuan Li, Chuanyang Zheng, Zhijiang Guo, Yichun Yin, Enze Xie, Zhicheng Yang, Qingxing Cao, Haiming Wang, Xiongwei Han, Jing Tang, Chengming Li, Xiaodan Liang:
DQ-LoRe: Dual Queries with Low Rank Approximation Re-ranking for In-Context Learning. CoRR abs/2310.02954 (2023) - [i197]Jing Xiong, Jianhao Shen, Ye Yuan, Haiming Wang, Yichun Yin, Zhengying Liu, Lin Li, Zhijiang Guo, Qingxing Cao, Yinya Huang, Chuanyang Zheng, Xiaodan Liang, Ming Zhang, Qun Liu:
TRIGO: Benchmarking Formal Mathematical Proof Reduction for Generative Language Models. CoRR abs/2310.10180 (2023) - [i196]Mengxue Qu, Yu Wu, Wu Liu, Xiaodan Liang, Jingkuan Song, Yao Zhao, Yunchao Wei:
RIO: A Benchmark for Reasoning Intention-Oriented Objects in Open Environments. CoRR abs/2310.17290 (2023) - [i195]Zhicheng Yang, Yiwei Wang, Yinya Huang, Jing Xiong, Xiaodan Liang, Jing Tang:
Speak Like a Native: Prompting Large Language Models in a Native Style. CoRR abs/2311.13538 (2023) - [i194]Yinya Huang, Ruixin Hong, Hongming Zhang, Wei Shao, Zhicheng Yang, Dong Yu, Changshui Zhang, Xiaodan Liang, Linqi Song:
CLOMO: Counterfactual Logical Modification with Large Language Models. CoRR abs/2311.17438 (2023) - [i193]Cong Wang, Jiaxi Gu, Panwen Hu, Songcen Xu, Hang Xu, Xiaodan Liang:
DreamVideo: High-Fidelity Image-to-Video Generation with Image Retention and Text Guidance. CoRR abs/2312.03018 (2023) - [i192]Xujie Zhang, Xiu Li, Michael Kampffmeyer, Xin Dong, Zhenyu Xie, Feida Zhu, Haoye Dong, Xiaodan Liang:
WarpDiffusion: Efficient Diffusion Model for High-Fidelity Virtual Try-on. CoRR abs/2312.03667 (2023) - [i191]Zhenyu Xie, Yang Wu, Xuehao Gao, Zhongqian Sun, Wei Yang, Xiaodan Liang:
Towards Detailed Text-to-Motion Synthesis via Basic-to-Advanced Hierarchical Diffusion Model. CoRR abs/2312.10960 (2023) - [i190]Hanhui Li, Xiaojian Lin, Xuan Huang, Zejun Yang, Zhisheng Wang, Xiaodan Liang:
Monocular 3D Hand Mesh Recovery via Dual Noise Estimation. CoRR abs/2312.15916 (2023) - 2022
- [j46]Weixing Su, Linfeng Li, Fang Liu, Maowei He, Xiaodan Liang:
AI on the edge: a comprehensive review. Artif. Intell. Rev. 55(8): 6125-6183 (2022) - [j45]Nanqing Dong, Michael Kampffmeyer, Xiaodan Liang, Min Xu, Irina Voiculescu, Eric P. Xing:
Towards robust partially supervised multi-structure medical image segmentation on small-scale data. Appl. Soft Comput. 114: 108074 (2022) - [j44]Xunlin Zhan, Yinya Huang, Xiao Dong, Qingxing Cao, Xiaodan Liang:
PathReasoner: Explainable reasoning paths for commonsense question answering. Knowl. Based Syst. 235: 107612 (2022) - [j43]Xunlin Zhan, Yuan Li, Xiao Dong, Xiaodan Liang, Zhiting Hu, Lawrence Carin:
elBERto: Self-supervised commonsense learning for question answering. Knowl. Based Syst. 258: 109964 (2022) - [j42]Xiaodan Liang, Siwen Xu, Yang Liu, Liling Sun:
A Modified Whale Optimization Algorithm and Its Application in Seismic Inversion Problem. Mob. Inf. Syst. 2022: 9159130:1-9159130:18 (2022) - [j41]Liang Lin, Yiming Gao, Ke Gong, Meng Wang, Xiaodan Liang:
Graphonomy: Universal Image Parsing via Graph Reasoning and Transfer. IEEE Trans. Pattern Anal. Mach. Intell. 44(5): 2504-2518 (2022) - [j40]Bingqian Lin, Yi Zhu, Yanxin Long, Xiaodan Liang, Qixiang Ye, Liang Lin:
Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation. IEEE Trans. Pattern Anal. Mach. Intell. 44(10): 7175-7189 (2022) - [j39]Bingqian Lin, Yi Zhu, Xiaodan Liang:
Atom correlation based graph propagation for scene graph generation. Pattern Recognit. 122: 108300 (2022) - [j38]Fuyu Wang, Xiaodan Liang, Lin Xu, Liang Lin:
Unifying Relational Sentence Generation and Retrieval for Medical Image Report Composition. IEEE Trans. Cybern. 52(6): 5015-5025 (2022) - [j37]Yi Zhu, Xiwen Liang, Bingqian Lin, Qixiang Ye, Jianbin Jiao, Liang Lin, Xiaodan Liang:
Configurable Graph Reasoning for Visual Relationship Detection. IEEE Trans. Neural Networks Learn. Syst. 33(1): 117-129 (2022) - [j36]Qingxing Cao, Bailin Li, Xiaodan Liang, Keze Wang, Liang Lin:
Knowledge-Routed Visual Question Reasoning: Challenges for Deep Representation Embedding. IEEE Trans. Neural Networks Learn. Syst. 33(7): 2758-2767 (2022) - [c183]Jianhua Han, Xiajun Deng, Xinyue Cai, Zhen Yang, Hang Xu, Chunjing Xu, Xiaodan Liang:
Laneformer: Object-Aware Row-Column Transformers for Lane Detection. AAAI 2022: 799-807 - [c182]Xiwen Liang, Fengda Zhu, Yi Zhu, Bingqian Lin, Bing Wang, Xiaodan Liang:
Contrastive Instruction-Trajectory Learning for Vision-Language Navigation. AAAI 2022: 1592-1600 - [c181]Jiahui Gao, Hang Xu, Han Shi, Xiaozhe Ren, Philip L. H. Yu, Xiaodan Liang, Xin Jiang, Zhenguo Li:
AutoBERT-Zero: Evolving BERT Backbone from Scratch. AAAI 2022: 10663-10671 - [c180]Xiwen Liang, Fengda Zhu, Lingling Li, Hang Xu, Xiaodan Liang:
Visual-Language Navigation Pretraining via Prompt-based Environmental Self-exploration. ACL (1) 2022: 4837-4851 - [c179]Xin Dong, Fuwei Zhao, Zhenyu Xie, Xijin Zhang, Daniel K. Du, Min Zheng, Xiang Long, Xiaodan Liang, Jianchao Yang:
Dressing in the Wild by Watching Dance Videos. CVPR 2022: 3470-3479 - [c178]Chaojie Yang, Hanhui Li, Shengjie Wu, Shengkai Zhang, Haonan Yan, Nianhong Jiao, Jie Tang, Runnan Zhou, Xiaodan Liang, Tianxiang Zheng:
BodyGAN: General-purpose Controllable Neural Human Body Generation. CVPR 2022: 7723-7732 - [c177]Binbin Yang, Xinchi Deng, Han Shi, Changlin Li, Gengwei Zhang, Hang Xu, Shen Zhao, Liang Lin, Xiaodan Liang:
Continual Object Detection via Prototypical Task Correlation Guided Gating Mechanism. CVPR 2022: 9245-9254 - [c176]Sihao Lin, Hongwei Xie, Bing Wang, Kaicheng Yu, Xiaojun Chang, Xiaodan Liang, Gang Wang:
Knowledge Distillation via the Target-aware Transformer. CVPR 2022: 10905-10914 - [c175]Minbin Huang, Zhijian Huang, Changlin Li, Xin Chen, Hang Xu, Zhenguo Li, Xiaodan Liang:
Arch-Graph: Acyclic Architecture Relation Predictor for Task-Transferable Neural Architecture Search. CVPR 2022: 11871-11881 - [c174]Pengzhen Ren, Changlin Li, Guangrun Wang, Yun Xiao, Qing Du, Xiaodan Liang, Xiaojun Chang:
Beyond Fixation: Dynamic Window Visual Transformer. CVPR 2022: 11977-11987 - [c173]Changlin Li, Bohan Zhuang, Guangrun Wang, Xiaodan Liang, Xiaojun Chang, Yi Yang:
Automated Progressive Learning for Efficient Training of Vision Transformers. CVPR 2022: 12476-12486 - [c172]Bingqian Lin, Yi Zhu, Zicong Chen, Xiwen Liang, Jianzhuang Liu, Xiaodan Liang:
ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts. CVPR 2022: 15375-15385 - [c171]Mingjie Li, Wenjia Cai, Karin Verspoor, Shirui Pan, Xiaodan Liang, Xiaojun Chang:
Cross-modal Clinical Graph Transformer for Ophthalmic Report Generation. CVPR 2022: 20624-20633 - [c170]Xiao Dong, Xunlin Zhan, Yangxin Wu, Yunchao Wei, Michael C. Kampffmeyer, Xiaoyong Wei, Minlong Lu, Yaowei Wang, Xiaodan Liang:
M5Product: Self-harmonized Contrastive Learning for E-commercial Multi-modal Pretraining. CVPR 2022: 21220-21230 - [c169]Quande Liu, Youpeng Wen, Jianhua Han, Chunjing Xu, Hang Xu, Xiaodan Liang:
Open-World Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding. ECCV (20) 2022: 275-292 - [c168]Kaican Li, Kai Chen, Haoyu Wang, Lanqing Hong, Chaoqiang Ye, Jianhua Han, Yukuai Chen, Wei Zhang, Chunjing Xu, Dit-Yan Yeung, Xiaodan Liang, Zhenguo Li, Hang Xu:
CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving. ECCV (38) 2022: 406-423 - [c167]Mengxue Qu, Yu Wu, Wu Liu, Qiqi Gong, Xiaodan Liang, Olga Russakovsky, Yao Zhao, Yunchao Wei:
SiRi: A Simple Selective Retraining Mechanism for Transformer-Based Visual Grounding. ECCV (35) 2022: 546-562 - [c166]Zhicheng Yang, Jinghui Qin, Jiaqi Chen, Liang Lin, Xiaodan Liang:
LogicSolver: Towards Interpretable Math Word Problem Solving with Logical Prompt-enhanced Learning. EMNLP (Findings) 2022: 1-13 - [c165]Yi Cheng, Wenge Liu, Wenjie Li, Jiashuo Wang, Ruihui Zhao, Bang Liu, Xiaodan Liang, Yefeng Zheng:
Improving Multi-turn Emotional Support Dialogue Generation with Lookahead Strategy Planning. EMNLP 2022: 3014-3026 - [c164]Jiaqi Chen, Tong Li, Jinghui Qin, Pan Lu, Liang Lin, Chongyu Chen, Xiaodan Liang:
UniGeo: Unifying Geometry Logical Reasoning via Reformulating Mathematical Expression. EMNLP 2022: 3313-3323 - [c163]Yinya Huang, Hongming Zhang, Ruixin Hong, Xiaodan Liang, Changshui Zhang, Dong Yu:
MetaLogic: Logical Reasoning Explanations with Fine-Grained Structure. EMNLP 2022: 4698-4724 - [c162]Yi Zhu, Zhaoqing Zhu, Bingqian Lin, Xiaodan Liang, Feng Zhao, Jianzhuang Liu:
RelCLIP: Adapting Language-Image Pretraining for Visual Relationship Detection via Relational Contrastive Learning. EMNLP 2022: 4800-4810 - [c161]Han Shi, Jiahui Gao, Hang Xu, Xiaodan Liang, Zhenguo Li, Lingpeng Kong, Stephen M. S. Lee, James T. Kwok:
Revisiting Over-smoothing in BERT from the Perspective of Graph. ICLR 2022 - [c160]Lewei Yao, Runhui Huang, Lu Hou, Guansong Lu, Minzhe Niu, Hang Xu, Xiaodan Liang, Zhenguo Li, Xin Jiang, Chunjing Xu:
FILIP: Fine-grained Interactive Language-Image Pre-Training. ICLR 2022 - [c159]Siyi Hu, Chuanlong Xie, Xiaodan Liang, Xiaojun Chang:
Policy Diagnosis via Measuring Role Diversity in Cooperative Multi-agent RL. ICML 2022: 9041-9071 - [c158]Wenge Liu, Yi Cheng, Hao Wang, Jianheng Tang, Yafei Liu, Ruihui Zhao, Wenjie Li, Yefeng Zheng, Xiaodan Liang:
"My nose is running." "Are you also coughing?": Building A Medical Diagnosis Agent with Interpretable Inquiry Logics. IJCAI 2022: 4266-4272 - [c157]Xujie Zhang, Yu Sha, Michael C. Kampffmeyer, Zhenyu Xie, Zequn Jie, Chengwen Huang, Jianqing Peng, Xiaodan Liang:
ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal Fashion Design. ACM Multimedia 2022: 4525-4535 - [c156]Zhicheng Yang, Jinghui Qin, Jiaqi Chen, Xiaodan Liang:
Unbiased Math Word Problems Benchmark for Mitigating Solving Bias. NAACL-HLT (Findings) 2022: 1401-1408 - [c155]Guangyi Liu, Zichao Yang, Tianhua Tao, Xiaodan Liang, Junwei Bao, Zhen Li, Xiaodong He, Shuguang Cui, Zhiting Hu:
Don't Take It Literally: An Edit-Invariant Sequence Loss for Text Generation. NAACL-HLT 2022: 2055-2078 - [c154]Xipeng Chen, Guangrun Wang, Dizhong Zhu, Xiaodan Liang, Philip H. S. Torr, Liang Lin:
Structure-Preserving 3D Garment Modeling with Neural Sewing Machines. NeurIPS 2022 - [c153]Jiaxi Gu, Xiaojun Meng, Guansong Lu, Lu Hou, Niu Minzhe, Xiaodan Liang, Lewei Yao, Runhui Huang, Wei Zhang, Xin Jiang, Chunjing Xu, Hang Xu:
Wukong: A 100 Million Large-scale Chinese Cross-modal Pre-training Benchmark. NeurIPS 2022 - [c152]Zaiyu Huang, Hanhui Li, Zhenyu Xie, Michael Kampffmeyer, Qingling Cai, Xiaodan Liang:
Towards Hard-pose Virtual Try-on via 3D-aware Global Correspondence Learning. NeurIPS 2022 - [c151]Xiwen Liang, Yangxin Wu, Jianhua Han, Hang Xu, Chunjing Xu, Xiaodan Liang:
Effective Adaptation in Multi-Task Co-Training for Unified Autonomous Driving. NeurIPS 2022 - [c150]Lewei Yao, Jianhua Han, Youpeng Wen, Xiaodan Liang, Dan Xu, Wei Zhang, Zhenguo Li, Chunjing Xu, Hang Xu:
DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection. NeurIPS 2022 - [c149]Zicheng Zhang, Yi Zhu, Jianzhuang Liu, Xiaodan Liang, Wei Ke:
CoupAlign: Coupling Word-Pixel with Sentence-Mask Alignments for Referring Image Segmentation. NeurIPS 2022 - [c148]Wenge Liu, Jianheng Tang, Yi Cheng, Wenjie Li, Yefeng Zheng, Xiaodan Liang:
MedDG: An Entity-Centric Medical Consultation Dataset for Entity-Aware Medical Dialogue Generation. NLPCC (1) 2022: 447-459 - [i189]Li Liu, Qingle Huang, Sihao Lin, Hongwei Xie, Bing Wang, Xiaojun Chang, Xiaodan Liang:
Exploring Inter-Channel Correlation for Diversity-preserved KnowledgeDistillation. CoRR abs/2202.03680 (2022) - [i188]Jiaxi Gu, Xiaojun Meng, Guansong Lu, Lu Hou, Minzhe Niu, Hang Xu, Xiaodan Liang, Wei Zhang, Xin Jiang, Chunjing Xu:
Wukong: 100 Million Large-scale Chinese Cross-modal Pre-training Dataset and A Foundation Framework. CoRR abs/2202.06767 (2022) - [i187]Han Shi, Jiahui Gao, Hang Xu, Xiaodan Liang, Zhenguo Li, Lingpeng Kong, Stephen M. S. Lee, James T. Kwok:
Revisiting Over-smoothing in BERT from the Perspective of Graph. CoRR abs/2202.08625 (2022) - [i186]Shervin Minaee, Xiaodan Liang, Shuicheng Yan:
Modern Augmented Reality: Applications, Trends, and Future Directions. CoRR abs/2202.09450 (2022) - [i185]Xiwen Liang, Fengda Zhu, Lingling Li, Hang Xu, Xiaodan Liang:
Visual-Language Navigation Pretraining via Prompt-based Environmental Self-exploration. CoRR abs/2203.04006 (2022) - [i184]Kaican Li, Kai Chen, Haoyu Wang, Lanqing Hong, Chaoqiang Ye, Jianhua Han, Yukuai Chen, Wei Zhang, Chunjing Xu, Dit-Yan Yeung, Xiaodan Liang, Zhenguo Li, Hang Xu:
CODA: A Real-World Road Corner Case Dataset for Object Detection in Autonomous Driving. CoRR abs/2203.07724 (2022) - [i183]Xunlin Zhan, Yuan Li, Xiao Dong, Xiaodan Liang, Zhiting Hu, Lawrence Carin:
elBERto: Self-supervised Commonsense Learning for Question Answering. CoRR abs/2203.09424 (2022) - [i182]Jianhua Han, Xiajun Deng, Xinyue Cai, Zhen Yang, Hang Xu, Chunjing Xu, Xiaodan Liang:
Laneformer: Object-aware Row-Column Transformers for Lane Detection. CoRR abs/2203.09830 (2022) - [i181]Pengzhen Ren, Changlin Li, Guangrun Wang, Yun Xiao, Qing Du, Xiaodan Liang, Xiaojun Chang:
Beyond Fixation: Dynamic Window Visual Transformer. CoRR abs/2203.12856 (2022) - [i180]Changlin Li, Bohan Zhuang, Guangrun Wang, Xiaodan Liang, Xiaojun Chang, Yi Yang:
Automated Progressive Learning for Efficient Training of Vision Transformers. CoRR abs/2203.14509 (2022) - [i179]Xin Dong, Fuwei Zhao, Zhenyu Xie, Xijin Zhang, Daniel K. Du, Min Zheng, Xiang Long, Xiaodan Liang, Jianchao Yang:
Dressing in the Wild by Watching Dance Videos. CoRR abs/2203.15320 (2022) - [i178]Minbin Huang, Zhijian Huang, Changlin Li, Xin Chen, Hang Xu, Zhenguo Li, Xiaodan Liang:
Arch-Graph: Acyclic Architecture Relation Predictor for Task-Transferable Neural Architecture Search. CoRR abs/2204.05941 (2022) - [i177]Wenge Liu, Yi Cheng, Hao Wang, Jianheng Tang, Yafei Liu, Ruihui Zhao, Wenjie Li, Yefeng Zheng, Xiaodan Liang:
"My nose is running.""Are you also coughing?": Building A Medical Diagnosis Agent with Interpretable Inquiry Logics. CoRR abs/2204.13953 (2022) - [i176]Binbin Yang, Xinchi Deng, Han Shi, Changlin Li, Gengwei Zhang, Hang Xu, Shen Zhao, Liang Lin, Xiaodan Liang:
Continual Object Detection via Prototypical Task Correlation Guided Gating Mechanism. CoRR abs/2205.03055 (2022) - [i175]Zhicheng Yang, Jinghui Qin, Jiaqi Chen, Xiaodan Liang:
Unbiased Math Word Problems Benchmark for Mitigating Solving Bias. CoRR abs/2205.08108 (2022) - [i174]Zhicheng Yang, Jinghui Qin, Jiaqi Chen, Liang Lin, Xiaodan Liang:
LogicSolver: Towards Interpretable Math Word Problem Solving with Logical Prompt-enhanced Learning. CoRR abs/2205.08232 (2022) - [i173]Sihao Lin, Hongwei Xie, Bing Wang, Kaicheng Yu, Xiaojun Chang, Xiaodan Liang, Gang Wang:
Knowledge Distillation via the Target-aware Transformer. CoRR abs/2205.10793 (2022) - [i172]Jiahui Gao, Renjie Pi, Yong Lin, Hang Xu, Jiacheng Ye, Zhiyong Wu, Xiaodan Liang, Zhenguo Li, Lingpeng Kong:
ZeroGen+: Self-Guided High-Quality Data Generation in Efficient Zero-Shot Learning. CoRR abs/2205.12679 (2022) - [i171]Kaicheng Yu, Tao Tang, Hongwei Xie, Zhiwei Lin, Zhongwei Wu, Zhongyu Xia, Tingting Liang, Haiyang Sun, Jiong Deng, Dayang Hao, Yongtao Wang, Xiaodan Liang, Bing Wang:
Benchmarking the Robustness of LiDAR-Camera Fusion for 3D Object Detection. CoRR abs/2205.14951 (2022) - [i170]Bingqian Lin, Yi Zhu, Zicong Chen, Xiwen Liang, Jianzhuang Liu, Xiaodan Liang:
ADAPT: Vision-Language Navigation with Modality-Aligned Action Prompts. CoRR abs/2205.15509 (2022) - [i169]Mingjie Li, Wenjia Cai, Karin Verspoor, Shirui Pan, Xiaodan Liang, Xiaojun Chang:
Cross-modal Clinical Graph Transformer for Ophthalmic Report Generation. CoRR abs/2206.01988 (2022) - [i168]Xiao Dong, Xunlin Zhan, Yunchao Wei, Xiaoyong Wei, Yaowei Wang, Minlong Lu, Xiaochun Cao, Xiaodan Liang:
Entity-Graph Enhanced Cross-Modal Pretraining for Instance-level Product Retrieval. CoRR abs/2206.08842 (2022) - [i167]Yinya Huang, Lemao Liu, Kun Xu, Meng Fang, Liang Lin, Xiaodan Liang:
Discourse-Aware Graph Networks for Textual Logical Reasoning. CoRR abs/2207.01450 (2022) - [i166]Siyi Hu, Chuanlong Xie, Xiaodan Liang, Xiaojun Chang:
Policy Diagnosis via Measuring Role Diversity in Cooperative Multi-agent RL. CoRR abs/2207.05683 (2022) - [i165]Quande Liu, Youpeng Wen, Jianhua Han, Chunjing Xu, Hang Xu, Xiaodan Liang:
Open-world Semantic Segmentation via Contrasting and Clustering Vision-Language Embedding. CoRR abs/2207.08455 (2022) - [i164]Mengxue Qu, Yu Wu, Wu Liu, Qiqi Gong, Xiaodan Liang, Olga Russakovsky, Yao Zhao, Yunchao Wei:
SiRi: A Simple Selective Retraining Mechanism for Transformer-based Visual Grounding. CoRR abs/2207.13325 (2022) - [i163]Zhenyu Xie, Zaiyu Huang, Fuwei Zhao, Haoye Dong, Michael Kampffmeyer, Xin Dong, Feida Zhu, Xiaodan Liang:
PASTA-GAN++: A Versatile Framework for High-Resolution Unpaired Virtual Try-on. CoRR abs/2207.13475 (2022) - [i162]Guangyi Liu, Zeyu Feng, Yuan Gao, Zichao Yang, Xiaodan Liang, Junwei Bao, Xiaodong He, Shuguang Cui, Zhen Li, Zhiting Hu:
Composable Text Control Operations in Latent Space with Ordinary Differential Equations. CoRR abs/2208.00638 (2022) - [i161]Xujie Zhang, Yu Sha, Michael C. Kampffmeyer, Zhenyu Xie, Zequn Jie, Chengwen Huang, Jianqing Peng, Xiaodan Liang:
ARMANI: Part-level Garment-Text Alignment for Unified Cross-Modal Fashion Design. CoRR abs/2208.05621 (2022) - [i160]Xiwen Liang, Yangxin Wu, Jianhua Han, Hang Xu, Chunjing Xu, Xiaodan Liang:
Effective Adaptation in Multi-Task Co-Training for Unified Autonomous Driving. CoRR abs/2209.08953 (2022) - [i159]Lewei Yao, Jianhua Han, Youpeng Wen, Xiaodan Liang, Dan Xu, Wei Zhang, Zhenguo Li, Chunjing Xu, Hang Xu:
DetCLIP: Dictionary-Enriched Visual-Concept Paralleled Pre-training for Open-world Detection. CoRR abs/2209.09407 (2022) - [i158]Yi Cheng, Wenge Liu, Wenjie Li, Jiashuo Wang, Ruihui Zhao, Bang Liu, Xiaodan Liang, Yefeng Zheng:
Improving Multi-turn Emotional Support Dialogue Generation with Lookahead Strategy Planning. CoRR abs/2210.04242 (2022) - [i157]Tao Tang, Changlin Li, Guangrun Wang, Kaicheng Yu, Xiaojun Chang, Xiaodan Liang:
Learning Self-Regularized Adversarial Views for Self-Supervised Vision Transformers. CoRR abs/2210.08458 (2022) - [i156]Yinya Huang, Hongming Zhang, Ruixin Hong, Xiaodan Liang, Changshui Zhang, Dong Yu:
MetaLogic: Logical Reasoning Explanations with Fine-Grained Structure. CoRR abs/2210.12487 (2022) - [i155]Siyi Hu, Yifan Zhong, Minquan Gao, Weixun Wang, Hao Dong, Zhihui Li, Xiaodan Liang, Xiaojun Chang, Yaodong Yang:
MARLlib: Extending RLlib for Multi-agent Reinforcement Learning. CoRR abs/2210.13708 (2022) - [i154]Yanxin Long, Jianhua Han, Runhui Huang, Xu Hang, Yi Zhu, Chunjing Xu, Xiaodan Liang:
P3OVD: Fine-grained Visual-Text Prompt-Driven Self-Training for Open-Vocabulary Object Detection. CoRR abs/2211.00849 (2022) - [i153]Xipeng Chen, Guangrun Wang, Dizhong Zhu, Xiaodan Liang, Philip H. S. Torr, Liang Lin:
Structure-Preserving 3D Garment Modeling with Neural Sewing Machines. CoRR abs/2211.06701 (2022) - [i152]Zaiyu Huang, Hanhui Li, Zhenyu Xie, Michael Kampffmeyer, Qingling Cai, Xiaodan Liang:
Towards Hard-pose Virtual Try-on via 3D-aware Global Correspondence Learning. CoRR abs/2211.14052 (2022) - [i151]Zutao Jiang, Guansong Lu, Xiaodan Liang, Jihua Zhu, Wei Zhang, Xiaojun Chang, Hang Xu:
3D-TOGO: Towards Text-Guided Cross-Category 3D Object Generation. CoRR abs/2212.01103 (2022) - [i150]Zicheng Zhang, Yi Zhu, Jianzhuang Liu, Xiaodan Liang, Wei Ke:
CoupAlign: Coupling Word-Pixel with Sentence-Mask Alignments for Referring Image Segmentation. CoRR abs/2212.01769 (2022) - [i149]Jiaqi Chen, Tong Li, Jinghui Qin, Pan Lu, Liang Lin, Chongyu Chen, Xiaodan Liang:
UniGeo: Unifying Geometry Logical Reasoning via Reformulating Mathematical Expression. CoRR abs/2212.02746 (2022) - [i148]Runhui Huang, Yanxin Long, Jianhua Han, Hang Xu, Xiwen Liang, Chunjing Xu, Xiaodan Liang:
NLIP: Noise-robust Language-Image Pre-training. CoRR abs/2212.07086 (2022) - 2021
- [j35]Wenge Liu, Jianheng Tang, Xiaodan Liang, Qingling Cai:
Heterogeneous graph reasoning for knowledge-grounded medical dialogue system. Neurocomputing 442: 260-268 (2021) - [j34]Bingqian Lin, Yi Zhu, Xiaodan Liang:
Heterogeneous Excitation-and-Squeeze Network for visual dialog. Neurocomputing 449: 399-410 (2021) - [j33]Qingxing Cao, Xiaodan Liang, Bailin Li, Liang Lin:
Interpretable Visual Question Answering by Reasoning on Dependency Trees. IEEE Trans. Pattern Anal. Mach. Intell. 43(3): 887-901 (2021) - [j32]Bowen Wu, Zhenyu Xie, Xiaodan Liang, Yubei Xiao, Haoye Dong, Liang Lin:
Image Comes Dancing With Collaborative Parsing-Flow Video Synthesis. IEEE Trans. Image Process. 30: 9259-9269 (2021) - [j31]Yukai Shi, Sen Zhang, Chenxing Zhou, Xiaodan Liang, Xiaojun Yang, Liang Lin:
GTAE: Graph Transformer-Based Auto-Encoders for Linguistic-Constrained Text Style Transfer. ACM Trans. Intell. Syst. Technol. 12(3): 32:1-32:16 (2021) - [j30]Guangyi Liu, Yinghong Liao, Fuyu Wang, Bin Zhang, Lu Zhang, Xiaodan Liang, Xiang Wan, Shaolin Li, Zhen Li, Shuixing Zhang, Shuguang Cui:
Medical-VLBERT: Medical Visual Language BERT for COVID-19 CT Report Generation With Alternate Learning. IEEE Trans. Neural Networks Learn. Syst. 32(9): 3786-3797 (2021) - [c147]Gengwei Zhang, Yiming Gao, Hang Xu, Hao Zhang, Zhenguo Li, Xiaodan Liang:
Ada-Segment: Automated Multi-loss Adaptation for Panoptic Segmentation. AAAI 2021: 3333-3341 - [c146]Yinya Huang, Meng Fang, Xunlin Zhan, Qingxing Cao, Xiaodan Liang:
REM-Net: Recursive Erasure Memory Network for Commonsense Evidence Refinement. AAAI 2021: 6375-6383 - [c145]Shuai Lin, Pan Zhou, Xiaodan Liang, Jianheng Tang, Ruihui Zhao, Ziliang Chen, Liang Lin:
Graph-Evolving Meta-Learning for Low-Resource Medical Dialogue Generation. AAAI 2021: 13362-13370 - [c144]Yubei Xiao, Ke Gong, Pan Zhou, Guolin Zheng, Xiaodan Liang, Liang Lin:
Adversarial Meta Sampling for Multilingual Low-Resource Speech Recognition. AAAI 2021: 14112-14120 - [c143]Jiaqi Chen, Jianheng Tang, Jinghui Qin, Xiaodan Liang, Lingbo Liu, Eric P. Xing, Liang Lin:
GeoQA: A Geometric Question Answering Benchmark Towards Multimodal Numerical Reasoning. ACL/IJCNLP (Findings) 2021: 513-523 - [c142]Zheng Ye, Liucun Lu, Lishan Huang, Liang Lin, Xiaodan Liang:
Towards Quantifiable Dialogue Coherence Evaluation. ACL/IJCNLP (1) 2021: 2718-2729 - [c141]Jinghui Qin, Xiaodan Liang, Yining Hong, Jianheng Tang, Liang Lin:
Neural-Symbolic Solver for Math Word Problems with Auxiliary Tasks. ACL/IJCNLP (1) 2021: 5870-5881 - [c140]Pan Lu, Ran Gong, Shibiao Jiang, Liang Qiu, Siyuan Huang, Xiaodan Liang, Song-Chun Zhu:
Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning. ACL/IJCNLP (1) 2021: 6774-6786 - [c139]Ziyue Xu, Xiaodan Liang, Maowei He, Hanning Chen:
Multiple Adaptive Strategies-based Rat Swarm Optimizer. CCIS 2021: 159-163 - [c138]Yawen Duan, Xin Chen, Hang Xu, Zewei Chen, Xiaodan Liang, Tong Zhang, Zhenguo Li:
TransNAS-Bench-101: Improving Transferability and Generalizability of Cross-Task Neural Architecture Search. CVPR 2021: 5251-5260 - [c137]Changlin Li, Guangrun Wang, Bing Wang, Xiaodan Liang, Zhihui Li, Xiaojun Chang:
Dynamic Slimmable Network. CVPR 2021: 8607-8617 - [c136]Fengda Zhu, Xiwen Liang, Yi Zhu, Qizhi Yu, Xiaojun Chang, Xiaodan Liang:
SOON: Scenario Oriented Object Navigation With Graph-Based Exploration. CVPR 2021: 12689-12699 - [c135]Chenhe Dong, Guangrun Wang, Hang Xu, Jiefeng Peng, Xiaozhe Ren, Xiaodan Liang:
EfficientBERT: Progressively Searching Multilayer Perceptron via Warm-up Knowledge Distillation. EMNLP (Findings) 2021: 1424-1437 - [c134]Guolin Zheng, Yubei Xiao, Ke Gong, Pan Zhou, Xiaodan Liang, Liang Lin:
Wav-BERT: Cooperative Acoustic and Linguistic Representation Learning for Low-Resource Speech Recognition. EMNLP (Findings) 2021: 2765-2777 - [c133]Yi Zhu, Yue Weng, Fengda Zhu, Xiaodan Liang, Qixiang Ye, Yutong Lu, Jianbin Jiao:
Self-Motivated Communication Agent for Real-World Vision-Dialog Navigation. ICCV 2021: 1574-1583 - [c132]Qingxing Cao, Wentao Wan, Keze Wang, Xiaodan Liang, Liang Lin:
Linguistically Routing Capsule Network for Out-of-distribution Visual Question Answering. ICCV 2021: 1594-1603 - [c131]Chong Liu, Fengda Zhu, Xiaojun Chang, Xiaodan Liang, Zongyuan Ge, Yi-Dong Shen:
Vision-Language Navigation with Random Environmental Mixup. ICCV 2021: 1624-1634 - [c130]Jiageng Mao, Minzhe Niu, Haoyue Bai, Xiaodan Liang, Hang Xu, Chunjing Xu:
Pyramid R-CNN: Towards Better Performance and Adaptability for 3D Object Detection. ICCV 2021: 2703-2712 - [c129]Jiageng Mao, Yujing Xue, Minzhe Niu, Haoyue Bai, Jiashi Feng, Xiaodan Liang, Hang Xu, Chunjing Xu:
Voxel Transformer for 3D Object Detection. ICCV 2021: 3144-3153 - [c128]Hanxue Liang, Chenhan Jiang, Dapeng Feng, Xin Chen, Hang Xu, Xiaodan Liang, Wei Zhang, Zhenguo Li, Luc Van Gool:
Exploring Geometry-aware Contrast and Clustering Harmonization for Self-supervised 3D Object Detection. ICCV 2021: 3273-3282 - [c127]Hang Xu, Ning Kang, Gengwei Zhang, Chuanlong Xie, Xiaodan Liang, Zhenguo Li:
NASOA: Towards Faster Task-oriented Online Fine-tuning with a Zoo of Models. ICCV 2021: 5077-5086 - [c126]Li Liu, Qingle Huang, Sihao Lin, Hongwei Xie, Bing Wang, Xiaojun Chang, Xiaodan Liang:
Exploring Inter-Channel Correlation for Diversity-preserved Knowledge Distillation. ICCV 2021: 8251-8260 - [c125]Haonan Yan, Jiaqi Chen, Xujie Zhang, Shengkai Zhang, Nianhong Jiao, Xiaodan Liang, Tianxiang Zheng:
UltraPose: Synthesizing Dense Pose with 1 Billion Points by Human-body Decoupling 3D Model. ICCV 2021: 10871-10880 - [c124]Xunlin Zhan, Yangxin Wu, Xiao Dong, Yunchao Wei, Minlong Lu, Yichi Zhang, Hang Xu, Xiaodan Liang:
Product1M: Towards Weakly Supervised Instance-Level Product Retrieval via Cross-Modal Pretraining. ICCV 2021: 11762-11771 - [c123]Changlin Li, Tao Tang, Guangrun Wang, Jiefeng Peng, Bing Wang, Xiaodan Liang, Xiaojun Chang:
BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture Search. ICCV 2021: 12261-12271 - [c122]Jiefeng Peng, Jiqi Zhang, Changlin Li, Guangrun Wang, Xiaodan Liang, Liang Lin:
Pi-NAS: Improving Neural Architecture Search by Reducing Supernet Training Consistency Shift. ICCV 2021: 12334-12344 - [c121]Fuwei Zhao, Zhenyu Xie, Michael Kampffmeyer, Haoye Dong, Songfang Han, Tianxiang Zheng, Tao Zhang, Xiaodan Liang:
M3D-VTON: A Monocular-to-3D Virtual Try-On Network. ICCV 2021: 13219-13229 - [c120]Siyi Hu, Fengda Zhu, Xiaojun Chang, Xiaodan Liang:
UPDeT: Universal Multi-agent RL via Policy Decoupling with Transformers. ICLR 2021 - [c119]Peidong Liu, Gengwei Zhang, Bochao Wang, Hang Xu, Xiaodan Liang, Yong Jiang, Zhenguo Li:
Loss Function Discovery for Object Detection via Convergence-Simulation Driven Search. ICLR 2021 - [c118]Siyi Hu, Fengda Zhu, Xiaojun Chang, Xiaodan Liang:
Transformer Based Multi-Agent Framework. ICME Workshops 2021: 1-2 - [c117]Binbin Yang, Xiaodan Liang, Junhao Zhong, Jiefeng Peng, Guangrun Wang, Liang Lin:
Unifying Dynamic Optimizer Search and Network Architecture Search. ICME 2021: 1-6 - [c116]Han Shi, Jiahui Gao, Xiaozhe Ren, Hang Xu, Xiaodan Liang, Zhenguo Li, James Tin-Yau Kwok:
SparseBERT: Rethinking the Importance Analysis in Self-attention. ICML 2021: 9547-9557 - [c115]Junfan Lin, Zhongzhan Huang, Keze Wang, Xiaodan Liang, Weiwei Chen, Liang Lin:
Continuous Transition: Improving Sample Efficiency for Continuous Control Problems via MixUp. ICRA 2021: 9490-9497 - [c114]Zhenyu Xie, Xujie Zhang, Fuwei Zhao, Haoye Dong, Michael C. Kampffmeyer, Haonan Yan, Xiaodan Liang:
WAS-VTON: Warping Architecture Search for Virtual Try-on Network. ACM Multimedia 2021: 3350-3359 - [c113]Yinya Huang, Meng Fang, Yu Cao, Liwei Wang, Xiaodan Liang:
DAGN: Discourse-Aware Graph Network for Logical Reasoning. NAACL-HLT 2021: 5848-5855 - [c112]Jianhua Han, Xiwen Liang, Hang Xu, Kai Chen, Lanqing Hong, Jiageng Mao, Chaoqiang Ye, Wei Zhang, Zhenguo Li, Xiaodan Liang, Chunjing Xu:
SODA10M: A Large-Scale 2D Self/Semi-Supervised Object Detection Dataset for Autonomous Driving. NeurIPS Datasets and Benchmarks 2021 - [c111]Mingjie Li, Wenjia Cai, Rui Liu, Yuetian Weng, Xiaoyun Zhao, Cong Wang, Xin Chen, Zhong Liu, Caineng Pan, Mengke Li, Yingfeng Zheng, Yizhi Liu, Flora D. Salim, Karin Verspoor, Xiaodan Liang, Xiaojun Chang:
FFA-IR: Towards an Explainable and Reliable Medical Report Generation Benchmark. NeurIPS Datasets and Benchmarks 2021 - [c110]Pan Lu, Liang Qiu, Jiaqi Chen, Tanglin Xia, Yizhou Zhao, Wei Zhang, Zhou Yu, Xiaodan Liang, Song-Chun Zhu:
IconQA: A New Benchmark for Abstract Diagram Understanding and Visual Language Reasoning. NeurIPS Datasets and Benchmarks 2021 - [c109]Jiageng Mao, Minzhe Niu, Chenhan Jiang, Hanxue Liang, Jingheng Chen, Xiaodan Liang, Yamin Li, Chaoqiang Ye, Wei Zhang, Zhenguo Li, Jie Yu, Chunjing Xu, Hang Xu:
One Million Scenes for Autonomous Driving: ONCE Dataset. NeurIPS Datasets and Benchmarks 2021 - [c108]Zhenyu Xie, Zaiyu Huang, Fuwei Zhao, Haoye Dong, Michael Kampffmeyer, Xiaodan Liang:
Towards Scalable Unpaired Virtual Try-On via Patch-Routed Spatially-Adaptive GAN. NeurIPS 2021: 2598-2610 - [p1]Qingxing Cao, Wentao Wan, Xiaodan Liang, Liang Lin:
Graph Reasoning Networks and Applications. Neuro-Symbolic Artificial Intelligence 2021: 103-125 - [i147]Fuyu Wang, Xiaodan Liang, Lin Xu, Liang Lin:
Unifying Relational Sentence Generation and Retrieval for Medical Image Report Composition. CoRR abs/2101.03287 (2021) - [i146]Siyi Hu, Fengda Zhu, Xiaojun Chang, Xiaodan Liang:
UPDeT: Universal Multi-agent Reinforcement Learning via Policy Decoupling with Transformers. CoRR abs/2101.08001 (2021) - [i145]Liang Lin, Yiming Gao, Ke Gong, Meng Wang, Xiaodan Liang:
Graphonomy: Universal Image Parsing via Graph Reasoning and Transfer. CoRR abs/2101.10620 (2021) - [i144]Yukai Shi, Sen Zhang, Chenxing Zhou, Xiaodan Liang, Xiaojun Yang, Liang Lin:
GTAE: Graph-Transformer based Auto-Encoders for Linguistic-Constrained Text Style Transfer. CoRR abs/2102.00769 (2021) - [i143]Peidong Liu, Gengwei Zhang, Bochao Wang, Hang Xu, Xiaodan Liang, Yong Jiang, Zhenguo Li:
Loss Function Discovery for Object Detection via Convergence-Simulation Driven Search. CoRR abs/2102.04700 (2021) - [i142]Han Shi, Jiahui Gao, Xiaozhe Ren, Hang Xu, Xiaodan Liang, Zhenguo Li, James T. Kwok:
SparseBERT: Rethinking the Importance Analysis in Self-attention. CoRR abs/2102.12871 (2021) - [i141]Zhengzhong Liu, Guanxiong Ding, Avinash Bukkittu, Mansi Gupta, Pengzhi Gao, Atif Ahmed, Shikun Zhang, Xin Gao, Swapnil Singhavi, Linwei Li, Wei Wei, Zecong Hu, Haoran Shi, Xiaodan Liang, Teruko Mitamura, Eric P. Xing, Zhiting Hu:
A Data-Centric Framework for Composable NLP Workflows. CoRR abs/2103.01834 (2021) - [i140]Changlin Li, Tao Tang, Guangrun Wang, Jiefeng Peng, Bing Wang, Xiaodan Liang, Xiaojun Chang:
BossNAS: Exploring Hybrid CNN-transformers with Block-wisely Self-supervised Neural Architecture Search. CoRR abs/2103.12424 (2021) - [i139]Changlin Li, Guangrun Wang, Bing Wang, Xiaodan Liang, Zhihui Li, Xiaojun Chang:
Dynamic Slimmable Network. CoRR abs/2103.13258 (2021) - [i138]Yinya Huang, Meng Fang, Yu Cao, Liwei Wang, Xiaodan Liang:
DAGN: Discourse-Aware Graph Network for Logical Reasoning. CoRR abs/2103.14349 (2021) - [i137]Fengda Zhu, Xiwen Liang, Yi Zhu, Xiaojun Chang, Xiaodan Liang:
SOON: Scenario Oriented Object Navigation with Graph-based Exploration. CoRR abs/2103.17138 (2021) - [i136]Pan Lu, Ran Gong, Shibiao Jiang, Liang Qiu, Siyuan Huang, Xiaodan Liang, Song-Chun Zhu:
Inter-GPS: Interpretable Geometry Problem Solving with Formal Language and Symbolic Reasoning. CoRR abs/2105.04165 (2021) - [i135]Yawen Duan, Xin Chen, Hang Xu, Zewei Chen, Xiaodan Liang, Tong Zhang, Zhenguo Li:
TransNAS-Bench-101: Improving Transferability and Generalizability of Cross-Task Neural Architecture Search. CoRR abs/2105.11871 (2021) - [i134]Jiaqi Chen, Jianheng Tang, Jinghui Qin, Xiaodan Liang, Lingbo Liu, Eric P. Xing, Liang Lin:
GeoQA: A Geometric Question Answering Benchmark Towards Multimodal Numerical Reasoning. CoRR abs/2105.14517 (2021) - [i133]Zheng Ye, Liucun Lu, Lishan Huang, Liang Lin, Xiaodan Liang:
Towards Quantifiable Dialogue Coherence Evaluation. CoRR abs/2106.00507 (2021) - [i132]Chong Liu, Fengda Zhu, Xiaojun Chang, Xiaodan Liang, Yi-Dong Shen:
Vision-Language Navigation with Random Environmental Mixup. CoRR abs/2106.07876 (2021) - [i131]Shuai Lin, Pan Zhou, Zi-Yuan Hu, Shuojia Wang, Ruihui Zhao, Yefeng Zheng, Liang Lin, Eric P. Xing, Xiaodan Liang:
Prototypical Graph Contrastive Learning. CoRR abs/2106.09645 (2021) - [i130]Jiageng Mao, Minzhe Niu, Chenhan Jiang, Hanxue Liang, Xiaodan Liang, Yamin Li, Chaoqiang Ye, Wei Zhang, Zhenguo Li, Jie Yu, Hang Xu, Chunjing Xu:
One Million Scenes for Autonomous Driving: ONCE Dataset. CoRR abs/2106.11037 (2021) - [i129]Jianhua Han, Xiwen Liang, Hang Xu, Kai Chen, Lanqing Hong, Chaoqiang Ye, Wei Zhang, Zhenguo Li, Xiaodan Liang, Chunjing Xu:
SODA10M: Towards Large-Scale Object Detection Benchmark for Autonomous Driving. CoRR abs/2106.11118 (2021) - [i128]Guangyi Liu, Zichao Yang, Tianhua Tao, Xiaodan Liang, Zhen Li, Bowen Zhou, Shuguang Cui, Zhiting Hu:
Don't Take It Literally: An Edit-Invariant Sequence Loss for Text Generation. CoRR abs/2106.15078 (2021) - [i127]Jinghui Qin, Xiaodan Liang, Yining Hong, Jianheng Tang, Liang Lin:
Neural-Symbolic Solver for Math Word Problems with Auxiliary Tasks. CoRR abs/2107.01431 (2021) - [i126]Jiahui Gao, Hang Xu, Han Shi, Xiaozhe Ren, Philip L. H. Yu, Xiaodan Liang, Xin Jiang, Zhenguo Li:
AutoBERT-Zero: Evolving BERT Backbone from Scratch. CoRR abs/2107.07445 (2021) - [i125]Bingqian Lin, Yi Zhu, Yanxin Long, Xiaodan Liang, Qixiang Ye, Liang Lin:
Adversarial Reinforced Instruction Attacker for Robust Vision-Language Navigation. CoRR abs/2107.11252 (2021) - [i124]Xunlin Zhan, Yangxin Wu, Xiao Dong, Yunchao Wei, Minlong Lu, Yichi Zhang, Hang Xu, Xiaodan Liang:
Product1M: Towards Weakly Supervised Instance-Level Product Retrieval via Cross-modal Pretraining. CoRR abs/2107.14572 (2021) - [i123]