default search action
Ping Luo 0002
羅平
Person information
- unicode name: 羅平
- affiliation: University of Hong Kong, Department of Computer Science, Hong Kong
- affiliation (PhD 2014): Chinese University of Hong Kong, Department of Information Engineering, Hong Kong
- affiliation (former): Sun Yat-Sen University, School of Software, Guangzhou, China
- affiliation (former): Lotus Hill Insititue, China
Other persons with the same name
- Ping Luo — disambiguation page
- Ping Luo 0001 — Chinese Academy of Sciences, Institute of Computing Technology, Beijing, China (and 1 more)
- Ping Luo 0003 — University of Saskatchewan, Division of Biomedical Engineering, Saskatoon, SK, Canada
- Ping Luo 0004 — Tsinghua University, Key Laboratory for Information System Security, Beijing, China (and 1 more)
- Ping Luo 0005 — University of Electronic Science and Technology of China, Institute of Electronic and Information Engineering, State Key Laboratory of Electronic Thin Films and Integrated Devices, China
- Ping Luo 0006 — Guangzhou University, School of Economics and Statistics, China
Other persons with a similar name
- Ai-Ping Luo
- Chih-Ping Luo
- Shi-Ping Luo
- Xi-Ping Luo
- Xue-ping Luo
- Yiping Luo (aka: Yi-Ping Luo) — disambiguation page
- Yiping Luo 0001 (aka: Yi-Ping Luo 0001) — Hunan Institute of Engineering, Xiangtan, China
- Yiping Luo 0002 (aka: Yi-Ping Luo 0002) — Central South University, School of Traffic and Transportation Engineering, Changsha, China
- Zhengping Luo 0002 (aka: Zheng-ping Luo 0002) — Chinese Academy of Sciences, Institute of Plasma Physics, Hefei, China
- Luo Ping
SPARQL queries
🛈 Please note that only 65% of the records listed on this page have a DOI. Therefore, DOI-based queries can only provide partial results.
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [j35]Xiaokang Chen, Mingyu Ding, Xiaodi Wang, Ying Xin, Shentong Mo, Yunhao Wang, Shumin Han, Ping Luo, Gang Zeng, Jingdong Wang:
Context Autoencoder for Self-supervised Representation Learning. Int. J. Comput. Vis. 132(1): 208-223 (2024) - [j34]Weijia Wu, Yuanqiang Cai, Chunhua Shen, Debing Zhang, Ying Fu, Hong Zhou, Ping Luo:
End-to-End Video Text Spotting with Transformer. Int. J. Comput. Vis. 132(9): 4019-4035 (2024) - [j33]Jian Ding, Enze Xie, Hang Xu, Chenhan Jiang, Zhenguo Li, Ping Luo, Gui-Song Xia:
Deeply Unsupervised Patch Re-Identification for Pre-Training Object Detectors. IEEE Trans. Pattern Anal. Mach. Intell. 46(3): 1348-1361 (2024) - [j32]Junjie Wang, Qichao Zhang, Yao Mu, Dong Li, Dongbin Zhao, Yuzheng Zhuang, Ping Luo, Bin Wang, Jianye Hao:
Prototypical Context-Aware Dynamics for Generalization in Visual Control With Model-Based Reinforcement Learning. IEEE Trans. Ind. Informatics 20(9): 10717-10727 (2024) - [j31]Chongjian Ge, Yibing Song, Chao Ma, Yuankai Qi, Ping Luo:
Rethinking Attentive Object Detection via Neural Attention Learning. IEEE Trans. Image Process. 33: 1726-1739 (2024) - [j30]Zhouxia Wang, Jiawei Zhang, Xintao Wang, Tianshui Chen, Ying Shan, Wenping Wang, Ping Luo:
Analysis and Benchmarking of Extending Blind Face Image Restoration to Videos. IEEE Trans. Image Process. 33: 5676-5687 (2024) - [j29]Zeyu Gao, Yao Mu, Chen Chen, Jingliang Duan, Ping Luo, Yanfeng Lu, Shengbo Eben Li:
Enhance Sample Efficiency and Robustness of End-to-End Urban Autonomous Driving via Semantic Masked World Model. IEEE Trans. Intell. Transp. Syst. 25(10): 13067-13079 (2024) - [j28]Chaofan Tao, Rui Lin, Quan Chen, Zhaoyang Zhang, Ping Luo, Ngai Wong:
FAT: Frequency-Aware Transformation for Bridging Full-Precision and Low-Precision Deep Representations. IEEE Trans. Neural Networks Learn. Syst. 35(2): 2640-2654 (2024) - [j27]Ping Luo, Jieren Cheng, Neal Xiong, Zhenhao Liu, Jie Wu:
FedVeca: Federated Vectorized Averaging on Non-IID Data With Adaptive Bi-Directional Global Objective. IEEE Trans. Parallel Distributed Syst. 35(11): 2102-2113 (2024) - [c192]Tianqi Wang, Sukmin Kim, Wenxuan Ji, Enze Xie, Chongjian Ge, Junsong Chen, Zhenguo Li, Ping Luo:
DeepAccident: A Motion and Accident Prediction Benchmark for V2X Autonomous Driving. AAAI 2024: 5599-5606 - [c191]Chengyue Wu, Yukang Gan, Yixiao Ge, Zeyu Lu, Jiahao Wang, Ye Feng, Ying Shan, Ping Luo:
LLaMA Pro: Progressive LLaMA with Block Expansion. ACL (1) 2024: 6518-6537 - [c190]Fanqing Meng, Wenqi Shao, Quanfeng Lu, Peng Gao, Kaipeng Zhang, Yu Qiao, Ping Luo:
ChartAssistant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning. ACL (Findings) 2024: 7775-7803 - [c189]Mengkang Hu, Haoyu Dong, Ping Luo, Shi Han, Dongmei Zhang:
KET-QA: A Dataset for Knowledge Enhanced Table Question Answering. LREC/COLING 2024: 9705-9719 - [c188]Lirui Zhao, Yue Yang, Kaipeng Zhang, Wenqi Shao, Yuxin Zhang, Yu Qiao, Ping Luo, Rongrong Ji:
DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model. CVPR 2024: 6390-6399 - [c187]Shoufa Chen, Mengmeng Xu, Jiawei Ren, Yuren Cong, Sen He, Yanping Xie, Animesh Sinha, Ping Luo, Tao Xiang, Juan-Manuel Pérez-Rúa:
GenTron: Diffusion Transformers for Image and Video Generation. CVPR 2024: 6441-6451 - [c186]Jiazhi Yang, Shenyuan Gao, Yihang Qiu, Li Chen, Tianyu Li, Bo Dai, Kashyap Chitta, Penghao Wu, Jia Zeng, Ping Luo, Jun Zhang, Andreas Geiger, Yu Qiao, Hongyang Li:
Generalized Predictive Model for Autonomous Driving. CVPR 2024: 14662-14672 - [c185]Zhixuan Liang, Yao Mu, Hengbo Ma, Masayoshi Tomizuka, Mingyu Ding, Ping Luo:
SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution. CVPR 2024: 16467-16476 - [c184]Zhe Chen, Jiannan Wu, Wenhai Wang, Weijie Su, Guo Chen, Sen Xing, Muyan Zhong, Qinglong Zhang, Xizhou Zhu, Lewei Lu, Bin Li, Ping Luo, Tong Lu, Yu Qiao, Jifeng Dai:
Intern VL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks. CVPR 2024: 24185-24198 - [c183]Ruijie Yao, Sheng Jin, Lumin Xu, Wang Zeng, Wentao Liu, Chen Qian, Ping Luo, Ji Wu:
GKGNet: Group K-Nearest Neighbor Based Graph Convolutional Network for Multi-label Image Recognition. ECCV (18) 2024: 91-107 - [c182]Sheng Jin, Shuhuai Li, Tong Li, Wentao Liu, Chen Qian, Ping Luo:
You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-person Multi-task Human-Centric Perception. ECCV (18) 2024: 126-146 - [c181]Yue Yang, Kaipeng Zhang, Yuying Ge, Wenqi Shao, Zeyue Xue, Yu Qiao, Ping Luo:
Align, Adapt and Inject: Audio-Guided Image Generation, Editing and Stylization. ICASSP 2024: 3475-3479 - [c180]Junsong Chen, Jincheng Yu, Chongjian Ge, Lewei Yao, Enze Xie, Zhongdao Wang, James T. Kwok, Ping Luo, Huchuan Lu, Zhenguo Li:
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis. ICLR 2024 - [c179]Mengkang Hu, Yao Mu, Xinmiao Yu, Mingyu Ding, Shiguang Wu, Wenqi Shao, Qiguang Chen, Bin Wang, Yu Qiao, Ping Luo:
Tree-Planner: Efficient Close-loop Task Planning with Large Language Models. ICLR 2024 - [c178]Yuanfeng Ji, Chongjian Ge, Weikai Kong, Enze Xie, Zhengying Liu, Zhenguo Li, Ping Luo:
Large Language Models as Automated Aligners for benchmarking Vision-Language Models. ICLR 2024 - [c177]Haoyu Lu, Guoxing Yang, Nanyi Fei, Yuqi Huo, Zhiwu Lu, Ping Luo, Mingyu Ding:
VDT: General-purpose Video Diffusion Transformers via Mask Modeling. ICLR 2024 - [c176]Wenqi Shao, Mengzhao Chen, Zhaoyang Zhang, Peng Xu, Lirui Zhao, Zhiqian Li, Kaipeng Zhang, Peng Gao, Yu Qiao, Ping Luo:
OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models. ICLR 2024 - [c175]Haopeng Sun, Lumin Xu, Sheng Jin, Ping Luo, Chen Qian, Wentao Liu:
PROGRAM: PROtotype GRAph Model based Pseudo-Label Learning for Test-Time Adaptation. ICLR 2024 - [c174]Yi Wang, Yinan He, Yizhuo Li, Kunchang Li, Jiashuo Yu, Xin Ma, Xinhao Li, Guo Chen, Xinyuan Chen, Yaohui Wang, Ping Luo, Ziwei Liu, Yali Wang, Limin Wang, Yu Qiao:
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation. ICLR 2024 - [c173]Peng Xu, Wenqi Shao, Mengzhao Chen, Shitao Tang, Kaipeng Zhang, Peng Gao, Fengwei An, Yu Qiao, Ping Luo:
BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation. ICLR 2024 - [c172]Yao Mu, Junting Chen, Qinglong Zhang, Shoufa Chen, Qiaojun Yu, Chongjian Ge, Runjian Chen, Zhixuan Liang, Mengkang Hu, Chaofan Tao, Peize Sun, Haibao Yu, Chao Yang, Wenqi Shao, Wenhai Wang, Jifeng Dai, Yu Qiao, Mingyu Ding, Ping Luo:
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis. ICML 2024 - [c171]Yue Yang, Yuqi Lin, Hong Liu, Wenqi Shao, Runjian Chen, Hailong Shang, Yu Wang, Yu Qiao, Kaipeng Zhang, Ping Luo:
Position: Towards Implicit Prompt For Text-To-Image Models. ICML 2024 - [c170]Kaining Ying, Fanqing Meng, Jin Wang, Zhiqian Li, Han Lin, Yue Yang, Hao Zhang, Wenbo Zhang, Yuqi Lin, Shuo Liu, Jiayi Lei, Quanfeng Lu, Runjian Chen, Peng Xu, Renrui Zhang, Haozhe Zhang, Peng Gao, Yali Wang, Yu Qiao, Ping Luo, Kaipeng Zhang, Wenqi Shao:
MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI. ICML 2024 - [c169]Anran Liu, Cheng Lin, Yuan Liu, Xiaoxiao Long, Zhiyang Dou, Hao-Xiang Guo, Ping Luo, Wenping Wang:
Part123: Part-aware 3D Reconstruction from a Single-view Image. SIGGRAPH (Conference Paper Track) 2024: 24 - [c168]Zhouxia Wang, Ziyang Yuan, Xintao Wang, Yaowei Li, Tianshui Chen, Menghan Xia, Ping Luo, Ying Shan:
MotionCtrl: A Unified and Flexible Motion Controller for Video Generation. SIGGRAPH (Conference Paper Track) 2024: 114 - [i258]Fanqing Meng, Wenqi Shao, Quanfeng Lu, Peng Gao, Kaipeng Zhang, Yu Qiao, Ping Luo:
ChartAssisstant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning. CoRR abs/2401.02384 (2024) - [i257]Chengyue Wu, Yukang Gan, Yixiao Ge, Zeyu Lu, Jiahao Wang, Ye Feng, Ping Luo, Ying Shan:
LLaMA Pro: Progressive LLaMA with Block Expansion. CoRR abs/2401.02415 (2024) - [i256]Junsong Chen, Yue Wu, Simian Luo, Enze Xie, Sayak Paul, Ping Luo, Hang Zhao, Zhenguo Li:
PIXART-δ: Fast and Controllable Image Generation with Latent Consistency Models. CoRR abs/2401.05252 (2024) - [i255]Yutao Hu, Tianbin Li, Quanfeng Lu, Wenqi Shao, Junjun He, Yu Qiao, Ping Luo:
OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLM. CoRR abs/2402.09181 (2024) - [i254]Junting Chen, Yao Mu, Qiaojun Yu, Tianming Wei, Silang Wu, Zhecheng Yuan, Zhixuan Liang, Chao Yang, Kaipeng Zhang, Wenqi Shao, Yu Qiao, Huazhe Xu, Mingyu Ding, Ping Luo:
RoboScript: Code Generation for Free-Form Manipulation Tasks across Real and Simulation. CoRR abs/2402.14623 (2024) - [i253]Zekang Yang, Wang Zeng, Sheng Jin, Chen Qian, Ping Luo, Wentao Liu:
AutoMMLab: Automatically Generating Deployable Models from Language Instructions for Computer Vision Tasks. CoRR abs/2402.15351 (2024) - [i252]Yao Mu, Junting Chen, Qinglong Zhang, Shoufa Chen, Qiaojun Yu, Chongjian Ge, Runjian Chen, Zhixuan Liang, Mengkang Hu, Chaofan Tao, Peize Sun, Haibao Yu, Chao Yang, Wenqi Shao, Wenhai Wang, Jifeng Dai, Yu Qiao, Mingyu Ding, Ping Luo:
RoboCodeX: Multimodal Code Generation for Robotic Behavior Synthesis. CoRR abs/2402.16117 (2024) - [i251]Peng Xu, Wenqi Shao, Mengzhao Chen, Shitao Tang, Kaipeng Zhang, Peng Gao, Fengwei An, Yu Qiao, Ping Luo:
BESA: Pruning Large Language Models with Blockwise Parameter-Efficient Sparsity Allocation. CoRR abs/2402.16880 (2024) - [i250]Yue Yang, Yuqi lin, Hong Liu, Wenqi Shao, Runjian Chen, Hailong Shang, Yu Wang, Yu Qiao, Kaipeng Zhang, Ping Luo:
Towards Implicit Prompt For Text-To-Image Models. CoRR abs/2403.02118 (2024) - [i249]Junsong Chen, Chongjian Ge, Enze Xie, Yue Wu, Lewei Yao, Xiaozhe Ren, Zhongdao Wang, Ping Luo, Huchuan Lu, Zhenguo Li:
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation. CoRR abs/2403.04692 (2024) - [i248]Hao Zhang, Wenqi Shao, Hong Liu, Yongqiang Ma, Ping Luo, Yu Qiao, Kaipeng Zhang:
AVIBench: Towards Evaluating the Robustness of Large Vision-Language Model on Adversarial Visual-Instructions. CoRR abs/2403.09346 (2024) - [i247]Jiazhi Yang, Shenyuan Gao, Yihang Qiu, Li Chen, Tianyu Li, Bo Dai, Kashyap Chitta, Penghao Wu, Jia Zeng, Ping Luo, Jun Zhang, Andreas Geiger, Yu Qiao, Hongyang Li:
Generalized Predictive Model for Autonomous Driving. CoRR abs/2403.09630 (2024) - [i246]Tianqi Wang, Enze Xie, Ruihang Chu, Zhenguo Li, Ping Luo:
DriveCoT: Integrating Chain-of-Thought Reasoning with End-to-End Driving. CoRR abs/2403.16996 (2024) - [i245]Shilong Zhang, Lianghua Huang, Xi Chen, Yifei Zhang, Zhi-Fan Wu, Yutong Feng, Wei Wang, Yujun Shen, Yu Liu, Ping Luo:
FlashFace: Human Image Personalization with High-fidelity Identity Preservation. CoRR abs/2403.17008 (2024) - [i244]Shuo Liu, Kaining Ying, Hao Zhang, Yue Yang, Yuqi Lin, Tianle Zhang, Chuanhao Li, Yu Qiao, Ping Luo, Wenqi Shao, Kaipeng Zhang:
ConvBench: A Multi-Turn Conversation Evaluation Benchmark with Hierarchical Capability for Large Vision-Language Models. CoRR abs/2403.20194 (2024) - [i243]Haibao Yu, Wenxian Yang, Jiaru Zhong, Zhenwei Yang, Siqi Fan, Ping Luo, Zaiqing Nie:
End-to-End Autonomous Driving through V2X Cooperation. CoRR abs/2404.00717 (2024) - [i242]Lirui Zhao, Yue Yang, Kaipeng Zhang, Wenqi Shao, Yuxin Zhang, Yu Qiao, Ping Luo, Rongrong Ji:
DiffAgent: Fast and Accurate Text-to-Image API Selection with Large Language Model. CoRR abs/2404.01342 (2024) - [i241]Jiahao Wang, Wenqi Shao, Mengzhao Chen, Chengyue Wu, Yong Liu, Kaipeng Zhang, Songyang Zhang, Kai Chen, Ping Luo:
Adapting LLaMA Decoder to Vision Transformer. CoRR abs/2404.06773 (2024) - [i240]Kaining Ying, Fanqing Meng, Jin Wang, Zhiqian Li, Han Lin, Yue Yang, Hao Zhang, Wenbo Zhang, Yuqi Lin, Shuo Liu, Jiayi Lei, Quanfeng Lu, Runjian Chen, Peng Xu, Renrui Zhang, Haozhe Zhang, Peng Gao, Yali Wang, Yu Qiao, Ping Luo, Kaipeng Zhang, Wenqi Shao:
MMT-Bench: A Comprehensive Multimodal Benchmark for Evaluating Large Vision-Language Models Towards Multitask AGI. CoRR abs/2404.16006 (2024) - [i239]Sheng Jin, Ruijie Yao, Lumin Xu, Wentao Liu, Chen Qian, Ji Wu, Ping Luo:
UniFS: Universal Few-shot Instance Perception with Point Representations. CoRR abs/2404.19401 (2024) - [i238]Yao Lai, Jinxin Liu, David Z. Pan, Ping Luo:
Scalable and Effective Arithmetic Tree Generation for Adder and Multiplier Designs. CoRR abs/2405.06758 (2024) - [i237]Chengyue Wu, Yixiao Ge, Qiushan Guo, Jiahao Wang, Zhixuan Liang, Zeyu Lu, Ying Shan, Ping Luo:
Plot2Code: A Comprehensive Benchmark for Evaluating Multi-modal Large Language Models in Code Generation from Scientific Plots. CoRR abs/2405.07990 (2024) - [i236]Mengkang Hu, Haoyu Dong, Ping Luo, Shi Han, Dongmei Zhang:
KET-QA: A Dataset for Knowledge Enhanced Table Question Answering. CoRR abs/2405.08099 (2024) - [i235]Chuanhao Li, Zhen Li, Chenchen Jing, Shuo Liu, Wenqi Shao, Yuwei Wu, Ping Luo, Yu Qiao, Kaipeng Zhang:
UDKAG: Augmenting Large Vision-Language Models with Up-to-Date Knowledge. CoRR abs/2405.14554 (2024) - [i234]Yao Lai, Sungyoung Lee, Guojin Chen, Souradip Poddar, Mengkang Hu, David Z. Pan, Ping Luo:
AnalogCoder: Analog Circuit Design via Training-Free Code Generation. CoRR abs/2405.14918 (2024) - [i233]Anran Liu, Cheng Lin, Yuan Liu, Xiaoxiao Long, Zhiyang Dou, Hao-Xiang Guo, Ping Luo, Wenping Wang:
Part123: Part-aware 3D Reconstruction from a Single-view Image. CoRR abs/2405.16888 (2024) - [i232]Jia Zeng, Qingwen Bu, Bangjun Wang, Wenke Xia, Li Chen, Hao Dong, Haoming Song, Dong Wang, Di Hu, Ping Luo, Heming Cui, Bin Zhao, Xuelong Li, Yu Qiao, Hongyang Li:
Learning Manipulation by Predicting Interaction. CoRR abs/2406.00439 (2024) - [i231]Peize Sun, Yi Jiang, Shoufa Chen, Shilong Zhang, Bingyue Peng, Ping Luo, Zehuan Yuan:
Autoregressive Model Beats Diffusion: Llama for Scalable Image Generation. CoRR abs/2406.06525 (2024) - [i230]Weiyun Wang, Shuibo Zhang, Yiming Ren, Yuchen Duan, Tiantong Li, Shuo Liu, Mengkang Hu, Zhe Chen, Kaipeng Zhang, Lewei Lu, Xizhou Zhu, Ping Luo, Yu Qiao, Jifeng Dai, Wenqi Shao, Wenhai Wang:
Needle In A Multimodal Haystack. CoRR abs/2406.07230 (2024) - [i229]Jiannan Wu, Muyan Zhong, Sen Xing, Zeqiang Lai, Zhaoyang Liu, Wenhai Wang, Zhe Chen, Xizhou Zhu, Lewei Lu, Tong Lu, Ping Luo, Yu Qiao, Jifeng Dai:
VisionLLM v2: An End-to-End Generalist Multimodal Large Language Model for Hundreds of Vision-Language Tasks. CoRR abs/2406.08394 (2024) - [i228]Quanfeng Lu, Wenqi Shao, Zitao Liu, Fanqing Meng, Boxuan Li, Botong Chen, Siyuan Huang, Kaipeng Zhang, Yu Qiao, Ping Luo:
GUI Odyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices. CoRR abs/2406.08451 (2024) - [i227]Tianle Zhang, Langtian Ma, Yuchen Yan, Yuchen Zhang, Kai Wang, Yue Yang, Ziyao Guo, Wenqi Shao, Yang You, Yu Qiao, Ping Luo, Kaipeng Zhang:
Rethinking Human Evaluation Protocol for Text-to-Video Models: Enhancing Reliability, Reproducibility, and Practicality. CoRR abs/2406.08845 (2024) - [i226]Zeyu Gao, Yao Mu, Jinye Qu, Mengkang Hu, Lingyue Guo, Ping Luo, Yanfeng Lu:
DAG-Plan: Generating Directed Acyclic Dependency Graphs for Dual-Arm Cooperative Planning. CoRR abs/2406.09953 (2024) - [i225]Fanqing Meng, Wenqi Shao, Lixin Luo, Yahong Wang, Yiran Chen, Quanfeng Lu, Yue Yang, Tianshuo Yang, Kaipeng Zhang, Yu Qiao, Ping Luo:
PhyBench: A Physical Commonsense Benchmark for Evaluating Text-to-Image Models. CoRR abs/2406.11802 (2024) - [i224]Yatai Ji, Shilong Zhang, Jie Wu, Peize Sun, Weifeng Chen, Xuefeng Xiao, Sidi Yang, Yujiu Yang, Ping Luo:
IDA-VLM: Towards Movie Understanding via ID-Aware Large Vision-Language Model. CoRR abs/2407.07577 (2024) - [i223]Yi Zhang, Wang Zeng, Sheng Jin, Chen Qian, Ping Luo, Wentao Liu:
When Pedestrian Detection Meets Multi-Modal Learning: Generalist Model and Benchmark Dataset. CoRR abs/2407.10125 (2024) - [i222]Mengzhao Chen, Wenqi Shao, Peng Xu, Jiahao Wang, Peng Gao, Kaipeng Zhang, Yu Qiao, Ping Luo:
EfficientQAT: Efficient Quantization-Aware Training for Large Language Models. CoRR abs/2407.11062 (2024) - [i221]Wang Zeng, Sheng Jin, Lumin Xu, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo, Xiaogang Wang:
TCFormer: Visual Recognition via Token Clustering Transformer. CoRR abs/2407.11321 (2024) - [i220]Jianhao Li, Tianyu Sun, Zhongdao Wang, Enze Xie, Bailan Feng, Hongbo Zhang, Ze Yuan, Ke Xu, Jiaheng Liu, Ping Luo:
Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts. CoRR abs/2407.11382 (2024) - [i219]Chaofan Tao, Qian Liu, Longxu Dou, Niklas Muennighoff, Zhongwei Wan, Ping Luo, Min Lin, Ngai Wong:
Scaling Laws with Vocabulary: Larger Models Deserve Larger Vocabularies. CoRR abs/2407.13623 (2024) - [i218]Lirui Zhao, Tianshuo Yang, Wenqi Shao, Yuxin Zhang, Yu Qiao, Ping Luo, Kaipeng Zhang, Rongrong Ji:
Diffree: Text-Guided Shape Free Object Inpainting with Diffusion Model. CoRR abs/2407.16982 (2024) - [i217]Fanqing Meng, Jin Wang, Chuanhao Li, Quanfeng Lu, Hao Tian, Jiaqi Liao, Xizhou Zhu, Jifeng Dai, Yu Qiao, Ping Luo, Kaipeng Zhang, Wenqi Shao:
MMIU: Multimodal Multi-image Understanding for Evaluating Large Vision-Language Models. CoRR abs/2408.02718 (2024) - [i216]Mengkang Hu, Tianxing Chen, Qiguang Chen, Yao Mu, Wenqi Shao, Ping Luo:
HiAgent: Hierarchical Working Memory Management for Solving Long-Horizon Agent Tasks with Large Language Model. CoRR abs/2408.09559 (2024) - [i215]Yangyang Xu, Wenqi Shao, Yong Du, Haiming Zhu, Yang Zhou, Ping Luo, Shengfeng He:
Task-Oriented Diffusion Inversion for High-Fidelity Text-based Editing. CoRR abs/2408.13395 (2024) - [i214]Yao Mu, Tianxing Chen, Shijia Peng, Zanxin Chen, Zeyu Gao, Yude Zou, Lunkai Lin, Zhiqiang Xie, Ping Luo:
RoboTwin: Dual-Arm Robot Benchmark with Generative Digital Twins (early version). CoRR abs/2409.02920 (2024) - [i213]Qingwen Bu, Jia Zeng, Li Chen, Yanchao Yang, Guyue Zhou, Junchi Yan, Ping Luo, Heming Cui, Yi Ma, Hongyang Li:
Closed-Loop Visuomotor Control with Generative Expectation for Robotic Manipulation. CoRR abs/2409.09016 (2024) - [i212]Xi Wang, Tianxing Chen, Qiaojun Yu, Tianling Xu, Zanxin Chen, Yiting Fu, Cewu Lu, Yao Mu, Ping Luo:
Articulated Object Manipulation using Online Axis Estimation with SAM2-Based Tracking. CoRR abs/2409.16287 (2024) - 2023
- [j26]Lumin Xu, Sheng Jin, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo, Xiaogang Wang:
ZoomNAS: Searching for Whole-Body Human Pose Estimation in the Wild. IEEE Trans. Pattern Anal. Mach. Intell. 45(4): 5296-5313 (2023) - [j25]Yiming Gao, Zhanghui Kuang, Guanbin Li, Ping Luo, Yimin Chen, Liang Lin, Wayne Zhang:
Fashion Retrieval via Graph Reasoning Networks on a Similarity Pyramid. IEEE Trans. Pattern Anal. Mach. Intell. 45(6): 7019-7034 (2023) - [j24]Shoufa Chen, Enze Xie, Chongjian Ge, Runjian Chen, Ding Liang, Ping Luo:
CycleMLP: A MLP-Like Architecture for Dense Visual Predictions. IEEE Trans. Pattern Anal. Mach. Intell. 45(12): 14284-14300 (2023) - [j23]Zhouxia Wang, Jiawei Zhang, Tianshui Chen, Wenping Wang, Ping Luo:
RestoreFormer++: Towards Real-World Blind Face Restoration From Undegraded Key-Value Pairs. IEEE Trans. Pattern Anal. Mach. Intell. 45(12): 15462-15476 (2023) - [j22]Peize Sun, Rufeng Zhang, Yi Jiang, Tao Kong, Chenfeng Xu, Wei Zhan, Masayoshi Tomizuka, Zehuan Yuan, Ping Luo:
Sparse R-CNN: An End-to-End Framework for Object Detection. IEEE Trans. Pattern Anal. Mach. Intell. 45(12): 15650-15664 (2023) - [j21]Qiang Zhai, Xin Li, Fan Yang, Zhicheng Jiao, Ping Luo, Hong Cheng, Zicheng Liu:
MGL: Mutual Graph Learning for Camouflaged Object Detection. IEEE Trans. Image Process. 32: 1897-1910 (2023) - [j20]Jie Zhu, Jiyang Qi, Mingyu Ding, Xiaokang Chen, Ping Luo, Xinggang Wang, Wenyu Liu, Leye Wang, Jingdong Wang:
Understanding Self-Supervised Pretraining with Part-Aware Representation Learning. Trans. Mach. Learn. Res. 2023 (2023) - [j19]Hao Tan, Ran Cheng, Shihua Huang, Cheng He, Changxiao Qiu, Fan Yang, Ping Luo:
RelativeNAS: Relative Neural Architecture Search via Slow-Fast Learning. IEEE Trans. Neural Networks Learn. Syst. 34(1): 475-489 (2023) - [c167]Yuanfeng Ji, Lu Zhang, Jiaxiang Wu, Bingzhe Wu, Lanqing Li, Long-Kai Huang, Tingyang Xu, Yu Rong, Jie Ren, Ding Xue, Houtim Lai, Wei Liu, Junzhou Huang, Shuigeng Zhou, Ping Luo, Peilin Zhao, Yatao Bian:
DrugOOD: Out-of-Distribution Dataset Curator and Benchmark for AI-Aided Drug Discovery - a Focus on Affinity Prediction Problems with Noise Annotations. AAAI 2023: 8023-8031 - [c166]Chaofan Tao, Lu Hou, Haoli Bai, Jiansheng Wei, Xin Jiang, Qun Liu, Ping Luo, Ngai Wong:
Structured Pruning for Efficient Generative Pre-trained Language Models. ACL (Findings) 2023: 10880-10895 - [c165]Haibao Yu, Wenxian Yang, Hongzhi Ruan, Zhenwei Yang, Yingjuan Tang, Xu Gao, Xin Hao, Yifeng Shi, Yifeng Pan, Ning Sun, Juan Song, Jirui Yuan, Ping Luo, Zaiqing Nie:
V2X-Seq: A Large-Scale Sequential Dataset for Vehicle-Infrastructure Cooperative Perception and Forecasting. CVPR 2023: 5486-5495 - [c164]Yao Mu, Shunyu Yao, Mingyu Ding, Ping Luo, Chuang Gan:
EC2: Emergent Communication for Embodied Control. CVPR 2023: 6704-6714 - [c163]Shilong Zhang, Xinjiang Wang, Jiaqi Wang, Jiangmiao Pang, Chengqi Lyu, Wenwei Zhang, Ping Luo, Kai Chen:
Dense Distinct Query for End-to-End Object Detection. CVPR 2023: 7329-7338 - [c162]Jiahao Wang, Songyang Zhang, Yong Liu, Taiqiang Wu, Yujiu Yang, Xihui Liu, Kai Chen, Ping Luo, Dahua Lin:
RIFormer: Keep Your Vision Backbone Effective But Removing Token Mixer. CVPR 2023: 14443-14452 - [c161]Mingyu Ding, Yikang Shen, Lijie Fan, Zhenfang Chen, Zitian Chen, Ping Luo, Joshua B. Tenenbaum, Chuang Gan:
Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention. CVPR 2023: 14528-14539 - [c160]Bin Yan, Yi Jiang, Jiannan Wu, Dong Wang, Ping Luo, Zehuan Yuan, Huchuan Lu:
Universal Instance Perception as Object Discovery and Retrieval. CVPR 2023: 15325-15336 - [c159]Yuying Ge, Annabella Macaluso, Li Erran Li, Ping Luo, Xiaolong Wang:
Policy Adaptation from Foundation Model Feedback. CVPR 2023: 19059-19069 - [c158]Ziyun Zeng, Yuying Ge, Xihui Liu, Bin Chen, Ping Luo, Shu-Tao Xia, Yixiao Ge:
Learning Transferable Spatiotemporal Representations from Natural Script Knowledge. CVPR 2023: 23079-23089 - [c157]Teng Wang, Yixiao Ge, Feng Zheng, Ran Cheng, Ying Shan, Xiaohu Qie, Ping Luo:
Accelerating Vision-Language Pretraining with Free Language Modeling. CVPR 2023: 23161-23170 - [c156]Jiannan Wu, Yi Jiang, Bin Yan, Huchuan Lu, Zehuan Yuan, Ping Luo:
Segment Every Reference Object in Spatial and Temporal Spaces. ICCV 2023: 2538-2550 - [c155]Yutao Hu, Qixiong Wang, Wenqi Shao, Enze Xie, Zhenguo Li, Jungong Han, Ping Luo:
Beyond One-to-One: Rethinking the Referring Image Segmentation. ICCV 2023: 4044-4054 - [c154]Jiannan Wu, Yi Jiang, Bin Yan, Huchuan Lu, Zehuan Yuan, Ping Luo:
Exploring Transformers for Open-world Instance Segmentation. ICCV 2023: 6588-6598 - [c153]Wenwen Tong, Chonghao Sima, Tai Wang, Li Chen, Silei Wu, Hanming Deng, Yi Gu, Lewei Lu, Ping Luo, Dahua Lin, Hongyang Li:
Scene as Occupancy. ICCV 2023: 8372-8381 - [c152]Chongjian Ge, Junsong Chen, Enze Xie, Zhongdao Wang, Lanqing Hong, Huchuan Lu, Zhenguo Li, Ping Luo:
MetaBEV: Solving Sensor Failures for 3D Detection and Map Segmentation. ICCV 2023: 8687-8697 - [c151]Yangyang Xu, Shengfeng He, Kwan-Yee K. Wong, Ping Luo:
RIGID: Recurrent GAN Inversion and Editing of Real Face Videos. ICCV 2023: 13645-13655 - [c150]Peize Sun, Shoufa Chen, Chenchen Zhu, Fanyi Xiao, Ping Luo, Saining Xie, Zhicheng Yan:
Going Denser with Open-Vocabulary Part Segmentation. ICCV 2023: 15407-15419 - [c149]Mengzhao Chen, Wenqi Shao, Peng Xu, Mingbao Lin, Kaipeng Zhang, Fei Chao, Rongrong Ji, Yu Qiao, Ping Luo:
DiffRate : Differentiable Compression Rate for Efficient Vision Transformers. ICCV 2023: 17118-17128 - [c148]Shoufa Chen, Peize Sun, Yibing Song, Ping Luo:
DiffusionDet: Diffusion Model for Object Detection. ICCV 2023: 19773-19786 - [c147]Yuanfeng Ji, Zhe Chen, Enze Xie, Lanqing Hong, Xihui Liu, Zhaoqiang Liu, Tong Lu, Zhenguo Li, Ping Luo:
DDP: Diffusion Model for Dense Visual Prediction. ICCV 2023: 21684-21695 - [c146]Matej Kristan, Jirí Matas, Martin Danelljan, Michael Felsberg, Hyung Jin Chang, Luka Cehovin Zajc, Alan Lukezic, Ondrej Drbohlav, Zhongqun Zhang, Khanh-Tung Tran, Xuan-Son Vu, Johanna Björklund, Christoph Mayer, Yushan Zhang, Lei Ke, Jie Zhao, Gustavo Fernández, Noor Al-Shakarji, Dong An, Michael Arens, Stefan Becker, Goutam Bhat, Sebastian Bullinger, Antoni B. Chan, Shijie Chang, Hanyuan Chen, Xin Chen, Yan Chen, Zhenyu Chen, Yangming Cheng, Yutao Cui, Chunyuan Deng, Jiahua Dong, Matteo Dunnhofer, Wei Feng, Jianlong Fu, Jie Gao, Ruize Han, Zeqi Hao, Jun-Yan He, Keji He, Zhenyu He, Xiantao Hu, Kaer Huang, Yuqing Huang, Yi Jiang, Ben Kang, Jin-Peng Lan, Hyungjun Lee, Chenyang Li, Jiahao Li, Ning Li, Wangkai Li, Xiaodi Li, Xin Li, Pengyu Liu, Yue Liu, Huchuan Lu, Bin Luo, Ping Luo, Yinchao Ma, Deshui Miao, Christian Micheloni, Kannappan Palaniappan, Hancheol Park, Matthieu Paul, Houwen Peng, Zekun Qian, Gani Rahmon, Norbert Scherer-Negenborn, Pengcheng Shao, Wooksu Shin, Elham Soltani Kazemi, Tianhui Song, Rainer Stiefelhagen, Rui Sun, Chuanming Tang, Zhangyong Tang, Imad Eddine Toubal, Jack Valmadre, Joost van de Weijer, Luc Van Gool, Jash Vira, Stéphane Vujasinovic, Cheng Wan, Jia Wan, Dong Wang, Fei Wang, Feifan Wang, He Wang, Limin Wang, Song Wang, Yaowei Wang, Zhepeng Wang, Gangshan Wu, Jiannan Wu, Qiangqiang Wu, Xiaojun Wu, Anqi Xiao, Jinxia Xie, Chenlong Xu, Min Xu, Tianyang Xu, Yuanyou Xu, Bin Yan, Dawei Yang, Ming-Hsuan Yang, Tianyu Yang, Yi Yang, Zongxin Yang, Xuanwu Yin, Fisher Yu, Hongyuan Yu, Qianjin Yu, Weichen Yu, Yongsheng Yuan, Zehuan Yuan, Jianlin Zhang, Lu Zhang, Tianzhu Zhang, Guodongfang Zhao, Shaochuan Zhao, Yaozong Zheng, Bineng Zhong, Jiawen Zhu, Xuefeng Zhu, Yueting Zhuang, ChengAo Zong, Kunlong Zuo:
The First Visual Object Tracking Segmentation VOTS2023 Challenge Results. ICCV (Workshops) 2023: 1788-1810 - [c145]Runjian Chen, Yao Mu, Runsen Xu, Wenqi Shao, Chenhan Jiang, Hang Xu, Yu Qiao, Zhenguo Li, Ping Luo:
CO3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving. ICLR 2023 - [c144]Chongjian Ge, Jiangliu Wang, Zhan Tong, Shoufa Chen, Yibing Song, Ping Luo:
Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning. ICLR 2023 - [c143]Chuang Lin, Peize Sun, Yi Jiang, Ping Luo, Lizhen Qu, Gholamreza Haffari, Zehuan Yuan, Jianfei Cai:
Learning Object-Language Alignments for Open-Vocabulary Object Detection. ICLR 2023 - [c142]Yao Lai, Jinxin Liu, Zhentao Tang, Bin Wang, Jianye Hao, Ping Luo:
ChiPFormer: Transferable Chip Placement via Offline Decision Transformer. ICML 2023: 18346-18364 - [c141]Zhixuan Liang, Yao Mu, Mingyu Ding, Fei Ni, Masayoshi Tomizuka, Ping Luo:
AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners. ICML 2023: 20725-20745 - [c140]Chengyue Wu, Teng Wang, Yixiao Ge, Zeyu Lu, Ruisong Zhou, Ying Shan, Ping Luo:
π-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation. ICML 2023: 37713-37727 - [c139]Yao Mu, Zhiqian Lan, Chen Chen, Chang Liu, Ping Luo, Shengbo Eben Li:
Neural MPC-Based Decision-Making Framework for Autonomous Driving in Multi-Lane Roundabout. ITSC 2023: 5403-5409 - [c138]Fanqing Meng, Wenqi Shao, Zhanglin Peng, Chonghe Jiang, Kaipeng Zhang, Yu Qiao, Ping Luo:
Foundation Model is Efficient Multimodal Multitask Model Selector. NeurIPS 2023 - [c137]Yao Mu, Qinglong Zhang, Mengkang Hu, Wenhai Wang, Mingyu Ding, Jun Jin, Bin Wang, Jifeng Dai, Yu Qiao, Ping Luo:
EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought. NeurIPS 2023 - [c136]Wenhai Wang, Zhe Chen, Xiaokang Chen, Jiannan Wu, Xizhou Zhu, Gang Zeng, Ping Luo, Tong Lu, Jie Zhou, Yu Qiao, Jifeng Dai:
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks. NeurIPS 2023 - [c135]Huijie Wang, Tianyu Li, Yang Li, Li Chen, Chonghao Sima, Zhenbo Liu, Bangjun Wang, Peijin Jia, Yuting Wang, Shengyin Jiang, Feng Wen, Hang Xu, Ping Luo, Junchi Yan, Wei Zhang, Hongyang Li:
OpenLane-V2: A Topology Reasoning Benchmark for Unified 3D HD Mapping. NeurIPS 2023 - [c134]Haibao Yu, Yingjuan Tang, Enze Xie, Jilei Mao, Ping Luo, Zaiqing Nie:
Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection. NeurIPS 2023 - [i211]Bin Huang, Yangguang Li, Enze Xie, Feng Liang, Luya Wang, Mingzhu Shen, Fenggang Liu, Tianqi Wang, Ping Luo, Jing Shao:
Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception. CoRR abs/2301.07870 (2023) - [i210]Jie Zhu, Jiyang Qi, Mingyu Ding, Xiaokang Chen, Ping Luo, Xinggang Wang, Wenyu Liu, Leye Wang, Jingdong Wang:
Understanding Self-Supervised Pretraining with Part-Aware Representation Learning. CoRR abs/2301.11915 (2023) - [i209]Zhixuan Liang, Yao Mu, Mingyu Ding, Fei Ni, Masayoshi Tomizuka, Ping Luo:
AdaptDiffuser: Diffusion Models as Adaptive Self-evolving Planners. CoRR abs/2302.01877 (2023) - [i208]Bin Yan, Yi Jiang, Jiannan Wu, Dong Wang, Ping Luo, Zehuan Yuan, Huchuan Lu:
Universal Instance Perception as Object Discovery and Retrieval. CoRR abs/2303.06674 (2023) - [i207]Haibao Yu, Yingjuan Tang, Enze Xie, Jilei Mao, Jirui Yuan, Ping Luo, Zaiqing Nie:
Vehicle-Infrastructure Cooperative 3D Object Detection via Feature Flow Prediction. CoRR abs/2303.10552 (2023) - [i206]Shilong Zhang, Xinjiang Wang, Jiaqi Wang, Jiangmiao Pang, Chengqi Lyu, Wenwei Zhang, Ping Luo, Kai Chen:
Dense Distinct Query for End-to-End Object Detection. CoRR abs/2303.12776 (2023) - [i205]Teng Wang, Yixiao Ge, Feng Zheng, Ran Cheng, Ying Shan, Xiaohu Qie, Ping Luo:
Accelerating Vision-Language Pretraining with Free Language Modeling. CoRR abs/2303.14038 (2023) - [i204]Chongjian Ge, Jiangliu Wang, Zhan Tong, Shoufa Chen, Yibing Song, Ping Luo:
Soft Neighbors are Positive Supporters in Contrastive Visual Representation Learning. CoRR abs/2303.17142 (2023) - [i203]Yuanfeng Ji, Zhe Chen, Enze Xie, Lanqing Hong, Xihui Liu, Zhaoqiang Liu, Tong Lu, Zhenguo Li, Ping Luo:
DDP: Diffusion Model for Dense Visual Prediction. CoRR abs/2303.17559 (2023) - [i202]Tianqi Wang, Sukmin Kim, Wenxuan Ji, Enze Xie, Chongjian Ge, Junsong Chen, Zhenguo Li, Ping Luo:
DeepAccident: A Motion and Accident Prediction Benchmark for V2X Autonomous Driving. CoRR abs/2304.01168 (2023) - [i201]Qiushan Guo, Yizhou Yu, Yi Jiang, Jiannan Wu, Zehuan Yuan, Ping Luo:
Multi-Level Contrastive Learning for Dense Prediction Task. CoRR abs/2304.02010 (2023) - [i200]Qiushan Guo, Chuofan Ma, Yi Jiang, Zehuan Yuan, Yizhou Yu, Ping Luo:
EGC: Image Generation and Classification via a Diffusion Energy-Based Model. CoRR abs/2304.02012 (2023) - [i199]Mingyu Ding, Yikang Shen, Lijie Fan, Zhenfang Chen, Zitian Chen, Ping Luo, Joshua B. Tenenbaum, Chuang Gan:
Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention. CoRR abs/2304.03282 (2023) - [i198]Mingyu Ding, Yan Xu, Zhenfang Chen, David Daniel Cox, Ping Luo, Joshua B. Tenenbaum, Chuang Gan:
Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following. CoRR abs/2304.03767 (2023) - [i197]Tianyu Li, Li Chen, Xiangwei Geng, Huijie Wang, Yang Li, Zhenbo Liu, Shengyin Jiang, Yuting Wang, Hang Xu, Chunjing Xu, Feng Wen, Ping Luo, Junchi Yan, Wei Zhang, Xiaogang Wang, Yu Qiao, Hongyang Li:
Topology Reasoning for Driving Scenes. CoRR abs/2304.05277 (2023) - [i196]Jiahao Wang, Songyang Zhang, Yong Liu, Taiqiang Wu, Yujiu Yang, Xihui Liu, Kai Chen, Ping Luo, Dahua Lin:
RIFormer: Keep Your Vision Backbone Effective While Removing Token Mixer. CoRR abs/2304.05659 (2023) - [i195]Yao Mu, Shunyu Yao, Mingyu Ding, Ping Luo, Chuang Gan:
EC^2: Emergent Communication for Embodied Control. CoRR abs/2304.09448 (2023) - [i194]Chongjian Ge, Junsong Chen, Enze Xie, Zhongdao Wang, Lanqing Hong, Huchuan Lu, Zhenguo Li, Ping Luo:
MetaBEV: Solving Sensor Failures for BEV Detection and Map Segmentation. CoRR abs/2304.09801 (2023) - [i193]Huijie Wang, Zhenbo Liu, Yang Li, Tianyu Li, Li Chen, Chonghao Sima, Yuting Wang, Shengyin Jiang, Feng Wen, Hang Xu, Ping Luo, Junchi Yan, Wei Zhang, Jun Yao, Yu Qiao, Hongyang Li:
Road Genome: A Topology Reasoning Benchmark for Scene Understanding in Autonomous Driving. CoRR abs/2304.10440 (2023) - [i192]Chengyue Wu, Teng Wang, Yixiao Ge, Zeyu Lu, Ruisong Zhou, Ying Shan, Ping Luo:
π-Tuning: Transferring Multimodal Foundation Models with Optimal Multi-task Interpolation. CoRR abs/2304.14381 (2023) - [i191]Tao Gong, Chengqi Lyu, Shilong Zhang, Yudong Wang, Miao Zheng, Qian Zhao, Kuikun Liu, Wenwei Zhang, Ping Luo, Kai Chen:
MultiModal-GPT: A Vision and Language Model for Dialogue with Humans. CoRR abs/2305.04790 (2023) - [i190]Zhaoyang Liu, Yinan He, Wenhai Wang, Weiyun Wang, Yi Wang, Shoufa Chen, Qinglong Zhang, Zeqiang Lai, Yang Yang, Qingyun Li, Jiashuo Yu, Kunchang Li, Zhe Chen, Xue Yang, Xizhou Zhu, Yali Wang, Limin Wang, Ping Luo, Jifeng Dai, Yu Qiao:
InternGPT: Solving Vision-Centric Tasks by Interacting with Chatbots Beyond Language. CoRR abs/2305.05662 (2023) - [i189]Haibao Yu, Wenxian Yang, Hongzhi Ruan, Zhenwei Yang, Yingjuan Tang, Xu Gao, Xin Hao, Yifeng Shi, Yifeng Pan, Ning Sun, Juan Song, Jirui Yuan, Ping Luo, Zaiqing Nie:
V2X-Seq: A Large-Scale Sequential Dataset for Vehicle-Infrastructure Cooperative Perception and Forecasting. CoRR abs/2305.05938 (2023) - [i188]Kunchang Li, Yinan He, Yi Wang, Yizhuo Li, Wenhai Wang, Ping Luo, Yali Wang, Limin Wang, Yu Qiao:
VideoChat: Chat-Centric Video Understanding. CoRR abs/2305.06355 (2023) - [i187]Peize Sun, Shoufa Chen, Chenchen Zhu, Fanyi Xiao, Ping Luo, Saining Xie, Zhicheng Yan:
Going Denser with Open-Vocabulary Part Segmentation. CoRR abs/2305.11173 (2023) - [i186]Wenhai Wang, Zhe Chen, Xiaokang Chen, Jiannan Wu, Xizhou Zhu, Gang Zeng, Ping Luo, Tong Lu, Jie Zhou, Yu Qiao, Jifeng Dai:
VisionLLM: Large Language Model is also an Open-Ended Decoder for Vision-Centric Tasks. CoRR abs/2305.11175 (2023) - [i185]Haoyu Lu, Guoxing Yang, Nanyi Fei, Yuqi Huo, Zhiwu Lu, Ping Luo, Mingyu Ding:
VDT: An Empirical Study on Video Diffusion with Transformers. CoRR abs/2305.13311 (2023) - [i184]Yao Mu, Qinglong Zhang, Mengkang Hu, Wenhai Wang, Mingyu Ding, Jun Jin, Bin Wang, Jifeng Dai, Yu Qiao, Ping Luo:
EmbodiedGPT: Vision-Language Pre-Training via Embodied Chain of Thought. CoRR abs/2305.15021 (2023) - [i183]Yuanfeng Ji, Yatao Bian, Guoji Fu, Peilin Zhao, Ping Luo:
SyNDock: N Rigid Protein Docking via Learnable Group Synchronization. CoRR abs/2305.15156 (2023) - [i182]Mengzhao Chen, Wenqi Shao, Peng Xu, Mingbao Lin, Kaipeng Zhang, Fei Chao, Rongrong Ji, Yu Qiao, Ping Luo:
DiffRate : Differentiable Compression Rate for Efficient Vision Transformers. CoRR abs/2305.17997 (2023) - [i181]Zeyue Xue, Guanglu Song, Qiushan Guo, Boxiao Liu, Zhuofan Zong, Yu Liu, Ping Luo:
RAPHAEL: Text-to-Image Generation via Large Mixture of Diffusion Paths. CoRR abs/2305.18295 (2023) - [i180]Chonghao Sima, Wenwen Tong, Tai Wang, Li Chen, Silei Wu, Hanming Deng, Yi Gu, Lewei Lu, Ping Luo, Dahua Lin, Hongyang Li:
Scene as Occupancy. CoRR abs/2306.02851 (2023) - [i179]Peng Xu, Wenqi Shao, Kaipeng Zhang, Peng Gao, Shuo Liu, Meng Lei, Fanqing Meng, Siyuan Huang, Yu Qiao, Ping Luo:
LVLM-eHub: A Comprehensive Evaluation Benchmark for Large Vision-Language Models. CoRR abs/2306.09265 (2023) - [i178]Yue Yang, Kaipeng Zhang, Yuying Ge, Wenqi Shao, Zeyue Xue, Yu Qiao, Ping Luo:
Align, Adapt and Inject: Sound-guided Unified Image Generation. CoRR abs/2306.11504 (2023) - [i177]Yao Lai, Jinxin Liu, Zhentao Tang, Bin Wang, Jianye Hao, Ping Luo:
ChiPFormer: Transferable Chip Placement via Offline Decision Transformer. CoRR abs/2306.14744 (2023) - [i176]Shilong Zhang, Peize Sun, Shoufa Chen, Min Xiao, Wenqi Shao, Wenwei Zhang, Kai Chen, Ping Luo:
GPT4RoI: Instruction Tuning Large Language Model on Region-of-Interest. CoRR abs/2307.03601 (2023) - [i175]Yi Wang, Yinan He, Yizhuo Li, Kunchang Li, Jiashuo Yu, Xin Ma, Xinyuan Chen, Yaohui Wang, Ping Luo, Ziwei Liu, Yali Wang, Limin Wang, Yu Qiao:
InternVid: A Large-scale Video-Text Dataset for Multimodal Understanding and Generation. CoRR abs/2307.06942 (2023) - [i174]Wenqi Shao, Yutao Hu, Peng Gao, Meng Lei, Kaipeng Zhang, Fanqing Meng, Peng Xu, Siyuan Huang, Hongsheng Li, Yu Qiao, Ping Luo:
Tiny LVLM-eHub: Early Multimodal Experiments with Bard. CoRR abs/2308.03729 (2023) - [i173]Jiannan Wu, Yi Jiang, Bin Yan, Huchuan Lu, Zehuan Yuan, Ping Luo:
Exploring Transformers for Open-world Instance Segmentation. CoRR abs/2308.04206 (2023) - [i172]Yangyang Xu, Shengfeng He, Kwan-Yee K. Wong, Ping Luo:
RIGID: Recurrent GAN Inversion and Editing of Real Face Videos. CoRR abs/2308.06097 (2023) - [i171]Fanqing Meng, Wenqi Shao, Zhanglin Peng, Chonghe Jiang, Kaipeng Zhang, Yu Qiao, Ping Luo:
Foundation Model is Efficient Multimodal Multitask Model Selector. CoRR abs/2308.06262 (2023) - [i170]Zhouxia Wang, Jiawei Zhang, Tianshui Chen, Wenping Wang, Ping Luo:
RestoreFormer++: Towards Real-World Blind Face Restoration from Undegraded Key-Value Pairs. CoRR abs/2308.07228 (2023) - [i169]Wenqi Shao, Mengzhao Chen, Zhaoyang Zhang, Peng Xu, Lirui Zhao, Zhiqian Li, Kaipeng Zhang, Peng Gao, Yu Qiao, Ping Luo:
OmniQuant: Omnidirectionally Calibrated Quantization for Large Language Models. CoRR abs/2308.13137 (2023) - [i168]Yutao Hu, Qixiong Wang, Wenqi Shao, Enze Xie, Zhenguo Li, Jungong Han, Ping Luo:
Beyond One-to-One: Rethinking the Referring Image Segmentation. CoRR abs/2308.13853 (2023) - [i167]Ruijie Yao, Sheng Jin, Lumin Xu, Wang Zeng, Wentao Liu, Chen Qian, Ping Luo, Ji Wu:
GKGNet: Group K-Nearest Neighbor based Graph Convolutional Network for Multi-Label Image Recognition. CoRR abs/2308.14378 (2023) - [i166]Zhouxia Wang, Xintao Wang, Liangbin Xie, Zhongang Qi, Ying Shan, Wenping Wang, Ping Luo:
StyleAdapter: A Single-Pass LoRA-Free Model for Stylized Image Generation. CoRR abs/2309.01770 (2023) - [i165]Xiangchao Yan, Runjian Chen, Bo Zhang, Jiakang Yuan, Xinyu Cai, Botian Shi, Wenqi Shao, Junchi Yan, Ping Luo, Yu Qiao:
SPOT: Scalable 3D Pre-training via Occupancy Prediction for Autonomous Driving. CoRR abs/2309.10527 (2023) - [i164]Junsong Chen, Jincheng Yu, Chongjian Ge, Lewei Yao, Enze Xie, Yue Wu, Zhongdao Wang, James T. Kwok, Ping Luo, Huchuan Lu, Zhenguo Li:
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis. CoRR abs/2310.00426 (2023) - [i163]Hao Sha, Yao Mu, Yuxuan Jiang, Li Chen, Chenfeng Xu, Ping Luo, Shengbo Eben Li, Masayoshi Tomizuka, Wei Zhan, Mingyu Ding:
LanguageMPC: Large Language Models as Decision Makers for Autonomous Driving. CoRR abs/2310.03026 (2023) - [i162]Hao Zhang, Kaipeng Zhang, Lumin Xu, Shenqi Lai, Wenqi Shao, Nanning Zheng, Ping Luo, Yu Qiao:
Language-driven Open-Vocabulary Keypoint Detection for Animal Body and Face. CoRR abs/2310.05056 (2023) - [i161]Zhixuan Liang, Xingyu Zeng, Rui Zhao, Ping Luo:
MeanAP-Guided Reinforced Active Learning for Object Detection. CoRR abs/2310.08387 (2023) - [i160]Mengkang Hu, Yao Mu, Xinmiao Yu, Mingyu Ding, Shiguang Wu, Wenqi Shao, Qiguang Chen, Bin Wang, Yu Qiao, Ping Luo:
Tree-Planner: Efficient Close-loop Task Planning with Large Language Models. CoRR abs/2310.08582 (2023) - [i159]Yizhuo Li, Kunchang Li, Yinan He, Yi Wang, Yali Wang, Limin Wang, Yu Qiao, Ping Luo:
Harvest Video Foundation Models via Efficient Post-Pretraining. CoRR abs/2310.19554 (2023) - [i158]Haibao Yu, Yingjuan Tang, Enze Xie, Jilei Mao, Ping Luo, Zaiqing Nie:
Flow-Based Feature Fusion for Vehicle-Infrastructure Cooperative 3D Object Detection. CoRR abs/2311.01682 (2023) - [i157]Yangyang Xu, Shengfeng He, Wenqi Shao, Kwan-Yee K. Wong, Yu Qiao, Ping Luo:
DiffusionMat: Alpha Matting as Sequential Refinement Learning. CoRR abs/2311.13535 (2023) - [i156]Yuanfeng Ji, Chongjian Ge, Weikai Kong, Enze Xie, Zhengying Liu, Zhengguo Li, Ping Luo:
Large Language Models as Automated Aligners for benchmarking Vision-Language Models. CoRR abs/2311.14580 (2023) - [i155]Kunchang Li, Yali Wang, Yinan He, Yizhuo Li, Yi Wang, Yi Liu, Zun Wang, Jilan Xu, Guo Chen, Ping Luo, Limin Wang, Yu Qiao:
MVBench: A Comprehensive Multi-modal Video Understanding Benchmark. CoRR abs/2311.17005 (2023) - [i154]Yanqing Liu, Kai Wang, Wenqi Shao, Ping Luo, Yu Qiao, Mike Zheng Shou, Kaipeng Zhang, Yang You:
MLLMs-Augmented Visual-Language Representation Learning. CoRR abs/2311.18765 (2023) - [i153]Zhouxia Wang, Ziyang Yuan, Xintao Wang, Tianshui Chen, Menghan Xia, Ping Luo, Ying Shan:
MotionCtrl: A Unified and Flexible Motion Controller for Video Generation. CoRR abs/2312.03641 (2023) - [i152]Shoufa Chen, Mengmeng Xu, Jiawei Ren, Yuren Cong, Sen He, Yanping Xie, Animesh Sinha, Ping Luo, Tao Xiang, Juan-Manuel Pérez-Rúa:
GenTron: Delving Deep into Diffusion Transformers for Image and Video Generation. CoRR abs/2312.04557 (2023) - [i151]Sheng Jin, Shuhuai Li, Tong Li, Wentao Liu, Chen Qian, Ping Luo:
You Only Learn One Query: Learning Unified Human Query for Single-Stage Multi-Person Multi-Task Human-Centric Perception. CoRR abs/2312.05525 (2023) - [i150]Jiankai Sun, Chuanyang Zheng, Enze Xie, Zhengying Liu, Ruihang Chu, Jianing Qiu, Jiaqi Xu, Mingyu Ding, Hongyang Li, Mengzhe Geng, Yue Wu, Wenhai Wang, Junsong Chen, Zhangyue Yin, Xiaozhe Ren, Jie Fu, Junxian He, Wu Yuan, Qi Liu, Xihui Liu, Yu Li, Hao Dong, Yu Cheng, Ming Zhang, Pheng-Ann Heng, Jifeng Dai, Ping Luo, Jingdong Wang, Ji-Rong Wen, Xipeng Qiu, Yike Guo, Hui Xiong, Qun Liu, Zhenguo Li:
A Survey of Reasoning with Foundation Models. CoRR abs/2312.11562 (2023) - [i149]Zhixuan Liang, Yao Mu, Hengbo Ma, Masayoshi Tomizuka, Mingyu Ding, Ping Luo:
SkillDiffuser: Interpretable Hierarchical Planning via Skill Abstractions in Diffusion-Based Task Execution. CoRR abs/2312.11598 (2023) - [i148]Chonghao Sima, Katrin Renz, Kashyap Chitta, Li Chen, Hanxue Zhang, Chengen Xie, Ping Luo, Andreas Geiger, Hongyang Li:
DriveLM: Driving with Graph Visual Question Answering. CoRR abs/2312.14150 (2023) - [i147]Zhe Chen, Jiannan Wu, Wenhai Wang, Weijie Su, Guo Chen, Sen Xing, Muyan Zhong, Qinglong Zhang, Xizhou Zhu, Lewei Lu, Bin Li, Ping Luo, Tong Lu, Yu Qiao, Jifeng Dai:
InternVL: Scaling up Vision Foundation Models and Aligning for Generic Visual-Linguistic Tasks. CoRR abs/2312.14238 (2023) - [i146]Jiannan Wu, Yi Jiang, Bin Yan, Huchuan Lu, Zehuan Yuan, Ping Luo:
UniRef++: Segment Every Reference Object in Spatial and Temporal Spaces. CoRR abs/2312.15715 (2023) - 2022
- [j18]Jieren Cheng, Xinzhi Yao, Hui Li, Hao Lu, Naixue Xiong, Ping Luo, Le Liu, Hao Guo, Wen Feng:
Cooperative Detection Method for DDoS Attacks Based on Blockchain. Comput. Syst. Sci. Eng. 43(1): 103-117 (2022) - [j17]Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao:
PVT v2: Improved baselines with Pyramid Vision Transformer. Comput. Vis. Media 8(3): 415-424 (2022) - [j16]Jieren Cheng, Ping Luo, Naixue Xiong, Jie Wu:
AAFL: Asynchronous-Adaptive Federated Learning in Edge-Based Wireless Communication Systems for Countering Communicable Infectious Diseasess. IEEE J. Sel. Areas Commun. 40(11): 3172-3190 (2022) - [j15]Enze Xie, Wenhai Wang, Mingyu Ding, Ruimao Zhang, Ping Luo:
PolarMask++: Enhanced Polar Representation for Single-Shot Instance Segmentation and Beyond. IEEE Trans. Pattern Anal. Mach. Intell. 44(9): 5385-5400 (2022) - [j14]Xingang Pan, Xiaohang Zhan, Bo Dai, Dahua Lin, Chen Change Loy, Ping Luo:
Exploiting Deep Generative Prior for Versatile Image Restoration and Manipulation. IEEE Trans. Pattern Anal. Mach. Intell. 44(11): 7474-7489 (2022) - [j13]Yuying Ge, Ruimao Zhang, Ping Luo:
MetaCloth: Learning Unseen Tasks of Dense Fashion Landmark Detection From a Few Samples. IEEE Trans. Image Process. 31: 1120-1133 (2022) - [j12]Shixiong Zhao, Fanxin Li, Xusheng Chen, Xiuxian Guan, Jianyu Jiang, Dong Huang, Yuhao Qing, Sen Wang, Peng Wang, Gong Zhang, Cheng Li, Ping Luo, Heming Cui:
vPipe: A Virtualized Acceleration System for Achieving Efficient and Scalable Pipeline Parallel DNN Training. IEEE Trans. Parallel Distributed Syst. 33(3): 489-506 (2022) - [c133]Zhe Chen, Wenhai Wang, Enze Xie, Tong Lu, Ping Luo:
Towards Ultra-Resolution Neural Style Transfer via Thumbnail Instance Normalization. AAAI 2022: 393-400 - [c132]Chaofan Tao, Lu Hou, Wei Zhang, Lifeng Shang, Xin Jiang, Qun Liu, Ping Luo, Ngai Wong:
Compression of Generative Pre-trained Language Models via Quantization. ACL (1) 2022: 4821-4836 - [c131]Mingyu Ding, Yan Xu, Zhenfang Chen, David Daniel Cox, Ping Luo, Joshua B. Tenenbaum, Chuang Gan:
Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following. CoRL 2022: 1743-1754 - [c130]Zhiqi Li, Wenhai Wang, Enze Xie, Zhiding Yu, Anima Anandkumar, José M. Álvarez, Ping Luo, Tong Lu:
Panoptic SegFormer: Delving Deeper into Panoptic Segmentation with Transformers. CVPR 2022: 1270-1279 - [c129]Jiannan Wu, Yi Jiang, Peize Sun, Zehuan Yuan, Ping Luo:
Language as Queries for Referring Video Object Segmentation. CVPR 2022: 4964-4974 - [c128]Wang Zeng, Sheng Jin, Wentao Liu, Chen Qian, Ping Luo, Wanli Ouyang, Xiaogang Wang:
Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer. CVPR 2022: 11091-11101 - [c127]Qiushan Guo, Yao Mu, Jianyu Chen, Tianqi Wang, Yizhou Yu, Ping Luo:
Scale-Equivalent Distillation for Semi-Supervised Object Detection. CVPR 2022: 14502-14511 - [c126]Yuying Ge, Yixiao Ge, Xihui Liu, Dian Li, Ying Shan, Xiaohu Qie, Ping Luo:
Bridging Video-text Retrieval with Multiple Choice Questions. CVPR 2022: 16146-16155 - [c125]Zhouxia Wang, Jiawei Zhang, Runjian Chen, Wenping Wang, Ping Luo:
RestoreFormer: High-Quality Blind Face Restoration from Undegraded Key-Value Pairs. CVPR 2022: 17491-17500 - [c124]Peize Sun, Jinkun Cao, Yi Jiang, Zehuan Yuan, Song Bai, Kris Kitani, Ping Luo:
DanceTrack: Multi-Object Tracking in Uniform Appearance and Diverse Motion. CVPR 2022: 20961-20970 - [c123]Yifu Zhang, Peize Sun, Yi Jiang, Dongdong Yu, Fucheng Weng, Zehuan Yuan, Ping Luo, Wenyu Liu, Xinggang Wang:
ByteTrack: Multi-object Tracking by Associating Every Detection Box. ECCV (22) 2022: 1-21 - [c122]Mingyu Ding, Bin Xiao, Noel Codella, Ping Luo, Jingdong Wang, Lu Yuan:
DaViT: Dual Attention Vision Transformers. ECCV (24) 2022: 74-92 - [c121]Wenqi Shao, Xun Zhao, Yixiao Ge, Zhaoyang Zhang, Lei Yang, Xiaogang Wang, Ying Shan, Ping Luo:
Not All Models Are Equal: Predicting Model Transferability in a Self-challenging Fisher Space. ECCV (34) 2022: 286-302 - [c120]Hao Meng, Sheng Jin, Wentao Liu, Chen Qian, Mengxiang Lin, Wanli Ouyang, Ping Luo:
3D Interacting Hand Pose Estimation by Hand De-occlusion and Removal. ECCV (6) 2022: 380-397 - [c119]Lumin Xu, Sheng Jin, Wang Zeng, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo, Xiaogang Wang:
Pose for Everything: Towards Category-Agnostic Pose Estimation. ECCV (6) 2022: 398-416 - [c118]Wentao Jiang, Sheng Jin, Wentao Liu, Chen Qian, Ping Luo, Si Liu:
PoseTrans: A Simple yet Effective Pose Transformation Augmentation for Human Pose Estimation. ECCV (5) 2022: 643-659 - [c117]Yuying Ge, Yixiao Ge, Xihui Liu, Jinpeng Wang, Jianping Wu, Ying Shan, Xiaohu Qie, Ping Luo:
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-Text Retrieval. ECCV (35) 2022: 691-708 - [c116]Bin Yan, Yi Jiang, Peize Sun, Dong Wang, Zehuan Yuan, Ping Luo, Huchuan Lu:
Towards Grand Unification of Object Tracking. ECCV (21) 2022: 733-751 - [c115]Weijia Wu, Enze Xie, Ruimao Zhang, Wenhai Wang, Ping Luo, Hong Zhou:
Polygon-Free: Unconstrained Scene Text Detection with Box Annotations. ICIP 2022: 1226-1230 - [c114]Shoufa Chen, Enze Xie, Chongjian Ge, Runjian Chen, Ding Liang, Ping Luo:
CycleMLP: A MLP-like Architecture for Dense Prediction. ICLR 2022 - [c113]Mingyu Ding, Yuqi Huo, Haoyu Lu, Linjie Yang, Zhe Wang, Zhiwu Lu, Jingdong Wang, Ping Luo:
Learning Versatile Neural Architectures by Propagating Network Codes. ICLR 2022 - [c112]Wenqi Shao, Yixiao Ge, Zhaoyang Zhang, Xuyuan Xu, Xiaogang Wang, Ying Shan, Ping Luo:
Dynamic Token Normalization improves Vision Transformers. ICLR 2022 - [c111]Can Wang, Sheng Jin, Yingda Guan, Wentao Liu, Chen Qian, Ping Luo, Wanli Ouyang:
Pseudo-Labeled Auto-Curriculum Learning for Semi-Supervised Keypoint Localization. ICLR 2022 - [c110]Shuo Yang, Peize Sun, Yi Jiang, Xiaobo Xia, Ruiheng Zhang, Zehuan Yuan, Changhu Wang, Ping Luo, Min Xu:
Objects in Semantic Topology. ICLR 2022 - [c109]Xiaoyu Chen, Yao Mark Mu, Ping Luo, Shengbo Li, Jianyu Chen:
Flow-based Recurrent Belief State Learning for POMDPs. ICML 2022: 3444-3468 - [c108]Yao Mark Mu, Shoufa Chen, Mingyu Ding, Jianyu Chen, Runjian Chen, Ping Luo:
CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer. ICML 2022: 16043-16061 - [c107]Teng Wang, Wenhao Jiang, Zhichao Lu, Feng Zheng, Ran Cheng, Chengguo Yin, Ping Luo:
VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix. ICML 2022: 22680-22690 - [c106]Zhecheng Yuan, Guozheng Ma, Yao Mu, Bo Xia, Bo Yuan, Xueqian Wang, Ping Luo, Huazhe Xu:
Don't Touch What Matters: Task-Aware Lipschitz Data Augmentation for Visual Reinforcement Learning. IJCAI 2022: 3702-3708 - [c105]Shoufa Chen, Chongjian Ge, Zhan Tong, Jiangliu Wang, Yibing Song, Jue Wang, Ping Luo:
AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition. NeurIPS 2022 - [c104]Yuanfeng Ji, Haotian Bai, Chongjian Ge, Jie Yang, Ye Zhu, Ruimao Zhang, Zhen Li, Lingyan Zhang, Wanling Ma, Xiang Wan, Ping Luo:
AMOS: A Large-Scale Abdominal Multi-Organ Benchmark for Versatile Medical Image Segmentation. NeurIPS 2022 - [c103]Yao Lai, Yao Mu, Ping Luo:
MaskPlace: Fast Chip Placement via Reinforced Visual Representation Learning. NeurIPS 2022 - [c102]Chuofan Ma, Qiushan Guo, Yi Jiang, Ping Luo, Zehuan Yuan, Xiaojuan Qi:
Rethinking Resolution in the Context of Efficient Video Recognition. NeurIPS 2022 - [c101]Yao Mu, Yuzheng Zhuang, Fei Ni, Bin Wang, Jianyu Chen, Jianye Hao, Ping Luo:
DOMINO: Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning. NeurIPS 2022 - [c100]Zeyue Xue, Jianming Liang, Guanglu Song, Zhuofan Zong, Liang Chen, Yu Liu, Ping Luo:
Large-batch Optimization for Dense Visual Predictions: Training Faster R-CNN in 4.2 Minutes. NeurIPS 2022 - [i145]Jiannan Wu, Yi Jiang, Peize Sun, Zehuan Yuan, Ping Luo:
Language as Queries for Referring Video Object Segmentation. CoRR abs/2201.00487 (2022) - [i144]Yuying Ge, Yixiao Ge, Xihui Liu, Dian Li, Ying Shan, Xiaohu Qie, Ping Luo:
BridgeFormer: Bridging Video-text Retrieval with Multiple Choice Questions. CoRR abs/2201.04850 (2022) - [i143]Yuying Ge, Yibing Song, Ruimao Zhang, Ping Luo:
MetaDance: Few-shot Dancing Video Retargeting via Temporal-aware Meta-learning. CoRR abs/2201.04851 (2022) - [i142]Zhouxia Wang, Jiawei Zhang, Runjian Chen, Wenping Wang, Ping Luo:
RestoreFormer: High-Quality Blind Face Restoration From Undegraded Key-Value Pairs. CoRR abs/2201.06374 (2022) - [i141]Can Wang, Sheng Jin, Yingda Guan, Wentao Liu, Chen Qian, Ping Luo, Wanli Ouyang:
Pseudo-Labeled Auto-Curriculum Learning for Semi-Supervised Keypoint Localization. CoRR abs/2201.08613 (2022) - [i140]Yuanfeng Ji, Lu Zhang, Jiaxiang Wu, Bingzhe Wu, Long-Kai Huang, Tingyang Xu, Yu Rong, Lanqing Li, Jie Ren, Ding Xue, Houtim Lai, Shaoyong Xu, Jing Feng, Wei Liu, Ping Luo, Shuigeng Zhou, Junzhou Huang, Peilin Zhao, Yatao Bian:
DrugOOD: Out-of-Distribution (OOD) Dataset Curator and Benchmark for AI-aided Drug Discovery - A Focus on Affinity Prediction Problems with Noise Annotations. CoRR abs/2201.09637 (2022) - [i139]Xiaokang Chen, Mingyu Ding, Xiaodi Wang, Ying Xin, Shentong Mo, Yunhao Wang, Shumin Han, Ping Luo, Gang Zeng, Jingdong Wang:
Context Autoencoder for Self-Supervised Representation Learning. CoRR abs/2202.03026 (2022) - [i138]Zhecheng Yuan, Guozheng Ma, Yao Mu, Bo Xia, Bo Yuan, Xueqian Wang, Ping Luo, Huazhe Xu:
Don't Touch What Matters: Task-Aware Lipschitz Data Augmentation for Visual Reinforcement Learning. CoRR abs/2202.09982 (2022) - [i137]Chunmeng Liu, Enze Xie, Wenjia Wang, Wenhai Wang, Guangyao Li, Ping Luo:
WegFormer: Transformers for Weakly Supervised Semantic Segmentation. CoRR abs/2203.08421 (2022) - [i136]Weijia Wu, Debing Zhang, Ying Fu, Chunhua Shen, Hong Zhou, Yuanqiang Cai, Ping Luo:
End-to-End Video Text Spotting with Transformer. CoRR abs/2203.10539 (2022) - [i135]Chaofan Tao, Lu Hou, Wei Zhang, Lifeng Shang, Xin Jiang, Qun Liu, Ping Luo, Ngai Wong:
Compression of Generative Pre-trained Language Models via Quantization. CoRR abs/2203.10705 (2022) - [i134]Qiushan Guo, Yao Mu, Jianyu Chen, Tianqi Wang, Yizhou Yu, Ping Luo:
Scale-Equivalent Distillation for Semi-Supervised Object Detection. CoRR abs/2203.12244 (2022) - [i133]Mingyu Ding, Bin Xiao, Noel Codella, Ping Luo, Jingdong Wang, Lu Yuan:
DaViT: Dual Attention Vision Transformers. CoRR abs/2204.03645 (2022) - [i132]Enze Xie, Zhiding Yu, Daquan Zhou, Jonah Philion, Anima Anandkumar, Sanja Fidler, Ping Luo, José M. Álvarez:
M2BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation. CoRR abs/2204.05088 (2022) - [i131]Teng Wang, Zhu Liu, Feng Zheng, Zhichao Lu, Ran Cheng, Ping Luo:
Semantic-Aware Pretraining for Dense Video Captioning. CoRR abs/2204.07449 (2022) - [i130]Wang Zeng, Sheng Jin, Wentao Liu, Chen Qian, Ping Luo, Wanli Ouyang, Xiaogang Wang:
Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering Transformer. CoRR abs/2204.08680 (2022) - [i129]Yuying Ge, Yixiao Ge, Xihui Liu, Alex Jinpeng Wang, Jianping Wu, Ying Shan, Xiaohu Qie, Ping Luo:
MILES: Visual BERT Pre-training with Injected Language Semantics for Video-text Retrieval. CoRR abs/2204.12408 (2022) - [i128]Xiaoyu Chen, Yao Mu, Ping Luo, Shengbo Li, Jianyu Chen:
Flow-based Recurrent Belief State Learning for POMDPs. CoRR abs/2205.11051 (2022) - [i127]Shoufa Chen, Chongjian Ge, Zhan Tong, Jiangliu Wang, Yibing Song, Jue Wang, Ping Luo:
AdaptFormer: Adapting Vision Transformers for Scalable Visual Recognition. CoRR abs/2205.13535 (2022) - [i126]Runjian Chen, Yao Mu, Runsen Xu, Wenqi Shao, Chenhan Jiang, Hang Xu, Zhenguo Li, Ping Luo:
CO^3: Cooperative Unsupervised 3D Representation Learning for Autonomous Driving. CoRR abs/2206.04028 (2022) - [i125]Yuanfeng Ji, Haotian Bai, Jie Yang, Chongjian Ge, Ye Zhu, Ruimao Zhang, Zhen Li, Lingyan Zhang, Wanling Ma, Xiang Wan, Ping Luo:
AMOS: A Large-Scale Abdominal Multi-Organ Benchmark for Versatile Medical Image Segmentation. CoRR abs/2206.08023 (2022) - [i124]Yao Mu, Shoufa Chen, Mingyu Ding, Jianyu Chen, Runjian Chen, Ping Luo:
CtrlFormer: Learning Transferable State Representation for Visual Control via Transformer. CoRR abs/2206.08883 (2022) - [i123]Teng Wang, Wenhao Jiang, Zhichao Lu, Feng Zheng, Ran Cheng, Chengguo Yin, Ping Luo:
VLMixer: Unpaired Vision-Language Pre-training via Cross-Modal CutMix. CoRR abs/2206.08919 (2022) - [i122]Jinrui Zhang, Teng Wang, Feng Zheng, Ran Cheng, Ping Luo:
Exploiting Context Information for Generic Event Boundary Captioning. CoRR abs/2207.01050 (2022) - [i121]Wenqi Shao, Xun Zhao, Yixiao Ge, Zhaoyang Zhang, Lei Yang, Xiaogang Wang, Ying Shan, Ping Luo:
Not All Models Are Equal: Predicting Model Transferability in a Self-challenging Fisher Space. CoRR abs/2207.03036 (2022) - [i120]Bin Yan, Yi Jiang, Peize Sun, Dong Wang, Zehuan Yuan, Ping Luo, Huchuan Lu:
Towards Grand Unification of Object Tracking. CoRR abs/2207.07078 (2022) - [i119]Lumin Xu, Sheng Jin, Wang Zeng, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo, Xiaogang Wang:
Pose for Everything: Towards Category-Agnostic Pose Estimation. CoRR abs/2207.10387 (2022) - [i118]Hao Meng, Sheng Jin, Wentao Liu, Chen Qian, Mengxiang Lin, Wanli Ouyang, Ping Luo:
3D Interacting Hand Pose Estimation by Hand De-occlusion and Removal. CoRR abs/2207.11061 (2022) - [i117]Wentao Jiang, Sheng Jin, Wentao Liu, Chen Qian, Ping Luo, Si Liu:
PoseTrans: A Simple Yet Effective Pose Transformation Augmentation for Human Pose Estimation. CoRR abs/2208.07755 (2022) - [i116]Lumin Xu, Sheng Jin, Wentao Liu, Chen Qian, Wanli Ouyang, Ping Luo, Xiaogang Wang:
ZoomNAS: Searching for Whole-body Human Pose Estimation in the Wild. CoRR abs/2208.11547 (2022) - [i115]Hongyang Li, Chonghao Sima, Jifeng Dai, Wenhai Wang, Lewei Lu, Huijie Wang, Enze Xie, Zhiqi Li, Hanming Deng, Hao Tian, Xizhou Zhu, Li Chen, Yulu Gao, Xiangwei Geng, Jia Zeng, Yang Li, Jiazhi Yang, Xiaosong Jia, Bohan Yu, Yu Qiao, Dahua Lin, Si Liu, Junchi Yan, Jianping Shi, Ping Luo:
Delving into the Devils of Bird's-eye-view Perception: A Review, Evaluation and Recipe. CoRR abs/2209.05324 (2022) - [i114]Chuofan Ma, Qiushan Guo, Yi Jiang, Zehuan Yuan, Ping Luo, Xiaojuan Qi:
Rethinking Resolution in the Context of Efficient Video Recognition. CoRR abs/2209.12797 (2022) - [i113]Ping Luo, Jieren Cheng, Zhenhao Liu, Naixue Xiong, Jie Wu:
FedVeca: Federated Vectorized Averaging on Non-IID Data with Adaptive Bi-directional Global Objective. CoRR abs/2209.13803 (2022) - [i112]Ziyun Zeng, Yuying Ge, Xihui Liu, Bin Chen, Ping Luo, Shu-Tao Xia, Yixiao Ge:
Learning Transferable Spatiotemporal Representations from Natural Script Knowledge. CoRR abs/2209.15280 (2022) - [i111]Zeyu Gao, Yao Mu, Ruoyan Shen, Chen Chen, Yangang Ren, Jianyu Chen, Shengbo Eben Li, Ping Luo, Yanfeng Lu:
Enhance Sample Efficiency and Robustness of End-to-end Urban Autonomous Driving via Semantic Masked World Model. CoRR abs/2210.04017 (2022) - [i110]Yao Mu, Yuzheng Zhuang, Fei Ni, Bin Wang, Jianyu Chen, Jianye Hao, Ping Luo:
Decomposed Mutual Information Optimization for Generalized Context in Meta-Reinforcement Learning. CoRR abs/2210.04209 (2022) - [i109]Zeyue Xue, Jianming Liang, Guanglu Song, Zhuofan Zong, Liang Chen, Yu Liu, Ping Luo:
Large-batch Optimization for Dense Visual Predictions. CoRR abs/2210.11078 (2022) - [i108]Shoufa Chen, Peize Sun, Yibing Song, Ping Luo:
DiffusionDet: Diffusion Model for Object Detection. CoRR abs/2211.09788 (2022) - [i107]Junjie Wang, Yao Mu, Dong Li, Qichao Zhang, Dongbin Zhao, Yuzheng Zhuang, Ping Luo, Bin Wang, Jianye Hao:
Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning. CoRR abs/2211.12774 (2022) - [i106]Yao Lai, Yao Mu, Ping Luo:
MaskPlace: Fast Chip Placement via Reinforced Visual Representation Learning. CoRR abs/2211.13382 (2022) - [i105]Chuang Lin, Peize Sun, Yi Jiang, Ping Luo, Lizhen Qu, Gholamreza Haffari, Zehuan Yuan, Jianfei Cai:
Learning Object-Language Alignments for Open-Vocabulary Object Detection. CoRR abs/2211.14843 (2022) - [i104]Yuying Ge, Annabella Macaluso, Li Erran Li, Ping Luo, Xiaolong Wang:
Self-Play and Self-Describe: Policy Adaptation with Vision-Language Foundation Models. CoRR abs/2212.07398 (2022) - 2021
- [j11]Ping Luo, Ruimao Zhang, Jiamin Ren, Zhanglin Peng, Jingyu Li:
Switchable Normalization for Learning-to-Normalize Deep Representation. IEEE Trans. Pattern Anal. Mach. Intell. 43(2): 712-728 (2021) - [c99]Mingyu Ding, Xiaochen Lian, Linjie Yang, Peng Wang, Xiaojie Jin, Zhiwu Lu, Ping Luo:
HR-NAS: Searching Efficient High-Resolution Neural Architectures With Lightweight Transformers. CVPR 2021: 2982-2992 - [c98]Yuying Ge, Yibing Song, Ruimao Zhang, Chongjian Ge, Wei Liu, Ping Luo:
Parser-Free Virtual Try-On via Distilling Appearance Flows. CVPR 2021: 8485-8493 - [c97]Jiahang Wang, Sheng Jin, Wentao Liu, Weizhong Liu, Chen Qian, Ping Luo:
When Human Pose Estimation Meets Robustness: Adversarial Algorithms and Benchmarks. CVPR 2021: 11855-11864 - [c96]Peize Sun, Rufeng Zhang, Yi Jiang, Tao Kong, Chenfeng Xu, Wei Zhan, Masayoshi Tomizuka, Lei Li, Zehuan Yuan, Changhu Wang, Ping Luo:
Sparse R-CNN: End-to-End Object Detection With Learnable Proposals. CVPR 2021: 14454-14463 - [c95]Lumin Xu, Yingda Guan, Sheng Jin, Wentao Liu, Chen Qian, Ping Luo, Wanli Ouyang, Xiaogang Wang:
ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search. CVPR 2021: 16072-16081 - [c94]Chongjian Ge, Yibing Song, Yuying Ge, Han Yang, Wei Liu, Ping Luo:
Disentangled Cycle Consistency for Highly-Realistic Virtual Try-On. CVPR 2021: 16928-16937 - [c93]Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao:
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions. ICCV 2021: 548-558 - [c92]Zhaoyang Zhang, Yitong Jiang, Jun Jiang, Xiaogang Wang, Ping Luo, Jinwei Gu:
STAR: A Structure-aware Lightweight Transformer for Real-time Image Enhancement. ICCV 2021: 4086-4095 - [c91]Wei Shang, Dongwei Ren, Dongqing Zou, Jimmy S. Ren, Ping Luo, Wangmeng Zuo:
Bringing Events into Video Deblurring with Non-consecutively Blurry Frames. ICCV 2021: 4511-4520 - [c90]Teng Wang, Ruimao Zhang, Zhichao Lu, Feng Zheng, Ran Cheng, Ping Luo:
End-to-End Dense Video Captioning with Parallel Decoding. ICCV 2021: 6827-6837 - [c89]Shoufa Chen, Peize Sun, Enze Xie, Chongjian Ge, Jiannan Wu, Lan Ma, Jiajun Shen, Ping Luo:
Watch Only Once: An End-to-End Video Action Detection Framework. ICCV 2021: 8158-8167 - [c88]Enze Xie, Jian Ding, Wenhai Wang, Xiaohang Zhan, Hang Xu, Peize Sun, Zhenguo Li, Ping Luo:
DetCo: Unsupervised Contrastive Learning for Object Detection. ICCV 2021: 8372-8381 - [c87]Muhammad Awais, Fengwei Zhou, Hang Xu, Lanqing Hong, Ping Luo, Sung-Ho Bae, Zhenguo Li:
Adversarial Robustness for Unsupervised Domain Adaptation. ICCV 2021: 8548-8557 - [c86]Xingang Pan, Bo Dai, Ziwei Liu, Chen Change Loy, Ping Luo:
Do 2D GANs Know 3D Shape? Unsupervised 3D Shape Reconstruction from 2D Image GANs. ICLR 2021 - [c85]Peize Sun, Yi Jiang, Enze Xie, Wenqi Shao, Zehuan Yuan, Changhu Wang, Ping Luo:
What Makes for End-to-End Object Detection? ICML 2021: 9934-9944 - [c84]Zhaoyang Zhang, Wenqi Shao, Jinwei Gu, Xiaogang Wang, Ping Luo:
Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution. ICML 2021: 12546-12556 - [c83]Enze Xie, Wenjia Wang, Wenhai Wang, Peize Sun, Hang Xu, Ding Liang, Ping Luo:
Segmenting Transparent Objects in the Wild with Transformer. IJCAI 2021: 1194-1200 - [c82]Lingyun Wu, Zhiqiang Hu, Yuanfeng Ji, Ping Luo, Shaoting Zhang:
Multi-frame Collaboration for Effective Endoscopic Video Polyp Detection via Spatial-Temporal Feature Transformation. MICCAI (5) 2021: 302-312 - [c81]Yuanfeng Ji, Ruimao Zhang, Huijie Wang, Zhen Li, Lingyun Wu, Shaoting Zhang, Ping Luo:
Multi-compound Transformer for Accurate Biomedical Image Segmentation. MICCAI (1) 2021: 326-336 - [c80]Mingyu Ding, Zhenfang Chen, Tao Du, Ping Luo, Josh Tenenbaum, Chuang Gan:
Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language. NeurIPS 2021: 887-899 - [c79]Chongjian Ge, Youwei Liang, Yibing Song, Jianbo Jiao, Jue Wang, Ping Luo:
Revitalizing CNN Attention via Transformers in Self-Supervised Visual Representation Learning. NeurIPS 2021: 4193-4206 - [c78]Yao Mu, Yuzheng Zhuang, Bin Wang, Guangxiang Zhu, Wulong Liu, Jianyu Chen, Ping Luo, Shengbo Li, Chongjie Zhang, Jianye Hao:
Model-Based Reinforcement Learning via Imagination with Derived Memory. NeurIPS 2021: 9493-9505 - [c77]Enze Xie, Wenhai Wang, Zhiding Yu, Anima Anandkumar, José M. Álvarez, Ping Luo:
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers. NeurIPS 2021: 12077-12090 - [c76]Yuqi Huo, Mingyu Ding, Haoyu Lu, Nanyi Fei, Zhiwu Lu, Ji-Rong Wen, Ping Luo:
Compressed Video Contrastive Learning. NeurIPS 2021: 14176-14187 - [c75]Zhongzhan Huang, Wenqi Shao, Xinjiang Wang, Liang Lin, Ping Luo:
Rethinking the Pruning Criteria for Convolutional Neural Network. NeurIPS 2021: 16305-16318 - [i103]Enze Xie, Wenjia Wang, Wenhai Wang, Peize Sun, Hang Xu, Ding Liang, Ping Luo:
Trans2Seg: Transparent Object Segmentation with Transformer. CoRR abs/2101.08461 (2021) - [i102]Enze Xie, Jian Ding, Wenhai Wang, Xiaohang Zhan, Hang Xu, Zhenguo Li, Ping Luo:
DetCo: Unsupervised Contrastive Learning for Object Detection. CoRR abs/2102.04803 (2021) - [i101]Chaofan Tao, Rui Lin, Quan Chen, Zhaoyang Zhang, Ping Luo, Ngai Wong:
FAT: Learning Low-Bitwidth Parametric Representation via Frequency-Aware Transformation. CoRR abs/2102.07444 (2021) - [i100]Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao:
Pyramid Vision Transformer: A Versatile Backbone for Dense Prediction without Convolutions. CoRR abs/2102.12122 (2021) - [i99]Yuying Ge, Yibing Song, Ruimao Zhang, Chongjian Ge, Wei Liu, Ping Luo:
Parser-Free Virtual Try-on via Distilling Appearance Flows. CoRR abs/2103.04559 (2021) - [i98]Jian Ding, Enze Xie, Hang Xu, Chenhan Jiang, Zhenguo Li, Ping Luo, Gui-Song Xia:
Unsupervised Pretraining for Object Detection by Patch Reidentification. CoRR abs/2103.04814 (2021) - [i97]Chongjian Ge, Yibing Song, Yuying Ge, Han Yang, Wei Liu, Ping Luo:
Disentangled Cycle Consistency for Highly-realistic Virtual Try-On. CoRR abs/2103.09479 (2021) - [i96]Zhe Chen, Wenhai Wang, Enze Xie, Tong Lu, Ping Luo:
Towards Ultra-Resolution Neural Style Transfer via Thumbnail Instance Normalization. CoRR abs/2103.11784 (2021) - [i95]Mingyu Ding, Yuqi Huo, Haoyu Lu, Linjie Yang, Zhe Wang, Zhiwu Lu, Jingdong Wang, Ping Luo:
Learning Versatile Neural Architectures by Propagating Network Codes. CoRR abs/2103.13253 (2021) - [i94]Enze Xie, Wenhai Wang, Mingyu Ding, Ruimao Zhang, Ping Luo:
PolarMask++: Enhanced Polar Representation for Single-Shot Instance Segmentation and Beyond. CoRR abs/2105.02184 (2021) - [i93]Jiahang Wang, Sheng Jin, Wentao Liu, Weizhong Liu, Chen Qian, Ping Luo:
When Human Pose Estimation Meets Robustness: Adversarial Algorithms and Benchmarks. CoRR abs/2105.06152 (2021) - [i92]Wenqi Shao, Hang Yu, Zhaoyang Zhang, Hang Xu, Zhenguo Li, Ping Luo:
BWCP: Probabilistic Learning-to-Prune Channels for ConvNets via Batch Whitening. CoRR abs/2105.06423 (2021) - [i91]Lumin Xu, Yingda Guan, Sheng Jin, Wentao Liu, Chen Qian, Ping Luo, Wanli Ouyang, Xiaogang Wang:
ViPNAS: Efficient Video Pose Estimation via Neural Architecture Search. CoRR abs/2105.10154 (2021) - [i90]Enze Xie, Wenhai Wang, Zhiding Yu, Anima Anandkumar, José M. Álvarez, Ping Luo:
SegFormer: Simple and Efficient Design for Semantic Segmentation with Transformers. CoRR abs/2105.15203 (2021) - [i89]Zhaoyang Zhang, Wenqi Shao, Jinwei Gu, Xiaogang Wang, Ping Luo:
Differentiable Dynamic Quantization with Mixed Precision and Adaptive Resolution. CoRR abs/2106.02295 (2021) - [i88]Mingyu Ding, Xiaochen Lian, Linjie Yang, Peng Wang, Xiaojie Jin, Zhiwu Lu, Ping Luo:
HR-NAS: Searching Efficient High-Resolution Neural Architectures with Lightweight Transformers. CoRR abs/2106.06560 (2021) - [i87]Wenhai Wang, Enze Xie, Xiang Li, Deng-Ping Fan, Kaitao Song, Ding Liang, Tong Lu, Ping Luo, Ling Shao:
PVTv2: Improved Baselines with Pyramid Vision Transformer. CoRR abs/2106.13797 (2021) - [i86]Yuanfeng Ji, Ruimao Zhang, Huijie Wang, Zhen Li, Lingyun Wu, Shaoting Zhang, Ping Luo:
Multi-Compound Transformer for Accurate Biomedical Image Segmentation. CoRR abs/2106.14385 (2021) - [i85]Lingyun Wu, Zhiqiang Hu, Yuanfeng Ji, Ping Luo, Shaoting Zhang:
Multi-frame Collaboration for Effective Endoscopic Video Polyp Detection via Spatial-Temporal Feature Transformation. CoRR abs/2107.03609 (2021) - [i84]