default search action
Chuang Gan
Person information
Other persons with a similar name
SPARQL queries
🛈 Please note that only 57% of the records listed on this page have a DOI. Therefore, DOI-based queries can only provide partial results.
Refine list
refinements active!
zoomed in on ?? of ?? records
view refined list in
export refined list as
showing all ?? records
2020 – today
- 2024
- [c162]Zhenfang Chen, Qinhong Zhou, Yikang Shen, Yining Hong, Zhiqing Sun, Dan Gutfreund, Chuang Gan:
Visual Chain-of-Thought Prompting for Knowledge-Based Visual Reasoning. AAAI 2024: 1254-1262 - [c161]Zhiqing Sun, Sheng Shen, Shengcao Cao, Haotian Liu, Chunyuan Li, Yikang Shen, Chuang Gan, Liangyan Gui, Yu-Xiong Wang, Yiming Yang, Kurt Keutzer, Trevor Darrell:
Aligning Large Multimodal Models with Factually Augmented RLHF. ACL (Findings) 2024: 13088-13110 - [c160]Kefan Su, Siyuan Zhou, Jiechuan Jiang, Chuang Gan, Xiangjun Wang, Zongqing Lu:
Multi-Agent Alternate Q-Learning. AAMAS 2024: 1791-1799 - [c159]Phuc D. A. Nguyen, Tuan Duc Ngo, Evangelos Kalogerakis, Chuang Gan, Anh Tuan Tran, Cuong Pham, Khoi Nguyen:
Open3DIS: Open-Vocabulary 3D Instance Segmentation with 2D Mask Guidance. CVPR 2024: 4018-4028 - [c158]Andong Wang, Bo Wu, Sunli Chen, Zhenfang Chen, Haotian Guan, Wei-Ning Lee, Li Erran Li, Chuang Gan:
SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge. CVPR 2024: 13384-13394 - [c157]Zeyuan Yang, Jiageng Lin, Peihao Chen, Anoop Cherian, Tim K. Marks, Jonathan Le Roux, Chuang Gan:
RILA: Reflective and Imaginative Language Agent for Zero-Shot Semantic Audio-Visual Navigation. CVPR 2024: 16251-16261 - [c156]Yining Hong, Zishuo Zheng, Peihao Chen, Yian Wang, Junyan Li, Chuang Gan:
MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World. CVPR 2024: 26396-26406 - [c155]Junyan Li, Delin Chen, Tianle Cai, Peihao Chen, Yining Hong, Zhenfang Chen, Yikang Shen, Chuang Gan:
FlexAttention for Efficient High-Resolution Vision-Language Models. ECCV (25) 2024: 286-302 - [c154]Zhenfang Chen, Rui Sun, Wenjun Liu, Yining Hong, Chuang Gan:
GENOME: Generative Neuro-Symbolic Visual Reasoning by Growing and Reusing Modules. ICLR 2024 - [c153]Junyan Li, Delin Chen, Yining Hong, Zhenfang Chen, Peihao Chen, Yikang Shen, Chuang Gan:
CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding. ICLR 2024 - [c152]Zilin Si, Gu Zhang, Qingwei Ben, Branden Romero, Zhou Xian, Chao Liu, Chuang Gan:
DIFFTACTILE: A Physics-based Differentiable Tactile Simulator for Contact-rich Robotic Manipulation. ICLR 2024 - [c151]Zhiqing Sun, Yikang Shen, Hongxin Zhang, Qinhong Zhou, Zhenfang Chen, David Daniel Cox, Yiming Yang, Chuang Gan:
SALMON: Self-Alignment with Instructable Reward Models. ICLR 2024 - [c150]Yian Wang, Juntian Zheng, Zhehuan Chen, Zhou Xian, Gu Zhang, Chao Liu, Chuang Gan:
Thin-Shell Object Manipulations With Differentiable Physics Simulations. ICLR 2024 - [c149]Hongxin Zhang, Weihua Du, Jiaming Shan, Qinhong Zhou, Yilun Du, Joshua B. Tenenbaum, Tianmin Shu, Chuang Gan:
Building Cooperative Embodied Agents Modularly with Large Language Models. ICLR 2024 - [c148]Qinhong Zhou, Sunli Chen, Yisong Wang, Haozhe Xu, Weihua Du, Hongxin Zhang, Yilun Du, Joshua B. Tenenbaum, Chuang Gan:
HAZARD Challenge: Embodied Decision Making in Dynamically Changing Environments. ICLR 2024 - [c147]Heting Gao, Kaizhi Qian, Junrui Ni, Chuang Gan, Mark A. Hasegawa-Johnson, Shiyu Chang, Yang Zhang:
Speech Self-Supervised Learning Using Diffusion Model Synthetic Data. ICML 2024 - [c146]Pingchuan Ma, Tsun-Hsuan Wang, Minghao Guo, Zhiqing Sun, Joshua B. Tenenbaum, Daniela Rus, Chuang Gan, Wojciech Matusik:
LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery. ICML 2024 - [c145]Yufei Wang, Zhou Xian, Feng Chen, Tsun-Hsuan Wang, Yian Wang, Katerina Fragkiadaki, Zackory Erickson, David Held, Chuang Gan:
RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation. ICML 2024 - [c144]Haoyu Zhen, Xiaowen Qiu, Peihao Chen, Jincheng Yang, Xin Yan, Yilun Du, Yining Hong, Chuang Gan:
3D-VLA: A 3D Vision-Language-Action Generative World Model. ICML 2024 - [c143]Zhicheng Zheng, Xin Yan, Zhenfang Chen, Jingzhou Wang, Qin Zhi Eddie Lim, Joshua B. Tenenbaum, Chuang Gan:
ContPhy: Continuum Physical Concept Learning and Reasoning from Videos. ICML 2024 - [c142]Siyuan Zhou, Yilun Du, Jiaben Chen, Yandong Li, Dit-Yan Yeung, Chuang Gan:
RoboDreamer: Learning Compositional World Models for Robot Imagination. ICML 2024 - [c141]Qiao Gu, Ali Kuwajerwala, Sacha Morin, Krishna Murthy Jatavallabhula, Bipasha Sen, Aditya Agarwal, Corban Rivera, William Paul, Kirsty Ellis, Rama Chellappa, Chuang Gan, Celso Miguel de Melo, Joshua B. Tenenbaum, Antonio Torralba, Florian Shkurti, Liam Paull:
ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning. ICRA 2024: 5021-5028 - [c140]Ji Lin, Jiaming Tang, Haotian Tang, Shang Yang, Wei-Ming Chen, Wei-Chen Wang, Guangxuan Xiao, Xingyu Dang, Chuang Gan, Song Han:
AWQ: Activation-aware Weight Quantization for On-Device LLM Compression and Acceleration. MLSys 2024 - [i168]Yining Hong, Zishuo Zheng, Peihao Chen, Yian Wang, Junyan Li, Chuang Gan:
MultiPLY: A Multisensory Object-Centric Embodied Large Language Model in 3D World. CoRR abs/2401.08577 (2024) - [i167]Qinhong Zhou, Sunli Chen, Yisong Wang, Haozhe Xu, Weihua Du, Hongxin Zhang, Yilun Du, Joshua B. Tenenbaum, Chuang Gan:
HAZARD Challenge: Embodied Decision Making in Dynamically Changing Environments. CoRR abs/2401.12975 (2024) - [i166]Shun Zhang, Zhenfang Chen, Sunli Chen, Yikang Shen, Zhiqing Sun, Chuang Gan:
Improving Reinforcement Learning from Human Feedback with Efficient Reward Model Ensemble. CoRR abs/2401.16635 (2024) - [i165]Zhicheng Zheng, Xin Yan, Zhenfang Chen, Jingzhou Wang, Qin Zhi Eddie Lim, Joshua B. Tenenbaum, Chuang Gan:
ContPhy: Continuum Physical Concept Learning and Reasoning from Videos. CoRR abs/2402.06119 (2024) - [i164]Zilin Si, Gu Zhang, Qingwei Ben, Branden Romero, Zhou Xian, Chao Liu, Chuang Gan:
DIFFTACTILE: A Physics-based Differentiable Tactile Simulator for Contact-rich Robotic Manipulation. CoRR abs/2403.08716 (2024) - [i163]Zhiqing Sun, Longhui Yu, Yikang Shen, Weiyang Liu, Yiming Yang, Sean Welleck, Chuang Gan:
Easy-to-Hard Generalization: Scalable Alignment Beyond Human Supervision. CoRR abs/2403.09472 (2024) - [i162]Haoyu Zhen, Xiaowen Qiu, Peihao Chen, Jincheng Yang, Xin Yan, Yilun Du, Yining Hong, Chuang Gan:
3D-VLA: A 3D Vision-Language-Action Generative World Model. CoRR abs/2403.09631 (2024) - [i161]Yian Wang, Juntian Zheng, Zhehuan Chen, Zhou Xian, Gu Zhang, Chao Liu, Chuang Gan:
Thin-Shell Object Manipulations With Differentiable Physics Simulations. CoRR abs/2404.00451 (2024) - [i160]Hongxin Zhang, Zeyuan Wang, Qiushi Lyu, Zheyuan Zhang, Sunli Chen, Tianmin Shu, Yilun Du, Chuang Gan:
COMBO: Compositional World Models for Embodied Multi-Agent Cooperation. CoRR abs/2404.10775 (2024) - [i159]Rachel Chen, Juheon Lee, Chuang Gan, Zijiang Yang, Mohammad Amin Nabian, Jun Zeng:
Virtual Foundry Graphnet for Metal Sintering Deformation Prediction. CoRR abs/2404.11753 (2024) - [i158]Siyuan Zhou, Yilun Du, Jiaben Chen, Yandong Li, Dit-Yan Yeung, Chuang Gan:
RoboDreamer: Learning Compositional World Models for Robot Imagination. CoRR abs/2404.12377 (2024) - [i157]Yujun Lin, Haotian Tang, Shang Yang, Zhekai Zhang, Guangxuan Xiao, Chuang Gan, Song Han:
QServe: W4A8KV4 Quantization and System Co-design for Efficient LLM Serving. CoRR abs/2405.04532 (2024) - [i156]Bo Wu, Shoubin Yu, Zhenfang Chen, Joshua B. Tenenbaum, Chuang Gan:
STAR: A Benchmark for Situated Reasoning in Real-World Videos. CoRR abs/2405.09711 (2024) - [i155]Andong Wang, Bo Wu, Sunli Chen, Zhenfang Chen, Haotian Guan, Wei-Ning Lee, Li Erran Li, Chuang Gan:
SOK-Bench: A Situated Video Reasoning Benchmark with Aligned Open-World Knowledge. CoRR abs/2405.09713 (2024) - [i154]Pingchuan Ma, Tsun-Hsuan Wang, Minghao Guo, Zhiqing Sun, Joshua B. Tenenbaum, Daniela Rus, Chuang Gan, Wojciech Matusik:
LLM and Simulation as Bilevel Optimizers: A New Paradigm to Advance Physical Scientific Discovery. CoRR abs/2405.09783 (2024) - [i153]Jiaben Chen, Xin Yan, Yihang Chen, Siyuan Cen, Qinwei Ma, Haoyu Zhen, Kaizhi Qian, Lie Lu, Chuang Gan:
RapVerse: Coherent Vocals and Whole-Body Motions Generations from Text. CoRR abs/2405.20336 (2024) - [i152]Minghao Guo, Bohan Wang, Pingchuan Ma, Tianyuan Zhang, Crystal Elaine Owens, Chuang Gan, Joshua B. Tenenbaum, Kaiming He, Wojciech Matusik:
Physically Compatible 3D Object Modeling from a Single Image. CoRR abs/2405.20510 (2024) - [i151]Changhao Li, Xinyu Sun, Peihao Chen, Jugang Fan, Zixu Wang, Yanxia Liu, Jin-Hui Zhu, Chuang Gan, Mingkui Tan:
CoNav: A Benchmark for Human-Centered Collaborative Navigation. CoRR abs/2406.02425 (2024) - [i150]Irene Huang, Wei Lin, Muhammad Jehanzeb Mirza, Jacob A. Hansen, Sivan Doveh, Victor Ion Butoi, Roei Herzig, Assaf Arbelle, Hilde Kuehne, Trevor Darrell, Chuang Gan, Aude Oliva, Rogério Feris, Leonid Karlinsky:
ConMe: Rethinking Evaluation of Compositional Reasoning for Modern VLMs. CoRR abs/2406.08164 (2024) - [i149]Jie Yin, Andrew Luo, Yilun Du, Anoop Cherian, Tim K. Marks, Jonathan Le Roux, Chuang Gan:
Disentangled Acoustic Fields For Multimodal Physical Scene Understanding. CoRR abs/2407.11333 (2024) - [i148]Junyan Li, Delin Chen, Tianle Cai, Peihao Chen, Yining Hong, Zhenfang Chen, Yikang Shen, Chuang Gan:
FlexAttention for Efficient High-Resolution Vision-Language Models. CoRR abs/2407.20228 (2024) - [i147]Zhenfang Chen, Shilong Dong, Kexin Yi, Yunzhu Li, Mingyu Ding, Antonio Torralba, Joshua B. Tenenbaum, Chuang Gan:
Compositional Physical Reasoning of Objects and Events from Videos. CoRR abs/2408.02687 (2024) - 2023
- [j16]Konrad Heidler, Lichao Mou, Di Hu, Pu Jin, Guangyao Li, Chuang Gan, Ji-Rong Wen, Xiao Xiang Zhu:
Self-supervised audiovisual representation learning for remote sensing data. Int. J. Appl. Earth Obs. Geoinformation 116: 103130 (2023) - [j15]Hongchang Wang, Huaxiang Lu, Huimin Guo, Haifang Jian, Chuang Gan, Wu Liu:
Bird-Count: a multi-modality benchmark and system for bird population counting in the wild. Multim. Tools Appl. 82(29): 45293-45315 (2023) - [j14]Yihong Xu, Yutong Ban, Guillaume Delorme, Chuang Gan, Daniela Rus, Xavier Alameda-Pineda:
TransCenter: Transformers With Dense Representations for Multiple-Object Tracking. IEEE Trans. Pattern Anal. Mach. Intell. 45(6): 7820-7835 (2023) - [c139]Mo Yu, Yi Gu, Xiaoxiao Guo, Yufei Feng, Xiaodan Zhu, Michael A. Greenspan, Murray Campbell, Chuang Gan:
JECC: Commonsense Reasoning Tasks Derived from Interactive Fictions. ACL (Findings) 2023: 11226-11238 - [c138]Xinyu Sun, Peihao Chen, Liangwei Chen, Changhao Li, Thomas H. Li, Mingkui Tan, Chuang Gan:
Masked Motion Encoding for Self-Supervised Video Representation Learning. CVPR 2023: 2235-2245 - [c137]Yao Mu, Shunyu Yao, Mingyu Ding, Ping Luo, Chuang Gan:
EC2: Emergent Communication for Embodied Control. CVPR 2023: 6704-6714 - [c136]Yining Hong, Chunru Lin, Yilun Du, Zhenfang Chen, Joshua B. Tenenbaum, Chuang Gan:
3D Concept Learning and Reasoning from Multi-View Images. CVPR 2023: 9202-9212 - [c135]Kun Su, Kaizhi Qian, Eli Shlizerman, Antonio Torralba, Chuang Gan:
Physics-Driven Diffusion Models for Impact Sound Synthesis from Videos. CVPR 2023: 9749-9759 - [c134]Zitian Chen, Yikang Shen, Mingyu Ding, Zhenfang Chen, Hengshuang Zhao, Erik G. Learned-Miller, Chuang Gan:
Mod-Squad: Designing Mixtures of Experts As Modular Multi-Task Learners. CVPR 2023: 11828-11837 - [c133]Mingyu Ding, Yikang Shen, Lijie Fan, Zhenfang Chen, Zitian Chen, Ping Luo, Joshua B. Tenenbaum, Chuang Gan:
Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention. CVPR 2023: 14528-14539 - [c132]Aisha Urooj Khan, Hilde Kuehne, Bo Wu, Kim Chheu, Walid Bousselham, Chuang Gan, Niels da Vitoria Lobo, Mubarak Shah:
Learning Situation Hyper-Graphs for Video Question Answering. CVPR 2023: 14879-14889 - [c131]Shawn Tan, Yikang Shen, Zhenfang Chen, Aaron C. Courville, Chuang Gan:
Sparse Universal Transformer. EMNLP 2023: 169-179 - [c130]Chuang Gan, Yuchong Hu, Leyan Zhao, Xin Zhao, Pengyu Gong, Wenhao Zhang, Lin Wang, Dan Feng:
Enabling Encrypted Delta Compression for Outsourced Storage Systems via Preserving Similarity. ICCD 2023: 231-238 - [c129]Chengyang Zhao, Yikang Shen, Zhenfang Chen, Mingyu Ding, Chuang Gan:
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions. ICCV 2023: 2827-2838 - [c128]Kunyang Lin, Peihao Chen, Diwei Huang, Thomas H. Li, Mingkui Tan, Chuang Gan:
Learning Vision-and-Language Navigation from YouTube Videos. ICCV 2023: 8283-8292 - [c127]Han Cai, Junyan Li, Muyan Hu, Chuang Gan, Song Han:
EfficientViT: Lightweight Multi-Scale Attention for High-Resolution Dense Prediction. ICCV 2023: 17256-17267 - [c126]Sizhe Li, Zhiao Huang, Tao Chen, Tao Du, Hao Su, Joshua B. Tenenbaum, Chuang Gan:
DexDeform: Dexterous Deformable Object Manipulation with Human Demonstrations and Differentiable Physics. ICLR 2023 - [c125]Xuan Li, Yi-Ling Qiao, Peter Yichen Chen, Krishna Murthy Jatavallabhula, Ming C. Lin, Chenfanfu Jiang, Chuang Gan:
PAC-NeRF: Physics Augmented Continuum Neural Radiance Fields for Geometry-Agnostic System Identification. ICLR 2023 - [c124]Tsun-Hsuan Wang, Pingchuan Ma, Andrew Everett Spielberg, Zhou Xian, Hao Zhang, Joshua B. Tenenbaum, Daniela Rus, Chuang Gan:
SoftZoo: A Soft Robot Co-design Benchmark For Locomotion In Diverse Environments. ICLR 2023 - [c123]Zhou Xian, Bo Zhu, Zhenjia Xu, Hsiao-Yu Tung, Antonio Torralba, Katerina Fragkiadaki, Chuang Gan:
FluidLab: A Differentiable Environment for Benchmarking Complex Fluid Manipulation. ICLR 2023 - [c122]Mengdi Xu, Yuchen Lu, Yikang Shen, Shun Zhang, Ding Zhao, Chuang Gan:
Hyper-Decision Transformer for Efficient Online Policy Adaptation. ICLR 2023 - [c121]Shun Zhang, Zhenfang Chen, Yikang Shen, Mingyu Ding, Joshua B. Tenenbaum, Chuang Gan:
Planning with Large Language Models for Code Generation. ICLR 2023 - [c120]Zhiao Huang, Litian Liang, Zhan Ling, Xuanlin Li, Chuang Gan, Hao Su:
Reparameterized Policy Learning for Multimodal Trajectory Optimization. ICML 2023: 13957-13975 - [c119]Pingchuan Ma, Peter Yichen Chen, Bolei Deng, Joshua B. Tenenbaum, Tao Du, Chuang Gan, Wojciech Matusik:
Learning Neural Constitutive Laws from Motion Observations for Generalizable PDE Dynamics. ICML 2023: 23279-23300 - [c118]Wei Xiao, Tsun-Hsuan Wang, Ramin M. Hasani, Mathias Lechner, Yutong Ban, Chuang Gan, Daniela Rus:
On the Forward Invariance of Neural ODEs. ICML 2023: 38100-38124 - [c117]Peng Gao, Qingzhao Zhu, Hongsheng Lu, Chuang Gan, Hao Zhang:
Deep Masked Graph Matching for Correspondence Identification in Collaborative Perception. ICRA 2023: 6117-6123 - [c116]Ligeng Zhu, Lanxiang Hu, Ji Lin, Wei-Ming Chen, Wei-Chen Wang, Chuang Gan, Song Han:
PockEngine: Sparse and Efficient Fine-tuning in a Pocket. MICRO 2023: 1381-1394 - [c115]Yining Hong, Haoyu Zhen, Peihao Chen, Shuhong Zheng, Yilun Du, Zhenfang Chen, Chuang Gan:
3D-LLM: Injecting the 3D World into Large Language Models. NeurIPS 2023 - [c114]Zhiao Huang, Feng Chen, Yewen Pu, Chunru Lin, Hao Su, Chuang Gan:
DiffVL: Scaling Up Soft Body Manipulation using Vision-Language Driven Differentiable Physics. NeurIPS 2023 - [c113]Zhiqing Sun, Yikang Shen, Qinhong Zhou, Hongxin Zhang, Zhenfang Chen, David D. Cox, Yiming Yang, Chuang Gan:
Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision. NeurIPS 2023 - [c112]Hsiao-Yu Tung, Mingyu Ding, Zhenfang Chen, Daniel Bear, Chuang Gan, Josh Tenenbaum, Dan Yamins, Judith E. Fan, Kevin A. Smith:
Physion++: Evaluating Physical Scene Understanding that Requires Online Inference of Different Physical Properties. NeurIPS 2023 - [c111]Tsun-Hsuan Johnson Wang, Juntian Zheng, Pingchuan Ma, Yilun Du, Byungchul Kim, Andrew Spielberg, Joshua B. Tenenbaum, Chuang Gan, Daniela Rus:
DiffuseBot: Breeding Soft Robots With Physics-Augmented Generative Diffusion Models. NeurIPS 2023 - [c110]Siyuan Zhou, Yilun Du, Shun Zhang, Mengdi Xu, Yikang Shen, Wei Xiao, Dit-Yan Yeung, Chuang Gan:
Adaptive Online Replanning with Diffusion Models. NeurIPS 2023 - [c109]Zhenjia Xu, Zhou Xian, Xingyu Lin, Cheng Chi, Zhiao Huang, Chuang Gan, Shuran Song:
RoboNinja: Learning an Adaptive Cutting Policy for Multi-Material Objects. Robotics: Science and Systems 2023 - [c108]Jinghan Jia, Shashank Srikant, Tamara Mitrovska, Chuang Gan, Shiyu Chang, Sijia Liu, Una-May O'Reilly:
ClawSAT: Towards Both Robust and Accurate Code Models. SANER 2023: 212-223 - [i146]Zhenfang Chen, Qinhong Zhou, Yikang Shen, Yining Hong, Hao Zhang, Chuang Gan:
See, Think, Confirm: Interactive Prompting Between Vision and Language Models for Knowledge-based Visual Reasoning. CoRR abs/2301.05226 (2023) - [i145]Zhenjia Xu, Zhou Xian, Xingyu Lin, Cheng Chi, Zhiao Huang, Chuang Gan, Shuran Song:
RoboNinja: Learning an Adaptive Cutting Policy for Multi-Material Objects. CoRR abs/2302.11553 (2023) - [i144]Zhou Xian, Bo Zhu, Zhenjia Xu, Hsiao-Yu Tung, Antonio Torralba, Katerina Fragkiadaki, Chuang Gan:
FluidLab: A Differentiable Environment for Benchmarking Complex Fluid Manipulation. CoRR abs/2303.02346 (2023) - [i143]Shun Zhang, Zhenfang Chen, Yikang Shen, Mingyu Ding, Joshua B. Tenenbaum, Chuang Gan:
Planning with Large Language Models for Code Generation. CoRR abs/2303.05510 (2023) - [i142]Xuan Li, Yi-Ling Qiao, Peter Yichen Chen, Krishna Murthy Jatavallabhula, Ming C. Lin, Chenfanfu Jiang, Chuang Gan:
PAC-NeRF: Physics Augmented Continuum Neural Radiance Fields for Geometry-Agnostic System Identification. CoRR abs/2303.05512 (2023) - [i141]Peng Gao, Qingzhao Zhu, Hongsheng Lu, Chuang Gan, Hao Zhang:
Deep Masked Graph Matching for Correspondence Identification in Collaborative Perception. CoRR abs/2303.07555 (2023) - [i140]Tsun-Hsuan Wang, Pingchuan Ma, Andrew Everett Spielberg, Zhou Xian, Hao Zhang, Joshua B. Tenenbaum, Daniela Rus, Chuang Gan:
SoftZoo: A Soft Robot Co-design Benchmark For Locomotion In Diverse Environments. CoRR abs/2303.09555 (2023) - [i139]Yining Hong, Chunru Lin, Yilun Du, Zhenfang Chen, Joshua B. Tenenbaum, Chuang Gan:
3D Concept Learning and Reasoning from Multi-View Images. CoRR abs/2303.11327 (2023) - [i138]Kun Su, Kaizhi Qian, Eli Shlizerman, Antonio Torralba, Chuang Gan:
Physics-Driven Diffusion Models for Impact Sound Synthesis from Videos. CoRR abs/2303.16897 (2023) - [i137]Sizhe Li, Zhiao Huang, Tao Chen, Tao Du, Hao Su, Joshua B. Tenenbaum, Chuang Gan:
DexDeform: Dexterous Deformable Object Manipulation with Human Demonstrations and Differentiable Physics. CoRR abs/2304.03223 (2023) - [i136]Mingyu Ding, Yikang Shen, Lijie Fan, Zhenfang Chen, Zitian Chen, Ping Luo, Joshua B. Tenenbaum, Chuang Gan:
Visual Dependency Transformers: Dependency Tree Emerges from Reversed Attention. CoRR abs/2304.03282 (2023) - [i135]Mingyu Ding, Yan Xu, Zhenfang Chen, David Daniel Cox, Ping Luo, Joshua B. Tenenbaum, Chuang Gan:
Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following. CoRR abs/2304.03767 (2023) - [i134]Mengdi Xu, Yuchen Lu, Yikang Shen, Shun Zhang, Ding Zhao, Chuang Gan:
Hyper-Decision Transformer for Efficient Online Policy Adaptation. CoRR abs/2304.08487 (2023) - [i133]Aisha Urooj Khan, Hilde Kuehne, Bo Wu, Kim Chheu, Walid Bousselham, Chuang Gan, Niels da Vitoria Lobo, Mubarak Shah:
Learning Situation Hyper-Graphs for Video Question Answering. CoRR abs/2304.08682 (2023) - [i132]Yao Mu, Shunyu Yao, Mingyu Ding, Ping Luo, Chuang Gan:
EC^2: Emergent Communication for Embodied Control. CoRR abs/2304.09448 (2023) - [i131]Pingchuan Ma, Peter Yichen Chen, Bolei Deng, Joshua B. Tenenbaum, Tao Du, Chuang Gan, Wojciech Matusik:
Learning Neural Constitutive Laws From Motion Observations for Generalizable PDE Dynamics. CoRR abs/2304.14369 (2023) - [i130]Zhiqing Sun, Yikang Shen, Qinhong Zhou, Hongxin Zhang, Zhenfang Chen, David D. Cox, Yiming Yang, Chuang Gan:
Principle-Driven Self-Alignment of Language Models from Scratch with Minimal Human Supervision. CoRR abs/2305.03047 (2023) - [i129]Wei Xiao, Tsun-Hsuan Wang, Chuang Gan, Daniela Rus:
SafeDiffuser: Safe Planning with Diffusion Probabilistic Models. CoRR abs/2306.00148 (2023) - [i128]Yikang Shen, Zheyu Zhang, Tianyou Cao, Shawn Tan, Zhenfang Chen, Chuang Gan:
ModuleFormer: Learning Modular Large Language Models From Uncurated Data. CoRR abs/2306.04640 (2023) - [i127]Hsiao-Yu Tung, Mingyu Ding, Zhenfang Chen, Daniel Bear, Chuang Gan, Joshua B. Tenenbaum, Daniel L. K. Yamins, Judith E. Fan, Kevin A. Smith:
Physion++: Evaluating Physical Scene Understanding that Requires Online Inference of Different Physical Properties. CoRR abs/2306.15668 (2023) - [i126]Zitian Chen, Mingyu Ding, Yikang Shen, Wei Zhan, Masayoshi Tomizuka, Erik G. Learned-Miller, Chuang Gan:
An Efficient General-Purpose Modular Vision Model via Multi-Task Heterogeneous Training. CoRR abs/2306.17165 (2023) - [i125]Hongxin Zhang, Weihua Du, Jiaming Shan, Qinhong Zhou, Yilun Du, Joshua B. Tenenbaum, Tianmin Shu, Chuang Gan:
Building Cooperative Embodied Agents Modularly with Large Language Models. CoRR abs/2307.02485 (2023) - [i124]Zhiao Huang, Litian Liang, Zhan Ling, Xuanlin Li, Chuang Gan, Hao Su:
Reparameterized Policy Learning for Multimodal Trajectory Optimization. CoRR abs/2307.10710 (2023) - [i123]Kunyang Lin, Peihao Chen, Diwei Huang, Thomas H. Li, Mingkui Tan, Chuang Gan:
Learning Vision-and-Language Navigation from YouTube Videos. CoRR abs/2307.11984 (2023) - [i122]Yining Hong, Haoyu Zhen, Peihao Chen, Shuhong Zheng, Yilun Du, Zhenfang Chen, Chuang Gan:
3D-LLM: Injecting the 3D World into Large Language Models. CoRR abs/2307.12981 (2023) - [i121]Peihao Chen, Xinyu Sun, Hongyan Zhi, Runhao Zeng, Thomas H. Li, Gaowen Liu, Mingkui Tan, Chuang Gan:
A2Nav: Action-Aware Zero-Shot Robot Navigation by Exploiting Vision-and-Language Ability of Foundation Models. CoRR abs/2308.07997 (2023) - [i120]Zhiqing Sun, Sheng Shen, Shengcao Cao, Haotian Liu, Chunyuan Li, Yikang Shen, Chuang Gan, Liang-Yan Gui, Yu-Xiong Wang, Yiming Yang, Kurt Keutzer, Trevor Darrell:
Aligning Large Multimodal Models with Factually Augmented RLHF. CoRR abs/2309.14525 (2023) - [i119]Qiao Gu, Alihusein Kuwajerwala, Sacha Morin, Krishna Murthy Jatavallabhula, Bipasha Sen, Aditya Agarwal, Corban Rivera, William Paul, Kirsty Ellis, Rama Chellappa, Chuang Gan, Celso Miguel de Melo, Joshua B. Tenenbaum, Antonio Torralba, Florian Shkurti, Liam Paull:
ConceptGraphs: Open-Vocabulary 3D Scene Graphs for Perception and Planning. CoRR abs/2309.16650 (2023) - [i118]Haoyu Zhou, Mingyu Ding, Weikun Peng, Masayoshi Tomizuka, Lin Shao, Chuang Gan:
Generalizable Long-Horizon Manipulations with Large Language Models. CoRR abs/2310.02264 (2023) - [i117]Zhiqing Sun, Yikang Shen, Hongxin Zhang, Qinhong Zhou, Zhenfang Chen, David D. Cox, Yiming Yang, Chuang Gan:
SALMON: Self-Alignment with Principle-Following Reward Models. CoRR abs/2310.05910 (2023) - [i116]Chengyang Zhao, Yikang Shen, Zhenfang Chen, Mingyu Ding, Chuang Gan:
TextPSG: Panoptic Scene Graph Generation from Textual Descriptions. CoRR abs/2310.07056 (2023) - [i115]Shawn Tan, Yikang Shen, Zhenfang Chen, Aaron C. Courville, Chuang Gan:
Sparse Universal Transformer. CoRR abs/2310.07096 (2023) - [i114]Siyuan Zhou, Yilun Du, Shun Zhang, Mengdi Xu, Yikang Shen, Wei Xiao, Dit-Yan Yeung, Chuang Gan:
Adaptive Online Replanning with Diffusion Models. CoRR abs/2310.09629 (2023) - [i113]Zheyu Zhang, Zhuorui Ye, Yikang Shen, Chuang Gan:
Autonomous Tree-search Ability of Large Language Models. CoRR abs/2310.10686 (2023) - [i112]Ligeng Zhu, Lanxiang Hu, Ji Lin, Wei-Chen Wang, Wei-Ming Chen, Chuang Gan, Song Han:
PockEngine: Sparse and Efficient Fine-tuning in a Pocket. CoRR abs/2310.17752 (2023) - [i111]Yufei Wang, Zhou Xian, Feng Chen, Tsun-Hsuan Wang, Yian Wang, Zackory Erickson, David Held, Chuang Gan:
RoboGen: Towards Unleashing Infinite Data for Automated Robot Learning via Generative Simulation. CoRR abs/2311.01455 (2023) - [i110]Junyan Li, Delin Chen, Yining Hong, Zhenfang Chen, Peihao Chen, Yikang Shen, Chuang Gan:
CoVLM: Composing Visual Entities and Relationships in Large Language Models Via Communicative Decoding. CoRR abs/2311.03354 (2023) - [i109]Zhenfang Chen, Rui Sun, Wenjun Liu, Yining Hong, Chuang Gan:
GENOME: GenerativE Neuro-symbOlic visual reasoning by growing and reusing ModulEs. CoRR abs/2311.04901 (2023) - [i108]Tsun-Hsuan Wang, Juntian Zheng, Pingchuan Ma, Yilun Du, Byungchul Kim, Andrew Spielberg, Joshua B. Tenenbaum, Chuang Gan, Daniela Rus:
DiffuseBot: Breeding Soft Robots With Physics-Augmented Generative Diffusion Models. CoRR abs/2311.17053 (2023) - [i107]Kunyang Lin, Yufeng Wang, Peihao Chen, Runhao Zeng, Siyuan Zhou, Mingkui Tan, Chuang Gan:
DCIR: Dynamic Consistency Intrinsic Reward for Multi-Agent Reinforcement Learning. CoRR abs/2312.05783 (2023) - [i106]Zhiao Huang, Feng Chen, Yewen Pu, Chunru Lin, Hao Su, Chuang Gan:
DiffVL: Scaling Up Soft Body Manipulation using Vision-Language Driven Differentiable Physics. CoRR abs/2312.06408 (2023) - [i105]Phuc D. A. Nguyen, Tuan Duc Ngo, Chuang Gan, Evangelos Kalogerakis, Anh Tran, Cuong Pham, Khoi Nguyen:
Open3DIS: Open-vocabulary 3D Instance Segmentation with 2D Mask Guidance. CoRR abs/2312.10671 (2023) - 2022
- [j13]Ao Liu, Xiaoyu Chen, Sijia Liu, Lirong Xia, Chuang Gan:
Certifiably robust interpretation via Rényi differential privacy. Artif. Intell. 313: 103787 (2022) - [j12]Xiang Long, Gerard de Melo, Dongliang He, Fu Li, Zhizhen Chi, Shilei Wen, Chuang Gan:
Purely Attention Based Local Feature Integration for Video Classification. IEEE Trans. Pattern Anal. Mach. Intell. 44(4): 2140-2154 (2022) - [j11]Ji Lin, Chuang Gan, Kuan Wang, Song Han:
TSM: Temporal Shift Module for Efficient and Scalable Video Understanding on Edge Devices. IEEE Trans. Pattern Anal. Mach. Intell. 44(5): 2760-2774 (2022) - [j10]Runhao Zeng, Wenbing Huang, Mingkui Tan, Yu Rong, Peilin Zhao, Junzhou Huang, Chuang Gan:
Graph Convolutional Module for Temporal Action Localization in Videos. IEEE Trans. Pattern Anal. Mach. Intell. 44(10): 6209-6223 (2022) - [j9]Xiangpeng Li, Bo Wu, Jingkuan Song, Lianli Gao, Pengpeng Zeng, Chuang Gan:
Text-instance graph: Exploring the relational semantics for text-based visual question answering. Pattern Recognit. 124: 108455 (2022) - [c107]Xingyu Lin, Carl Qi, Yunchu Zhang, Zhiao Huang, Katerina Fragkiadaki, Yunzhu Li, Chuang Gan, David Held:
Planning with Spatial-Temporal Abstraction from Point Clouds for Deformable Object Manipulation. CoRL 2022: 1640-1651 - [c106]Mingyu Ding, Yan Xu, Zhenfang Chen, David Daniel Cox, Ping Luo, Joshua B. Tenenbaum, Chuang Gan:
Embodied Concept Learner: Self-supervised Learning of Concepts and Mapping through Instruction Following. CoRL 2022: 1743-1754 - [c105]Yining Hong, Kaichun Mo, Li Yi, Leonidas J. Guibas, Antonio Torralba, Joshua B. Tenenbaum, Chuang Gan:
Fixing Malfunctional Objects With Learned Physical Simulation and Functional Prediction. CVPR 2022: 1403-1413 - [c104]Chuang Gan, Yi Gu, Siyuan Zhou, Jeremy Schwartz, Seth Alter, James Traer, Dan Gutfreund, Joshua B. Tenenbaum, Josh H. McDermott, Antonio Torralba:
Finding Fallen Objects Via Asynchronous Audio-Visual Integration. CVPR 2022: 10513-10523 - [c103]Xueyi Liu, Xiaomeng Xu, Anyi Rao, Chuang Gan, Li Yi:
AutoGPart: Intermediate Supervision Search for Generalizable 3D Part Segmentation. CVPR 2022: 11614-11624 - [c102]Hongbin Lin, Yifan Zhang, Zhen Qiu, Shuaicheng Niu, Chuang Gan, Yanxia Liu, Mingkui Tan:
Prototype-Guided Continual Adaptation for Class-Incremental Unsupervised Domain Adaptation. ECCV (33) 2022: 351-368 - [c101]Aisha Urooj Khan, Hilde Kuehne, Chuang Gan, Niels da Vitoria Lobo, Mubarak Shah:
Weakly Supervised Grounding for VQA in Vision-Language Transformers. ECCV (35) 2022: 652-670 - [c100]Yi Gu, Shunyu Yao, Chuang Gan, Josh Tenenbaum, Mo Yu:
Revisiting the Roles of "Text" in Text Games. EMNLP (Findings) 2022: 6867-6876 - [c99]Pingchuan Ma, Tao Du, Joshua B. Tenenbaum, Wojciech Matusik, Chuang Gan:
RISP: Rendering-Invariant State Predictor with Differentiable Simulation and Rendering for Cross-Domain Parameter Estimation. ICLR 2022 - [c98]Han Cai, Chuang Gan, Ji Lin, Song Han:
Network Augmentation for Tiny Deep Learning. ICLR 2022 - [c97]Zhenfang Chen, Kexin Yi, Yunzhu Li, Mingyu Ding, Antonio Torralba, Joshua B. Tenenbaum, Chuang Gan:
ComPhy: Compositional Physical Reasoning of Objects and Events from Videos. ICLR 2022 - [c96]Sizhe Li, Zhiao Huang, Tao Du, Hao Su, Joshua B. Tenenbaum, Chuang Gan:
Contact Points Discovery for Soft-Body Manipulations with Differentiable Physics. ICLR 2022 - [c95]Xingyu Lin, Zhiao Huang, Yunzhu Li, Joshua B. Tenenbaum, David Held, Chuang Gan:
DiffSkill: Skill Abstraction from Differentiable Physics for Deformable Object Manipulations with Tools. ICLR 2022 - [c94]Lingjie Mei, Jiayuan Mao, Ziqi Wang, Chuang Gan, Joshua B. Tenenbaum:
FALCON: Fast Visual Concept Learning by Integrating Images, Linguistic descriptions, and Conceptual Relations. ICLR 2022 - [c93]Shunyu Yao, Mo Yu, Yang Zhang, Karthik R. Narasimhan, Joshua B. Tenenbaum, Chuang Gan:
Linking Emergent and Natural Languages via Corpus Transfer. ICLR 2022 - [c92]Mengdi Xu, Yikang Shen, Shun Zhang, Yuchen Lu, Ding Zhao, Joshua B. Tenenbaum, Chuang Gan:
Prompting Decision Transformer for Few-Shot Policy Generalization. ICML 2022: 24631-24645 - [c91]Chuang Gan, Siyuan Zhou, Jeremy Schwartz, Seth Alter, Abhishek Bhandwaldar, Dan Gutfreund, Daniel L. K. Yamins, James J. DiCarlo, Josh H. McDermott, Antonio Torralba, Joshua B. Tenenbaum:
The ThreeDWorld Transport Challenge: A Visually Guided Task-and-Motion Planning Benchmark Towards Physically Realistic Embodied AI. ICRA 2022: 8847-8854 - [c90]Chuang Gan, Xiaoyu Chen, Phillip Isola, Antonio Torralba, Joshua B. Tenenbaum:
Noisy Agents: Self-supervised Exploration by Predicting Auditory Events. IROS 2022: 9259-9265 - [c89]Jinkai Zheng, Xinchen Liu, Xiaoyan Gu, Yaoqi Sun, Chuang Gan, Jiyong Zhang, Wu Liu, Chenggang Yan:
Gait Recognition in the Wild with Multi-hop Temporal Switch. ACM Multimedia 2022: 6136-6145 - [c88]Ji Lin, Ligeng Zhu, Wei-Ming Chen, Wei-Chen Wang, Chuang Gan, Song Han:
On-Device Training Under 256KB Memory. NeurIPS 2022 - [c87]Peihao Chen, Dongyu Ji, Kunyang Lin, Weiwen Hu, Wenbing Huang, Thomas H. Li, Mingkui Tan, Chuang Gan:
Learning Active Camera for Multi-Object Navigation. NeurIPS 2022 - [c86]Peihao Chen, Dongyu Ji, Kunyang Lin, Runhao Zeng, Thomas H. Li, Mingkui Tan, Chuang Gan:
Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation. NeurIPS 2022 - [c85]Jiaqi Han, Wenbing Huang, Hengbo Ma, Jiachen Li, Josh Tenenbaum, Chuang Gan:
Learning Physical Dynamics with Subequivariant Graph Neural Networks. NeurIPS 2022 - [c84]Yining Hong, Yilun Du, Chunru Lin, Josh Tenenbaum, Chuang Gan:
3D Concept Grounding on Neural Fields. NeurIPS 2022 - [c83]Andrew F. Luo, Yilun Du, Michael J. Tarr, Josh Tenenbaum, Antonio Torralba, Chuang Gan:
Learning Neural Acoustic Fields. NeurIPS 2022 - [c82]Chengliang Zhong, Peixing You, Xiaoxue Chen, Hao Zhao, Fuchun Sun, Guyue Zhou, Xiaodong Mu, Chuang Gan, Wenbing Huang:
SNAKE: Shape-aware Neural 3D Keypoint Field. NeurIPS 2022 - [i104]Xueyi Liu, Xiaomeng Xu, Anyi Rao, Chuang Gan, Li Yi:
AutoGPart: Intermediate Supervision Search for Generalizable 3D Part Segmentation. CoRR abs/2203.06558 (2022) - [i103]Shunyu Yao, Mo Yu, Yang Zhang, Karthik R. Narasimhan, Joshua B. Tenenbaum, Chuang Gan:
Linking Emergent and Natural Languages via Corpus Transfer. CoRR abs/2203.13344 (2022) - [i102]Lingjie Mei, Jiayuan Mao, Ziqi Wang, Chuang Gan, Joshua B. Tenenbaum:
FALCON: Fast Visual Concept Learning by Integrating Images, Linguistic descriptions, and Conceptual Relations. CoRR abs/2203.16639 (2022) - [i101]Xingyu Lin, Zhiao Huang, Yunzhu Li, Joshua B. Tenenbaum, David Held, Chuang Gan:
DiffSkill: Skill Abstraction from Differentiable Physics for Deformable Object Manipulations with Tools. CoRR abs/2203.17275 (2022) - [i100]Andrew F. Luo, Yilun Du, Michael J. Tarr, Joshua B. Tenenbaum, Antonio Torralba, Chuang Gan:
Learning Neural Acoustic Fields. CoRR abs/2204.00628 (2022) - [i99]Zhenfang Chen, Kexin Yi, Yunzhu Li, Mingyu Ding, Antonio Torralba, Joshua B. Tenenbaum, Chuang Gan:
ComPhy: Compositional Physical Reasoning of Objects and Events from Videos. CoRR abs/2205.01089 (2022) - [i98]Yining Hong, Kaichun Mo, Li Yi, Leonidas J. Guibas, Antonio Torralba, Joshua B. Tenenbaum, Chuang Gan:
Fixing Malfunctional Objects With Learned Physical Simulation and Functional Prediction. CoRR abs/2205.02834 (2022) - [i97]Sizhe Li, Zhiao Huang, Tao Du, Hao Su, Joshua B. Tenenbaum, Chuang Gan:
Contact Points Discovery for Soft-Body Manipulations with Differentiable Physics. CoRR abs/2205.02835 (2022) - [i96]Pingchuan Ma, Tao Du, Joshua B. Tenenbaum, Wojciech Matusik, Chuang Gan:
RISP: Rendering-Invariant State Predictor with Differentiable Simulation and Rendering for Cross-Domain Parameter Estimation. CoRR abs/2205.05678 (2022) - [i95]Han Cai, Chuang Gan, Song Han:
EfficientViT: Enhanced Linear Attention for High-Resolution Low-Computation Visual Recognition. CoRR abs/2205.14756 (2022) - [i94]Chengliang Zhong, Peixing You, Xiaoxue Chen, Hao Zhao, Fuchun Sun, Guyue Zhou, Xiaodong Mu, Chuang Gan, Wenbing Huang:
SNAKE: Shape-aware Neural 3D Keypoint Field. CoRR abs/2206.01724 (2022) - [i93]Mengdi Xu, Yikang Shen, Shun Zhang, Yuchen Lu, Ding Zhao, Joshua B. Tenenbaum, Chuang Gan:
Prompting Decision Transformer for Few-Shot Policy Generalization. CoRR abs/2206.13499 (2022) - [i92]Ji Lin, Ligeng Zhu, Wei-Ming Chen, Wei-Chen Wang, Chuang Gan, Song Han:
On-Device Training Under 256KB Memory. CoRR abs/2206.15472 (2022) - [i91]Aisha Urooj Khan, Hilde Kuehne, Chuang Gan, Niels da Vitoria Lobo, Mubarak Shah:
Weakly Supervised Grounding for VQA in Vision-Language Transformers. CoRR abs/2207.02334 (2022) - [i90]Chuang Gan, Yi Gu, Siyuan Zhou, Jeremy Schwartz, Seth Alter, James Traer, Dan Gutfreund, Joshua B. Tenenbaum, Josh H. McDermott, Antonio Torralba:
Finding Fallen Objects Via Asynchronous Audio-Visual Integration. CoRR abs/2207.03483 (2022) - [i89]Yining Hong, Yilun Du, Chunru Lin, Joshua B. Tenenbaum, Chuang Gan:
3D Concept Grounding on Neural Fields. CoRR abs/2207.06403 (2022) - [i88]Hongbin Lin, Yifan Zhang, Zhen Qiu, Shuaicheng Niu, Chuang Gan, Yanxia Liu, Mingkui Tan:
Prototype-Guided Continual Adaptation for Class-Incremental Unsupervised Domain Adaptation. CoRR abs/2207.10856 (2022) - [i87]Jinkai Zheng, Xinchen Liu, Xiaoyan Gu, Yaoqi Sun, Chuang Gan, Jiyong Zhang, Wu Liu, Chenggang Yan:
Gait Recognition in the Wild with Multi-hop Temporal Switch. CoRR abs/2209.00355 (2022) - [i86]Kefan Su, Siyuan Zhou, Chuang Gan, Xiangjun Wang, Zongqing Lu:
MA2QL: A Minimalist Approach to Fully Decentralized Multi-Agent Reinforcement Learning. CoRR abs/2209.08244 (2022) - [i85]Xinyu Sun, Peihao Chen, Liangwei Chen, Thomas H. Li, Mingkui Tan, Chuang Gan:
M3Video: Masked Motion Modeling for Self-Supervised Video Representation Learning. CoRR abs/2210.06096 (2022) - [i84]Matt Deitke, Dhruv Batra, Yonatan Bisk, Tommaso Campari, Angel X. Chang, Devendra Singh Chaplot, Changan Chen, Claudia Pérez-D'Arpino, Kiana Ehsani, Ali Farhadi, Li Fei-Fei, Anthony G. Francis, Chuang Gan, Kristen Grauman, David Hall, Winson Han, Unnat Jain, Aniruddha Kembhavi, Jacob Krantz, Stefan Lee, Chengshu Li, Sagnik Majumder, Oleksandr Maksymets, Roberto Martín-Martín, Roozbeh Mottaghi, Sonia Raychaudhuri, Mike Roberts, Silvio Savarese, Manolis Savva, Mohit Shridhar, Niko Sünderhauf, Andrew Szot, Ben Talbot, Joshua B. Tenenbaum, Jesse Thomason, Alexander Toshev, Joanne Truong, Luca Weihs, Jiajun Wu:
Retrospectives on the Embodied AI Workshop. CoRR abs/2210.06849 (2022) - [i83]Jiaqi Han, Wenbing Huang, Hengbo Ma, Jiachen Li, Joshua B. Tenenbaum, Chuang Gan:
Learning Physical Dynamics with Subequivariant Graph Neural Networks. CoRR abs/2210.06876 (2022) - [i82]Peihao Chen, Dongyu Ji, Kunyang Lin, Weiwen Hu, Wenbing Huang, Thomas H. Li, Mingkui Tan, Chuang Gan:
Learning Active Camera for Multi-Object Navigation. CoRR abs/2210.07505 (2022) - [i81]Peihao Chen, Dongyu Ji, Kunyang Lin, Runhao Zeng, Thomas H. Li, Mingkui Tan, Chuang Gan:
Weakly-Supervised Multi-Granularity Map Learning for Vision-and-Language Navigation. CoRR abs/2210.07506 (2022) - [i80]Yi Gu, Shunyu Yao, Chuang Gan, Joshua B. Tenenbaum, Mo Yu:
Revisiting the Roles of "Text" in Text Games. CoRR abs/2210.08384 (2022) - [i79]Mo Yu, Xiaoxiao Guo, Yufei Feng, Yi Gu, Xiaodan Zhu, Michael A. Greenspan, Murray Campbell, Chuang Gan:
JECC: Commonsense Reasoning Tasks Derived from Interactive Fictions. CoRR abs/2210.15456 (2022) - [i78]Xingyu Lin, Carl Qi, Yunchu Zhang, Zhiao Huang, Katerina Fragkiadaki, Yunzhu Li, Chuang Gan, David Held:
Planning with Spatial-Temporal Abstraction from Point Clouds for Deformable Object Manipulation. CoRR abs/2210.15751 (2022) - [i77]Jinghan Jia, Shashank Srikant, Tamara Mitrovska, Chuang Gan, Shiyu Chang, Sijia Liu, Una-May O'Reilly:
CLAWSAT: Towards Both Robust and Accurate Code Models. CoRR abs/2211.11711 (2022) - [i76]Zitian Chen, Yikang Shen, Mingyu Ding, Zhenfang Chen, Hengshuang Zhao, Erik G. Learned-Miller, Chuang Gan:
Mod-Squad: Designing Mixture of Experts As Modular Multi-Task Learners. CoRR abs/2212.08066 (2022) - 2021
- [j8]Zhengzheng Tu, Ajian Zhou, Chuang Gan, Bo Jiang, Amir Hussain, Bin Luo:
A novel domain activation mapping-guided network (DA-GNT) for visual tracking. Neurocomputing 449: 443-454 (2021) - [j7]Kun Liu, Wu Liu, Huadong Ma, Mingkui Tan, Chuang Gan:
A Real-Time Action Representation With Temporal Encoding and Deep Compression. IEEE Trans. Circuits Syst. Video Technol. 31(2): 647-660 (2021) - [c81]Peihao Chen, Deng Huang, Dongliang He, Xiang Long, Runhao Zeng, Shilei Wen, Mingkui Tan, Chuang Gan:
RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning. AAAI 2021: 1045-1053 - [c80]Wenhao Wu, Dongliang He, Tianwei Lin, Fu Li, Chuang Gan, Errui Ding:
MVFNet: Multi-View Fusion Network for Efficient Video Recognition. AAAI 2021: 2943-2951 - [c79]Zelin Zhao, Chuang Gan, Jiajun Wu, Xiaoxiao Guo, Joshua B. Tenenbaum:
Augmenting Policy Learning with Routines Discovered from a Single Demonstration. AAAI 2021: 11024-11032 - [c78]Aisha Urooj Khan, Hilde Kuehne, Kevin Duarte, Chuang Gan, Niels da Vitoria Lobo, Mubarak Shah:
Found a Reason for me? Weakly-supervised Grounded Visual Question Answering using Capsules. CVPR 2021: 8465-8474 - [c77]Yilun Du, Chuang Gan, Phillip Isola:
Curious Representation Learning for Embodied Intelligence. ICCV 2021: 10388-10397 - [c76]Ren Wang, Kaidi Xu, Sijia Liu, Pin-Yu Chen, Tsui-Wei Weng, Chuang Gan, Meng Wang:
On Fast Adversarial Robustness Adaptation in Model-Agnostic Meta-Learning. ICLR 2021 - [c75]Zhenfang Chen, Jiayuan Mao, Jiajun Wu, Kwan-Yee Kenneth Wong, Joshua B. Tenenbaum, Chuang Gan:
Grounding Physical Concepts of Objects and Events Through Dynamic Visual Reasoning. ICLR 2021 - [c74]Zhiao Huang, Yuanming Hu, Tao Du, Siyuan Zhou, Hao Su, Joshua B. Tenenbaum, Chuang Gan:
PlasticineLab: A Soft-Body Manipulation Benchmark with Differentiable Physics. ICLR 2021 - [c73]Yuchen Lu, Yikang Shen, Siyuan Zhou, Aaron C. Courville, Joshua B. Tenenbaum, Chuang Gan:
Learning Task Decomposition with Ordered Memory Policy Network. ICLR 2021 - [c72]Mingxuan Jing, Wenbing Huang, Fuchun Sun, Xiaojian Ma, Tao Kong, Chuang Gan, Lei Li:
Adversarial Option-Aware Hierarchical Imitation Learning. ICML 2021: 5097-5106 - [c71]Kaizhi Qian, Yang Zhang, Shiyu Chang, Jinjun Xiong, Chuang Gan, David D. Cox, Mark Hasegawa-Johnson:
Global Prosody Style Transfer Without Text Transcriptions. ICML 2021: 8650-8660 - [c70]Tianmin Shu, Abhishek Bhandwaldar, Chuang Gan, Kevin A. Smith, Shari Liu, Dan Gutfreund, Elizabeth S. Spelke, Joshua B. Tenenbaum, Tomer D. Ullman:
AGENT: A Benchmark for Core Psychological Reasoning. ICML 2021: 9614-9625 - [c69]Jiayuan Mao, Zhezheng Luo, Chuang Gan, Joshua B. Tenenbaum, Jiajun Wu, Leslie Pack Kaelbling, Tomer D. Ullman:
Temporal and Object Quantification Networks. IJCAI 2021: 2804-2811 - [c68]Qingyuan Zhan, Guixing Wu, Chuang Gan:
MAGCN: A Multi-Adaptive Graph Convolutional Network for Traffic Forecasting. IJCNN 2021: 1-8 - [c67]Chuang Gan, Abhishek Bhandwaldar, Antonio Torralba, Joshua B. Tenenbaum, Phillip Isola:
OPEn: An Open-ended Physics Environment for Learning Without a Task. IROS 2021: 5878-5885 - [c66]Pengzhan Sun, Bo Wu, Xunsong Li, Wen Li, Lixin Duan, Chuang Gan:
Counterfactual Debiasing Inference for Compositional Action Recognition. ACM Multimedia 2021: 3220-3228 - [c65]Yuhan Zhang, Bo Wu, Wen Li, Lixin Duan, Chuang Gan:
STST: Spatial-Temporal Specialized Transformer for Skeleton-based Action Recognition. ACM Multimedia 2021: 3229-3237 - [c64]Mingyu Ding, Zhenfang Chen, Tao Du, Ping Luo, Josh Tenenbaum, Chuang Gan:
Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language. NeurIPS 2021: 887-899 - [c63]Chuang Gan, Jeremy Schwartz, Seth Alter, Damian Mrowca, Martin Schrimpf, James Traer, Julian De Freitas, Jonas Kubilius, Abhishek Bhandwaldar, Nick Haber, Megumi Sano, Kuno Kim, Elias Wang, Michael Lingelbach, Aidan Curtis, Kevin T. Feigelis, Daniel Bear, Dan Gutfreund, David D. Cox, Antonio Torralba, James J. DiCarlo, Josh Tenenbaum, Josh H. McDermott, Dan Yamins:
ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation. NeurIPS Datasets and Benchmarks 2021 - [c62]Ji Lin, Wei-Ming Chen, Han Cai, Chuang Gan, Song Han:
Memory-efficient Patch-based Inference for Tiny Deep Learning. NeurIPS 2021: 2346-2358 - [c61]Yining Hong, Li Yi, Josh Tenenbaum, Antonio Torralba, Chuang Gan:
PTR: A Benchmark for Part-based Conceptual, Relational, and Physical Reasoning. NeurIPS 2021: 17427-17440 - [c60]Lijie Fan, Sijia Liu, Pin-Yu Chen, Gaoyuan Zhang, Chuang Gan:
When does Contrastive Learning Preserve Adversarial Robustness from Pretraining to Finetuning? NeurIPS 2021: 21480-21492 - [c59]Bo Wu, Shoubin Yu, Zhenfang Chen, Josh Tenenbaum, Chuang Gan:
STAR: A Benchmark for Situated Reasoning in Real-World Videos. NeurIPS Datasets and Benchmarks 2021 - [c58]Yongfeng Zhang, Min Zhang, Hanxiong Chen, Xu Chen, Xianjie Chen, Chuang Gan, Tong Sun, Xin Luna Dong:
The 1st International Workshop on Machine Reasoning: International Machine Reasoning Conference (MRC 2021). WSDM 2021: 1161-1162 - [i75]Ren Wang, Kaidi Xu, Sijia Liu, Pin-Yu Chen, Tsui-Wei Weng, Chuang Gan, Meng Wang:
On Fast Adversarial Robustness Adaptation in Model-Agnostic Meta-Learning. CoRR abs/2102.10454 (2021) - [i74]Tianmin Shu, Abhishek Bhandwaldar, Chuang Gan, Kevin A. Smith, Shari Liu, Dan Gutfreund, Elizabeth S. Spelke, Joshua B. Tenenbaum, Tomer D. Ullman:
AGENT: A Benchmark for Core Psychological Reasoning. CoRR abs/2102.12321 (2021) - [i73]Yuchen Lu, Yikang Shen, Siyuan Zhou, Aaron C. Courville, Joshua B. Tenenbaum, Chuang Gan:
Learning Task Decomposition with Ordered Memory Policy Network. CoRR abs/2103.10972 (2021) - [i72]Chuang Gan, Siyuan Zhou, Jeremy Schwartz, Seth Alter, Abhishek Bhandwaldar, Dan Gutfreund, Daniel L. K. Yamins, James J. DiCarlo, Josh H. McDermott, Antonio Torralba, Joshua B. Tenenbaum:
The ThreeDWorld Transport Challenge: A Visually Guided Task-and-Motion Planning Benchmark for Physically Realistic Embodied AI. CoRR abs/2103.14025 (2021) - [i71]Yihong Xu, Yutong Ban, Guillaume Delorme, Chuang Gan, Daniela Rus, Xavier Alameda-Pineda:
TransCenter: Transformers with Dense Queries for Multiple-Object Tracking. CoRR abs/2103.15145 (2021) - [i70]Zhenfang Chen, Jiayuan Mao, Jiajun Wu, Kwan-Yee Kenneth Wong, Joshua B. Tenenbaum, Chuang Gan:
Grounding Physical Concepts of Objects and Events Through Dynamic Visual Reasoning. CoRR abs/2103.16564 (2021) - [i69]Zhiao Huang, Yuanming Hu, Tao Du, Siyuan Zhou, Hao Su, Joshua B. Tenenbaum, Chuang Gan:
PlasticineLab: A Soft-Body Manipulation Benchmark with Differentiable Physics. CoRR abs/2104.03311 (2021) - [i68]Yilun Du, Chuang Gan, Phillip Isola:
Curious Representation Learning for Embodied Intelligence. CoRR abs/2105.01060 (2021) - [i67]Aisha Urooj Khan, Hilde Kuehne, Kevin Duarte, Chuang Gan, Niels da Vitoria Lobo, Mubarak Shah:
Found a Reason for me? Weakly-supervised Grounded Visual Question Answering using Capsules. CoRR abs/2105.04836 (2021) - [i66]Mingxuan Jing, Wenbing Huang, Fuchun Sun, Xiaojian Ma, Tao Kong, Chuang Gan, Lei Li:
Adversarial Option-Aware Hierarchical Imitation Learning. CoRR abs/2106.05530 (2021) - [i65]Jiayuan Mao, Zhezheng Luo, Chuang Gan, Joshua B. Tenenbaum, Jiajun Wu, Leslie Pack Kaelbling, Tomer D. Ullman:
Temporal and Object Quantification Networks. CoRR abs/2106.05891 (2021) - [i64]Shaobo Min, Qi Dai, Hongtao Xie, Chuang Gan, Yongdong Zhang, Jingdong Wang:
Cross-Modal Attention Consistency for Video-Audio Unsupervised Learning. CoRR abs/2106.06939 (2021) - [i63]Kaizhi Qian, Yang Zhang, Shiyu Chang, Jinjun Xiong, Chuang Gan, David D. Cox, Mark Hasegawa-Johnson:
Global Rhythm Style Transfer Without Text Transcriptions. CoRR abs/2106.08519 (2021) - [i62]Ao Liu, Xiaoyu Chen, Sijia Liu, Lirong Xia, Chuang Gan:
Certifiably Robust Interpretation via Renyi Differential Privacy. CoRR abs/2107.01561 (2021) - [i61]Konrad Heidler, Lichao Mou, Di Hu, Pu Jin, Guangyao Li, Chuang Gan, Ji-Rong Wen, Xiao Xiang Zhu:
Self-supervised Audiovisual Representation Learning for Remote Sensing Data. CoRR abs/2108.00688 (2021) - [i60]Ji Lin, Chuang Gan, Kuan Wang, Song Han:
TSM: Temporal Shift Module for Efficient and Scalable Video Understanding on Edge Device. CoRR abs/2109.13227 (2021) - [i59]Chuang Gan, Abhishek Bhandwaldar, Antonio Torralba, Joshua B. Tenenbaum, Phillip Isola:
OPEn: An Open-ended Physics Environment for Learning Without a Task. CoRR abs/2110.06912 (2021) - [i58]Han Cai, Chuang Gan, Ji Lin, Song Han:
Network Augmentation for Tiny Deep Learning. CoRR abs/2110.08890 (2021) - [i57]Ji Lin, Wei-Ming Chen, Han Cai, Chuang Gan, Song Han:
MCUNetV2: Memory-Efficient Patch-based Inference for Tiny Deep Learning. CoRR abs/2110.15352 (2021) - [i56]Mingyu Ding, Zhenfang Chen, Tao Du, Ping Luo, Joshua B. Tenenbaum, Chuang Gan:
Dynamic Visual Reasoning by Learning Differentiable Physics Models from Video and Language. CoRR abs/2110.15358 (2021) - [i55]Lijie Fan, Sijia Liu, Pin-Yu Chen, Gaoyuan Zhang, Chuang Gan:
When Does Contrastive Learning Preserve Adversarial Robustness from Pretraining to Finetuning? CoRR abs/2111.01124 (2021) - [i54]Runhao Zeng, Wenbing Huang, Mingkui Tan, Yu Rong, Peilin Zhao, Junzhou Huang, Chuang Gan:
Graph Convolutional Module for Temporal Action Localization in Videos. CoRR abs/2112.00302 (2021) - [i53]Yining Hong, Li Yi, Joshua B. Tenenbaum, Antonio Torralba, Chuang Gan:
PTR: A Benchmark for Part-based Conceptual, Relational, and Physical Reasoning. CoRR abs/2112.05136 (2021) - 2020
- [j6]Peihao Chen, Yang Zhang, Mingkui Tan, Hongdong Xiao, Deng Huang, Chuang Gan:
Generating Visually Aligned Sound From Videos. IEEE Trans. Image Process. 29: 8292-8302 (2020) - [j5]Peihao Chen, Chuang Gan, Guangyao Shen, Wenbing Huang, Runhao Zeng, Mingkui Tan:
Relation Attention for Temporal Action Localization. IEEE Trans. Multim. 22(10): 2723-2733 (2020) - [c57]Deng Huang, Peihao Chen, Runhao Zeng, Qing Du, Mingkui Tan, Chuang Gan:
Location-Aware Graph Convolutional Networks for Video Question Answering. AAAI 2020: 11021-11028 - [c56]Hanrui Wang, Zhanghao Wu, Zhijian Liu, Han Cai, Ligeng Zhu, Chuang Gan, Song Han:
HAT: Hardware-Aware Transformers for Efficient Natural Language Processing. ACL 2020: 7675-7688 - [c55]Runhao Zeng, Haoming Xu, Wenbing Huang, Peihao Chen, Mingkui Tan, Chuang Gan:
Dense Regression Network for Video Grounding. CVPR 2020: 10284-10293 - [c54]Chuang Gan, Deng Huang, Hang Zhao, Joshua B. Tenenbaum, Antonio Torralba:
Music Gesture for Visual Sound Separation. CVPR 2020: 10475-10484 - [c53]Zhijian Liu, Zhanghao Wu, Chuang Gan, Ligeng Zhu, Song Han:
DataMix: Efficient Privacy-Preserving Edge-Cloud Inference. ECCV (11) 2020: 578-595 - [c52]Chuang Gan, Deng Huang, Peihao Chen, Joshua B. Tenenbaum, Antonio Torralba:
Foley Music: Learning to Generate Music from Videos. ECCV (11) 2020: 758-775 - [c51]Xiaoxiao Guo, Mo Yu, Yupeng Gao, Chuang Gan, Murray Campbell, Shiyu Chang:
Interactive Fiction Game Playing as Multi-Paragraph Reading Comprehension with Reinforcement Learning. EMNLP (1) 2020: 7755-7765 - [c50]Han Cai, Chuang Gan, Tianzhe Wang, Zhekai Zhang, Song Han:
Once-for-All: Train One Network and Specialize it for Efficient Deployment. ICLR 2020 - [c49]Kexin Yi, Chuang Gan, Yunzhu Li, Pushmeet Kohli, Jiajun Wu, Antonio Torralba, Joshua B. Tenenbaum:
CLEVRER: Collision Events for Video Representation and Reasoning. ICLR 2020 - [c48]Zhoutong Zhang, Yunyun Wang, Chuang Gan, Jiajun Wu, Joshua B. Tenenbaum, Antonio Torralba, William T. Freeman:
Deep Audio Priors Emerge From Harmonic Convolutional Networks. ICLR 2020 - [c47]Chuang Gan, Yiwei Zhang, Jiajun Wu, Boqing Gong, Joshua B. Tenenbaum:
Look, Listen, and Act: Towards Audio-Visual Embodied Navigation. ICRA 2020: 9701-9707 - [c46]Haoming Xu, Runhao Zeng, Qingyao Wu, Mingkui Tan, Chuang Gan:
Cross-Modal Relation-Aware Networks for Audio-Visual Event Localization. ACM Multimedia 2020: 3893-3901 - [c45]Xin Li, Tianwei Lin, Xiao Liu, Wangmeng Zuo, Chao Li, Xiang Long, Dongliang He, Fu Li, Shilei Wen, Chuang Gan:
Deep Concept-wise Temporal Convolutional Networks for Action Localization. ACM Multimedia 2020: 4004-4012 - [c44]Wu Liu, Chuang Gan, Jingkuan Song, Dingwen Zhang, Wenbing Huang, John Smith:
HUMA'20: 1st International Workshop on Human-Centric Multimedia Analysis. ACM Multimedia 2020: 4763-4764 - [c43]Han Cai, Chuang Gan, Ligeng Zhu, Song Han:
TinyTL: Reduce Memory, Not Parameters for Efficient On-Device Learning. NeurIPS 2020 - [c42]Ji Lin, Wei-Ming Chen, Yujun Lin, John Cohn, Chuang Gan, Song Han:
MCUNet: Tiny Deep Learning on IoT Devices. NeurIPS 2020 - [e1]Wu Liu, Chuang Gan, John R. Smith, Jingkuan Song, Dingwen Zhang, Wenbing Huang:
HuMA'20: Proceedings of the 1st International Workshop on Human-centric Multimedia Analysis, Seattle, WA, USA, October12, 2020. ACM 2020, ISBN 978-1-4503-8151-2 [contents] - [i52]Chi Han, Jiayuan Mao, Chuang Gan, Joshua B. Tenenbaum, Jiajun Wu:
Visual Concept-Metaconcept Learning. CoRR abs/2002.01464 (2020) - [i51]Runhao Zeng, Haoming Xu, Wenbing Huang, Peihao Chen, Mingkui Tan, Chuang Gan:
Dense Regression Network for Video Grounding. CoRR abs/2004.03545 (2020) - [i50]Chuang Gan, Deng Huang, Hang Zhao, Joshua B. Tenenbaum, Antonio Torralba:
Music Gesture for Visual Sound Separation. CoRR abs/2004.09476 (2020) - [i49]Hanrui Wang, Zhanghao Wu, Zhijian Liu, Han Cai, Ligeng Zhu, Chuang Gan, Song Han:
HAT: Hardware-Aware Transformers for Efficient Natural Language Processing. CoRR abs/2005.14187 (2020) - [i48]Kun Liu, Wu Liu, Huadong Ma, Mingkui Tan, Chuang Gan:
A Real-time Action Representation with Temporal Encoding and Deep Compression. CoRR abs/2006.09675 (2020) - [i47]Kun Liu, Xun Yang, Tat-Seng Chua, Huadong Ma, Chuang Gan:
Language Guided Networks for Cross-modal Moment Retrieval. CoRR abs/2006.10457 (2020) - [i46]Chuang Gan, Jeremy Schwartz, Seth Alter, Martin Schrimpf, James Traer, Julian De Freitas, Jonas Kubilius, Abhishek Bhandwaldar, Nick Haber, Megumi Sano, Kuno Kim, Elias Wang, Damian Mrowca, Michael Lingelbach, Aidan Curtis, Kevin T. Feigelis, Daniel M. Bear, Dan Gutfreund, David D. Cox, James J. DiCarlo, Josh H. McDermott, Joshua B. Tenenbaum, Daniel L. K. Yamins:
ThreeDWorld: A Platform for Interactive Multi-Modal Physical Simulation. CoRR abs/2007.04954 (2020) - [i45]Ji Lin, Wei-Ming Chen, Yujun Lin, John Cohn, Chuang Gan, Song Han:
MCUNet: Tiny Deep Learning on IoT Devices. CoRR abs/2007.10319 (2020) - [i44]Chuang Gan, Deng Huang, Peihao Chen, Joshua B. Tenenbaum, Antonio Torralba:
Foley Music: Learning to Generate Music from Videos. CoRR abs/2007.10984 (2020) - [i43]Han Cai, Chuang Gan, Ligeng Zhu, Song Han:
Tiny Transfer Learning: Towards Memory-Efficient On-Device Learning. CoRR abs/2007.11622 (2020) - [i42]Chuang Gan, Xiaoyu Chen, Phillip Isola, Antonio Torralba, Joshua B. Tenenbaum:
Noisy Agents: Self-supervised Exploration by Predicting Auditory Events. CoRR abs/2007.13729 (2020) - [i41]Peihao Chen, Yang Zhang, Mingkui Tan, Hongdong Xiao, Deng Huang, Chuang Gan:
Generating Visually Aligned Sound from Videos. CoRR abs/2008.00820 (2020) - [i40]Deng Huang, Peihao Chen, Runhao Zeng, Qing Du, Mingkui Tan, Chuang Gan:
Location-aware Graph Convolutional Networks for Video Question Answering. CoRR abs/2008.09105 (2020) - [i39]Xiaoxiao Guo, Mo Yu, Yupeng Gao, Chuang Gan, Murray Campbell, Shiyu Chang:
Interactive Fiction Game Playing as Multi-Paragraph Reading Comprehension with Reinforcement Learning. CoRR abs/2010.02386 (2020) - [i38]Yu Sun, Qian Bao, Wu Liu, Wenpeng Gao, Yili Fu, Chuang Gan, Tao Mei:
Synthetic Training for Monocular Human Mesh Recovery. CoRR abs/2010.14036 (2020) - [i37]Peihao Chen, Deng Huang, Dongliang He, Xiang Long, Runhao Zeng, Shilei Wen, Mingkui Tan, Chuang Gan:
RSPNet: Relative Speed Perception for Unsupervised Video Representation Learning. CoRR abs/2011.07949 (2020) - [i36]Wenhao Wu, Dongliang He, Tianwei Lin, Fu Li, Chuang Gan, Errui Ding:
MVFNet: Multi-View Fusion Network for Efficient Video Recognition. CoRR abs/2012.06977 (2020) - [i35]Jianwei Yang, Jiayuan Mao, Jiajun Wu, Devi Parikh, David D. Cox, Joshua B. Tenenbaum, Chuang Gan:
Object-Centric Diagnosis of Visual Reasoning. CoRR abs/2012.11587 (2020) - [i34]Zelin Zhao, Chuang Gan, Jiajun Wu, Xiaoxiao Guo, Joshua B. Tenenbaum:
Augmenting Policy Learning with Routines Discovered from a Demonstration. CoRR abs/2012.12469 (2020)
2010 – 2019
- 2019
- [j4]Wen-bing Huang, Lijie Fan, Mehrtash Harandi, Lin Ma, Huaping Liu, Wei Liu, Chuang Gan:
Toward Efficient Action Recognition: Principal Backpropagation for Training Two-Stream Networks. IEEE Trans. Image Process. 28(4): 1773-1782 (2019) - [j3]Runhao Zeng, Chuang Gan, Peihao Chen, Wenbing Huang, Qingyao Wu, Mingkui Tan:
Breaking Winner-Takes-All: Iterative-Winners-Out Networks for Weakly Supervised Temporal Action Localization. IEEE Trans. Image Process. 28(12): 5797-5808 (2019) - [c41]Lijie Fan, Wenbing Huang, Chuang Gan, Junzhou Huang, Boqing Gong:
Controllable Image-to-Video Translation: A Case Study on Facial Expression Generation. AAAI 2019: 3510-3517 - [c40]Dongliang He, Zhichao Zhou, Chuang Gan, Fu Li, Xiao Liu, Yandong Li, Limin Wang, Shilei Wen:
StNet: Local and Global Spatial-Temporal Modeling for Action Recognition. AAAI 2019: 8401-8408 - [c39]Xiangpeng Li, Jingkuan Song, Lianli Gao, Xianglong Liu, Wenbing Huang, Xiangnan He, Chuang Gan:
Beyond RNNs: Positional Self-Attention with Co-Attention for Video Question Answering. AAAI 2019: 8658-8665 - [c38]Andrew Rouditchenko, Hang Zhao, Chuang Gan, Josh H. McDermott, Antonio Torralba:
Self-Supervised Segmentation and Source Separation on Videos. CVPR Workshops 2019: 0 - [c37]Andrew Rouditchenko, Hang Zhao, Chuang Gan, Josh H. McDermott, Antonio Torralba:
Self-supervised Audio-visual Co-segmentation. ICASSP 2019: 2357-2361 - [c36]Hang Zhao, Chuang Gan, Wei-Chiu Ma, Antonio Torralba:
The Sound of Motions. ICCV 2019: 1735-1744 - [c35]Chuang Gan, Hang Zhao, Peihao Chen, David D. Cox, Antonio Torralba:
Self-Supervised Moving Vehicle Tracking With Stereo Sound. ICCV 2019: 7052-7061 - [c34]Ji Lin, Chuang Gan, Song Han:
TSM: Temporal Shift Module for Efficient Video Understanding. ICCV 2019: 7082-7092 - [c33]Runhao Zeng, Wenbing Huang, Chuang Gan, Mingkui Tan, Yu Rong, Peilin Zhao, Junzhou Huang:
Graph Convolutional Networks for Temporal Action Localization. ICCV 2019: 7093-7102 - [c32]Ji Lin, Chuang Gan, Song Han:
Defensive Quantization: When Efficiency Meets Robustness. ICLR (Poster) 2019 - [c31]Jiayuan Mao, Chuang Gan, Pushmeet Kohli, Joshua B. Tenenbaum, Jiajun Wu:
The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision. ICLR 2019 - [c30]Xuguang Duan, Qi Wu, Chuang Gan, Yiwei Zhang, Wenbing Huang, Anton van den Hengel, Wenwu Zhu:
Watch, Reason and Code: Learning to Represent Videos Using Program. ACM Multimedia 2019: 1543-1551 - [c29]Guangyao Shen, Wenbing Huang, Chuang Gan, Mingkui Tan, Junzhou Huang, Wenwu Zhu, Boqing Gong:
Facial Image-to-Video Translation by a Hidden Affine Transformation. ACM Multimedia 2019: 2505-2513 - [c28]Chao Yang, Xiaojian Ma, Wenbing Huang, Fuchun Sun, Huaping Liu, Junzhou Huang, Chuang Gan:
Imitation Learning from Observations by Minimizing Inverse Dynamics Disagreement. NeurIPS 2019: 239-249 - [c27]Jianwei Yang, Zhile Ren, Chuang Gan, Hongyuan Zhu, Devi Parikh:
Cross-channel Communication Networks. NeurIPS 2019: 1295-1304 - [c26]Chi Han, Jiayuan Mao, Chuang Gan, Josh Tenenbaum, Jiajun Wu:
Visual Concept-Metaconcept Learning. NeurIPS 2019: 5002-5013 - [i33]Kaidi Xu, Sijia Liu, Gaoyuan Zhang, Mengshu Sun, Pu Zhao, Quanfu Fan, Chuang Gan, Xue Lin:
Interpreting Adversarial Examples by Activation Promotion and Suppression. CoRR abs/1904.02057 (2019) - [i32]Hang Zhao, Chuang Gan, Wei-Chiu Ma, Antonio Torralba:
The Sound of Motions. CoRR abs/1904.05979 (2019) - [i31]Ji Lin, Chuang Gan, Song Han:
Defensive Quantization: When Efficiency Meets Robustness. CoRR abs/1904.08444 (2019) - [i30]Andrew Rouditchenko, Hang Zhao, Chuang Gan, Josh H. McDermott, Antonio Torralba:
Self-Supervised Audio-Visual Co-Segmentation. CoRR abs/1904.09013 (2019) - [i29]Jiayuan Mao, Chuang Gan, Pushmeet Kohli, Joshua B. Tenenbaum, Jiajun Wu:
The Neuro-Symbolic Concept Learner: Interpreting Scenes, Words, and Sentences From Natural Supervision. CoRR abs/1904.12584 (2019) - [i28]Xin Li, Tianwei Lin, Xiao Liu, Chuang Gan, Wangmeng Zuo, Chao Li, Xiang Long, Dongliang He, Fu Li, Shilei Wen:
Deep Concept-wise Temporal Convolutional Networks for Action Localization. CoRR abs/1908.09442 (2019) - [i27]Han Cai, Chuang Gan, Song Han:
Once for All: Train One Network and Specialize it for Efficient Deployment. CoRR abs/1908.09791 (2019) - [i26]Runhao Zeng, Wenbing Huang, Mingkui Tan, Yu Rong, Peilin Zhao, Junzhou Huang, Chuang Gan:
Graph Convolutional Networks for Temporal Action Localization. CoRR abs/1909.03252 (2019) - [i25]Ji Lin, Chuang Gan, Song Han:
Training Kinetics in 15 Minutes: Large-scale Distributed Training on Videos. CoRR abs/1910.00932 (2019) - [i24]Kexin Yi, Chuang Gan, Yunzhu Li, Pushmeet Kohli, Jiajun Wu, Antonio Torralba, Joshua B. Tenenbaum:
CLEVRER: CoLlision Events for Video REpresentation and Reasoning. CoRR abs/1910.01442 (2019) - [i23]Chao Yang, Xiaojian Ma, Wenbing Huang, Fuchun Sun, Huaping Liu, Junzhou Huang, Chuang Gan:
Imitation Learning from Observations by Minimizing Inverse Dynamics Disagreement. CoRR abs/1910.04417 (2019) - [i22]Fan Yang, Xiao Liu, Dongliang He, Chuang Gan, Jian Wang, Chao Li, Fu Li, Shilei Wen:
TruNet: Short Videos Generation from Long Videos via Story-Preserving Truncation. CoRR abs/1910.05899 (2019) - [i21]Chuang Gan, Hang Zhao, Peihao Chen, David D. Cox, Antonio Torralba:
Self-supervised Moving Vehicle Tracking with Stereo Sound. CoRR abs/1910.11760 (2019) - [i20]Chuang Gan, Yiwei Zhang, Jiajun Wu, Boqing Gong, Joshua B. Tenenbaum:
Look, Listen, and Act: Towards Audio-Visual Embodied Navigation. CoRR abs/1912.11684 (2019) - 2018
- [j2]Xiang Long, Chuang Gan, Gerard de Melo:
Video Captioning with Multi-Faceted Attention. Trans. Assoc. Comput. Linguistics 6: 173-184 (2018) - [c25]Kun Liu, Wu Liu, Chuang Gan, Mingkui Tan, Huadong Ma:
T-C3D: Temporal Convolutional 3D Network for Real-Time Action Recognition. AAAI 2018: 7138-7145 - [c24]Xiang Long, Chuang Gan, Gerard de Melo, Xiao Liu, Yandong Li, Fu Li, Shilei Wen:
Multimodal Keyless Attention Fusion for Video Classification. AAAI 2018: 7202-7209 - [c23]Tali Dekel, Chuang Gan, Dilip Krishnan, Ce Liu, William T. Freeman:
Sparse, Smart Contours to Represent and Edit Images. CVPR 2018: 3511-3520 - [c22]Chuang Gan, Boqing Gong, Kun Liu, Hao Su, Leonidas J. Guibas:
Geometry Guided Convolutional Neural Networks for Self-Supervised Video Representation Learning. CVPR 2018: 5589-5597 - [c21]Lijie Fan, Wen-bing Huang, Chuang Gan, Stefano Ermon, Boqing Gong, Junzhou Huang:
End-to-End Learning of Motion Representation for Video Understanding. CVPR 2018: 6016-6025 - [c20]Xiang Long, Chuang Gan, Gerard de Melo, Jiajun Wu, Xiao Liu, Shilei Wen:
Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification. CVPR 2018: 7834-7843 - [c19]Xingyi Zhou, Arjun Karpur, Chuang Gan, Linjie Luo, Qixing Huang:
Unsupervised Domain Adaptation for 3D Keypoint Estimation via View Consistency. ECCV (12) 2018: 141-157 - [c18]Hang Zhao, Chuang Gan, Andrew Rouditchenko, Carl Vondrick, Josh H. McDermott, Antonio Torralba:
The Sound of Pixels. ECCV (1) 2018: 587-604 - [c17]Kexin Yi, Jiajun Wu, Chuang Gan, Antonio Torralba, Pushmeet Kohli, Josh Tenenbaum:
Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding. NeurIPS 2018: 1039-1050 - [c16]Xuguang Duan, Wen-bing Huang, Chuang Gan, Jingdong Wang, Wenwu Zhu, Junzhou Huang:
Weakly Supervised Dense Event Captioning in Videos. NeurIPS 2018: 3063-3073 - [i19]Lijie Fan, Wen-bing Huang, Chuang Gan, Stefano Ermon, Boqing Gong, Junzhou Huang:
End-to-End Learning of Motion Representation for Video Understanding. CoRR abs/1804.00413 (2018) - [i18]Hang Zhao, Chuang Gan, Andrew Rouditchenko, Carl Vondrick, Josh H. McDermott, Antonio Torralba:
The Sound of Pixels. CoRR abs/1804.03160 (2018) - [i17]Lijie Fan, Wen-bing Huang, Chuang Gan, Junzhou Huang, Boqing Gong:
Controllable Image-to-Video Translation: A Case Study on Facial Expression Generation. CoRR abs/1808.02992 (2018) - [i16]Kexin Yi, Jiajun Wu, Chuang Gan, Antonio Torralba, Pushmeet Kohli, Joshua B. Tenenbaum:
Neural-Symbolic VQA: Disentangling Reasoning from Vision and Language Understanding. CoRR abs/1810.02338 (2018) - [i15]Dongliang He, Zhichao Zhou, Chuang Gan, Fu Li, Xiao Liu, Yandong Li, Limin Wang, Shilei Wen:
StNet: Local and Global Spatial-Temporal Modeling for Action Recognition. CoRR abs/1811.01549 (2018) - [i14]Ji Lin, Chuang Gan, Song Han:
Temporal Shift Module for Efficient Video Understanding. CoRR abs/1811.08383 (2018) - [i13]Xuguang Duan, Wen-bing Huang, Chuang Gan, Jingdong Wang, Wenwu Zhu, Junzhou Huang:
Weakly Supervised Dense Event Captioning in Videos. CoRR abs/1812.03849 (2018) - 2017
- [c15]Chuang Gan, Chen Sun, Ram Nevatia:
DECK: Discovering Event Composition Knowledge from Web Images for Zero-Shot Event Detection and Recounting in Videos. AAAI 2017: 4032-4038 - [c14]Chuang Gan, Zhe Gan, Xiaodong He, Jianfeng Gao, Li Deng:
StyleNet: Generating Attractive Visual Captions with Styles. CVPR 2017: 955-964 - [c13]Zhe Gan, Chuang Gan, Xiaodong He, Yunchen Pu, Kenneth Tran, Jianfeng Gao, Lawrence Carin, Li Deng:
Semantic Compositional Networks for Visual Captioning. CVPR 2017: 1141-1150 - [c12]Chuang Gan, Yandong Li, Haoxiang Li, Chen Sun, Boqing Gong:
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation. ICCV 2017: 1829-1838 - [c11]Xiaodan Liang, Zhiting Hu, Hao Zhang, Chuang Gan, Eric P. Xing:
Recurrent Topic-Transition GAN for Visual Paragraph Generation. ICCV 2017: 3382-3391 - [p1]Chuang Gan, Tianbao Yang, Boqing Gong:
A Multisource Domain Generalization Approach to Visual Attribute Detection. Domain Adaptation in Computer Vision Applications 2017: 277-289 - [i12]Xiaodan Liang, Zhiting Hu, Hao Zhang, Chuang Gan, Eric P. Xing:
Recurrent Topic-Transition GAN for Visual Paragraph Generation. CoRR abs/1703.07022 (2017) - [i11]Fu Li, Chuang Gan, Xiao Liu, Yunlong Bian, Xiang Long, Yandong Li, Zhichao Li, Jie Zhou, Shilei Wen:
Temporal Modeling Approaches for Large-scale Youtube-8M Video Understanding. CoRR abs/1707.04555 (2017) - [i10]Yunlong Bian, Chuang Gan, Xiao Liu, Fu Li, Xiang Long, Yandong Li, Heng Qi, Jie Zhou, Shilei Wen, Yuanqing Lin:
Revisiting the Effectiveness of Off-the-shelf Temporal Modeling Approaches for Large-scale Video Classification. CoRR abs/1708.03805 (2017) - [i9]Chuang Gan, Yandong Li, Haoxiang Li, Chen Sun, Boqing Gong:
VQS: Linking Segmentations to Questions and Answers for Supervised Attention in VQA and Question-Focused Semantic Segmentation. CoRR abs/1708.04686 (2017) - [i8]Xiang Long, Chuang Gan, Gerard de Melo, Jiajun Wu, Xiao Liu, Shilei Wen:
Attention Clusters: Purely Attention Based Local Feature Integration for Video Classification. CoRR abs/1711.09550 (2017) - [i7]Xingyi Zhou, Arjun Karpur, Chuang Gan, Linjie Luo, Qixing Huang:
Unsupervised Domain Adaptation for 3D Keypoint Prediction from a Single Depth Scan. CoRR abs/1712.05765 (2017) - [i6]Tali Dekel, Chuang Gan, Dilip Krishnan, Ce Liu, William T. Freeman:
Smart, Sparse Contours to Represent and Edit Images. CoRR abs/1712.08232 (2017) - 2016
- [j1]Chuang Gan, Yi Yang, Linchao Zhu, Deli Zhao, Yueting Zhuang:
Recognizing an Action Using Its Name: A Knowledge-Based Approach. Int. J. Comput. Vis. 120(1): 61-77 (2016) - [c10]Chuang Gan, Ming C. Lin, Yi Yang, Gerard de Melo, Alexander G. Hauptmann:
Concepts Not Alone: Exploring Pairwise Relationships for Zero-Shot Video Activity Recognition. AAAI 2016: 3487- - [c9]Chuang Gan, Tianbao Yang, Boqing Gong:
Learning Attributes Equals Multi-Source Domain Generalization. CVPR 2016: 87-97 - [c8]Chuang Gan, Ting Yao, Kuiyuan Yang, Yi Yang, Tao Mei:
You Lead, We Exceed: Labor-Free Video Concept Learning by Jointly Exploiting Web Videos and Images. CVPR 2016: 923-932 - [c7]Chuang Gan, Chen Sun, Lixin Duan, Boqing Gong:
Webly-Supervised Video Recognition by Mutually Voting for Relevant Web Images and Web Video Frames. ECCV (3) 2016: 849-866 - [i5]