default search action
18th ECCV 2024: Milan, Italy - Part XXII
- Ales Leonardis, Elisa Ricci, Stefan Roth, Olga Russakovsky, Torsten Sattler, Gül Varol:
Computer Vision - ECCV 2024 - 18th European Conference, Milan, Italy, September 29-October 4, 2024, Proceedings, Part XXII. Lecture Notes in Computer Science 15080, Springer 2025, ISBN 978-3-031-72669-9 - Kai Zhang, Sai Bi, Hao Tan, Yuanbo Xiangli, Nanxuan Zhao, Kalyan Sunkavalli, Zexiang Xu:
GS-LRM: Large Reconstruction Model for 3D Gaussian Splatting. 1-19 - Runyi Hu, Jie Zhang, Ting Xu, Jiwei Li, Tianwei Zhang:
Robust-Wide: Robust Watermarking Against Instruction-Driven Image Editing. 20-37 - Qiao Mo, Yukang Ding, Jinhua Hao, Qiang Zhu, Ming Sun, Chao Zhou, Feiyu Chen, Shuyuan Zhu:
OAPT: Offset-Aware Partition Transformer for Double JPEG Artifacts Removal. 38-56 - Ryosuke Yamada, Kensho Hara, Hirokatsu Kataoka, Koshi Makihara, Nakamasa Inoue, Rio Yokota, Yutaka Satoh:
Formula-Supervised Visual-Geometric Pre-training. 57-74 - Yue Fan, Xiaojian Ma, Rujie Wu, Yuntao Du, Jiaqi Li, Zhi Gao, Qing Li:
[inline-graphic not available: see fulltext]VideoAgent: A Memory-Augmented Multimodal Agent for Video Understanding. 75-92 - Guanghao Zheng, Yuchen Liu, Wenrui Dai, Chenglin Li, Junni Zou, Hongkai Xiong:
Towards Unified Representation of Invariant-Specific Features in Missing Modality Face Anti-spoofing. 93-110 - Shangquan Sun, Wenqi Ren, Xinwei Gao, Rui Wang, Xiaochun Cao:
Restoring Images in Adverse Weather Conditions via Histogram Transformer. 111-129 - Tongkun Guan, Chengyu Lin, Wei Shen, Xiaokang Yang:
PosFormer: Recognizing Complex Handwritten Mathematical Expression with Position Forest Transformer. 130-147 - Yubin Hu, Xiaoyang Guo, Yang Xiao, Jingwei Huang, Yong-Jin Liu:
NGP-RT: Fusing Multi-level Hash Features with Lightweight Attention for Real-Time Novel View Synthesis. 148-165 - Han Wang, Yongjie Ye, Yanjie Wang, Yuxiang Nie, Can Huang:
Elysium: Exploring Object-Level Perception in Videos via MLLM. 166-185 - Shuxiang Xie, Shuyi Zhou, Ken Sakurada, Ryoichi Ishikawa, Masaki Onishi, Takeshi Oishi:
G2fR: Frequency Regularization in Grid-Based Feature Encoding Neural Radiance Fields. 186-203 - Agneet Chatterjee, Gabriela Ben Melech Stan, Estelle Aflalo, Sayak Paul, Dhruba Ghosh, Tejas Gokhale, Ludwig Schmidt, Hannaneh Hajishirzi, Vasudev Lal, Chitta Baral, Yezhou Yang:
Getting it Right: Improving Spatial Consistency in Text-to-Image Models. 204-222 - Xueqi Ma, Yilin Liu, Wenjun Zhou, Ruowei Wang, Hui Huang:
Generating 3D House Wireframes with Semantics. 223-240 - Xiao Fu, Wei Yin, Mu Hu, Kaixuan Wang, Yuexin Ma, Ping Tan, Shaojie Shen, Dahua Lin, Xiaoxiao Long:
GeoWizard: Unleashing the Diffusion Priors for 3D Geometry Estimation from a Single Image. 241-258 - Yiyao Ma, Kai Chen, Hon-Sing Tong, Ruofeng Wei, Yui-Lun Ng, Ka-Wai Kwok, Qi Dou:
Shape-Guided Configuration-Aware Learning for Endoscopic-Image-Based Pose Estimation of Flexible Robotic Instruments. 259-276 - Jianan Wei, Tianfei Zhou, Yi Yang, Wenguan Wang:
Nonverbal Interaction Detection. 277-295 - Jian Zou, Tianyu Huang, Guanglei Yang, Zhenhua Guo, Tao Luo, Chun-Mei Feng, Wangmeng Zuo:
UniM2AE: Multi-modal Masked Autoencoders with Unified 3D Representation for 3D Perception in Autonomous Driving. 296-313 - Minheng Ni, Yeli Shen, Lei Zhang, Wangmeng Zuo:
Responsible Visual Editing. 314-330 - Weijia Wu, Zhuang Li, Yuchao Gu, Rui Zhao, Yefei He, David Junhao Zhang, Mike Zheng Shou, Yan Li, Tingting Gao, Di Zhang:
DragAnything: Motion Control for Anything Using Entity Representation. 331-348 - Shuting He, Henghui Ding, Xudong Jiang, Bihan Wen:
[inline-graphic not available: see fulltext] SegPoint: Segment Any Point Cloud via Large Language Model. 349-367 - Sheng Fan, Rui Liu, Wenguan Wang, Yi Yang:
Navigation Instruction Generation with BEV Perception and Large Language Models. 368-387 - Taemin Park, Hyuck Lee, Heeyoung Kim:
Rebalancing Using Estimated Class Distribution for Imbalanced Semi-supervised Learning Under Class Distribution Mismatch. 388-404 - Qiuhong Shen, Xingyi Yang, Michael Bi Mi, Xinchao Wang:
Vista3D: Unravel the 3D Darkside of a Single Image. 405-421 - Yi Yao, Chan-Feng Hsu, Jhe-Hao Lin, Hongxia Xie, Terence Lin, Yi-Ning Huang, Hong-Han Shuai, Wen-Huang Cheng:
The Fabrication of Reality and Fantasy: Scene Generation with LLM-Assisted Prompt Interpretation. 422-438 - Junjie Huang, Yun Ye, Zhujin Liang, Yi Shan, Dalong Du:
Detecting as Labeling: Rethinking LiDAR-Camera Fusion in 3D Object Detection. 439-455 - Qiuhong Shen, Xingyi Yang, Xinchao Wang:
FlashSplat: 2D to 3D Gaussian Splatting Segmentation Solved Optimally. 456-472 - Guanting Dong, Yueyi Zhang, Xiaoyan Sun, Zhiwei Xiong:
Exploiting Dual-Correlation for Multi-frame Time-of-Flight Denoising. 473-489
manage site settings
To protect your privacy, all features that rely on external API calls from your browser are turned off by default. You need to opt-in for them to become active. All settings here will be stored as cookies with your web browser. For more information see our F.A.Q.