会议文集


文集名Computer Vision - ECCV 2024
会议名18th European Conference on Computer Vision (ECCV 2024)
中译名《第十八届欧洲计算机视觉会议,卷23》
机构European Computer Vision Association (ECVA)
会议日期September 29 - October 4, 2024
会议地点Milan, Italy
出版年2025
馆藏号354368


题名作者出版年
Weak-to-Strong Compositional Learning from Generative Models for Language-Based Object DetectionKwanyong Park; Kuniaki Saito; Donghyun Kim2025
Domesticating SAM for Breast Ultrasound Image Segmentation via Spatial-Frequency Fusion and Uncertainty CorrectionWanting Zhang; Huisi Wu; Jing Qin2025
CanonicalFusion: Generating Drivable 3D Human Avatars from Multiple ImagesJisu Shin; Junmyeong Lee; Seongmin Lee; Min-Gyu Park; Ju-Mi Kang; Ju Hong Yoon; Hae-Gon Jeon2025
Camera Height Doesn't Change: Unsupervised Training for Metric Monocular Road-Scene Depth EstimationGenki Kinoshita; Ko Nishino2025
Uni3DL: A Unified Model for 3D Vision-Language UnderstandingXiang Li; Jian Ding; Zhaoyang Chen; Mohamed Elhoseiny2025
Object-Aware NIR-to-Visible TranslationYunyi Gao; Lin Gu; Qiankun Liu; Ying Fu2025
PaPr: Training-Free One-Step Patch Pruning with Lightweight ConvNets for Faster InferenceTanvir Mahmud; Burhaneddin Yaman; Chun-Hao Liu; Diana Marculescu2025
GENIXER: Empowering Multimodal Large Language Model as a Powerful Data GeneratorHenry Hengyuan Zhao; Pan Zhou; Mike Zheng Shou2025
BLINK: Multimodal Large Language Models Can See but Not PerceiveXingyu Fu; Yushi Hu; Bangzheng Li; Yu Feng; Haoyu Wang; Xudong Lin; Dan Roth; Noah A. Smith; Wei-Chiu Ma; Ranjay Krishna2025
AFF-ttention! Affordances and Attention Models for Short-Term Object Interaction AnticipationLorenzo Mur-Labadia; Ruben Martinez-Cantin; Jose J. Guerrero; Giovanni Maria Farinella; Antonino Furnari2025
PreLAR: World Model Pre-training with Learnable Action RepresentationLixuan Zhang; Meina Kan; Shiguang Shan; Xilin Chen2025
Multi-HMR: Multi-person Whole-Body Human Mesh Recovery in a Single ShotFabien Baradel; Matthieu Armando; Salma Galaaoui; Romain Bregier; Philippe Weinzaepfel; Gregory Rogez; Thomas Lucas2025
De-confounded Gaze EstimationZiyang Liang; Yiwei Bao; Feng Lu2025
Diffusion Models for Monocular Depth Estimation: Overcoming Challenging ConditionsFabio Tosi; Pierluigi Zama Ramirez; Matteo Poggi2025
FreestyleRet: Retrieving Images from Style-Diversified QueriesHao Li; Yanhao Jia; Peng Jin; Zesen Cheng; Kehan Li; Jialu Sui; Chang Liu; Li Yuan2025
ReGround: Improving Textual and Spatial Grounding at No CostPhillip Y. Lee; Minhyuk Sung2025
CardiacNet: Learning to Reconstruct Abnormalities for Cardiac Disease Assessment from Echocardiogram VideosJiewen Yang; Yiqun Lin; Bin Pu; Jiarong Guo; Xiaowei Xu; Xiaomeng Li2025
LaMI-DETR: Open-Vocabulary Detection with Language Model InstructionPenghui Du; Yu Wang; Yifan Sun; Luting Wang; Yue Liao; Gang Zhang; Errui Ding; Yan Wang; Jingdong Wang; Si Liu2025
Unrolled Decomposed Unpaired Learning for Controllable Low-Light Video EnhancementLingyu Zhu; Wenhan Yang; Baoliang Chen; Hanwei Zhu; Zhangkai Ni; Qi Mao; Shiqi Wang2025
Efficient Image Pre-training with Siamese Cropped Masked AutoencodersAlexandre Eymael; Renaud Vandeghen; Anthony Cioppa; Silvio Giancola; Bernard Ghanem; Marc Van Droogenbroeck2025
12