会议文集


文集名Computer Vision - ECCV 2024
会议名18th European Conference on Computer Vision (ECCV 2024)
中译名《第十八届欧洲计算机视觉会议,卷10》
机构European Computer Vision Association (ECVA)
会议日期September 29 - October 4, 2024
会议地点Milan, Italy
出版年2025
馆藏号354363


题名作者出版年
Modeling and Driving Human Body Soundfields Through Acoustic PrimitivesChao Huang; Dejan Markovic; Chenliang Xu; Alexander Richard2025
m&m's: A Benchmark to Evaluate Tool-Use for multi-step multi-modal TasksZixian Ma; Weikai Huang; Jieyu Zhang; Tanmay Gupta; Ranjay Krishna2025
Label-Anticipated Event Disentanglement for Audio-Visual Video ParsingJinxing Zhou; Dan Guo; Yuxin Mao; Yiran Zhong; Xiaojun Chang; Meng Wang2025
High-Fidelity 3D Textured Shapes Generation by Sparse Encoding and Adversarial DecodingQi Zuo; Xiaodong Gu; Yuan Dong; Zhengyi Zhao; Weihao Yuan; Lingteng Qiu; Liefeng Bo; Zilong Dong2025
Semi-supervised Video Desnowing Network via Temporal Decoupling Experts and Distribution-Driven Contrastive RegularizationHongtao Wu; Yijun Yang; Angelica I. Aviles-Rivero; Jingjing Ren; Sixiang Chen; Haoyu Chen; Lei Zhu2025
I-MedSAM: Implicit Medical Image Segmentation with Segment AnythingXiaobao Wei; Jiajun Cao; Yizhu Jin; Ming Lu; Guangyu Wang; Shanghang Zhang2025
ReMamber: Referring Image Segmentation with Mamba TwisterYuhuan Yang; Chaofan Ma; Jiangchao Yao; Zhun Zhong; Ya Zhang; Yanfeng Wang2025
TalkingGaussian: Structure-Persistent 3D Talking Head Synthesis via Gaussian SplattingJiahe Li; Jiawei Zhang; Xiao Bai; Jin Zheng; Xin Ning; Jun Zhou; Lin Gu2025
CAT: Enhancing Multimodal Large Language Model to Answer Questions in Dynamic Audio-Visual ScenariosQilang Ye; Zitong Yu; Rui Shao; Xinyu Xie; Philip Torr; Xiaochun Cao2025
Segmentation-Guided Layer-Wise Image Vectorization with Gradient FillsHengyu Zhou; Hui Zhang; Bin Wang2025
Implicit Style-Content Separation Using B-LoRAYarden Frenkel; Yael Vinker; Ariel Shamir; Daniel Cohen-Or2025
OpenPSG: Open-Set Panoptic Scene Graph Generation via Large Multimodal ModelsZijian Zhou; Zheng Zhu; Holger Caesar; Miaojing Shi2025
ActionVOS: Actions as Prompts for Video Object SegmentationLiangyang Ouyang; Ruicong Liu; Yifei Huang; Ryosuke Furuta; Yoichi Sato2025
FALIP: Visual Prompt as Foveal Attention Boosts CLIP Zero-Shot PerformanceJiedong Zhuang; Jiaqi Hu; Lianrui Mu; Rui Hu; Xiaoyu Liang; Jiangnan Ye; Haoji Hu2025
U-COPE: Taking a Further Step to Universal 9D Category-Level Object Pose EstimationLi Zhang; Weiqing Meng; Yan Zhong; Bin Kong; Mingliang Xu; Jianming Du; Xue Wang; Rujing Wang; Liu Liu2025
Integrating Markov Blanket Discovery Into Causal Representation Learning for Domain GeneralizationNaiyu Yin; Hanjing Wang; Yue Yu; Tian Gao; Amit Dhurandhar; Qiang Ji2025
Rotary Position Embedding for Vision TransformerByeongho Heo; Song Park; Dongyoon Han; Sangdoo Yun2025
Local All-Pair Correspondence for Point TrackingSeokju Cho; Jiahui Huang; Jisu Nam; Honggyu An; Seungryong Kim; Joon-Young Lee2025
MonoWAD: Weather-Adaptive Diffusion Model for Robust Monocular 3D Object DetectionYoungmin Oh; Hyung-Il Kim; Seong Tae Kim; Jung Uk Kim2025
ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic EnvironmentsTaewoong Kim; Cheolhong Min; Byeonghwi Kim; Jinyeon Kim; Wonje Jeung; Jonghyun Choi2025
12