会议文集


文集名Computer Vision - ECCV 2024
会议名18th European Conference on Computer Vision (ECCV 2024)
中译名《第十八届欧洲计算机视觉会议,卷62》
机构European Computer Vision Association (ECVA)
会议日期September 29 - October 4, 2024
会议地点Milan, Italy
出版年2025
馆藏号354384


题名作者出版年
Generating Physically Realistic and Directable Human Motions from Multi-modal InputsAayam Shrestha; Pan Liu; German Ros; Kai Yuan; Alan Fern2025
CoTracker: It Is Better to Track TogetherNikita Karaev; Ignacio Rocco; Benjamin Graham; Natalia Neverova; Andrea Vedaldi; Christian Rupprecht2025
SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language ModelsZiyi Lin; Dongyang Liu; Renrui Zhang; Peng Gao; Longtian Qiu; Han Xiao; Han Qiu; Wenqi Shao; Keqin Chen; Jiaming Han; Siyuan Huang; Yichi Zhang; Xuming He; Yu Qiao; Hongsheng Li2025
PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in PathologyYuxuan Sun; Hao Wu; Chenglu Zhu; Sunyi Zheng; Qizi Chen; Kai Zhang; Yunlong Zhang; Dan Wan; Xiaoxiao Lan; Mengyue Zheng; Jingxiong Li; Xinheng Lyu; Tao Lin; Lin Yang2025
Improving Adversarial Transferability via Model AlignmentAvery Ma; Amir-massoud Farahmand; Yangchen Pan; Philip Torr; Jindong Gu2025
RealGen: Retrieval Augmented Generation for Controllable Traffic ScenariosWenhao Ding; Yulong Cao; Ding Zhao; Chaowei Xiao; Marco Pavone2025
ADen: Adaptive Density Representations for Sparse-View Camera Pose EstimationHao Tang; Weiyao Wang; Pierre Gleize; Matt Feiszli2025
Embodied Understanding of Driving ScenariosYunsong Zhou; Linyan Huang; Qingwen Bu; Jia Zeng; Tianyu Li; Hang Qiu; Hongzi Zhu; Minyi Guo; Yu Qiao; Hongyang Li2025
Learning to Drive via Asymmetric Self-PlayChris Zhang; Sourav Biswas; Kelvin Wong; Kion Fallah; Lunjun Zhang; Dian Chen; Sergio Casas; Raquel Urtasun2025
OpenIns3D: Snap and Lookup for 3D Open-Vocabulary Instance SegmentationZhening Huang; Xiaoyang Wu; Xi Chen; Hengshuang Zhao; Lei Zhu; Joan Lasenby2025
ViLA: Efficient Video-Language Alignment for Video Question AnsweringXijun Wang; Junbang Liang; Chun-Kai Wang; Kenan Deng; Yu Lou; Ming C. Lin; Shan Yang2025
Factorizing Text-to-Video Generation by Explicit Image ConditioningRohit Girdhar; Mannat Singh; Andrew Brown; Quentin Duval; Samaneh Azadi; Sai Saketh Rambhatla; Akbar Shah; Xi Yin; Devi Parikh; Ishan Misra2025
MobileDiffusion: Instant Text-to-Image Generation on Mobile DevicesYang Zhao; Yanwu Xu; Zhisheng Xiao; Haolin Jia; Tingbo Hou2025
Open-Set Biometrics: Beyond Good Closed-Set ModelsYiyang Su; Minchul Kim; Feng Liu; Anil Jain; Xiaoming Liu2025
UNIT: Backdoor Mitigation via Automated Neural Distribution TighteningSiyuan Cheng; Guangyu Shen; Kaiyuan Zhang; Guanhong Tao; Shengwei An; Hanxi Guo; Shiqing Ma; Xiangyu Zhang2025
Which Model Generated This Image? A Model-Agnostic Approach for Origin AttributionFengyuan Liu; Haochen Luo; Yiming Li; Philip Torr; Jindong Gu2025
Osmosis: RGBD Diffusion Prior for Underwater Image RestorationOpher Bar Nathan; Deborah Levy; Tali Treibitz; Dan Rosenbaum2025
Towards Adaptive Pseudo-Label Learning for Semi-Supervised Temporal Action LocalizationFeixiang Zhou; Bryan Williams; Hossein Rahmani2025
Computing the Lipschitz Constant Needed for Fast Scene Recovery from CASSI MeasurementsAnders Holst; Niels Chr. Overgaard2025
DatasetNeRF: Efficient 3D-Aware Data Factory with Generative Radiance FieldsYu Chi; Fangneng Zhan; Sibo Wu; Christian Theobalt; Adam Kortylewski2025
12