主页
外文期刊
OA 期刊
电子期刊
外文会议
中文期刊
标准
网络数据库
专业机构
企业门户
起重机械
生产工程
高级检索
关于我们
版权声明
使用帮助
会议文集
文集名
Computer Vision - ECCV 2024
会议名
18th European Conference on Computer Vision (ECCV 2024)
中译名
《第十八届欧洲计算机视觉会议,卷62》
机构
European Computer Vision Association (ECVA)
会议日期
September 29 - October 4, 2024
会议地点
Milan, Italy
出版年
2025
馆藏号
354384
题名
作者
出版年
Generating Physically Realistic and Directable Human Motions from Multi-modal Inputs
Aayam Shrestha; Pan Liu; German Ros; Kai Yuan; Alan Fern
2025
CoTracker: It Is Better to Track Together
Nikita Karaev; Ignacio Rocco; Benjamin Graham; Natalia Neverova; Andrea Vedaldi; Christian Rupprecht
2025
SPHINX: A Mixer of Weights, Visual Embeddings and Image Scales for Multi-modal Large Language Models
Ziyi Lin; Dongyang Liu; Renrui Zhang; Peng Gao; Longtian Qiu; Han Xiao; Han Qiu; Wenqi Shao; Keqin Chen; Jiaming Han; Siyuan Huang; Yichi Zhang; Xuming He; Yu Qiao; Hongsheng Li
2025
PathMMU: A Massive Multimodal Expert-Level Benchmark for Understanding and Reasoning in Pathology
Yuxuan Sun; Hao Wu; Chenglu Zhu; Sunyi Zheng; Qizi Chen; Kai Zhang; Yunlong Zhang; Dan Wan; Xiaoxiao Lan; Mengyue Zheng; Jingxiong Li; Xinheng Lyu; Tao Lin; Lin Yang
2025
Improving Adversarial Transferability via Model Alignment
Avery Ma; Amir-massoud Farahmand; Yangchen Pan; Philip Torr; Jindong Gu
2025
RealGen: Retrieval Augmented Generation for Controllable Traffic Scenarios
Wenhao Ding; Yulong Cao; Ding Zhao; Chaowei Xiao; Marco Pavone
2025
ADen: Adaptive Density Representations for Sparse-View Camera Pose Estimation
Hao Tang; Weiyao Wang; Pierre Gleize; Matt Feiszli
2025
Embodied Understanding of Driving Scenarios
Yunsong Zhou; Linyan Huang; Qingwen Bu; Jia Zeng; Tianyu Li; Hang Qiu; Hongzi Zhu; Minyi Guo; Yu Qiao; Hongyang Li
2025
Learning to Drive via Asymmetric Self-Play
Chris Zhang; Sourav Biswas; Kelvin Wong; Kion Fallah; Lunjun Zhang; Dian Chen; Sergio Casas; Raquel Urtasun
2025
OpenIns3D: Snap and Lookup for 3D Open-Vocabulary Instance Segmentation
Zhening Huang; Xiaoyang Wu; Xi Chen; Hengshuang Zhao; Lei Zhu; Joan Lasenby
2025
ViLA: Efficient Video-Language Alignment for Video Question Answering
Xijun Wang; Junbang Liang; Chun-Kai Wang; Kenan Deng; Yu Lou; Ming C. Lin; Shan Yang
2025
Factorizing Text-to-Video Generation by Explicit Image Conditioning
Rohit Girdhar; Mannat Singh; Andrew Brown; Quentin Duval; Samaneh Azadi; Sai Saketh Rambhatla; Akbar Shah; Xi Yin; Devi Parikh; Ishan Misra
2025
MobileDiffusion: Instant Text-to-Image Generation on Mobile Devices
Yang Zhao; Yanwu Xu; Zhisheng Xiao; Haolin Jia; Tingbo Hou
2025
Open-Set Biometrics: Beyond Good Closed-Set Models
Yiyang Su; Minchul Kim; Feng Liu; Anil Jain; Xiaoming Liu
2025
UNIT: Backdoor Mitigation via Automated Neural Distribution Tightening
Siyuan Cheng; Guangyu Shen; Kaiyuan Zhang; Guanhong Tao; Shengwei An; Hanxi Guo; Shiqing Ma; Xiangyu Zhang
2025
Which Model Generated This Image? A Model-Agnostic Approach for Origin Attribution
Fengyuan Liu; Haochen Luo; Yiming Li; Philip Torr; Jindong Gu
2025
Osmosis: RGBD Diffusion Prior for Underwater Image Restoration
Opher Bar Nathan; Deborah Levy; Tali Treibitz; Dan Rosenbaum
2025
Towards Adaptive Pseudo-Label Learning for Semi-Supervised Temporal Action Localization
Feixiang Zhou; Bryan Williams; Hossein Rahmani
2025
Computing the Lipschitz Constant Needed for Fast Scene Recovery from CASSI Measurements
Anders Holst; Niels Chr. Overgaard
2025
DatasetNeRF: Efficient 3D-Aware Data Factory with Generative Radiance Fields
Yu Chi; Fangneng Zhan; Sibo Wu; Christian Theobalt; Adam Kortylewski
2025
1
2
国家科技图书文献中心
全球文献资源网
京ICP备05055788号-26
京公网安备11010202008970号 机械工业信息研究院 2018-2024