主页
外文期刊
OA 期刊
电子期刊
外文会议
中文期刊
标准
网络数据库
专业机构
企业门户
起重机械
生产工程
高级检索
关于我们
版权声明
使用帮助
会议文集
文集名
MultiMedia Modeling
会议名
30th International Conference on MultiMedia Modeling (MMM 2024)
中译名
《第三十届国际多媒体建模会议,卷4》
会议日期
January 29 - February 2, 2024
会议地点
Amsterdam, The Netherlands
出版年
2024
馆藏号
350938
题名
作者
出版年
Removing Stray-Light for Wild-Field Fundus Image Fusion Based on Large Generative Models
Jun Wu; Mingxin He; Yang Liu; Jingjie Lin; Zeyu Huang; Dayong Ding
2024
Training-Free Region Prediction with Stable Diffusion
Yuma Honbu; Keiji Yanai
2024
Mitigating Fine-Grained Hallucination by Fine-Tuning Large Vision-Language Models with Caption Rewrites
Lei Wang; Jiabang He; Shenshen Li; Ning Liu; Ee-Peng Lim
2024
GDTNet: A Synergistic Dilated Transformer and CNN by Gate Attention for Abdominal Multi-organ Segmentation
Can Zhang; Zhiqiang Wang; Yuan Zhang; Xuanya Li; Kai Hu
2024
Fine-Grained Multi-modal Fundus Image Generation Based on Diffusion Models for Glaucoma Classification
Xinyue Liu; Gang Yang; Yang Zhou; Yajie Yang; Weichen Huang; Dayong Ding; Jun Wu
2024
Adapting Pretrained Large-Scale Vision Models for Face Forgery Detection
Lantao Wang; Chao Ma
2024
Towards Cross-Modal Point Cloud Retrieval for Indoor Scenes
Fuyang Yu; Zhen Wang; Dongyuan Li; Peide Zhu; Xiaohui Liang; Xiaochuan Wang; Manabu Okumura
2024
Correlation Visualization Under Missing Values: A Comparison Between Imputation and Direct Parameter Estimation Methods
Nhat-Hao Pham; Khanh-Linh Vo; Mai Anh Vu; Thu Nguyen; Michael A. Riegler; Pal Halvorsen; Binh T. Nguyen
2024
IFI: Interpreting for Improving: A Multimodal Transformer with an Interpretability Technique for Recognition of Risk Events
Rupayan Mallick; Jenny Benois-Pineau; Akka Zemmari
2024
OOKPIK- A Collection of Out-of-Context Image-Caption Pairs
Kha-Luan Pham; Minh-Khoi Nguyen-Nhat; Anh-Huy Dinh; Quang-Tri Le; Manh-Thien Nguyen; Anh-Duy Tran; Minh-Triet Tran; Duc-Tien Dang-Nguyen
2024
LUMOS-DM: Landscape-Based Multimodal Scene Retrieval Enhanced by Diffusion Model
Viet-Tham Huynh; Trong-Thuan Nguyen; Quang-Thuc Nguyen; Mai-Khiem Tran; Tam V. Nguyen; Minh-Triet Tran
2024
Mining Landmark Images for Scene Reconstruction from Weakly Annotated Video Collections
Helmut Neuschmied; Werner Bailer
2024
A Framework for 3D Modeling of Construction Sites Using Aerial Imagery and Semantic NeRFs
Panagiotis Vrachnos; Marios Krestenitis; Ilias Koulalis; Konstantinos Ioannidis; Stefanos Vrochidis
2024
Multimodal 3D Object Retrieval
Maria Pegia; Bjorn Por Jonsson; Anastasia Moumtzidou; Sotiris Diplaris; Ilias Gialampoukidis; Stefanos Vrochidis; Ioannis Kompatsiaris
2024
An Integrated System for Spatio-temporal Summarization of 360-Degrees Videos
Ioannis Kontostathis; Evlampios Apostolidis; Vasileios Mezaris
2024
Mutant Texts: A Technique for Uncovering Unexpected Inconsistencies in Large-Scale Vision-Language Models
Mingliang Liang; Zhouran Liu; Martha Larson
2024
Exploring Artificial Intelligence for Advancing Performance Processes and Events in Io3MT
Romulo Vieira; Debora Muchaluat-Saade; Pablo Cesar
2024
Implementation of Melody Slot Machines
Masatoshi Hamanaka
2024
E2Evideo: End to End Video and Image Pre-processing and Analysis Tool
Faiga Alawad; Pal Halvorsen; Michael A. Riegler
2024
Augmented Reality Photo Presentation and Content-Based Image Retrieval on Mobile Devices with AR-Explorer
Loris Sauter; Tim Bachmann; Heiko Schuldt; Luca Rossetto
2024
1
2
国家科技图书文献中心
全球文献资源网
京ICP备05055788号-26
机械工业信息研究院 2018-2024