会议文集


文集名MultiMedia Modeling
会议名30th International Conference on MultiMedia Modeling (MMM 2024)
中译名《第三十届国际多媒体建模会议,卷4》
会议日期January 29 - February 2, 2024
会议地点Amsterdam, The Netherlands
出版年2024
馆藏号350938


题名作者出版年
Removing Stray-Light for Wild-Field Fundus Image Fusion Based on Large Generative ModelsJun Wu; Mingxin He; Yang Liu; Jingjie Lin; Zeyu Huang; Dayong Ding2024
Training-Free Region Prediction with Stable DiffusionYuma Honbu; Keiji Yanai2024
Mitigating Fine-Grained Hallucination by Fine-Tuning Large Vision-Language Models with Caption RewritesLei Wang; Jiabang He; Shenshen Li; Ning Liu; Ee-Peng Lim2024
GDTNet: A Synergistic Dilated Transformer and CNN by Gate Attention for Abdominal Multi-organ SegmentationCan Zhang; Zhiqiang Wang; Yuan Zhang; Xuanya Li; Kai Hu2024
Fine-Grained Multi-modal Fundus Image Generation Based on Diffusion Models for Glaucoma ClassificationXinyue Liu; Gang Yang; Yang Zhou; Yajie Yang; Weichen Huang; Dayong Ding; Jun Wu2024
Adapting Pretrained Large-Scale Vision Models for Face Forgery DetectionLantao Wang; Chao Ma2024
Towards Cross-Modal Point Cloud Retrieval for Indoor ScenesFuyang Yu; Zhen Wang; Dongyuan Li; Peide Zhu; Xiaohui Liang; Xiaochuan Wang; Manabu Okumura2024
Correlation Visualization Under Missing Values: A Comparison Between Imputation and Direct Parameter Estimation MethodsNhat-Hao Pham; Khanh-Linh Vo; Mai Anh Vu; Thu Nguyen; Michael A. Riegler; Pal Halvorsen; Binh T. Nguyen2024
IFI: Interpreting for Improving: A Multimodal Transformer with an Interpretability Technique for Recognition of Risk EventsRupayan Mallick; Jenny Benois-Pineau; Akka Zemmari2024
OOKPIK- A Collection of Out-of-Context Image-Caption PairsKha-Luan Pham; Minh-Khoi Nguyen-Nhat; Anh-Huy Dinh; Quang-Tri Le; Manh-Thien Nguyen; Anh-Duy Tran; Minh-Triet Tran; Duc-Tien Dang-Nguyen2024
LUMOS-DM: Landscape-Based Multimodal Scene Retrieval Enhanced by Diffusion ModelViet-Tham Huynh; Trong-Thuan Nguyen; Quang-Thuc Nguyen; Mai-Khiem Tran; Tam V. Nguyen; Minh-Triet Tran2024
Mining Landmark Images for Scene Reconstruction from Weakly Annotated Video CollectionsHelmut Neuschmied; Werner Bailer2024
A Framework for 3D Modeling of Construction Sites Using Aerial Imagery and Semantic NeRFsPanagiotis Vrachnos; Marios Krestenitis; Ilias Koulalis; Konstantinos Ioannidis; Stefanos Vrochidis2024
Multimodal 3D Object RetrievalMaria Pegia; Bjorn Por Jonsson; Anastasia Moumtzidou; Sotiris Diplaris; Ilias Gialampoukidis; Stefanos Vrochidis; Ioannis Kompatsiaris2024
An Integrated System for Spatio-temporal Summarization of 360-Degrees VideosIoannis Kontostathis; Evlampios Apostolidis; Vasileios Mezaris2024
Mutant Texts: A Technique for Uncovering Unexpected Inconsistencies in Large-Scale Vision-Language ModelsMingliang Liang; Zhouran Liu; Martha Larson2024
Exploring Artificial Intelligence for Advancing Performance Processes and Events in Io3MTRomulo Vieira; Debora Muchaluat-Saade; Pablo Cesar2024
Implementation of Melody Slot MachinesMasatoshi Hamanaka2024
E2Evideo: End to End Video and Image Pre-processing and Analysis ToolFaiga Alawad; Pal Halvorsen; Michael A. Riegler2024
Augmented Reality Photo Presentation and Content-Based Image Retrieval on Mobile Devices with AR-ExplorerLoris Sauter; Tim Bachmann; Heiko Schuldt; Luca Rossetto2024
12