Advances in Multimodal Information Retrieval

A multimodal search engine is designed to imitate the flexibility and agility of how the human mind works to create, process and refuse irrelevant ideas. So, the more elements you have in the input of the search engine to compare, the more accurate the results can be. Multimodal search engines use different inputs of different nature and methods of search at the same time with the possibility of combining the results by merging all of the input elements of the search. There are also engines that can use a feedback of the results with the evaluation of the user to perform a more appropriate and relevant search.

In the present book, ten typical literatures about multimodal information retrieval published on international authoritative journals were selected to introduce the worldwide newest progress, which contains reviews or original researches on multimodal information retrieval. We hope this book can demonstrate advances in multimodal information retrieval as well as give references to the researchers, students and other related people.


Sample Chapter(s)
Preface (172 KB)
Components of the Book:
  • Chapter 1
    Cross‑Modality Sub‑Image Retrieval Using Contrastive Multimodal Image Representations
  • Chapter 2
    Gesture Retrieval and its Application to the Study of Multimodal Communication
  • Chapter 3
    Multimodal Video Retrieval with Clip: A User Study
  • Chapter 4
    Multimodal Biomedical Image Retrieval and Indexing System Using Handcrafted with Deep Convolution Neural Network Features
  • Chapter 5
    Interactive Multimodal Video Search: an Extended Post-Evaluation for the VBS 2022 Competition
  • Chapter 6
    Verite: A Robust Benchmark for Multimodal Misinformation Detection Accounting for Unimodal Bias
  • Chapter 7
    Rocov2: Radiology Objects in Context Version 2, an updated Multimodal Image Dataset
  • Chapter 8
    Modelling the “Transactive Memory System” Inmultimodalmultiparty Interactions
  • Chapter 9
    A Twin Convolutional Neural Network with Hybrid Binary Optimizer for Multimodal Breast Cancer Digital Image Classification
  • Chapter 10
    Detecting and Locating Trending Places Using Multimodal Social Network Data
Readership: Students, academics, teachers and other people attending or interested in multimodal information retrieval.
Mahnaz Parian-Scherb

Tayfun Alpay
HITeC − Hamburger Informatik Technologie−Center e.V., University of Hamburg, Vogt−Kölln−Str.30, 22527 Hamburg, Germany

and more...
Copyright © 2006-2026 Scientific Research Publishing Inc. All Rights Reserved.
Top