Search Page | SpringerLink

Article

Full access

Bilingual video captioning model for enhanced video retrieval

Many video platforms rely on the descriptions that uploaders provide for video retrieval. However, this reliance may cause inaccuracies. Although...

Norah Alrebdi, Amal A. Al-Shargabi in Journal of Big Data

16 January 2024 Open access

Article

Self-expressive induced clustered attention for video-text retrieval

Extensive research has proven that self-attention achieves impressive performance in video-text retrieval. However, most state-of-the-art methods...

Jingxuan Zhu, Xiangjun Shen, ... Yongzhao Zhan in Multimedia Systems

27 November 2024

Article

Learning Text-to-Video Retrieval from Image Captioning

We describe a protocol to study text-to-video retrieval training with unlabeled videos, where we assume (i) no access to labels for any videos, i.e.,...

Lucas Ventura, Cordelia Schmid, Gül Varol in International Journal of Computer Vision

22 October 2024

Article

Efficient text augmentation in latent space for video retrieval

With the popularity of video sharing applications and streaming platforms, video retrieval became an active research topic. The core technique behind...

Na-Hyun Lee, Seong-Min Kang, Yoon-Sik Cho in Multimedia Tools and Applications

18 October 2024

Article

LSECA: local semantic enhancement and cross aggregation for video-text retrieval

Recently video retrieval based on the pre-training models (e.g., CLIP) has achieved outstanding success. To further improve the search performance,...

Zhiwen Wang, Donglin Zhang, Zhikai Hu in International Journal of Multimedia Information Retrieval

22 July 2024

Article

Opposition-based optimized max pooled 3D convolutional features for action video retrieval

Key frame selection serves as a c bridge between raw video data and meaningful retrieval results. Effective key frame selection enhances the...

Alina Banerjee, Ravinder Megavath, Ela Kumar in International Journal of Information Technology

12 August 2024

Article

Attention-based deep supervised hashing for near duplicate video retrieval

With the explosive growth of video data on the Internet, near duplicate video retrieval (NDVR) has become an important and challenging issue in the...

Naifei Shi, Chong Fu, ... Chiu-Wing Sham in Neural Computing and Applications

27 December 2023

Article

Hierarchical bi-directional conceptual interaction for text-video retrieval

The large pre-trained vision-language models (VLMs) utilized in text-video retrieval have demonstrated strong cross image-text understanding ability....

Wenpeng Han, Guanglin Niu, ... Xiaowei Zhang in Multimedia Systems

15 October 2024

Article

Full access

MGSGA: Multi-grained and Semantic-Guided Alignment for Text-Video Retrieval

In the text-video retrieval task, the objective is to calculate the similarity between a text and a video, and rank the relevant candidates higher....

Xiaoyu Wu, Jiayao Qian, Lulu Yang in Neural Processing Letters

17 February 2024 Open access

Article

Full access

Multimodal video retrieval with CLIP: a user study

Recent machine learning advances demonstrate the effectiveness of zero-shot models trained on large amounts of data collected from the internet....

Tayfun Alpay, Sven Magg, ... Daniel Speck in Information Retrieval Journal

29 September 2023 Open access

Article

Particle swarm optimized deep spatio-temporal features for efficient video retrieval

In content-based video retrieval, the phases of video frame selection and 3-dimensional feature extraction are especially crucial. These stages...

Alina Banerjee, Ela Kumar, M. Ravinder in International Journal of Information Technology

14 February 2024

Article

SPSD: Similarity-preserving self-distillation for video–text retrieval

Most of existing methods solve cross-modal video and text retrieval via coarse-grained similarity computation based on global representations or...

Jiachen Wang, Yan Hua, ... Hongwei Kou in International Journal of Multimedia Information Retrieval

01 September 2023

Article

Learning optimal deep prototypes for video retrieval systems with hybrid SVM-softmax layer

The research focuses on optimizing training time for video retrieval by producing optimized prototypes for a hybrid SVM-softmax regression...

Alina Banerjee, Ela Kumar, Ravinder Megavath in International Journal of Data Science and Analytics

18 June 2024

Conference paper

EA-VTR: Event-Aware Video-Text Retrieval

Understanding the content of events occurring in the video and their inherent temporal logic is crucial for video-text retrieval. However,...

Zongyang Ma, Ziqi Zhang, ... Weiming Hu in Computer Vision – ECCV 2024

2025

Article

An intelligent surgical video retrieval for computer vision enhancement in medical diagnosis using deep learning techniques

This paper addresses the challenge of efficiently retrieving surgical videos from large databases for computer vision enhancement in medical...

Archana Mantri, Rahul Mishra in Multimedia Tools and Applications

29 May 2024

Article

Video–text retrieval via multi-modal masked transformer and adaptive attribute-aware graph convolutional network

Despite significant advancements in deep learning-based video–text retrieval methods, three challenges persist: the alignment of fine-grained...

Gang Lv, Yining Sun, Fudong Nian in Multimedia Systems

22 January 2024

Article

Deep learning for video-text retrieval: a review

Video-Text Retrieval (VTR) aims to search for the most relevant video related to the semantics in a given sentence, and vice versa. In general, this...

Cunjuan Zhu, Qi Jia, ... Yu Liu in International Journal of Multimedia Information Retrieval

23 February 2023

Article

A multi-modal lecture video indexing and retrieval framework with multi-scale residual attention network and multi-similarity computation

Due to technological development, the mass production of video and its storage on the Internet has increased. This made a huge amount of videos to be...

A. Debnath, K. Sreenivasa Rao, Partha P. Das in Signal, Image and Video Processing

23 December 2023