The dataset includes 41,387 text files, 65,371 images and 30,818 videos (about 1091 hours) which are correlated semantically with each other by 335 ...
A real-world web dataset collected from Google, Flickr and YouTube for cross-media research, which indicates that it is possible to perform multiple cross- ...
The dataset includes 41,387 text files, 65,371 images and 30,818 videos (about 1091 hours) which are correlated semantically with each other by 335 ...
The dataset includes 41,387 text files, 65,371 images and 30,818 videos (about 1091 hours) which are correlated semantically with each other by 335 ...
Jul 12, 2014 · Our dataset is the largest cross-media dataset comprising 41,387 text files, 65,371 images and 30,818 videos with associated 335 concepts. This ...
Mar 20, 2017 · Abstract—This paper contributes a new large-scale dataset for weakly supervised cross-media retrieval, named Twitter100k.
PKU XMedia dataset is the first cross-media retrieval dataset with 5 media types (text, image, video, audio and 3D model), and it can be used for more ...
Nov 5, 2024 · TIP-I2V is the first dataset comprising over 1.70 million unique user-provided text and image prompts.
A Real-World Web Cross-Media Dataset Containing Images, Texts and Videos ... A real-world web dataset collected from Google, Flickr and YouTube for cross ...
This paper contributes a new, real-world web image dataset for cross-media retrieval called FB5K. ... data, including images, texts, video, audio and etc.