Vayadande et al., 2024 - Google Patents

The Rise of AI‐Generated News Videos: A Detailed Review

Vayadande et al., 2024

Document ID: 4127042023873659753
Author: Vayadande K; Bohri M; Chawala M; Kulkarni A; Mursal A
Publication year: 2024
Publication venue: How Machine Learning is Innovating Today's World: A Concise Technical Guide

External Links

Cited by

Snippet

The rapid advancements in Artificial Intelligence (AI) have given rise to the possibility of automating news video creation. AI‐powered news videos will offer a fresh and dynamic perspective on the day's top stories, delivering the content people need in a way that is easy …

Continue reading at onlinelibrary.wiley.com (other versions)

238000012552 review 0 title abstract description 24

Classifications

- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30017—Multimedia data retrieval; Retrieval of more than one type of audiovisual media
- G06F17/30023—Querying
- G06F17/30029—Querying by filtering; by personalisation, e.g. querying making use of user profiles
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30017—Multimedia data retrieval; Retrieval of more than one type of audiovisual media
- G06F17/30023—Querying
- G06F17/30038—Querying based on information manually generated or based on information not derived from the media content, e.g. tags, keywords, comments, usage information, user ratings
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2705—Parsing
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/21—Text processing
- G06F17/22—Manipulating or registering by use of codes, e.g. in sequence of text characters
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/3061—Information retrieval; Database structures therefor; File system structures therefor of unstructured textual data
- G06F17/30634—Querying
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30781—Information retrieval; Database structures therefor; File system structures therefor of video data
- G06F17/30784—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre
- G06F17/30796—Information retrieval; Database structures therefor; File system structures therefor of video data using features automatically derived from the video content, e.g. descriptors, fingerprints, signatures, genre using original textual content or text extracted from visual content or transcript of audio data
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30861—Retrieval from the Internet, e.g. browsers
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/20—Handling natural language data
- G06F17/27—Automatic analysis, e.g. parsing
- G06F17/2765—Recognition
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F17/00—Digital computing or data processing equipment or methods, specially adapted for specific functions
- G06F17/30—Information retrieval; Database structures therefor; File system structures therefor
- G06F17/30244—Information retrieval; Database structures therefor; File system structures therefor in image databases
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/62—Methods or arrangements for recognition using electronic means
- G06K9/6217—Design or setup of recognition systems and techniques; Extraction of features in feature space; Clustering techniques; Blind source separation
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06N—COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N99/00—Subject matter not provided for in other groups of this subclass
- G06N99/005—Learning machines, i.e. computer in which a programme is changed according to experience gained by the machine itself during a complete run
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/06—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids
- G10L21/10—Transformation of speech into a non-audible representation, e.g. speech visualisation or speech processing for tactile aids transforming into visible information
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition

Similar Documents

Publication	Publication Date	Title
Yang et al.	2018	Video captioning by adversarial LSTM
CN108986186B (en)	2023-05-05	Method and system for converting text into video
Jangra et al.	2023	A survey on multi-modal summarization
CN113569088B (en)	2021-12-21	Music recommendation method and device and readable storage medium
Ma et al.	2020	Learning to generate grounded visual captions without localization supervision
CN111026861B (en)	2023-07-04	Text abstract generation method, training device, training equipment and medium
CN112948708B (en)	2022-08-12	Short video recommendation method
CN116702737B (en)	2023-12-01	Document generation method, device, equipment, storage medium and product
Huddar et al.	2021	Attention-based multimodal contextual fusion for sentiment and emotion classification using bidirectional LSTM
CN107066464A (en)	2017-08-18	Semantic Natural Language Vector Space
CN112749326B (en)	2023-10-03	Information processing method, information processing device, computer equipment and storage medium
CN116975615A (en)	2023-10-31	Task prediction method and device based on video multi-mode information
Salur et al.	2022	A soft voting ensemble learning-based approach for multimodal sentiment analysis
CN116958997B (en)	2024-01-23	Graphic summary method and system based on heterogeneous graphic neural network
CN111985243A (en)	2020-11-24	Emotion model training method, emotion analysis device and storage medium
CN116977701A (en)	2023-10-31	Video classification model training method, video classification method and device
CN116955591A (en)	2023-10-27	Recommendation language generation method, related device and medium for content recommendation
He et al.	2018	Deep learning in natural language generation from images
Kalender et al.	2018	Videolization: knowledge graph based automated video generation from web content
CN116977992A (en)	2023-10-31	Text information identification method, apparatus, computer device and storage medium
CN116935170A (en)	2023-10-24	Processing method and device of video processing model, computer equipment and storage medium
CN118014086B (en)	2024-07-02	Data processing method, device, equipment, storage medium and product
CN114661951A (en)	2022-06-24	Video processing method and device, computer equipment and storage medium
Mei et al.	2020	Vision and language: from visual perception to content creation
Vayadande et al.	2024	The Rise of AI‐Generated News Videos: A Detailed Review