CN111918077A

CN111918077A - Synchronous synthesis method for video live broadcast image and sound acquisition and computer desktop acquisition

Info

Publication number: CN111918077A
Application number: CN202010718614.9A
Authority: CN
Inventors: 虞焰兴
Original assignee: Anhui Semxum Information Technology Co ltd
Current assignee: Anhui Semxum Information Technology Co ltd
Priority date: 2020-07-23
Filing date: 2020-07-23
Publication date: 2020-11-10

Abstract

The invention discloses a method for synchronously synthesizing live video image and sound acquisition and computer desktop acquisition, and relates to the technical field of streaming media synthesis. The invention comprises the following steps: acquiring audio/position parameters of at least one display area pre-selected on a desktop; capturing current whole desktop video information, and binding the current whole desktop video information into a first data packet; the camera collects the video of the teacher in class and binds the video into a second data packet; analyzing and merging the first data packet and the second data packet; the merged video is sent to a server side to match audio data of the regional audio/position parameter set with the video image; and covering the video data successfully matched at the position of the designated area. The invention is convenient for students to remotely watch courseware for teaching and the synchronous explanation of teachers, and simultaneously, a plurality of video frames are randomly switched, thereby improving the learning effect of remote teaching of students.

Description

Synchronous synthesis method for video live broadcast image and sound acquisition and computer desktop acquisition

Technical Field

The invention belongs to the technical field of streaming media synthesis, and particularly relates to a method for synchronously synthesizing live video image and sound acquisition and computer desktop acquisition.

Background

Multimedia (Multimedia) is a composite of multiple media, generally including multiple media forms such as text, sound, and images. Multimedia is embodied in informatization and is a trend of social development, multimedia education is a part of informatization, and the great promotion of multimedia education becomes a trend of educational development.

Due to the inherent characteristics of multimedia in various media forms such as text, sound and images, multimedia technology is often used in teaching, which is beneficial to improving the professional level of teachers and is beneficial to teachers to integrate teaching resources. The multimedia teaching technology can make up the defects in the traditional teaching, and is a novel scientific teaching technology.

Currently, in remote training adopting multimedia technology, a lecturer needs to share courseware (PPT/or other documents) and a computer desktop to students, but students in class cannot see expressions, actions and body languages of the teacher when watching a computer, so that body expression information of the teacher can be omitted, and besides, interactivity and interestingness are poor; in the live broadcasting process, the problem of video blocking or unsynchronized sound and pictures is caused because the data frames of the streaming media are easy to lose frames, and the teaching quality of teachers is seriously influenced.

Disclosure of Invention

The invention aims to provide a synchronous synthesis method for video live broadcast image sound acquisition and computer desktop acquisition, which is characterized in that desktop video information and teacher teaching video information are acquired, analyzed and combined, the combined video is subjected to region matching by a server, and after the matching is successful, the combined video is covered in a specified region, so that the problems that the teacher in a classroom cannot be synchronized with the computer desktop in the prior remote training, the teaching efficiency is low, and live broadcast video is easy to block are solved.

In order to solve the technical problems, the invention is realized by the following technical scheme:

the invention relates to a method for synchronously synthesizing live video image and sound acquisition and computer desktop acquisition, which comprises the following steps:

step S1: acquiring audio/position parameters of at least one display area pre-selected on a desktop;

step S2: capturing current whole desktop video information, and binding the current whole desktop video information into a first data packet;

step S3: the camera collects the video of the teacher in class and binds the video into a second data packet;

step S4: analyzing and merging the first data packet and the second data packet;

step S5: the merged video is sent to a server side to match audio data of the regional audio/position parameter set with the video image;

step S6: and covering the video data successfully matched at the position of the designated area.

Preferably, in step S1, the desktop video and the lesson video captured by the camera may be switched in the display area.

Preferably, in step S2, the client intercepts the video image of the whole desktop, and performs encoding and compression to obtain a video encoding data packet; the video image comprises at least one display area; and binding the video coding data packet and at least one position parameter into a corresponding second network data packet, and sending the second network data packet to the target server.

Preferably, in step S4, the desktop video is subjected to an audio flag identification process, and a regional audio/position parameter set is formed, and the extracted, encoded and compressed audio data and the regional audio/position parameter set are bound to a corresponding first network packet, and sent to the target server.

Preferably, the target server performs type analysis on a first network data packet or a second network data packet which is sent or sent, decodes the encoded and compressed audio data to obtain at least one group of audio data if the first network data packet is the first network data packet, and decodes the encoded and compressed audio data to obtain the video image if the second network data packet is the second network data packet.

Preferably, the target server detects whether an audio mark exists in a display area corresponding to the position parameter, if so, matches audio data with a video image corresponding to the position parameter according to an area audio/position parameter set corresponding to the audio mark, and shares a corresponding client desktop area, and if not, shares a corresponding client desktop area with the video image corresponding to the position parameter.

Preferably, in S5, if the first packet and the second packet are analyzed and merged to include the audio flag, the merged audio data needs to be mixed.

The invention has the following beneficial effects:

according to the invention, the desktop video information and the teacher teaching video information are collected, analyzed and combined, the combined videos are subjected to region matching by the server, and the matched videos are covered in the designated region, so that students can conveniently watch courseware of teaching and synchronous explanation of teachers remotely, and meanwhile, a plurality of video frames are randomly switched, so that the students can conveniently watch and study, and the remote teaching effect of the students is improved.

Of course, it is not necessary for any product in which the invention is practiced to achieve all of the above-described advantages at the same time.

Drawings

In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings used in the description of the embodiments will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art that other drawings can be obtained according to the drawings without creative efforts.

Fig. 1 is a step diagram of a method for synchronously synthesizing live video image and sound acquisition and computer desktop acquisition according to the present invention.

Detailed Description

The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.

Referring to fig. 1, the present invention is a method for synchronously synthesizing live video image and sound acquisition and computer desktop acquisition, comprising the following steps:

In step S1, the desktop video and the lesson video captured by the camera may be switched in the display area.

In step S2, the client intercepts the video image of the entire desktop, and performs encoding and compression to obtain a video encoding data packet; the video image comprises at least one display area; and binding the video coding data packet and at least one position parameter into a corresponding second network data packet, and sending the second network data packet to the target server.

In step S4, the desktop video is marked with an audio flag, and an area audio/position parameter set is formed, and the extracted, encoded and compressed audio data and the area audio/position parameter set are bound to a corresponding first network packet, and sent to the target server.

The target server performs type analysis on a first network data packet or a second network data packet which is sent or sent, decodes the encoded and compressed audio data to obtain at least one group of audio data if the first network data packet is the first network data packet, and decodes the encoded and compressed audio data to obtain a video image if the second network data packet is the second network data packet.

And if not, the video image corresponding to the position parameter is subjected to corresponding client desktop area sharing.

In S5, if the first packet and the second packet are analyzed and merged to include the audio flag, the merged audio data needs to be mixed.

It should be noted that, in the above system embodiment, each included unit is only divided according to functional logic, but is not limited to the above division as long as the corresponding function can be implemented; in addition, specific names of the functional units are only for convenience of distinguishing from each other, and are not used for limiting the protection scope of the present invention.

In addition, it is understood by those skilled in the art that all or part of the steps in the method for implementing the embodiments described above may be implemented by a program instructing associated hardware, and the corresponding program may be stored in a computer-readable storage medium.

The preferred embodiments of the invention disclosed above are intended to be illustrative only. The preferred embodiments are not intended to be exhaustive or to limit the invention to the precise embodiments disclosed. Obviously, many modifications and variations are possible in light of the above teaching. The embodiments were chosen and described in order to best explain the principles of the invention and the practical application, to thereby enable others skilled in the art to best utilize the invention. The invention is limited only by the claims and their full scope and equivalents.

Claims

1. A method for synchronously synthesizing live video image and sound acquisition and computer desktop acquisition is characterized by comprising the following steps:

2. The method as claimed in claim 1, wherein in step S1, the desktop video and the class video captured by the camera can be switched in the display area.

3. The method as claimed in claim 1, wherein in step S2, the client intercepts the video image of the whole desktop, and performs coding compression to obtain the video coding data packet; the video image comprises at least one display area; and binding the video coding data packet and at least one position parameter into a corresponding second network data packet, and sending the second network data packet to the target server.

4. The method as claimed in claim 1, wherein in step S4, the desktop video is identified by an audio marker, and a regional audio/position parameter set is formed, and the extracted, encoded and compressed audio data and the regional audio/position parameter set are bound to form a corresponding first network packet, and sent to the target server.

5. The method as claimed in claim 1 or 4, wherein the target server performs type analysis on a first network data packet or a second network data packet which is sent or sent, decodes the encoded and compressed audio data to obtain at least one set of audio data if the first network data packet is the first network data packet, and decodes the encoded and compressed audio data to obtain the video image if the second network data packet is the second network data packet.

6. The method as claimed in claim 4, wherein the target server detects whether there is an audio flag in a display area corresponding to the location parameter, if so, matches the audio data with the video image corresponding to the location parameter according to a regional audio/location parameter set corresponding to the audio flag, and shares a corresponding client desktop area, and if not, shares a corresponding client desktop area with the video image corresponding to the location parameter.

7. The method as claimed in claim 1, wherein in S5, if the first data packet and the second data packet are analyzed and merged to include an audio flag, the merged audio data needs to be mixed.