WO2024021008A1 - Data processing method, device and system, and storage medium - Google Patents

Data processing method, device and system, and storage medium Download PDF

Info

Publication number
WO2024021008A1
WO2024021008A1 PCT/CN2022/108901 CN2022108901W WO2024021008A1 WO 2024021008 A1 WO2024021008 A1 WO 2024021008A1 CN 2022108901 W CN2022108901 W CN 2022108901W WO 2024021008 A1 WO2024021008 A1 WO 2024021008A1
Authority
WO
WIPO (PCT)
Prior art keywords
data
node
processing
cross
information
Prior art date
Application number
PCT/CN2022/108901
Other languages
French (fr)
Chinese (zh)
Inventor
李佳徽
王鹏鸿
马梦瑶
范晓鹏
Original Assignee
华为技术有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 华为技术有限公司 filed Critical 华为技术有限公司
Priority to PCT/CN2022/108901 priority Critical patent/WO2024021008A1/en
Publication of WO2024021008A1 publication Critical patent/WO2024021008A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H03ELECTRONIC CIRCUITRY
    • H03MCODING; DECODING; CODE CONVERSION IN GENERAL
    • H03M7/00Conversion of a code where information is represented by a given sequence or number of digits to a code where the same, similar or subset of information is represented by a different sequence or number of digits
    • H03M7/30Compression; Expansion; Suppression of unnecessary data, e.g. redundancy reduction
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04WWIRELESS COMMUNICATION NETWORKS
    • H04W28/00Network traffic management; Network resource management
    • H04W28/02Traffic management, e.g. flow control or congestion control

Definitions

  • the present application relates to the field of communication technology, and in particular, to a data processing method, device, system and storage medium.
  • Distributed single/multi-modal signal processing is a typical scenario in the field of task-oriented source channel joint coding/semantic communication.
  • source acquisition units/nodes/terminals of the same or different modes need to process the collected information and send it to the server/base station for fusion processing to perform specific tasks.
  • the source can be images, videos, audio, sensor parameters and other signals.
  • audio-visual analysis tasks including the detection and identification of audio, visual and audio-visual events, and determine which of these events are visible and possible. audible as well as visible and audible.
  • the sending end in the existing technology uses 2D and 3D deep residual network ResNet and VGGish models respectively to extract audio-visual features, and then extracts them through visual encoders and audio encoders respectively.
  • the features are compressed and encoded, and finally sent out.
  • the sent signal is transmitted to the remote server through the noise channel for processing.
  • a Transformer model structure is used to jointly decode the received audio and video features, and finally the probability of the event is output through a fully connected layer and the activation function Softmax.
  • the video features collected by the first node and the audio features collected by the second node both correspond to the characteristics of the same environment.
  • the video features There may be some correlation information between audio features.
  • the server needs to process the two pieces of data. Since there is some related information in the two pieces of data, it will cause the server to process data redundantly, causing a certain degree of redundancy. Waste of transmission resources and loss of transmission performance.
  • This application discloses a data processing method, device, system and storage medium, which can reduce bandwidth resource consumption and improve the robustness of data transmission.
  • embodiments of the present application provide a data processing method, applied to the first node, including:
  • the first data is de-redundantly processed according to the cross-node auxiliary information to obtain the second data.
  • the cross-node auxiliary information is the first initial data collected by the first node and the second node. Information related to the second initial data collected;
  • the first node performs de-redundant processing on the first data based on the received cross-node auxiliary information from the server or base station, and combines it with the first initial data collected by the first node and the second data collected by the second node. Information related to the initial data is removed to obtain second data, and the second data is sent to the server.
  • the data transmission of the node can be reduced. amount, thereby reducing bandwidth resource consumption and improving transmission robustness.
  • the method further includes:
  • the first modality may be, for example, video, audio, image, etc.
  • the first initial data collected by the first node is data of the first modality.
  • the de-redundancy processing of the first data according to the cross-node auxiliary information to obtain the second data includes:
  • Both the cross-node auxiliary information and the first data are input into the first preset model for processing to obtain the second data, wherein when the change value of the system's received signal-to-noise ratio exceeds a threshold, the training is triggered.
  • the first default model is used.
  • the model By triggering the training model when the system's received signal-to-noise ratio change value exceeds the threshold, the model is continuously trained and updated, resulting in better model performance and higher transmission robustness during de-redundancy processing.
  • embodiments of the present application provide a data processing method, applied to a server or base station, including:
  • the third data is processed to obtain cross-node auxiliary information.
  • the cross-node auxiliary information is information related to the first initial data collected by the first node and the second initial data collected by the second node. ;
  • the server processes the third data from the second node to obtain the cross-node auxiliary information between the second node and the first node, and then sends the cross-node auxiliary information to the first node.
  • This helps the first node to perform de-redundant processing on the first data based on the received cross-node auxiliary information, which further helps the server to finally receive data from the first node as de-redundant data.
  • Using this method can reduce the data transmission volume of nodes, thereby reducing bandwidth resource consumption, improving transmission robustness, and improving server processing efficiency.
  • the method further includes:
  • the data for fusion processing by the server is deredundant, which means that there is no duplicate or related information between the second data and the third data, which improves the processing efficiency of the server.
  • the method further includes:
  • the first initial data collected by the first node is data of the first modality
  • the second initial data collected by the second node is data of the second modality.
  • processing the third data to obtain cross-node auxiliary information includes:
  • the third data is input into a second preset model for processing to obtain the cross-node auxiliary information, wherein when the change value of the system's received signal-to-noise ratio exceeds a threshold, training of the second preset model is triggered. .
  • the model By triggering the training model when the system's received signal-to-noise ratio change value exceeds the threshold, the model is continuously trained and updated, so that the model performs better during de-redundancy processing and has higher transmission robustness.
  • embodiments of the present application provide a data processing device, including:
  • the first processing module is used to compress and encode the collected first initial data to obtain the first data
  • a receiving module used to receive cross-node auxiliary information from the server or base station
  • the second processing module is configured to perform de-redundant processing on the first data according to the cross-node auxiliary information to obtain the second data.
  • the cross-node auxiliary information is the same as the third data collected by the first node. Information related to the first initial data and the second initial data collected by the second node;
  • a sending module configured to send the second data.
  • the receiving module is also used to:
  • the first initial data collected by the first node is data of the first modality.
  • the second processing module is used to:
  • Both the cross-node auxiliary information and the first data are input into the first preset model for processing to obtain the second data, wherein when the change value of the system's received signal-to-noise ratio exceeds a threshold, the training is triggered.
  • the first default model is used.
  • this application provides a data processing device, including:
  • a receiving module configured to receive third data from the second node, where the third data is obtained by the second node after compressing and encoding the collected second initial data;
  • a processing module configured to process the third data to obtain cross-node auxiliary information, where the cross-node auxiliary information is related to the first initial data collected by the first node and the second data collected by the second node. Information related to initial data;
  • a sending module configured to send the cross-node auxiliary information to the first node.
  • the receiving module is also configured to receive second data from the first node
  • the processing module is also used to perform fusion processing on the second data and the third data.
  • the sending module is also used to:
  • the first initial data collected by the first node is data of the first modality
  • the second initial data collected by the second node is data of the second modality.
  • processing module is also used to:
  • the third data is input into a second preset model for processing to obtain the cross-node auxiliary information, wherein when the change value of the system's received signal-to-noise ratio exceeds a threshold, training of the second preset model is triggered. .
  • the application provides a data processing device, including a processor and a communication interface.
  • the communication interface is used to receive and/or send data, and/or the communication interface is used to provide the processor with Output and/or output, the processor is used to call computer instructions to implement the method provided by any possible implementation manner of the first aspect, and/or to implement the method provided by any possible implementation manner of the second aspect method.
  • this application provides a data processing system.
  • the system includes a server or a base station, and also includes a first node, wherein:
  • the server or base station is configured to implement the method provided in any possible implementation manner of the second aspect; and the first node is configured to implement the method provided in any possible implementation manner of the first aspect.
  • the present application provides a computer storage medium, including computer instructions.
  • the computer instructions When the computer instructions are run on an electronic device, the electronic device causes the electronic device to execute any possible implementation manner and/or as in the first aspect.
  • the method provided by any possible implementation of the second aspect.
  • embodiments of the present application provide a computer program product.
  • the computer program product When the computer program product is run on a computer, it causes the computer to execute any possible implementation manner of the first aspect and/or any possible implementation method of the second aspect. Methods provided by the embodiments.
  • the computer program products described in the eight aspects are all used to execute the method provided in any one of the first aspects and the method provided in any one of the second aspects. Therefore, the beneficial effects it can achieve can be referred to the beneficial effects in the corresponding methods, and will not be described again here.
  • Figure 1 is a schematic architectural diagram of a data processing system provided by an embodiment of the present application.
  • FIG. 2 is a schematic flowchart of a data processing method provided by an embodiment of the present application.
  • Figure 3 is a schematic diagram of a data processing method provided by an embodiment of the present application.
  • FIG. 4 is a schematic diagram of another data processing method provided by an embodiment of the present application.
  • FIG. 5 is a schematic diagram of another data processing method provided by the embodiment of the present application.
  • Figure 6 is a schematic diagram of a model training method provided by an embodiment of the present application.
  • Figure 7a is a schematic diagram of the framework structure of Attention A provided by the embodiment of the present application.
  • Figure 7b is a schematic diagram of the framework structure of Attention B provided by the embodiment of the present application.
  • Figure 8a is a schematic diagram of the frame structure of an Encoder A1 provided by an embodiment of the present application.
  • Figure 8b is a schematic diagram of the frame structure of an Encoder B2 provided by an embodiment of the present application.
  • Figure 9 is a schematic diagram of the frame structure of Encoder B1 provided by the embodiment of the present application.
  • Figure 10a is a schematic diagram of the frame structure of another Encoder A1 provided by the embodiment of the present application.
  • Figure 10b is a schematic diagram of the frame structure of another Encoder B2 provided by the embodiment of the present application.
  • Figure 10c is a schematic diagram of the frame structure of another Encoder B1 provided by the embodiment of the present application.
  • Figure 11 is a schematic structural diagram of a data processing device provided by an embodiment of the present application.
  • Figure 12 is a schematic structural diagram of another data processing device provided by an embodiment of the present application.
  • Figure 13 is a schematic structural diagram of yet another data processing device provided by an embodiment of the present application.
  • this application provides a data processing method, device, system and storage medium, which can reduce bandwidth resource consumption and improve the robustness of data transmission.
  • Figure 1 is a schematic diagram of a data processing system applicable to the embodiment of the present application.
  • the system includes any one of node 1, node 2, server or base station.
  • the server is a device with centralized computing capabilities, which can be implemented through servers, virtual machines, clouds, or robots.
  • servers include but are not limited to general computers, dedicated server computers (such as personal computers, servers, UNIX servers, or mid-range servers, etc.), blade servers, etc.
  • the server is implemented by a server, as shown in Figure 1, the number of servers it contains can be one or multiple (such as a server cluster).
  • a virtual machine is a virtualized computing module.
  • the cloud is a software platform that uses application virtualization technology, which allows one or more software and applications to be developed and run in an independent virtualized environment.
  • the cloud can be deployed on public cloud, private cloud, or hybrid cloud.
  • Access network equipment refers to the radio access network (RAN) node (or equipment) that connects terminals to the wireless network, and can also be called a base station.
  • RAN nodes are: evolved Node B (gNB), transmission reception point (TRP), evolved Node B (evolved Node B, eNB), radio network controller, RNC), Node B (Node B, NB), base station controller (BSC), base transceiver station (BTS), home base station (for example, home evolved NodeB, or home Node B, HNB) , baseband unit (base band unit, BBU), or wireless fidelity (wireless fidelity, Wifi) access point (access point, AP), etc.
  • gNB evolved Node B
  • TRP transmission reception point
  • eNB evolved Node B
  • RNC radio network controller
  • Node B Node B
  • BSC base station controller
  • BTS base transceiver station
  • home base station for example, home evolved NodeB, or home Node B, HNB
  • access network equipment may include centralized unit (CU) nodes, distributed unit (DU) nodes, or RAN equipment including CU nodes and DU nodes.
  • the RAN equipment including CU nodes and DU nodes separates the protocol layer of gnb in the NR system. Some of the protocol layer functions are centralized and controlled by the CU. The remaining part or all of the protocol layer functions are distributed in the DU and centralized by the CU. Control DU.
  • the functions of CU can be implemented by one entity or by different entities.
  • the functions of the CU can be further divided, for example, the control plane (CP) and the user plane (UP) are separated, that is, the control plane (CU-CP) and the CU user plane (CU-UP) of the CU.
  • CU-CP and CU-UP can be implemented by different functional entities, and the CU-CP and CU-UP can be coupled with DU to jointly complete the functions of the base station.
  • node 1 and node 2 may be terminal devices.
  • the terminal equipment (Terminal Equipment) in the embodiment of this application can also be called a terminal, user equipment (User Equipment, UE), mobile station (Mobile Station, MS), mobile terminal (Mobile Terminal, MT), etc.
  • the terminal device can be a mobile phone (mobile phone), a tablet computer (Pad), a computer with wireless transceiver function, a virtual reality (Virtual Reality, VR) terminal device, an augmented reality (Augmented Reality, AR) terminal device, or an industrial control (Industrial Control) ), wireless terminals in Self Driving, wireless terminals in Remote Medical Surgery, wireless terminals in Smart Grid, wireless terminals in Transportation Safety Terminals, wireless terminals in Smart City, wireless terminals in Smart Home, etc.
  • node 1 and node 2 may be information source collection units.
  • both Node 1 and Node 2 can be used to collect signals such as images, videos, audios, and sensor parameters.
  • Node 1 and node 2 are also used to encode the collected signals (data) and send the encoded data to the server/base station via the wireless transmission channel.
  • terminal equipment and terminal equipment may partially overlap.
  • mobile phones can collect images, videos, audio and other signals, and this solution does not limit this.
  • the server/base station stores and fuses the data received from node 1 and node 2, and then performs one or more tasks. For example, performing audio-visual analysis tasks based on the received video data and audio data, specifically including the detection and identification of audio, visual and audio-visual events, and determining which of these events are visible, audible, and both visible and audible; further, You can also determine the start time and end time of each event.
  • this solution can be oriented to future cellular standards and Wi-Fi standards, and the products involved can include future cellular and Wi-Fi related products, for example, it can be applied to: mobile phones; tablets/watches that access cellular and Wi-Fi /Notebooks/Other IoT devices; base stations/wireless routers and other devices.
  • the node obtains the first data by compressing and encoding the collected initial data; and then performs de-redundant processing on the first data based on the cross-node auxiliary information received from the server/base station,
  • the cross-node auxiliary information is the relevant information between the initial data collected by the first node and the initial data collected by the second node; and send the second data to the server/base station.
  • FIG. 2 is a schematic flow chart of a data processing method provided by an embodiment of the present application.
  • this method can be applied to the aforementioned data processing system, such as the data processing system shown in Figure 1 .
  • the data processing method shown in Figure 2 may include steps 201-211. It should be understood that, for convenience of description, this application is described through the sequence 201-211, and is not intended to limit the execution to the above sequence. The embodiments of the present application do not limit the execution sequence, execution time, number of executions, etc. of one or more of the above steps.
  • the execution subject of steps 201-202 of the data processing method is the second node (such as the information source collection unit), the execution subject of steps 203-205, 210, and 211 is the server, and the execution subject of steps 206-209 is the first node (such as The information source acquisition unit) is described as an example, and this application is also applicable to other execution entities.
  • Steps 201-211 are as follows:
  • the second node performs compression encoding on the collected second initial data to obtain the third data
  • the coding methods of the compression coding process can be divided into three categories: (1) According to the statistical characteristics of the information source, methods such as predictive coding, transform coding, vector quantization coding, sub-band coding, and neural network coding are used. (2) According to the visual characteristics of the human eye, image coding based on directional filtering, image contour-ethical coding based on image, coding based on wavelet analysis and other methods are used. (3) According to the characteristics of the transferred scene: use methods such as fractal coding and model-based coding.
  • the second node may send the third data to the server through a wireless transmission channel.
  • the server receives the third data from the second node
  • the cross-node auxiliary information is related to the first initial data collected by the first node and the second initial data collected by the second node. Information;
  • the cross-node auxiliary information can be understood as information related to the first initial data collected by the first node and the second initial data collected by the second node.
  • the relevant information can be understood as repeated information with the same value in the first initial data and the second initial data, or related information.
  • node 1 collects the audio information of environment 1
  • node 2 collects the video information of environment 1.
  • This relevant information means that the data collected by node 1 and node 2 both correspond to the same environment.
  • node 1 collects sky information during the day
  • node 2 collects sky information at night. The relevant information is that the data collected by node 1 and node 2 both correspond to the sky.
  • cross-node auxiliary information is obtained by inputting the third data into a preset model for processing.
  • the preset model is a trained model. Through neural network training, a model that can obtain cross-node auxiliary information is obtained.
  • the server sends the extracted cross-node auxiliary information between the first node and the second node to the first node, so that the first node can perform redundancy processing.
  • the server processes the third data from the second node to obtain cross-node auxiliary information between the second node and the first node, and then sends the cross-node auxiliary information to the first node.
  • This helps the first node to perform de-redundant processing on the first data based on the received cross-node auxiliary information, which further helps the server to finally receive data from the first node as de-redundant data.
  • Using this method can reduce the data transmission volume of nodes, thereby reducing bandwidth resource consumption, improving transmission robustness, and improving server processing efficiency.
  • the first node performs compression encoding on the collected first initial data to obtain the first data
  • the first data is data obtained by compressing and encoding the initial data.
  • the execution order of step 206 may be before steps 202-205, which is not strictly limited in this solution.
  • the method further includes:
  • the server sends first instruction information to the first node, where the first instruction information is used to instruct the first node to collect data in the first mode;
  • each node receives the instruction information sent by the server respectively. That is, the first initial data collected by the first node is data of the first modality, and the second initial data collected by the second node is data of the second modality.
  • the data in the first mode or the data in the second mode may be, for example, one or more of audio signals, video signals, image signals, or environmental monitoring sensor signals.
  • the server can directly send the above instruction information, or send the instruction information based on the request sent by each node. This solution does not impose strict restrictions on this.
  • the first node sends a request to the server to instruct the server to perform initial configuration.
  • the server may perform initial configuration directly.
  • the initial configuration may include the above-mentioned first instruction information to instruct the first node to collect data of the first mode.
  • the embodiment of this application only introduces the initial configuration including instructing the first node to collect data of the first mode as an example.
  • the initial configuration may also include other information.
  • initial configuration may include:
  • a) Notify the compression parameters, including modal identification (which can indicate different modalities to achieve single/multi-modal processing), and the final layer network output dimension or length.
  • Feature transmission data type indication that is, the specific format of the data code stream when each part of the feature is transmitted, such as directly sending the output parameters of the network (such as two adjacent real numbers forming a complex symbol, resulting in a string of complex symbols), Or the value after quantization (requires channel coding and modulation to obtain a series of complex symbols).
  • c) Indicate timestamp information, which can include specifying the maximum time difference (the corresponding data packet will be discarded when timeout occurs, a value of 0 means no setting), period information (0 means no period is set, greater than 0 means setting the period), etc.
  • the configured signaling can be in the following format (the node can actively set it and then request the server/base station, or the server/base station can configure it directly):
  • the quantization bit width only needs to be sent when the data type is "quantized value".
  • the above parameters can be sent in one signaling, or they can be encapsulated in different signaling and sent at different times. This solution does not limit this.
  • the number of transmission resources required for a single inference can be calculated through the output dimension or length and data type, which can correspond to the number of symbols, for example.
  • the data type is "quantized value”
  • adding a timestamp is used to indicate the current time of sending characteristic data, which facilitates data synchronization between multiple nodes.
  • the absolute time value t can be recorded (the time interval can be one or more time slots, subframes, symbols, etc., and is not specifically limited; the length of the interval between t and t-1 can be determined by the sending end and the receiving end through negotiation, or it can Determine the interval length by predefinition or preconfiguration), or set a certain period T0 (T0>0), and only record mod(t, T0), mod represents the remainder operation.
  • the timestamp can also be converted into binary data and then channel coded and modulated to obtain a complex signal. For example, low code rate channel coding and low-order modulation can be performed to ensure reliable transmission.
  • the length of the processed code stream to be sent (a string of complex symbols) is consistent with the number of transmission resources.
  • a characteristic data length field (which can be channel coded and modulated together with the timestamp) can be added before the characteristic data code stream to indicate the actual length of the code stream.
  • the above corresponding code streams can be transmitted through PUSCH (applicable to nodes 1 and 2 ⁇ server/base station) or PDSCH (applicable to server/base station ⁇ nodes 1 and 2). In addition, they can also be sent at the physical layer after MAC packetization. .
  • the corresponding data packet will be discarded and no subsequent sending operation will be performed.
  • the redundancy removal process may include, for example, removing cross-node auxiliary information from the first data, or removing information related to the cross-node auxiliary information from the first data, thereby obtaining second data after redundancy removal. That is to say, the second data and the first data have no duplicate information, or in other words, no related information.
  • the node inputs the received cross-node auxiliary information from the server and the first data into a first preset model for processing to obtain the second data.
  • the first preset model is obtained after training.
  • the change value of the system's received signal-to-noise ratio exceeds a threshold, training of the first preset model is triggered.
  • the first node sends the second data to the server through the wireless transmission channel, so that the server can process the corresponding task.
  • the first node performs de-redundant processing on the first data based on the received cross-node auxiliary information from the server or base station, and correlates the first initial data collected by the first node and the second initial data collected by the second node.
  • the information is removed to obtain second data, and the second data is sent to the server.
  • the data transmission of the node can be reduced. amount, thereby reducing bandwidth resource consumption and improving transmission robustness.
  • the server receives the second data from the first node
  • the server obtains the cross-node auxiliary information from the third data sent by the second node, and sends the cross-node auxiliary information to the first node so that the first node can perform de-redundancy processing and obtain the second data, and then the server will send the cross-node auxiliary information from
  • the second data from the first node and the third data from the second node are fused and processed, and corresponding tasks are performed. That is to say, the data finally processed by the server for fusion is deredundant data without duplicate or related information. This not only reduces the bandwidth resources used by the first node to transmit data, but also improves the processing efficiency of the server.
  • the server processes the third data from the second node to obtain the cross-node auxiliary information between the second node and the first node, and then sends the cross-node auxiliary information to the first node.
  • the first node performs de-redundant processing on the first data according to the received cross-node auxiliary information, removes the relevant information between the first node and the second node, obtains the second data, and sends the second data to the server.
  • the second data helps the server to finally receive the data from the first node as deredundant data.
  • the first node removes the relevant information between it and the second node, and then sends the de-redundant data to the server. This can reduce the data transmission volume of the node, thereby reducing bandwidth resource consumption and improving It improves transmission robustness and improves server processing efficiency.
  • FIG. 3 is a schematic diagram of a data processing method provided by an embodiment of the present application.
  • this method can be applied to the aforementioned data processing system, such as the data processing system shown in Figure 1 .
  • node 1 passes the data or features of the specific modality through the encoder.
  • Encoder A1 compresses and encodes the data and sends it to the server/base station through the wireless channel.
  • the server/base station After the server/base station receives the data sent by Node 1, it obtains the cross-node auxiliary information in the data based on the encoder Encoder A2 .
  • the cross-node auxiliary information can be part of the data sent by Node 1, or it can be all the data. .
  • the server/base station sends the cross-node assistance information to node 2 (which may be the aforementioned first node, for example).
  • Node 2 compresses and encodes the data or features of the specific modality through the encoder Encoder B1 , decodes the received cross-node auxiliary information through the decoder A2 , and then combines the data encoded by Encoder B1 with the decoded data by Decoder A2 .
  • the cross-node auxiliary information is operated by the attention mechanism to guide the feature encoding, that is, the feature extraction shown in Figure 3 is performed to obtain the deredundant processed data, which is encoded by the encoder Encoder B2 , and then the obtained The data is sent to the server/base station.
  • the server/base station also decodes the data sent by Node 1 through Decoder A1 , and performs an attention mechanism operation on the decoded data to better restore the data sent by Node 1. Moreover, the server/base station decodes the received data from node 2, and then fuses the data of the two nodes to perform corresponding tasks.
  • one node only transmits data that is not relevant to the data of the other node, which can reduce the amount of feature transmission. and bandwidth resource occupancy. Further, when the total bandwidth occupancy is similar, the accuracy of the final task execution can be improved.
  • FIG. 4 is a schematic diagram of another data processing method provided by an embodiment of the present application.
  • node 1 for example, it can be the aforementioned second node
  • modal data with smaller dimensions (such as audio, environmental monitoring sensor signals)
  • Node 2 for example, it may be the aforementioned first node
  • the data collected by node 1 can be compressed into fewer symbols for transmission, so it only takes up less bandwidth resources.
  • the server/base station receives the low-dimensional modal data, and then sends part or all of the data to node 2, such as a high-dimensional modal data collection terminal.
  • Node 2 performs attention operations on the data sent by the server/base station and the high-dimensional modal data extracted by the node 2 to guide feature encoding, thereby effectively reducing the transmission volume and bandwidth resource occupation of high-dimensional modal data.
  • the method includes: first performing initial configuration.
  • the initial configuration for example, configures compression parameters, transmission data types, resource allocation, timestamp information, etc.
  • the initial configuration for example, configures compression parameters, transmission data types, resource allocation, timestamp information, etc.
  • Node 1 compresses the collected data/features into smaller dimensions and transmits them to the remote server/base station through the noise channel.
  • the remote server/base station receives the characteristics sent by node 1, obtains the cross-node auxiliary information, and then transmits the cross-node auxiliary information to node 2.
  • Node 2 compresses and encodes the collected data/features, decodes the received cross-node auxiliary information, and then performs attention calculation on the two to extract features that are not related to node 1, and converts the calculated node 2 Features are transmitted to the remote server/base station for further information fusion and calculation.
  • the characteristic data (complex signal) is obtained after processing according to the configured data type.
  • the characteristic data and channel coding and modulation are concatenated and sent to the server/base station.
  • the server/base station it is based on the network output (real number) of Encoder A2 , which is processed according to the configured data type to obtain characteristic data (complex signal); the characteristic data and the timestamp code stream and characteristic data after channel coding and modulation are The length is concatenated and sent to node 2.
  • the characteristic data (complex signal) is obtained after processing according to the configured data type; the characteristic data is combined with the timestamp code stream and characteristic data length after channel coding and modulation. After splicing, it is sent to the server/base station.
  • the encoders Encoder A1 , Encoder A2 , Encoder B1 , Encoder B2 shown in Figure 3 can respectively correspond to Neural network model.
  • the embodiments of the present application do not limit the number and types of models.
  • this embodiment of the present application also provides a model training method.
  • the training process includes: initial configuration, regularly sending pilot sequences to monitor changes in received signal-to-noise ratio (SNR), and model training/updating. It includes the following steps:
  • Initial configuration It can mainly configure the compression parameters, transmission data types, resource allocation, timestamp information, training parameters, etc. of each model.
  • This configuration can include:
  • a) Notify the compression parameters, including modal identification (which can indicate different modalities and achieve single/multi-modal processing), the final layer network output dimension or length, and the reverse gradient dimension or length;
  • Feature transmission data type indication that is, the specific format of the data stream when each part of the feature is transmitted, such as directly sending the output parameters of the network (two adjacent real values form a complex symbol), or quantized values (requires channel coding and modulation before sending);
  • c) Indicate training timestamp information, mainly specifying the maximum time difference (the corresponding data packet will be discarded when timeout occurs, a value of 0 means no setting), period information (0 means no period is set, greater than 0 means setting the period), etc.;
  • d) Indicate training parameters, including conventional model training parameters (such as update rate, batch size/number, maximum number of iterations, loss function threshold, etc.) as well as SNR change threshold and monitoring period for monitoring model updates;
  • the configured signaling can be in the following format (the node can actively set it and then request it from the server/base station, or the server/base station can configure it directly):
  • the above parameters can be sent in one signaling, or they can be encapsulated in different signaling and sent at different times;
  • the number of transmission resources required to train a single batch can be calculated through the output dimension or length and data type:
  • Each node sends a pilot sequence to the server/base station respectively, and the server/base station sends a pilot sequence to node 2 to estimate the SNR of the relevant transmission link.
  • the purpose of this processing is: under different SNR, the compression rate or coding rate of the transmitted data will become an important factor affecting the accuracy of the server or base station's task execution. Therefore, the compression rate or coding rate of the transmitted data can be adjusted by the calculated SNR. Code rate; in addition, different models can be selected for online inference based on different SNR.
  • model training is triggered.
  • this solution does not impose strict restrictions on this.
  • the existing pilot sequence can be reused and regular detection can be performed according to the initially configured monitoring period.
  • the server/base station sends training requests or instructions to node 1 and node 2.
  • Each node prepares data after receiving the instructions, starts training after completing data synchronization, and sends a training completion instruction after the training is completed.
  • the data set is synchronized.
  • Training data interaction mainly includes forward transmission (sending features) and reverse transmission (sending gradients).
  • the forward transmission includes: transmitting the characteristics of node 1 to the server/base station, the server/base station sending the characteristics of node 1 to node 2, and transmitting the characteristics of node 2 to the server/base station.
  • the following characteristic transmission format can be used:
  • Reverse transmission includes: the server/base station sends gradient information to node 2 (for updating the node 2 encoder), node 2 sends gradient information to the server/base station (for updating the cross-node auxiliary information encoder), the server/base station Send gradient information to node 1 (for node 1 encoder update).
  • the gradient information can be sent in the following transmission format, for example:
  • training timestamp code stream used to indicate the time of the current gradient information to facilitate data synchronization between multiple nodes
  • Gradient data code stream According to the initially configured compression parameters and data type, the processed code stream to be sent (the optional processing method configured in the initial stage can be consistent with the corresponding processing method of the configured characteristic transmission data type: directly convert real numbers
  • the gradient values are spliced into a complex signal; or quantized first, followed by channel coding and modulation to obtain a complex signal); optionally, the gradient data length field can be added before the gradient data code stream (channel coding and modulation are performed together with the timestamp ), indicating the actual length of the code stream;
  • the corresponding code stream can be transmitted through PUSCH (applicable to nodes 1 and 2 ⁇ server/base station) or PDSCH (applicable to server/base station ⁇ nodes 2 and 1). In addition, it can also be sent at the physical layer after being packaged by MAC.
  • the embodiment of the present application also provides an architecture of an encoder and a decoder.
  • Figure 7a shows the frame structure of Attention A shown in Figure 3
  • Figure 7b shows the frame structure of Attention B shown in Figure 3.
  • the network structure shown in Figure 3 contains multiple codecs and Attention.
  • the input of Attention A is the audio feature Y A decoded by the server/base station.
  • the data size can be 32*10*64, which is then calculated by self-attention and a feed-forward network (FFN). Get the output.
  • FNN feed-forward network
  • the input of Attention B is the visual feature x B of the video capture terminal and the decoded audio feature Z A fed back to the video capture terminal (the data sizes of x B and Z A are both 32*10*64), and then through crossover Attention calculation and a feed-forward neural network get the output result (the size of the output result is 32*10*64).
  • Figure 8a shows a network structure of the audio encoder Encoder A1
  • Figure 8b shows a network structure of the visual encoder Encoder B2
  • the audio encoder Encoder A1 uses a residual fully connected network to reduce the loss of audio features during the compression process, where the data input dimension is 32*10*128.
  • the visual encoder Encoder B2 splices the encoded visual feature x B and the x B * after audio-visual attention calculation (the data dimensions of x B * and x B are both 32*10*64), and then Use a layer of fully connected network for compression (the dimension of the compressed data is 32*10*48).
  • FC is the fully connected layer (Fully Connected, FC).
  • Figure 9 shows a network structure of the visual encoder Encoder B1 .
  • Encoder B1 compresses and encodes the read video image features and timing features.
  • the dimension size of the image features is 32*80*2048, and the dimension size of the timing features is 32*10*512.
  • Its input includes image features and temporal features.
  • an average pooling operation is performed on the image features (the data output after pooling is 32*10*2048), and then the data is compressed through a layer of full connection.
  • the final output size 32*10*64 is obtained through a layer of fully connected layers.
  • Decoder A1 and Decoder B2 can respectively use a layer of fully connected networks to expand the dimensions of audio features and video features respectively, so that Y A * and Y B have the same dimensions and are both 32*10*64 in size.
  • Encoder A2 transmits and encodes the feedback audio features. It can use a layer of fully connected network to calculate the input features. The input and output sizes are both 32*10*26.
  • Decoder A2 uses a layer of fully connected network to mainly implement dimension expansion of the delivered audio auxiliary information, that is, to make the dimension of Z A consistent with the dimension of x B for subsequent Attention operations (its input dimension is 32*10*26 , the output dimension is 32*10*64).
  • the embodiment of the present application also provides another encoder and decoder architecture. Take the scenario where node 1 collects image signals and node 2 collects video signals as an example.
  • Figure 10a shows another frame structure of the encoder Encoder A1 shown in Figure 3.
  • Figure 10b shows the frame structure of the encoder Encoder B2 shown in Figure 3.
  • Figure 10c shows the encoder shown in Figure 3.
  • the frame structure of Encoder B1 is a residual convolutional network, which compresses and codes the collected image features. The residual design is used to reduce the loss of information.
  • Encoder B2 splices the attention-operated information and visual features through a layer of convolutional network (for example, the convolution kernel size is 1*1) to achieve the fusion of multiple modal information.
  • Encoder B1 compresses and encodes the image features and timing features of the read video. The compression process is implemented by using a 3*3 convolution kernel and a 1*1 convolution kernel respectively.
  • the dimension size of the input data and output data of the above encoder is consistent with the previous embodiment.
  • Decoder A1 and Decoder B2 can also use other network structures, such as long short-term memory (LSTM) networks to achieve compression and dimension expansion of input features.
  • LSTM long short-term memory
  • the division of multiple units or modules is only a logical division based on functions and does not limit the specific structure of the device.
  • some of the functional modules may be subdivided into more small functional modules, and some of the functional modules may also be combined into one functional module.
  • some devices include a receiving unit and a transmitting unit.
  • the sending unit and the receiving unit can also be integrated into a communication unit, and the communication unit can realize the functions realized by the receiving unit and the sending unit.
  • each unit corresponds to its own program code (or program instruction).
  • program codes corresponding to these units are run on the processor, the unit is controlled by the processing unit and executes the corresponding process to achieve the corresponding function. .
  • Embodiments of the present application also provide a device for implementing any of the above methods.
  • a data processing device is provided that includes modules (or means) for implementing each step performed by the first node in any of the above methods.
  • another data processing device is also provided, including modules (or means) used to implement each step performed by the server or base station in any of the above methods.
  • FIG. 11 is a schematic structural diagram of a data processing device provided by an embodiment of the present application.
  • the data processing device is used to implement the aforementioned data processing method, such as the data processing method executed by the first node shown in FIG. 2 .
  • the device may include a first processing module 1101, a receiving module 1102, a second processing module 1103 and a sending module 1104, specifically as follows:
  • the first processing module 1101 is used to compress and encode the collected first initial data to obtain the first data
  • the receiving module 1102 is used to receive cross-node auxiliary information from the server or base station;
  • the second processing module 1103 is configured to perform de-redundant processing on the first data according to the cross-node auxiliary information to obtain second data, where the cross-node auxiliary information is related to the data collected by the first node. Information related to the first initial data and the second initial data collected by the second node;
  • the sending module 1104 is used to send the second data.
  • the receiving module 1102 is also used to:
  • the first initial data collected by the first node is data of the first modality.
  • the second processing module 1103 is used to:
  • Both the cross-node auxiliary information and the first data are input into the first preset model for processing to obtain the second data, wherein when the change value of the system's received signal-to-noise ratio exceeds a threshold, the training is triggered.
  • the first default model is used.
  • FIG. 12 is a schematic structural diagram of another data processing device provided by an embodiment of the present application.
  • the data processing device is used to implement the aforementioned data processing method, such as the data processing method executed by the server shown in FIG. 2 .
  • the device may include a receiving module 1201, a processing module 1202 and a sending module 1203, specifically as follows:
  • the receiving module 1201 is configured to receive third data from the second node, where the third data is obtained by the second node after compressing and encoding the collected second initial data;
  • the processing module 1202 is configured to process the third data to obtain cross-node auxiliary information, where the cross-node auxiliary information is related to the first initial data collected by the first node and the third data collected by the second node. 2. Information related to initial data;
  • Sending module 1203, configured to send the cross-node auxiliary information to the first node.
  • the receiving module 1201 is also configured to receive second data from the first node
  • the processing module 1202 is also used to perform fusion processing on the second data and the third data.
  • the sending module 1203 is also used to:
  • the first initial data collected by the first node is data of the first modality
  • the second initial data collected by the second node is data of the second modality.
  • processing module 1202 is also used to:
  • the third data is input into a second preset model for processing to obtain the cross-node auxiliary information, wherein when the change value of the system's received signal-to-noise ratio exceeds a threshold, training of the second preset model is triggered. .
  • each module in each device above is only a division of logical functions. In actual implementation, it can be fully or partially integrated into a physical entity, or it can also be physically separated.
  • the modules in the data processing device can be implemented in the form of the processor calling software; for example, the data processing device includes a processor, the processor is connected to a memory, instructions are stored in the memory, and the processor calls the instructions stored in the memory to achieve the above. Any method or function of each module of the device is implemented, where the processor is, for example, a general-purpose processor, such as a central processing unit (CPU) or a microprocessor, and the memory is a memory within the device or a memory outside the device.
  • CPU central processing unit
  • microprocessor a microprocessor
  • the modules in the device can be implemented in the form of hardware circuits, and some or all of the unit functions can be implemented through the design of the hardware circuits, which can be understood as one or more processors; for example, in one implementation,
  • the hardware circuit is an application-specific integrated circuit (ASIC), which realizes the functions of some or all of the above units through the design of the logical relationships of the components in the circuit; for another example, in another implementation, the hardware circuit is It can be realized by programmable logic device (PLD), taking field programmable gate array (FPGA) as an example, which can include a large number of logic gate circuits, and the logic gate circuits are configured through configuration files. connection relationships, thereby realizing the functions of some or all of the above units. All modules of the above device may be fully implemented by the processor calling software, or all may be implemented by hardware circuits, or part of the modules may be implemented by the processor calling software, and the remaining part may be implemented by hardware circuits.
  • PLD programmable logic device
  • FPGA field programmable gate array
  • FIG. 13 is a schematic diagram of the hardware structure of another data processing device provided by an embodiment of the present application.
  • the data processing device 1300 shown in FIG. 13 includes a memory 1301, a processor 1302, a communication interface 1303 and a bus 1304.
  • the memory 1301, the processor 1302, and the communication interface 1303 implement communication connections between each other through the bus 1304.
  • the memory 1301 may be a read only memory (ROM), a static storage device, a dynamic storage device or a random access memory (RAM).
  • ROM read only memory
  • RAM random access memory
  • the memory 1301 can store programs. When the program stored in the memory 1301 is executed by the processor 1302, the processor 1302 and the communication interface 1303 are used to execute various steps of the data processing method in the embodiment of the present application.
  • the processor 1302 is a circuit with signal processing capabilities.
  • the processor 1302 can be a circuit with the ability to read and run instructions, such as a central processing unit (CPU), a microprocessor, a graphics processor (graphics processor) processing unit (GPU) (can be understood as a microprocessor), or digital signal processor (digital signal processor, DSP), etc.; in another implementation, the processor 1302 can implement certain functions through the logical relationship of the hardware circuit , the logical relationship of the hardware circuit is fixed or reconfigurable, for example, the processor 1302 implements a hardware circuit for an ASIC or a programmable logic device PLD, such as an FPGA.
  • a programmable logic device PLD such as an FPGA.
  • the process of the processor loading the configuration file and realizing the hardware circuit configuration can be understood as the process of the processor loading instructions to realize the functions of some or all of the above modules.
  • it can also be a hardware circuit designed for artificial intelligence, which can be understood as an ASIC, such as a neural network processing unit (NPU), tensor processing unit (TPU), deep learning processing Unit (deep learning processing unit, DPU), etc.
  • the processor 1302 is used to execute relevant programs to implement the functions required to be performed by the units in the data processing device in the embodiment of the present application, or to execute the data processing method in the method embodiment of the present application.
  • each module in the above device can be one or more processors (or processing circuits) configured to implement the above method, such as: CPU, GPU, NPU, TPU, DPU, microprocessor, DSP, ASIC, FPGA , or a combination of at least two of these processor forms.
  • the SOC may include at least one processor for implementing any of the above methods or implementing the functions of each module of the device.
  • the at least one processor may be of different types, such as a CPU and an FPGA, or a CPU and an artificial intelligence processor. CPU and GPU etc.
  • the communication interface 1303 uses a transceiver device such as but not limited to a transceiver to implement communication between the device 1300 and other devices or communication networks. For example, data can be obtained through communication interface 1303.
  • a transceiver device such as but not limited to a transceiver to implement communication between the device 1300 and other devices or communication networks. For example, data can be obtained through communication interface 1303.
  • Bus 1304 may include a path that carries information between various components of device 1300 (eg, memory 1301, processor 1302, communication interface 1303).
  • the device 1300 shown in Figure 13 only shows a memory, a processor, and a communication interface, during specific implementation, those skilled in the art will understand that the device 1300 also includes other devices necessary for normal operation. . At the same time, based on specific needs, those skilled in the art should understand that the device 1300 may also include hardware devices that implement other additional functions. In addition, those skilled in the art should understand that the device 1300 may only include components necessary to implement the embodiments of the present application, and does not necessarily include all components shown in FIG. 13 .
  • Embodiments of the present application also provide a data processing system.
  • the system includes a server or a base station, and also includes a first node, wherein: the server or base station is used to implement the data processing method provided in the second aspect. One or more steps; the first node is used to implement one or more steps in the data processing method provided by the first aspect.
  • Embodiments of the present application also provide a computer-readable storage medium.
  • the computer-readable storage medium stores instructions, which when run on a computer or processor, cause the computer or processor to execute one of the above methods. or multiple steps.
  • An embodiment of the present application also provides a computer program product containing instructions.
  • the computer program product is run on a computer or processor, the computer or processor is caused to perform one or more steps in any of the above methods.
  • a, b, or c can mean: a, b, c, a-b, a-c, b-c, or a-b-c, where a, b, c can be single or multiple .
  • words such as “first” and “second” are used to distinguish identical or similar items with basically the same functions and effects. Those skilled in the art can understand that words such as “first” and “second” do not limit the number and execution order, and words such as “first” and “second” do not limit the number and execution order.
  • words such as “exemplary” or “for example” are used to represent examples, illustrations or explanations. Any embodiment or design described as “exemplary” or “such as” in the embodiments of the present application is not to be construed as preferred or advantageous over other embodiments or designs. Rather, the use of words such as “exemplary” or “such as” is intended to present related concepts in a concrete manner that is easier to understand.
  • a unit described as a separate component may or may not be physically separate.
  • a component shown as a unit may or may not be a physical unit, that is, it may be located in one place, or it may be distributed to multiple network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of this embodiment.
  • the computer program product includes one or more computer instructions.
  • the computer may be a general purpose computer, a special purpose computer, a computer network, or other programmable device.
  • the computer instructions may be stored in or transmitted over a computer-readable storage medium.
  • the computer instructions can be transmitted from one website, computer, server or data center to another through wired (such as coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (such as infrared, wireless, microwave, etc.) means A website site, computer, server or data center for transmission.
  • the computer-readable storage medium can be any available medium that can be accessed by a computer or a data storage device such as a server or data center integrated with one or more available media.
  • the available media may be read-only memory (ROM), random access memory (RAM), or magnetic media, such as floppy disks, hard disks, tapes, disks, or optical media, such as , digital versatile disc (digital versatile disc, DVD), or semiconductor media, such as solid state drive (solid state disk, SSD), etc.

Landscapes

  • Engineering & Computer Science (AREA)
  • Theoretical Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Mobile Radio Communication Systems (AREA)

Abstract

Embodiments of the present application provide a data processing method, device and system, and a storage medium. The method may comprise: performing compression encoding processing on collected first initial data to obtain first data; receiving cross-node auxiliary information from a server or a base station; performing redundancy removal processing on the first data according to the cross-node auxiliary information to obtain second data, wherein the cross-node auxiliary information is information related to the first initial data collected by a first node and second initial data collected by a second node; and sending the second data. By using the means, the information related to the first initial data collected by the first node and the second initial data collected by the second node is removed, and then the data subjected to redundancy removal is sent to the server, so that the data transmission volume of the node can be reduced, thereby reducing the bandwidth resource consumption, and improving the transmission robustness.

Description

数据处理方法、装置、系统以及存储介质Data processing methods, devices, systems and storage media 技术领域Technical field
本申请涉及通信技术领域,尤其涉及一种数据处理方法、装置、系统以及存储介质。The present application relates to the field of communication technology, and in particular, to a data processing method, device, system and storage medium.
背景技术Background technique
分布式单/多模态信号处理,是面向任务的信源信道联合编码/语义通信领域的一个典型场景。该场景下,相同或者不同模态的信源采集单元/节点/终端需要将采集到的信息经过处理后发送到服务器/基站,进行融合处理,以执行特定的任务。该信源可以是图像、视频、音频、传感器参数等信号。例如,当一个节点采集的是视频信息,一个节点采集的是音频信息,则可以联合执行视听解析任务,具体包括对音频、视觉以及视听事件的检测识别,并判断这些事件哪些是可见的、可听的以及可见且可听的。Distributed single/multi-modal signal processing is a typical scenario in the field of task-oriented source channel joint coding/semantic communication. In this scenario, source acquisition units/nodes/terminals of the same or different modes need to process the collected information and send it to the server/base station for fusion processing to perform specific tasks. The source can be images, videos, audio, sensor parameters and other signals. For example, when one node collects video information and another node collects audio information, they can jointly perform audio-visual analysis tasks, including the detection and identification of audio, visual and audio-visual events, and determine which of these events are visible and possible. audible as well as visible and audible.
在分布式无线传输的情况下,现有技术中的发送端,对于视听特征的提取分别采用了2D和3D的深度残差网络ResNet和VGGish模型,然后通过视觉编码器和音频编码器分别对提取的特征进行压缩编码,最后发送出去。该发送的信号经过噪声信道传输到远端的服务器进行处理。在接收端,采用一个Transformer模型的结构对接收的音频和视频特征进行联合解码,最后经过一层的全连接层和激活函数Softmax输出事件的概率。In the case of distributed wireless transmission, the sending end in the existing technology uses 2D and 3D deep residual network ResNet and VGGish models respectively to extract audio-visual features, and then extracts them through visual encoders and audio encoders respectively. The features are compressed and encoded, and finally sent out. The sent signal is transmitted to the remote server through the noise channel for processing. At the receiving end, a Transformer model structure is used to jointly decode the received audio and video features, and finally the probability of the event is output through a fully connected layer and the activation function Softmax.
由于该方案未充分考虑不同节点、模态信源之间的相关性,例如第一节点采集的视频特征和第二节点采集的音频特征均对应的是同一个环境的特征,此时该视频特征和音频特征之间可能存在一些关联信息。则当两个节点各自独立编码并传输自己的数据到服务器,服务器需要对两份数据进行处理,由于该两份数据中存在一些关联信息,因此会导致服务器处理数据比较冗余,带来一定程度传输资源浪费和传输性能损失。Since this solution does not fully consider the correlation between different nodes and modal sources, for example, the video features collected by the first node and the audio features collected by the second node both correspond to the characteristics of the same environment. At this time, the video features There may be some correlation information between audio features. Then when two nodes independently encode and transmit their own data to the server, the server needs to process the two pieces of data. Since there is some related information in the two pieces of data, it will cause the server to process data redundantly, causing a certain degree of redundancy. Waste of transmission resources and loss of transmission performance.
发明内容Contents of the invention
本申请公开了一种数据处理方法、装置、系统以及存储介质,可以减少带宽资源消耗,并提升数据传输的鲁棒性。This application discloses a data processing method, device, system and storage medium, which can reduce bandwidth resource consumption and improve the robustness of data transmission.
第一方面,本申请实施例提供一种数据处理方法,应用于第一节点,包括:In the first aspect, embodiments of the present application provide a data processing method, applied to the first node, including:
对采集的第一初始数据进行压缩编码处理,以得到第一数据;Perform compression encoding processing on the collected first initial data to obtain the first data;
接收来自服务器或基站的跨节点辅助信息;Receive cross-node assistance information from the server or base station;
根据所述跨节点辅助信息对所述第一数据进行去冗余处理,以得到第二数据,所述跨节点辅助信息为与所述第一节点采集的所述第一初始数据和第二节点采集的第二初始数据相关的信息;The first data is de-redundantly processed according to the cross-node auxiliary information to obtain the second data. The cross-node auxiliary information is the first initial data collected by the first node and the second node. Information related to the second initial data collected;
发送所述第二数据。Send the second data.
本申请实施例,第一节点基于接收到的来自服务器或基站的跨节点辅助信息对第一数据进行去冗余处理,将与第一节点采集的第一初始数据和第二节点采集的第二初始数据相关的信息去除,得到第二数据,并向所述服务器发送该第二数据。采用该手段,通过将与第一节点采集的第一初始数据和第二节点采集的第二初始数据相关的信息去除,然后向服务器发送该去冗余的数据,这样可以降低该节点的数据传输量,进而减少带宽资源消耗,并提升了传输鲁棒性。In this embodiment of the present application, the first node performs de-redundant processing on the first data based on the received cross-node auxiliary information from the server or base station, and combines it with the first initial data collected by the first node and the second data collected by the second node. Information related to the initial data is removed to obtain second data, and the second data is sent to the server. Using this method, by removing the information related to the first initial data collected by the first node and the second initial data collected by the second node, and then sending the de-redundant data to the server, the data transmission of the node can be reduced. amount, thereby reducing bandwidth resource consumption and improving transmission robustness.
在一种可能的实现方式中,所述方法还包括:In a possible implementation, the method further includes:
接收来自所述服务器或基站的第一指示信息,所述第一指示信息用于指示所述第一节点采集第一模态的数据。Receive first instruction information from the server or base station, where the first instruction information is used to instruct the first node to collect data in the first mode.
该第一模态例如可以是视频、音频、图像等。The first modality may be, for example, video, audio, image, etc.
在一种可能的实现方式中,所述第一节点采集的所述第一初始数据为所述第一模态的数据。In a possible implementation, the first initial data collected by the first node is data of the first modality.
在一种可能的实现方式中,所述根据所述跨节点辅助信息对所述第一数据进行去冗余处理,以得到第二数据,包括:In a possible implementation, the de-redundancy processing of the first data according to the cross-node auxiliary information to obtain the second data includes:
将所述跨节点辅助信息和所述第一数据均输入至第一预设模型中进行处理,以得到第二数据,其中,当系统的接收信噪比变化值超出阈值时,触发训练所述第一预设模型。Both the cross-node auxiliary information and the first data are input into the first preset model for processing to obtain the second data, wherein when the change value of the system's received signal-to-noise ratio exceeds a threshold, the training is triggered. The first default model.
通过在系统的接收信噪比变化值超出阈值时,触发训练模型,这样不断训练、更新模型,使得模型在进行去冗余处理时性能更好,传输鲁棒性更高。By triggering the training model when the system's received signal-to-noise ratio change value exceeds the threshold, the model is continuously trained and updated, resulting in better model performance and higher transmission robustness during de-redundancy processing.
第二方面,本申请实施例提供一种数据处理方法,应用于服务器或基站,包括:In the second aspect, embodiments of the present application provide a data processing method, applied to a server or base station, including:
接收来自第二节点的第三数据,所述第三数据是所述第二节点对采集的第二初始数据进行压缩编码处理后得到的;Receive third data from the second node, where the third data is obtained by the second node after compressing and encoding the collected second initial data;
对所述第三数据进行处理,以得到跨节点辅助信息,所述跨节点辅助信息为与所述第一节点采集的所述第一初始数据和第二节点采集的第二初始数据相关的信息;The third data is processed to obtain cross-node auxiliary information. The cross-node auxiliary information is information related to the first initial data collected by the first node and the second initial data collected by the second node. ;
向所述第一节点发送所述跨节点辅助信息。Send the cross-node assistance information to the first node.
本申请实施例,服务器通过对来自第二节点的第三数据进行处理,得到第二节点和第一节点之间的跨节点辅助信息,然后将该跨节点辅助信息发送给第一节点。这样有助于第一节点根据接收到的跨节点辅助信息对第一数据进行去冗余处理,进而有助于服务器最终接收到的来自第一节点的数据是去冗余的数据。采用该手段,可以降低节点的数据传输量,进而减少带宽资源消耗,并提升了传输鲁棒性,同时也提高了服务器的处理效率。In this embodiment of the present application, the server processes the third data from the second node to obtain the cross-node auxiliary information between the second node and the first node, and then sends the cross-node auxiliary information to the first node. This helps the first node to perform de-redundant processing on the first data based on the received cross-node auxiliary information, which further helps the server to finally receive data from the first node as de-redundant data. Using this method can reduce the data transmission volume of nodes, thereby reducing bandwidth resource consumption, improving transmission robustness, and improving server processing efficiency.
在一种可能的实现方式中,所述方法还包括:In a possible implementation, the method further includes:
接收来自所述第一节点的第二数据;receiving second data from the first node;
对所述第二数据和所述第三数据进行融合处理。Perform fusion processing on the second data and the third data.
该方案中服务器进行融合处理的数据是去冗余后的,也就是说第二数据和第三数据之间没有重复或相关的信息,这样提高了服务器的处理效率。In this solution, the data for fusion processing by the server is deredundant, which means that there is no duplicate or related information between the second data and the third data, which improves the processing efficiency of the server.
在一种可能的实现方式中,所述方法还包括:In a possible implementation, the method further includes:
向所述第一节点发送第一指示信息,所述第一指示信息用于指示所述第一节点采集第一模态的数据;Send first instruction information to the first node, where the first instruction information is used to instruct the first node to collect data in the first mode;
向所述第二节点发送第二指示信息,所述第二指示信息用于指示所述第二节点采集第二模态的数据。Send second instruction information to the second node, where the second instruction information is used to instruct the second node to collect data in the second mode.
在一种可能的实现方式中,所述第一节点采集的所述第一初始数据为第一模态的数据,所述第二节点采集的所述第二初始数据为第二模态的数据。In a possible implementation, the first initial data collected by the first node is data of the first modality, and the second initial data collected by the second node is data of the second modality. .
在一种可能的实现方式中,所述对所述第三数据进行处理,以得到跨节点辅助信息,包括:In a possible implementation, processing the third data to obtain cross-node auxiliary information includes:
将所述第三数据输入至第二预设模型中进行处理,以得到所述跨节点辅助信息,其中,当系统的接收信噪比变化值超出阈值时,触发训练所述第二预设模型。The third data is input into a second preset model for processing to obtain the cross-node auxiliary information, wherein when the change value of the system's received signal-to-noise ratio exceeds a threshold, training of the second preset model is triggered. .
通过在系统的接收信噪比变化值超出阈值时,触发训练模型,这样不断训练、更新模型, 使得模型在进行去冗余处理时性能更好,传输鲁棒性更高。By triggering the training model when the system's received signal-to-noise ratio change value exceeds the threshold, the model is continuously trained and updated, so that the model performs better during de-redundancy processing and has higher transmission robustness.
第三方面,本申请实施例提供一种数据处理装置,包括:In a third aspect, embodiments of the present application provide a data processing device, including:
第一处理模块,用于对采集的第一初始数据进行压缩编码处理,以得到第一数据;The first processing module is used to compress and encode the collected first initial data to obtain the first data;
接收模块,用于接收来自服务器或基站的跨节点辅助信息;A receiving module used to receive cross-node auxiliary information from the server or base station;
第二处理模块,用于根据所述跨节点辅助信息对所述第一数据进行去冗余处理,以得到第二数据,所述跨节点辅助信息为与所述第一节点采集的所述第一初始数据和第二节点采集的第二初始数据相关的信息;The second processing module is configured to perform de-redundant processing on the first data according to the cross-node auxiliary information to obtain the second data. The cross-node auxiliary information is the same as the third data collected by the first node. Information related to the first initial data and the second initial data collected by the second node;
发送模块,用于发送所述第二数据。A sending module, configured to send the second data.
在一种可能的实现方式中,所述接收模块,还用于:In a possible implementation, the receiving module is also used to:
接收来自所述服务器或基站的第一指示信息,所述第一指示信息用于指示所述第一节点采集第一模态的数据。Receive first instruction information from the server or base station, where the first instruction information is used to instruct the first node to collect data in the first mode.
在一种可能的实现方式中,所述第一节点采集的所述第一初始数据为所述第一模态的数据。In a possible implementation, the first initial data collected by the first node is data of the first modality.
在一种可能的实现方式中,所述第二处理模块,用于:In a possible implementation, the second processing module is used to:
将所述跨节点辅助信息和所述第一数据均输入至第一预设模型中进行处理,以得到第二数据,其中,当系统的接收信噪比变化值超出阈值时,触发训练所述第一预设模型。Both the cross-node auxiliary information and the first data are input into the first preset model for processing to obtain the second data, wherein when the change value of the system's received signal-to-noise ratio exceeds a threshold, the training is triggered. The first default model.
第四方面,本申请提供了一种数据处理装置,包括:In a fourth aspect, this application provides a data processing device, including:
接收模块,用于接收来自第二节点的第三数据,所述第三数据是所述第二节点对采集的第二初始数据进行压缩编码处理后得到的;A receiving module, configured to receive third data from the second node, where the third data is obtained by the second node after compressing and encoding the collected second initial data;
处理模块,用于对所述第三数据进行处理,以得到跨节点辅助信息,所述跨节点辅助信息为与所述第一节点采集的所述第一初始数据和第二节点采集的第二初始数据相关的信息;A processing module, configured to process the third data to obtain cross-node auxiliary information, where the cross-node auxiliary information is related to the first initial data collected by the first node and the second data collected by the second node. Information related to initial data;
发送模块,用于向所述第一节点发送所述跨节点辅助信息。A sending module, configured to send the cross-node auxiliary information to the first node.
在一种可能的实现方式中,所述接收模块,还用于接收来自所述第一节点的第二数据;In a possible implementation, the receiving module is also configured to receive second data from the first node;
所述处理模块,还用于对所述第二数据和所述第三数据进行融合处理。The processing module is also used to perform fusion processing on the second data and the third data.
在一种可能的实现方式中,所述发送模块,还用于:In a possible implementation, the sending module is also used to:
向所述第一节点发送第一指示信息,所述第一指示信息用于指示所述第一节点采集第一模态的数据;Send first instruction information to the first node, where the first instruction information is used to instruct the first node to collect data in the first mode;
向所述第二节点发送第二指示信息,所述第二指示信息用于指示所述第二节点采集第二模态的数据。Send second instruction information to the second node, where the second instruction information is used to instruct the second node to collect data in the second mode.
在一种可能的实现方式中,所述第一节点采集的所述第一初始数据为第一模态的数据,所述第二节点采集的所述第二初始数据为第二模态的数据。In a possible implementation, the first initial data collected by the first node is data of the first modality, and the second initial data collected by the second node is data of the second modality. .
在一种可能的实现方式中,所述处理模块,还用于:In a possible implementation, the processing module is also used to:
将所述第三数据输入至第二预设模型中进行处理,以得到所述跨节点辅助信息,其中,当系统的接收信噪比变化值超出阈值时,触发训练所述第二预设模型。The third data is input into a second preset model for processing to obtain the cross-node auxiliary information, wherein when the change value of the system's received signal-to-noise ratio exceeds a threshold, training of the second preset model is triggered. .
第五方面,本申请提供了一种数据处理装置,包括处理器和通信接口,所述通信接口用于接收和/或发送数据,和/或,所述通信接口用于为所述处理器提供输出和/或输出,所述处理器用于调用计算机指令,以实现如第一方面任一种可能的实施方式提供的方法,和/或,实现如第二方面任一种可能的实施方式提供的方法。In a fifth aspect, the application provides a data processing device, including a processor and a communication interface. The communication interface is used to receive and/or send data, and/or the communication interface is used to provide the processor with Output and/or output, the processor is used to call computer instructions to implement the method provided by any possible implementation manner of the first aspect, and/or to implement the method provided by any possible implementation manner of the second aspect method.
第六方面,本申请提供了一种数据处理系统,所述系统包括服务器或基站,还包括第一节点,其中:In a sixth aspect, this application provides a data processing system. The system includes a server or a base station, and also includes a first node, wherein:
所述服务器或基站用于实现如第二方面任一种可能的实施方式提供的方法;所述第一节点用于实现如第一方面任一种可能的实施方式提供的方法。The server or base station is configured to implement the method provided in any possible implementation manner of the second aspect; and the first node is configured to implement the method provided in any possible implementation manner of the first aspect.
第七方面,本申请提供了一种计算机存储介质,包括计算机指令,当所述计算机指令在电子设备上运行时,使得所述电子设备执行如第一方面任一种可能的实施方式和/或第二方面任一种可能的实施方式提供的方法。In a seventh aspect, the present application provides a computer storage medium, including computer instructions. When the computer instructions are run on an electronic device, the electronic device causes the electronic device to execute any possible implementation manner and/or as in the first aspect. The method provided by any possible implementation of the second aspect.
第八方面,本申请实施例提供一种计算机程序产品,当计算机程序产品在计算机上运行时,使得计算机执行如第一方面任一种可能的实施方式和/或第二方面任一种可能的实施方式提供的方法。In an eighth aspect, embodiments of the present application provide a computer program product. When the computer program product is run on a computer, it causes the computer to execute any possible implementation manner of the first aspect and/or any possible implementation method of the second aspect. Methods provided by the embodiments.
可以理解地,上述提供的第三方面所述的装置、第四方面所述的装置、第五方面所述的装置、第六方面所述的系统、第七方面所述的计算机存储介质或者第八方面所述的计算机程序产品均用于执行第一方面中任一所提供的方法以及第二方面中任一所提供的方法。因此,其所能达到的有益效果可参考对应方法中的有益效果,此处不再赘述。It can be understood that the above-mentioned device described in the third aspect, the device described in the fourth aspect, the device described in the fifth aspect, the system described in the sixth aspect, the computer storage medium described in the seventh aspect or the third aspect The computer program products described in the eight aspects are all used to execute the method provided in any one of the first aspects and the method provided in any one of the second aspects. Therefore, the beneficial effects it can achieve can be referred to the beneficial effects in the corresponding methods, and will not be described again here.
附图说明Description of drawings
下面对本申请实施例用到的附图进行介绍。The drawings used in the embodiments of this application are introduced below.
图1是本申请实施例提供的一种数据处理系统的架构示意图;Figure 1 is a schematic architectural diagram of a data processing system provided by an embodiment of the present application;
图2是本申请实施例提供的一种数据处理方法的流程示意图;Figure 2 is a schematic flowchart of a data processing method provided by an embodiment of the present application;
图3是本申请实施例提供的一种数据处理方法的示意图;Figure 3 is a schematic diagram of a data processing method provided by an embodiment of the present application;
图4是本申请实施例提供的另一种数据处理方法的示意图;Figure 4 is a schematic diagram of another data processing method provided by an embodiment of the present application;
图5是本申请实施例提供的又一种数据处理方法的示意图;Figure 5 is a schematic diagram of another data processing method provided by the embodiment of the present application;
图6是本申请实施例提供的一种模型训练方法示意图;Figure 6 is a schematic diagram of a model training method provided by an embodiment of the present application;
图7a是本申请实施例提供的一种Attention A的框架结构示意图; Figure 7a is a schematic diagram of the framework structure of Attention A provided by the embodiment of the present application;
图7b是本申请实施例提供的一种Attention B的框架结构示意图; Figure 7b is a schematic diagram of the framework structure of Attention B provided by the embodiment of the present application;
图8a是本申请实施例提供的一种Encoder A1的框架结构示意图; Figure 8a is a schematic diagram of the frame structure of an Encoder A1 provided by an embodiment of the present application;
图8b是本申请实施例提供的一种Encoder B2的框架结构示意图; Figure 8b is a schematic diagram of the frame structure of an Encoder B2 provided by an embodiment of the present application;
图9是本申请实施例提供的一种Encoder B1的框架结构示意图; Figure 9 is a schematic diagram of the frame structure of Encoder B1 provided by the embodiment of the present application;
图10a是本申请实施例提供的另一种Encoder A1的框架结构示意图; Figure 10a is a schematic diagram of the frame structure of another Encoder A1 provided by the embodiment of the present application;
图10b是本申请实施例提供的另一种Encoder B2的框架结构示意图; Figure 10b is a schematic diagram of the frame structure of another Encoder B2 provided by the embodiment of the present application;
图10c是本申请实施例提供的另一种Encoder B1的框架结构示意图; Figure 10c is a schematic diagram of the frame structure of another Encoder B1 provided by the embodiment of the present application;
图11是本申请实施例提供的一种数据处理装置的结构示意图;Figure 11 is a schematic structural diagram of a data processing device provided by an embodiment of the present application;
图12是本申请实施例提供的另一种数据处理装置的结构示意图;Figure 12 is a schematic structural diagram of another data processing device provided by an embodiment of the present application;
图13是本申请实施例提供的又一种数据处理装置的结构示意图。Figure 13 is a schematic structural diagram of yet another data processing device provided by an embodiment of the present application.
具体实施方式Detailed ways
下面结合本申请实施例中的附图对本申请实施例进行描述。本申请实施例的实施方式部分使用的术语仅用于对本申请的具体实施例进行解释,而非旨在限定本申请。The embodiments of the present application will be described below with reference to the drawings in the embodiments of the present application. The terms used in the implementation part of the embodiments of the present application are only used to explain the specific embodiments of the present application and are not intended to limit the present application.
由于现有技术未充分考虑不同节点、模态信源之间的相关性,导致服务器收到的处理数据比较冗余,会带来一定程度传输资源浪费和传输性能损失。有鉴于此,本申请提供一种数据处理方法、装置、系统以及存储介质,能够减少带宽资源消耗,并提升数据传输的鲁棒性。Since the existing technology does not fully consider the correlation between different nodes and modal sources, the processing data received by the server is relatively redundant, which will cause a certain degree of waste of transmission resources and loss of transmission performance. In view of this, this application provides a data processing method, device, system and storage medium, which can reduce bandwidth resource consumption and improve the robustness of data transmission.
以下将结合附图,来详细介绍本申请实施例的系统架构。请参见图1,图1是本申请实施例适用的一种数据处理系统的示意图,该系统包括节点1、节点2、服务端或者基站中的任一种等。The system architecture of the embodiment of the present application will be introduced in detail below with reference to the accompanying drawings. Please refer to Figure 1. Figure 1 is a schematic diagram of a data processing system applicable to the embodiment of the present application. The system includes any one of node 1, node 2, server or base station.
服务端是具有集中计算能力的装置,可以通过服务器、虚拟机、云端或机器人等装置实现。其中,服务器包含但不限于是通用计算机、专用服务器计算机(例如个人计算机、服务器、UNIX服务器、或中端服务器等)、刀片式服务器等。当服务端由服务器实现时,例如图1所示,其所包含的服务器的数量也可以是一个,也可以是多个(如服务器集群)。虚拟机是被虚拟化的计算模块。云端是采用应用程序虚拟化技术的软件平台,能够让一个或者多个软件、应用在独立的虚拟化环境中开发、运行。可选的,云端可以部署在公有云、私有云、或者混合云上等。The server is a device with centralized computing capabilities, which can be implemented through servers, virtual machines, clouds, or robots. Among them, servers include but are not limited to general computers, dedicated server computers (such as personal computers, servers, UNIX servers, or mid-range servers, etc.), blade servers, etc. When the server is implemented by a server, as shown in Figure 1, the number of servers it contains can be one or multiple (such as a server cluster). A virtual machine is a virtualized computing module. The cloud is a software platform that uses application virtualization technology, which allows one or more software and applications to be developed and run in an independent virtualized environment. Optionally, the cloud can be deployed on public cloud, private cloud, or hybrid cloud.
需要说明的是,本申请实施例以服务器为例进行说明,其还可以是服务端的其他任意实现,本方案对此不作限制。It should be noted that the embodiments of this application take a server as an example for explanation, and it can also be any other implementation of the server, and this solution does not limit this.
接入网设备,是指将终端接入到无线网络的无线接入网(radio access network,RAN)节点(或设备),又可以称为基站。目前,一些RAN节点的举例为:继续演进的节点B(gNB)、传输接收点(transmission reception point,TRP)、演进型节点B(evolved Node B,eNB)、无线网络控制器(radio network controller,RNC)、节点B(Node B,NB)、基站控制器(base station controller,BSC)、基站收发台(base transceiver station,BTS)、家庭基站(例如,home evolved NodeB,或home Node B,HNB)、基带单元(base band unit,BBU),或无线保真(wireless fidelity,Wifi)接入点(access point,AP)等。另外,在一种网络结构中,接入网设备可以包括集中单元(centralized unit,CU)节点、或分布单元(distributed unit,DU)节点、或包括CU节点和DU节点的RAN设备。其中包括CU节点和DU节点的RAN设备将NR系统中gnb的协议层拆分开,部分协议层的功能放在CU集中控制,剩下部分或全部协议层的功能分布在DU中,由CU集中控制DU。其中CU的功能可以由一个实体来实现也可以由不同的实体实现。例如,可以对CU的功能进行进一步切分,例如,将控制面(CP)和用户面(UP)分离,即CU的控制面(CU-CP)和CU用户面(CU-UP)。例如,CU-CP和CU-UP可以由不同的功能实体来实现,所述CU-CP和CU-UP可以与DU相耦合,共同完成基站的功能。Access network equipment refers to the radio access network (RAN) node (or equipment) that connects terminals to the wireless network, and can also be called a base station. Currently, some examples of RAN nodes are: evolved Node B (gNB), transmission reception point (TRP), evolved Node B (evolved Node B, eNB), radio network controller, RNC), Node B (Node B, NB), base station controller (BSC), base transceiver station (BTS), home base station (for example, home evolved NodeB, or home Node B, HNB) , baseband unit (base band unit, BBU), or wireless fidelity (wireless fidelity, Wifi) access point (access point, AP), etc. In addition, in a network structure, access network equipment may include centralized unit (CU) nodes, distributed unit (DU) nodes, or RAN equipment including CU nodes and DU nodes. The RAN equipment including CU nodes and DU nodes separates the protocol layer of gnb in the NR system. Some of the protocol layer functions are centralized and controlled by the CU. The remaining part or all of the protocol layer functions are distributed in the DU and centralized by the CU. Control DU. The functions of CU can be implemented by one entity or by different entities. For example, the functions of the CU can be further divided, for example, the control plane (CP) and the user plane (UP) are separated, that is, the control plane (CU-CP) and the CU user plane (CU-UP) of the CU. For example, CU-CP and CU-UP can be implemented by different functional entities, and the CU-CP and CU-UP can be coupled with DU to jointly complete the functions of the base station.
在一种可能的实现方式中,节点1和节点2可以是终端设备。本申请实施例中的终端设备(Terminal Equipment)也可以称为终端、用户设备(User Equipment,UE)、移动台(Mobile Station,MS)、移动终端(Mobile Terminal,MT)等。终端设备可以是手机(mobile phone)、平板电脑(Pad)、带无线收发功能的电脑、虚拟现实(Virtual Reality,VR)终端设备、增强现实(Augmented Reality,AR)终端设备、工业控制(Industrial Control)中的无线终端、无人驾驶(Self Driving)中的无线终端、远程手术(Remote Medical Surgery)中的无线终端、智能电网(Smart Grid)中的无线终端、运输安全(Transportation Safety)中的无线终端、智慧城市(Smart City)中的无线终端、智慧家庭(Smart Home)中的无线终端等等。In a possible implementation, node 1 and node 2 may be terminal devices. The terminal equipment (Terminal Equipment) in the embodiment of this application can also be called a terminal, user equipment (User Equipment, UE), mobile station (Mobile Station, MS), mobile terminal (Mobile Terminal, MT), etc. The terminal device can be a mobile phone (mobile phone), a tablet computer (Pad), a computer with wireless transceiver function, a virtual reality (Virtual Reality, VR) terminal device, an augmented reality (Augmented Reality, AR) terminal device, or an industrial control (Industrial Control) ), wireless terminals in Self Driving, wireless terminals in Remote Medical Surgery, wireless terminals in Smart Grid, wireless terminals in Transportation Safety Terminals, wireless terminals in Smart City, wireless terminals in Smart Home, etc.
在另一种可能的实现方式中,节点1和节点2可以是信源采集单元。例如,节点1和节点2均可以用于采集图像、视频、音频、传感器参数等信号。节点1和节点2还用于对采集的信号(数据)进行编码,并将编码的数据经无线传输信道发送给服务器/基站。In another possible implementation, node 1 and node 2 may be information source collection units. For example, both Node 1 and Node 2 can be used to collect signals such as images, videos, audios, and sensor parameters. Node 1 and node 2 are also used to encode the collected signals (data) and send the encoded data to the server/base station via the wireless transmission channel.
需要说明的是,终端设备和终端设备可能有部分重叠,例如手机可以采集图像、视频、音频等信号,本方案对此不作限制。It should be noted that terminal equipment and terminal equipment may partially overlap. For example, mobile phones can collect images, videos, audio and other signals, and this solution does not limit this.
服务器/基站对接收到的分别来自节点1和节点2的数据进行存储以及融合计算处理,进而执行一个或多个任务。例如基于接收到的视频数据和音频数据执行视听解析任务,具体包括对音频、视觉以及视听事件的检测识别,并判断这些事件哪些是可见的、可听的以及可见且可听的;进一步地,还可以判断每个事件发生的起始时间和终止时间等。The server/base station stores and fuses the data received from node 1 and node 2, and then performs one or more tasks. For example, performing audio-visual analysis tasks based on the received video data and audio data, specifically including the detection and identification of audio, visual and audio-visual events, and determining which of these events are visible, audible, and both visible and audible; further, You can also determine the start time and end time of each event.
需要说明的是,本申请实施例以两个节点为例进行说明,其还可以是三个节点、四个节点等,本方案对此不作严格限制。It should be noted that the embodiment of this application takes two nodes as an example for description. It can also be three nodes, four nodes, etc. This solution does not strictly limit this.
进一步地,本方案可以面向未来的蜂窝标准、Wi-Fi标准,涉及的产品可包括未来蜂窝、Wi-Fi的相关产品,例如可以应用到:手机;接入蜂窝、Wi-Fi的平板/手表/笔记本/其他IoT设备;基站/无线路由器等设备中。Furthermore, this solution can be oriented to future cellular standards and Wi-Fi standards, and the products involved can include future cellular and Wi-Fi related products, for example, it can be applied to: mobile phones; tablets/watches that access cellular and Wi-Fi /Notebooks/Other IoT devices; base stations/wireless routers and other devices.
本申请实施例中,节点通过对采集的初始数据进行压缩编码处理,以得到第一数据;然后根据接收到的来自服务器/基站的跨节点辅助信息对所述第一数据进行去冗余处理,以得到第二数据,所述跨节点辅助信息为第一节点采集的所述初始数据和第二节点采集的初始数据之间的相关信息;并向所述服务器/基站发送所述第二数据。这样该节点在进行数据传输时,不需要传输两节点之间的重复、相关的信息,进而可以减少带宽资源消耗,并提升数据传输的鲁棒性。In the embodiment of this application, the node obtains the first data by compressing and encoding the collected initial data; and then performs de-redundant processing on the first data based on the cross-node auxiliary information received from the server/base station, To obtain the second data, the cross-node auxiliary information is the relevant information between the initial data collected by the first node and the initial data collected by the second node; and send the second data to the server/base station. In this way, when the node transmits data, it does not need to transmit repeated and related information between the two nodes, which can reduce bandwidth resource consumption and improve the robustness of data transmission.
上面说明了本申请实施例的架构,下面对本申请实施例的方法进行详细介绍。The architecture of the embodiment of the present application is described above, and the method of the embodiment of the present application is introduced in detail below.
参照图2所示,是本申请实施例提供的一种数据处理方法的流程示意图。可选的,该方法可以应用于前述的数据处理系统,例如图1所示的数据处理系统。如图2所示的数据处理方法可以包括步骤201-211。应理解,本申请为了方便描述,故通过201-211这一顺序进行描述,并不旨在限定一定通过上述顺序进行执行。本申请实施例对于上述一个或多个步骤的执行的先后顺序、执行的时间、执行的次数等不做限定。下文以数据处理方法的步骤201-202的执行主体为第二节点(如信源采集单元)、203-205、210、211的执行主体为服务器、206-209的执行主体为第一节点(如信源采集单元)为例进行描述,对于其他执行主体本申请同样也适用。步骤201-211具体如下:Refer to FIG. 2 , which is a schematic flow chart of a data processing method provided by an embodiment of the present application. Optionally, this method can be applied to the aforementioned data processing system, such as the data processing system shown in Figure 1 . The data processing method shown in Figure 2 may include steps 201-211. It should be understood that, for convenience of description, this application is described through the sequence 201-211, and is not intended to limit the execution to the above sequence. The embodiments of the present application do not limit the execution sequence, execution time, number of executions, etc. of one or more of the above steps. In the following, the execution subject of steps 201-202 of the data processing method is the second node (such as the information source collection unit), the execution subject of steps 203-205, 210, and 211 is the server, and the execution subject of steps 206-209 is the first node (such as The information source acquisition unit) is described as an example, and this application is also applicable to other execution entities. Steps 201-211 are as follows:
201、第二节点对采集的第二初始数据进行压缩编码处理,以得到第三数据;201. The second node performs compression encoding on the collected second initial data to obtain the third data;
其中,该压缩编码处理的编码方法可以分为三类:(1)根据信息源的统计特性,采用预测编码、变换编码、矢量量化编码、子带编码、神经网络编码等方法。(2)根据人眼视觉特性,采用基于方向滤波的图像编码、基于图像轮廓一伦理编码,基于小波分析的编码等方法。(3)根据传递景物特征:采用分形编码、基于模型的编码等方法。Among them, the coding methods of the compression coding process can be divided into three categories: (1) According to the statistical characteristics of the information source, methods such as predictive coding, transform coding, vector quantization coding, sub-band coding, and neural network coding are used. (2) According to the visual characteristics of the human eye, image coding based on directional filtering, image contour-ethical coding based on image, coding based on wavelet analysis and other methods are used. (3) According to the characteristics of the transferred scene: use methods such as fractal coding and model-based coding.
202、向服务器发送所述第三数据;202. Send the third data to the server;
第二节点可通过无线传输信道向服务器发送该第三数据。The second node may send the third data to the server through a wireless transmission channel.
203、服务器接收来自所述第二节点的第三数据;203. The server receives the third data from the second node;
204、对所述第三数据进行处理,以得到跨节点辅助信息,所述跨节点辅助信息为与所述第一节点采集的所述第一初始数据和第二节点采集的第二初始数据相关的信息;204. Process the third data to obtain cross-node auxiliary information. The cross-node auxiliary information is related to the first initial data collected by the first node and the second initial data collected by the second node. Information;
该跨节点辅助信息,可以理解为,与所述第一节点采集的所述第一初始数据和第二节点采集的第二初始数据相关的信息。The cross-node auxiliary information can be understood as information related to the first initial data collected by the first node and the second initial data collected by the second node.
该相关的信息,可以理解为,所述第一初始数据和所述第二初始数据中取值相同的重复信息,或者有关联的信息。The relevant information can be understood as repeated information with the same value in the first initial data and the second initial data, or related information.
可选的,节点1和节点2采集的数据中的某一部分相同,或者是具有相关关系。例如,节点1采集的是环境1的音频信息,节点2采集的是该环境1的视频信息。该相关信息即为节点1和节点2采集的数据均对应同一环境。再如,节点1采集的是白天的天空信息,节点2采集的是夜晚的天空信息。该相关的信息即为节点1和节点2采集的数据均对应天空。Optionally, a certain part of the data collected by node 1 and node 2 is the same, or is related. For example, node 1 collects the audio information of environment 1, and node 2 collects the video information of environment 1. This relevant information means that the data collected by node 1 and node 2 both correspond to the same environment. For another example, node 1 collects sky information during the day, and node 2 collects sky information at night. The relevant information is that the data collected by node 1 and node 2 both correspond to the sky.
在一种可能的实现方式中,通过将所述第三数据输入至预设模型中进行处理,以得到跨节点辅助信息。In a possible implementation manner, cross-node auxiliary information is obtained by inputting the third data into a preset model for processing.
例如,该预设模型是训练好的模型。通过进行神经网络训练,进而得到可以获取跨节点辅助信息的模型。For example, the preset model is a trained model. Through neural network training, a model that can obtain cross-node auxiliary information is obtained.
在一种可能的实现方式中,当系统的接收信噪比变化值超出阈值时,触发训练所述预设模型。通过不断的训练该模型,使得该模型性能更好。In a possible implementation, when the change value of the system's received signal-to-noise ratio exceeds a threshold, training of the preset model is triggered. By continuously training the model, the model performs better.
针对该模型介绍可参阅后续实施例中编码器Encoder A2的记载,在此不再赘述。 For an introduction to this model, please refer to the records of the encoder Encoder A2 in subsequent embodiments, and will not be described again here.
205、向第一节点发送所述跨节点辅助信息;205. Send the cross-node auxiliary information to the first node;
服务器将提取的第一节点和第二节点之间的跨节点辅助信息发送给第一节点,以便第一节点进行去冗余处理。The server sends the extracted cross-node auxiliary information between the first node and the second node to the first node, so that the first node can perform redundancy processing.
其中,服务器通过对来自第二节点的第三数据进行处理,得到第二节点和第一节点之间的跨节点辅助信息,然后将该跨节点辅助信息发送给第一节点。这样有助于第一节点根据接收到的跨节点辅助信息对第一数据进行去冗余处理,进而有助于服务器最终接收到的来自第一节点的数据是去冗余的数据。采用该手段,可以降低节点的数据传输量,进而减少带宽资源消耗,并提升了传输鲁棒性,同时也提高了服务器的处理效率。The server processes the third data from the second node to obtain cross-node auxiliary information between the second node and the first node, and then sends the cross-node auxiliary information to the first node. This helps the first node to perform de-redundant processing on the first data based on the received cross-node auxiliary information, which further helps the server to finally receive data from the first node as de-redundant data. Using this method can reduce the data transmission volume of nodes, thereby reducing bandwidth resource consumption, improving transmission robustness, and improving server processing efficiency.
206、第一节点对采集的第一初始数据进行压缩编码处理,以得到第一数据;206. The first node performs compression encoding on the collected first initial data to obtain the first data;
该第一数据即为将初始数据进行压缩编码后得到的数据。The first data is data obtained by compressing and encoding the initial data.
在一种可能的实现方式中,步骤206的执行顺序可以是在步骤202-205之前,本方案对此不作严格限制。In a possible implementation, the execution order of step 206 may be before steps 202-205, which is not strictly limited in this solution.
在一种可能的实现方式中,在步骤201、206之前,所述方法还包括:In a possible implementation, before steps 201 and 206, the method further includes:
服务器向第一节点发送第一指示信息,所述第一指示信息用于指示所述第一节点采集第一模态的数据;The server sends first instruction information to the first node, where the first instruction information is used to instruct the first node to collect data in the first mode;
向所述第二节点发送第二指示信息,所述第二指示信息用于指示所述第二节点采集第二模态的数据。Send second instruction information to the second node, where the second instruction information is used to instruct the second node to collect data in the second mode.
相应地,各节点分别接收服务器发送的指示信息。也即,所述第一节点采集的所述第一初始数据为第一模态的数据,所述第二节点采集的所述第二初始数据为第二模态的数据。Correspondingly, each node receives the instruction information sent by the server respectively. That is, the first initial data collected by the first node is data of the first modality, and the second initial data collected by the second node is data of the second modality.
可选的,上述第一模态的数据或者第二模态的数据,例如可以是音频信号、视频信号、图像信号或环境监测传感器信号等中的一种或多种信号。Optionally, the data in the first mode or the data in the second mode may be, for example, one or more of audio signals, video signals, image signals, or environmental monitoring sensor signals.
其中,服务器可以是直接发送上述指示信息,或者是基于各节点发送的请求进而发送指示信息,本方案对此不作严格限制。Among them, the server can directly send the above instruction information, or send the instruction information based on the request sent by each node. This solution does not impose strict restrictions on this.
在一种可能的实现方式中,第一节点向服务器发送请求,以指示服务器进行初始配置。In a possible implementation, the first node sends a request to the server to instruct the server to perform initial configuration.
在另一种可能的实现方式中,服务器可以是直接进行初始配置。In another possible implementation, the server may perform initial configuration directly.
其中,该初始配置可包括上述第一指示信息,以指示第一节点采集第一模态的数据。Wherein, the initial configuration may include the above-mentioned first instruction information to instruct the first node to collect data of the first mode.
本申请实施例仅以初始配置包括指示第一节点采集第一模态的数据为例进行介绍,该初始配置还可以包括其他信息。The embodiment of this application only introduces the initial configuration including instructing the first node to collect data of the first mode as an example. The initial configuration may also include other information.
在一种可能的实现方式中,初始配置可包括:In one possible implementation, initial configuration may include:
a)通报压缩参数,包括模态标识(可以指示不同的模态,实现单/多模态的处理)、最后 一层网络输出维度或长度。a) Notify the compression parameters, including modal identification (which can indicate different modalities to achieve single/multi-modal processing), and the final layer network output dimension or length.
b)特征传输数据类型指示,即各部分特征传输时数据码流的具体格式,如直接发送网络的输出参数(如2个相邻实数取值组成1个复数符号,得到一串复数符号),或者经过量化后的取值(需信道编码和调制后得到一串复数符号)。b) Feature transmission data type indication, that is, the specific format of the data code stream when each part of the feature is transmitted, such as directly sending the output parameters of the network (such as two adjacent real numbers forming a complex symbol, resulting in a string of complex symbols), Or the value after quantization (requires channel coding and modulation to obtain a series of complex symbols).
c)指示时间戳信息,可包括指定最大时间差(超时即丢弃对应数据包,取值为0表示不设置)、周期信息(0表示不设置周期,大于0表示设定周期)等。c) Indicate timestamp information, which can include specifying the maximum time difference (the corresponding data packet will be discarded when timeout occurs, a value of 0 means no setting), period information (0 means no period is set, greater than 0 means setting the period), etc.
d)配置的信令可以是如下格式(可以是节点主动设定后向服务器/基站请求,也可以是服务器/基站直接进行配置):d) The configured signaling can be in the following format (the node can actively set it and then request the server/base station, or the server/base station can configure it directly):
模态标识modal identifier 输出维度或长度Output dimensions or length 数据类型type of data 最大时间差maximum time difference 周期信息Period information 量化比特位宽Quantization bit width
可选的,量化比特位宽仅在数据类型为“经过量化后的取值”时需要发送。上述参数可以在一个信令中发送,也可以封装在不同的信令中,在不同的时刻发送,本方案对此不作限制。Optionally, the quantization bit width only needs to be sent when the data type is "quantized value". The above parameters can be sent in one signaling, or they can be encapsulated in different signaling and sent at different times. This solution does not limit this.
经过上述配置,可以通过输出维度或长度以及数据类型计算得到单次推理所需的传输资源数,例如可以对应到符号数。After the above configuration, the number of transmission resources required for a single inference can be calculated through the output dimension or length and data type, which can correspond to the number of symbols, for example.
例如,数据类型为“直接发送网络的输出参数”,传输资源数=输出维度或长度/2。再如,数据类型为“经过量化后的取值”,传输资源数=输出维度或长度*量化比特位宽,或者传输资源数=输出维度或长度*信道码率,或者传输资源数=输出维度或长度*调制比特数等。For example, the data type is "directly sending the output parameters of the network", and the number of transmission resources = output dimension or length/2. For another example, if the data type is "quantized value", the number of transmission resources = output dimension or length * quantization bit width, or the number of transmission resources = output dimension or length * channel code rate, or the number of transmission resources = output dimension Or length*number of modulation bits, etc.
其中,上述b)~d)的通用特征传输格式为:Among them, the general feature transmission format of the above b) ~ d) is:
时间戳码流Timestamp code stream 特征数据长度Characteristic data length 特征数据码流Characteristic data stream
对于时间戳码流,通过增加时间戳用于指示当前发送特征数据的时间,便于多个节点之间同步数据。例如可以记录绝对时间取值t(时间间隔可以是一个或多个时隙、子帧、符号等,不具体限定;可由发送端与接收端通过协商确定t与t–1时刻间隔长度,或者可以通过预定义或预配置等方式确定间隔长度),或者设定一定的周期T0(T0>0),只记录mod(t,T0),mod表示求余运算。For the timestamp code stream, adding a timestamp is used to indicate the current time of sending characteristic data, which facilitates data synchronization between multiple nodes. For example, the absolute time value t can be recorded (the time interval can be one or more time slots, subframes, symbols, etc., and is not specifically limited; the length of the interval between t and t-1 can be determined by the sending end and the receiving end through negotiation, or it can Determine the interval length by predefinition or preconfiguration), or set a certain period T0 (T0>0), and only record mod(t, T0), mod represents the remainder operation.
还可以将时间戳变换为二进制数据后进行信道编码和调制得到复数信号,例如进行低码率信道编码和低阶调制,以保证可靠传输。The timestamp can also be converted into binary data and then channel coded and modulated to obtain a complex signal. For example, low code rate channel coding and low-order modulation can be performed to ensure reliable transmission.
对于特征数据码流,根据初始配置的压缩参数和数据类型,经过处理后的待发送码流(一串复数符号),其长度与传输资源数一致。可选的,可以在特征数据码流之前加入特征数据长度字段(可以与时间戳一起进行信道编码和调制),指示码流实际长度。For the characteristic data code stream, according to the initially configured compression parameters and data type, the length of the processed code stream to be sent (a string of complex symbols) is consistent with the number of transmission resources. Optionally, a characteristic data length field (which can be channel coded and modulated together with the timestamp) can be added before the characteristic data code stream to indicate the actual length of the code stream.
对于上述相应码流可以通过PUSCH(适用于节点1、2→服务器/基站)或PDSCH(适用于服务器/基站→节点1、2)进行传输,此外,也可以经过MAC组包后在物理层发送。The above corresponding code streams can be transmitted through PUSCH (applicable to nodes 1 and 2 → server/base station) or PDSCH (applicable to server/base station → nodes 1 and 2). In addition, they can also be sent at the physical layer after MAC packetization. .
其中,如果服务器/基站或节点2收到的数据时间戳与当前时间超过初始配置的最大时间差,则丢弃对应数据包,不再进行后续发送操作。Among them, if the data timestamp received by the server/base station or node 2 and the current time exceed the initially configured maximum time difference, the corresponding data packet will be discarded and no subsequent sending operation will be performed.
207、接收来自服务器的跨节点辅助信息;207. Receive cross-node auxiliary information from the server;
208、根据所述跨节点辅助信息对所述第一数据进行去冗余处理,以得到第二数据;208. Perform redundancy processing on the first data according to the cross-node auxiliary information to obtain the second data;
该去冗余处理,例如可以是将第一数据中的跨节点辅助信息去除,或者是将第一数据中与跨节点辅助信息有关联的信息去除,进而得到去冗余后的第二数据。也即,第二数据与第一数据没有重复信息,或者说没有关联信息。The redundancy removal process may include, for example, removing cross-node auxiliary information from the first data, or removing information related to the cross-node auxiliary information from the first data, thereby obtaining second data after redundancy removal. That is to say, the second data and the first data have no duplicate information, or in other words, no related information.
在一种可能的实现方式中,节点将接收到的来自服务器的跨节点辅助信息和所述第一数据均输入至第一预设模型中进行处理,以得到第二数据。In a possible implementation, the node inputs the received cross-node auxiliary information from the server and the first data into a first preset model for processing to obtain the second data.
该第一预设模型是经过训练得到的。其中当系统的接收信噪比变化值超出阈值时,触发训练所述第一预设模型。The first preset model is obtained after training. When the change value of the system's received signal-to-noise ratio exceeds a threshold, training of the first preset model is triggered.
针对该模型介绍可参阅后续实施例中编码器Encoder B2、Attention B的记载,在此不再赘述。 For an introduction to this model, please refer to the records of the encoder Encoder B2 and Attention B in subsequent embodiments, and will not be described again here.
209、向所述服务器发送所述第二数据;209. Send the second data to the server;
进而第一节点将该第二数据通过无线传输信道发送给服务器,以便服务器进行相应任务的处理等。Then, the first node sends the second data to the server through the wireless transmission channel, so that the server can process the corresponding task.
其中,第一节点基于接收到的来自服务器或基站的跨节点辅助信息对第一数据进行去冗余处理,将与第一节点采集的第一初始数据和第二节点采集的第二初始数据相关的信息去除,得到第二数据,并向所述服务器发送该第二数据。采用该手段,通过将与第一节点采集的第一初始数据和第二节点采集的第二初始数据相关的信息去除,然后向服务器发送该去冗余的数据,这样可以降低该节点的数据传输量,进而减少带宽资源消耗,并提升了传输鲁棒性。Wherein, the first node performs de-redundant processing on the first data based on the received cross-node auxiliary information from the server or base station, and correlates the first initial data collected by the first node and the second initial data collected by the second node. The information is removed to obtain second data, and the second data is sent to the server. Using this method, by removing the information related to the first initial data collected by the first node and the second initial data collected by the second node, and then sending the de-redundant data to the server, the data transmission of the node can be reduced. amount, thereby reducing bandwidth resource consumption and improving transmission robustness.
210、服务器接收来自所述第一节点的第二数据;210. The server receives the second data from the first node;
211、对所述第二数据和所述第三数据进行融合处理。211. Perform fusion processing on the second data and the third data.
其中,服务器从第二节点发送的第三数据中获取到跨节点辅助信息,将跨节点辅助信息发送给第一节点,以便第一节点进行去冗余处理,得到第二数据,然后服务器将来自第一节点的第二数据和来自第二节点的第三数据进行融合处理,并执行相应任务。也即,服务器最终进行融合处理的数据是去冗余的数据,没有重复或相关信息。这样不仅可以减少第一节点传输数据的带宽资源,还提高了服务器的处理效率。Among them, the server obtains the cross-node auxiliary information from the third data sent by the second node, and sends the cross-node auxiliary information to the first node so that the first node can perform de-redundancy processing and obtain the second data, and then the server will send the cross-node auxiliary information from The second data from the first node and the third data from the second node are fused and processed, and corresponding tasks are performed. That is to say, the data finally processed by the server for fusion is deredundant data without duplicate or related information. This not only reduces the bandwidth resources used by the first node to transmit data, but also improves the processing efficiency of the server.
本申请实施例,服务器通过对来自第二节点的第三数据进行处理,得到第二节点和第一节点之间的跨节点辅助信息,然后将该跨节点辅助信息发送给第一节点。第一节点通过根据接收到的跨节点辅助信息对第一数据进行去冗余处理,将第一节点和第二节点之间的相关信息去除,得到第二数据,并向所述服务器发送该第二数据,进而有助于服务器最终接收到的来自第一节点的数据是去冗余的数据。采用该手段,第一节点通过将其与第二节点之间的相关信息去除,然后向服务器发送该去冗余的数据,这样可以降低该节点的数据传输量,进而减少带宽资源消耗,并提升了传输鲁棒性,同时也提高了服务器的处理效率。In this embodiment of the present application, the server processes the third data from the second node to obtain the cross-node auxiliary information between the second node and the first node, and then sends the cross-node auxiliary information to the first node. The first node performs de-redundant processing on the first data according to the received cross-node auxiliary information, removes the relevant information between the first node and the second node, obtains the second data, and sends the second data to the server. The second data, in turn, helps the server to finally receive the data from the first node as deredundant data. Using this method, the first node removes the relevant information between it and the second node, and then sends the de-redundant data to the server. This can reduce the data transmission volume of the node, thereby reducing bandwidth resource consumption and improving It improves transmission robustness and improves server processing efficiency.
参照图3所示,是本申请实施例提供的一种数据处理方法的示意图。可选的,该方法可以应用于前述的数据处理系统,例如图1所示的数据处理系统。Refer to FIG. 3 , which is a schematic diagram of a data processing method provided by an embodiment of the present application. Optionally, this method can be applied to the aforementioned data processing system, such as the data processing system shown in Figure 1 .
在一种可能的实现方式中,如图3所示的分布式单/多模态信号传输的场景,其中节点1(例如可以是前述第二节点)将特定模态的数据或特征经过编码器Encoder A1进行压缩和编码后通过无线信道发送给服务器/基站。 In a possible implementation, in the distributed single/multi-modal signal transmission scenario as shown in Figure 3, node 1 (for example, it can be the aforementioned second node) passes the data or features of the specific modality through the encoder. Encoder A1 compresses and encodes the data and sends it to the server/base station through the wireless channel.
服务器/基站接收到节点1发送的数据后,基于编码器Encoder A2获取该数据中的跨节点辅助信息,其中该跨节点辅助信息可以是节点1发送的数据中的部分数据,还可以是全部数据。然后,服务器/基站将该跨节点辅助信息发送至节点2(例如可以是前述第一节点)。节点2将特定模态的数据或特征经过编码器Encoder B1进行压缩和编码,并将接收到的跨节点辅助信息经解码器Decoder A2进行解码,然后将Encoder B1编码后的数据与Decoder A2解码后的跨节点辅助信息进行注意力Attention机制操作,对特征编码进行引导,即进行图3中所示的特征提取,得到去冗余处理后的数据,经过编码器Encoder B2进行编码,然后将得到的数据发送 给服务器/基站。 After the server/base station receives the data sent by Node 1, it obtains the cross-node auxiliary information in the data based on the encoder Encoder A2 . The cross-node auxiliary information can be part of the data sent by Node 1, or it can be all the data. . Then, the server/base station sends the cross-node assistance information to node 2 (which may be the aforementioned first node, for example). Node 2 compresses and encodes the data or features of the specific modality through the encoder Encoder B1 , decodes the received cross-node auxiliary information through the decoder A2 , and then combines the data encoded by Encoder B1 with the decoded data by Decoder A2 . The cross-node auxiliary information is operated by the attention mechanism to guide the feature encoding, that is, the feature extraction shown in Figure 3 is performed to obtain the deredundant processed data, which is encoded by the encoder Encoder B2 , and then the obtained The data is sent to the server/base station.
服务器/基站还对节点1发送的数据经过解码器Decoder A1解码,并对该解码后的数据进行注意力Attention机制操作,以便更好的还原节点1发送的数据。且,服务器/基站对接收到的来自节点2的数据进行解码,然后对该两节点的数据进行融合处理,以执行相应任务。 The server/base station also decodes the data sent by Node 1 through Decoder A1 , and performs an attention mechanism operation on the decoded data to better restore the data sent by Node 1. Moreover, the server/base station decodes the received data from node 2, and then fuses the data of the two nodes to perform corresponding tasks.
经过上述处理后,相较于现有技术中将两节点的数据直接进行传输的方式,本方案中其中一个节点仅传输与另一节点的数据不存在相关性的数据,这样可以减少特征传输量和带宽资源占用,进一步在总带宽占用相近的情况下,可以提升最终任务执行的精度。After the above processing, compared with the current method of directly transmitting data from two nodes, in this solution one node only transmits data that is not relevant to the data of the other node, which can reduce the amount of feature transmission. and bandwidth resource occupancy. Further, when the total bandwidth occupancy is similar, the accuracy of the final task execution can be improved.
参照图4所示,是本申请实施例提供的另一种数据处理方法的示意图。该示例与图3所示示例不同的地方在于,图4所示示例中节点1(例如可以是前述第二节点)采集和传输维度较小的模态数据(例如音频、环境监测传感器信号),节点2(例如可以是前述第一节点)采集和传输维度较大的模态数据(例如图像、视频信号)。其中,节点1采集的数据可压缩至较少符号传输,因此仅占用较少的带宽资源。服务器/基站接收到低维度模态数据,然后将其中的部分或全部数据发送至节点2,例如高维度模态数据采集终端。节点2将服务器/基站发送的数据与该节点2提取的高维度模态数据进行Attention操作,对特征编码进行引导,从而有效减少高维度模态数据的传输量和带宽资源占用。Refer to FIG. 4 , which is a schematic diagram of another data processing method provided by an embodiment of the present application. The difference between this example and the example shown in Figure 3 is that in the example shown in Figure 4, node 1 (for example, it can be the aforementioned second node) collects and transmits modal data with smaller dimensions (such as audio, environmental monitoring sensor signals), Node 2 (for example, it may be the aforementioned first node) collects and transmits larger-dimensional modal data (for example, images, video signals). Among them, the data collected by node 1 can be compressed into fewer symbols for transmission, so it only takes up less bandwidth resources. The server/base station receives the low-dimensional modal data, and then sends part or all of the data to node 2, such as a high-dimensional modal data collection terminal. Node 2 performs attention operations on the data sent by the server/base station and the high-dimensional modal data extracted by the node 2 to guide feature encoding, thereby effectively reducing the transmission volume and bandwidth resource occupation of high-dimensional modal data.
基于前述图2、图3和图4所示数据处理方法,本申请实施例还提供一种数据处理方法。其中,如图5所示,该方法包括:首先进行初始配置。该初始配置,例如是对压缩参数、传输数据类型、资源分配、时间戳信息等进行配置。针对该初始配置的介绍可参阅前述实施例中的记载,在此不再赘述。Based on the data processing methods shown in Figure 2, Figure 3 and Figure 4, embodiments of the present application also provide a data processing method. Among them, as shown in Figure 5, the method includes: first performing initial configuration. The initial configuration, for example, configures compression parameters, transmission data types, resource allocation, timestamp information, etc. For an introduction to this initial configuration, please refer to the records in the foregoing embodiments and will not be described again here.
然后,节点1将采集的数据/特征压缩至较小的维度,经过噪声信道传输至远端服务器/基站。远端服务器/基站接收节点1发送的特征,通过获取跨节点辅助信息,然后将跨节点辅助信息传输给节点2。Then, Node 1 compresses the collected data/features into smaller dimensions and transmits them to the remote server/base station through the noise channel. The remote server/base station receives the characteristics sent by node 1, obtains the cross-node auxiliary information, and then transmits the cross-node auxiliary information to node 2.
节点2对采集的数据/特征进行压缩编码,并对接收的跨节点辅助信息进行解码,然后将两者进行注意力计算,以便提取出与节点1没有关联的特征,并将该计算后的节点2特征传输至远端服务器/基站进行进一步信息融合与计算。Node 2 compresses and encodes the collected data/features, decodes the received cross-node auxiliary information, and then performs attention calculation on the two to extract features that are not related to node 1, and converts the calculated node 2 Features are transmitted to the remote server/base station for further information fusion and calculation.
在一种可能的实现方式中,对于节点1,其基于Encoder A1的网络输出(实数),根据配置的数据类型进行处理后得到特征数据(复数信号),通过将特征数据和经过信道编码、调制后的时间戳码流、特征数据长度拼接后发送给服务器/基站。 In a possible implementation, for node 1, based on the network output (real number) of Encoder A1 , the characteristic data (complex signal) is obtained after processing according to the configured data type. By combining the characteristic data and channel coding and modulation The final timestamp code stream and characteristic data length are concatenated and sent to the server/base station.
对于服务器/基站,其基于Encoder A2的网络输出(实数),根据配置的数据类型进行处理后得到特征数据(复数信号);将特征数据和经过信道编码、调制后的时间戳码流、特征数据长度拼接后发送给节点2。 For the server/base station, it is based on the network output (real number) of Encoder A2 , which is processed according to the configured data type to obtain characteristic data (complex signal); the characteristic data and the timestamp code stream and characteristic data after channel coding and modulation are The length is concatenated and sent to node 2.
对于节点2,其基于Encoder B2的网络输出(实数),根据配置的数据类型进行处理后得到特征数据(复数信号);将特征数据和经过信道编码、调制后的时间戳码流、特征数据长度拼接后发送给服务器/基站。 For node 2, based on the network output (real number) of Encoder B2 , the characteristic data (complex signal) is obtained after processing according to the configured data type; the characteristic data is combined with the timestamp code stream and characteristic data length after channel coding and modulation. After splicing, it is sent to the server/base station.
针对上述数据处理方法中对应的各个编码器、解码器,例如图3所示编码器Encoder A1、Encoder A2、Encoder B1、Encoder B2,解码器Decoder A1、Decoder A2、Decoder B2,其可以分别对应有神经网络模型。需要说明的是,本申请实施例对于模型的个数和种类等不作限制。如图6所示,本申请实施例还提供一种模型训练方法。该训练过程包括:初始配置、定期发送导 频序列监测接收信噪比(Signal-to-Noise Ratio,SNR)变化以及模型的训练/更新。其包括如下步骤: For each corresponding encoder and decoder in the above data processing method, for example, the encoders Encoder A1 , Encoder A2 , Encoder B1 , Encoder B2 shown in Figure 3, and the decoders Decoder A1 , Decoder A2 , and Decoder B2 can respectively correspond to Neural network model. It should be noted that the embodiments of the present application do not limit the number and types of models. As shown in Figure 6, this embodiment of the present application also provides a model training method. The training process includes: initial configuration, regularly sending pilot sequences to monitor changes in received signal-to-noise ratio (SNR), and model training/updating. It includes the following steps:
1.初始配置。其主要可以是对各模型的压缩参数、传输数据类型、资源分配、时间戳信息、训练参数等进行配置。1. Initial configuration. It can mainly configure the compression parameters, transmission data types, resource allocation, timestamp information, training parameters, etc. of each model.
对于初始配置,例如节点向服务器/基站请求,或者服务器/基站直接配置,本方案对此不作严格限制。For initial configuration, such as node requesting from the server/base station, or direct configuration by the server/base station, this solution does not place strict restrictions on this.
该配置可包括:This configuration can include:
a)通报压缩参数,包括模态标识(可以指示不同的模态,实现单/多模态的处理)、最后一层网络输出维度或长度、反向梯度维度或长度;a) Notify the compression parameters, including modal identification (which can indicate different modalities and achieve single/multi-modal processing), the final layer network output dimension or length, and the reverse gradient dimension or length;
b)特征传输数据类型指示,即各部分特征传输时数据码流的具体格式,如直接发送网络的输出参数(2个相邻实数取值组成1个复数符号),或者经过量化后的取值(需信道编码和调制后发送);b) Feature transmission data type indication, that is, the specific format of the data stream when each part of the feature is transmitted, such as directly sending the output parameters of the network (two adjacent real values form a complex symbol), or quantized values (requires channel coding and modulation before sending);
c)指示训练时间戳信息,主要是指定最大时间差(超时即丢弃对应数据包,取值为0表示不设置)、周期信息(0表示不设置周期,大于0表示设定周期)等;c) Indicate training timestamp information, mainly specifying the maximum time difference (the corresponding data packet will be discarded when timeout occurs, a value of 0 means no setting), period information (0 means no period is set, greater than 0 means setting the period), etc.;
d)指示训练参数,包括常规的模型训练参数(如:更新率、批次大小/数量、最大迭代次数、损失函数阈值等)以及监测模型更新的SNR变化阈值和监测周期;d) Indicate training parameters, including conventional model training parameters (such as update rate, batch size/number, maximum number of iterations, loss function threshold, etc.) as well as SNR change threshold and monitoring period for monitoring model updates;
e)配置的信令可以是如下格式(可以是节点主动设定后向服务器/基站请求,也可以是服务器/基站直接进行配置):e) The configured signaling can be in the following format (the node can actively set it and then request it from the server/base station, or the server/base station can configure it directly):
Figure PCTCN2022108901-appb-000001
Figure PCTCN2022108901-appb-000001
f)量化比特位宽仅在数据类型为“经过量化后的取值”时需要发送;f) The quantization bit width only needs to be sent when the data type is "quantized value";
g)如上参数可以在一个信令中发送,也可以封装在不同的信令中,在不同的时刻发送;g) The above parameters can be sent in one signaling, or they can be encapsulated in different signaling and sent at different times;
h)经过上述配置,可以通过输出维度或长度以及数据类型计算得到训练单个批次所需的传输资源数:h) After the above configuration, the number of transmission resources required to train a single batch can be calculated through the output dimension or length and data type:
例如,数据类型为“直接发送网络的输出参数”:传输资源数=正反向长度和/2;For example, the data type is "directly sending the output parameters of the network": number of transmission resources = sum of forward and reverse lengths/2;
再如,数据类型为“经过量化后的取值”:传输资源数=正反向长度和*量化比特位宽/信道码率/调制比特数;For another example, the data type is "quantized value": number of transmission resources = forward and reverse length and * quantization bit width/channel code rate/number of modulation bits;
其中:正反向长度和=输出维度或长度*批次大小+梯度维度或长度。Among them: the sum of forward and reverse lengths = output dimension or length * batch size + gradient dimension or length.
2.各节点分别发送导频序列至服务器/基站,服务器/基站发送导频序列至节点2,估算相关传输链路的SNR。这样处理的目的是:在不同SNR下,传输数据的压缩率或编码码率会成为影响服务器或基站执行任务精度的一个重要因素,因此可以通过计算得到的SNR来调整传输数据的压缩率或编码码率;此外可以根据不同SNR选择不同的模型进行在线推理。2. Each node sends a pilot sequence to the server/base station respectively, and the server/base station sends a pilot sequence to node 2 to estimate the SNR of the relevant transmission link. The purpose of this processing is: under different SNR, the compression rate or coding rate of the transmitted data will become an important factor affecting the accuracy of the server or base station's task execution. Therefore, the compression rate or coding rate of the transmitted data can be adjusted by the calculated SNR. Code rate; in addition, different models can be selected for online inference based on different SNR.
3.模型更新的监测机制。监测根据步骤2中估计出的SNR,当监测到SNR变化过大时,及时触发开启更新训练模型。3. Monitoring mechanism for model updates. Monitor the SNR estimated in step 2. When the SNR changes too much, trigger and update the training model in time.
例如当系统的接收SNR变化值超过3dB、6dB等时,触发模型训练。当然还可以是其他数值,本方案对此不作严格限制。For example, when the system's receiving SNR change value exceeds 3dB, 6dB, etc., model training is triggered. Of course, it can also be other values, and this solution does not impose strict restrictions on this.
对于步骤2和步骤3,可复用现有导频序列,根据初始配置的监测周期,进行定期的检测。For steps 2 and 3, the existing pilot sequence can be reused and regular detection can be performed according to the initially configured monitoring period.
在一种可能的实现方式中,通过记录初始SNR估计值SNR0和当前时间SNR估计值 SNR1,如果|SNR1–SNR0|≥设定的SNR变化阈值,则触发进行模型的更新,并更新SNR0=SNR1。In one possible implementation, by recording the initial SNR estimate value SNR0 and the current time SNR estimate value SNR1, if |SNR1–SNR0| ≥ the set SNR change threshold, an update of the model is triggered, and SNR0=SNR1 is updated. .
4.服务器/基站发送训练请求或指令给节点1和节点2,各节点接收到指令后进行数据准备,完成数据同步后开始训练,训练结束后发送训练完成的指示。4. The server/base station sends training requests or instructions to node 1 and node 2. Each node prepares data after receiving the instructions, starts training after completing data synchronization, and sends a training completion instruction after the training is completed.
对于模型训练/更新,其中:For model training/updating, where:
a)服务器/基站发送训练请求/指令后,进行数据集的同步。a) After the server/base station sends the training request/instruction, the data set is synchronized.
b)训练数据交互:主要包括正向传输(发送特征)和反向传输(发送梯度)。b) Training data interaction: mainly includes forward transmission (sending features) and reverse transmission (sending gradients).
其中,正向传输包括:节点1特征传输至服务器/基站、服务器/基站发送节点1特征至节点2、节点2特征传输至服务器/基站。例如可以采用如下特征传输格式:Among them, the forward transmission includes: transmitting the characteristics of node 1 to the server/base station, the server/base station sending the characteristics of node 1 to node 2, and transmitting the characteristics of node 2 to the server/base station. For example, the following characteristic transmission format can be used:
时间戳码流Timestamp code stream 特征数据长度Characteristic data length 特征数据码流Characteristic data stream
反向传输包括:服务器/基站发送梯度信息至节点2(用于节点2编码器的更新),节点2发送梯度信息至服务器/基站(用于跨节点辅助信息编码器的更新),服务器/基站发送梯度信息至节点1(用于节点1编码器的更新)。Reverse transmission includes: the server/base station sends gradient information to node 2 (for updating the node 2 encoder), node 2 sends gradient information to the server/base station (for updating the cross-node auxiliary information encoder), the server/base station Send gradient information to node 1 (for node 1 encoder update).
c)梯度信息的发送方式例如可以采用如下传输格式:c) The gradient information can be sent in the following transmission format, for example:
训练时间戳码流Training timestamp code stream 梯度数据长度Gradient data length 梯度数据码流gradient data stream
其中,训练时间戳码流:用于指示当前梯度信息的时间,便于多个节点之间同步数据;Among them, training timestamp code stream: used to indicate the time of the current gradient information to facilitate data synchronization between multiple nodes;
梯度数据码流:根据初始配置的压缩参数和数据类型,经过处理后的待发送码流(在初始阶段配置的可选处理方式,可以与配置的特征传输数据类型对应处理方式一致:直接将实数的梯度取值拼接为复数信号;或先量化,接着信道编码和调制,得到复数信号);可选的,可以在梯度数据码流之前加入梯度数据长度字段(与时间戳一起进行信道编码和调制),指示码流实际长度;Gradient data code stream: According to the initially configured compression parameters and data type, the processed code stream to be sent (the optional processing method configured in the initial stage can be consistent with the corresponding processing method of the configured characteristic transmission data type: directly convert real numbers The gradient values are spliced into a complex signal; or quantized first, followed by channel coding and modulation to obtain a complex signal); optionally, the gradient data length field can be added before the gradient data code stream (channel coding and modulation are performed together with the timestamp ), indicating the actual length of the code stream;
相应码流可以通过PUSCH(适用于节点1、2→服务器/基站)或PDSCH(适用于服务器/基站→节点2、1)进行传输,此外,也可以经过MAC组包后在物理层发送。The corresponding code stream can be transmitted through PUSCH (applicable to nodes 1 and 2 → server/base station) or PDSCH (applicable to server/base station → nodes 2 and 1). In addition, it can also be sent at the physical layer after being packaged by MAC.
本申请实施例还提供一种编码器和解码器的架构。如图7a所示为图3所示Attention A的框架结构,如图7b所示为图3所示Attention B的框架结构。其中以节点1采集音频信号,节点2采集视频信号的场景为例。在图3所示的网络结构中包含了多个编解码器和Attention。Attention A的输入为服务器/基站解码后的音频特征Y A,其数据大小可以为32*10*64,然后经过自注意力self-attention计算和一个前馈神经网络(feed-forward network,FFN)得到输出结果。类似地,Attention B的输入为视频采集终端的视觉特征x B和反馈至视频采集终端解码后的音频特征Z A(x B和Z A的数据大小均为32*10*64),然后经过交叉注意力计算和一个前馈神经网络得到输出结果(输出结果的大小为32*10*64)。 The embodiment of the present application also provides an architecture of an encoder and a decoder. Figure 7a shows the frame structure of Attention A shown in Figure 3, and Figure 7b shows the frame structure of Attention B shown in Figure 3. Take the scenario where node 1 collects audio signals and node 2 collects video signals as an example. The network structure shown in Figure 3 contains multiple codecs and Attention. The input of Attention A is the audio feature Y A decoded by the server/base station. The data size can be 32*10*64, which is then calculated by self-attention and a feed-forward network (FFN). Get the output. Similarly, the input of Attention B is the visual feature x B of the video capture terminal and the decoded audio feature Z A fed back to the video capture terminal (the data sizes of x B and Z A are both 32*10*64), and then through crossover Attention calculation and a feed-forward neural network get the output result (the size of the output result is 32*10*64).
图8a示出了音频编码器Encoder A1的一种网络结构,图8b示出了视觉编码器Encoder B2的一种网络结构。其中,音频编码器Encoder A1采用了一个残差的全连接网络来减少压缩过程中音频特征的丢失,其中数据输入维度为32*10*128。视觉编码器Encoder B2是将编码后的视觉特征x B和经过音频-视觉注意力计算后的x B*进行拼接(x B*和x B的数据维度大小均为32*10*64),然后利用一层的全连接网络进行压缩(压缩后数据的维度为32*10*48)。FC为全连接层(Fully Connected,FC)。 Figure 8a shows a network structure of the audio encoder Encoder A1 , and Figure 8b shows a network structure of the visual encoder Encoder B2 . Among them, the audio encoder Encoder A1 uses a residual fully connected network to reduce the loss of audio features during the compression process, where the data input dimension is 32*10*128. The visual encoder Encoder B2 splices the encoded visual feature x B and the x B * after audio-visual attention calculation (the data dimensions of x B * and x B are both 32*10*64), and then Use a layer of fully connected network for compression (the dimension of the compressed data is 32*10*48). FC is the fully connected layer (Fully Connected, FC).
图9示出了视觉编码器Encoder B1的一种网络结构。Encoder B1将读入的视频图像特征和时序特征进行压缩编码,其中图像特征的维度大小为32*80*2048,时序特征维度大小为32*10*512。其输入包括图像特征和时序特征。其中先对图像特征进行一个平均池化操作(池化后数据输出为32*10*2048),然后经过一层的全连接将数据进行压缩。通过对时序特征经过一层的全连接进行压缩,然后将两侧的特征进行拼接,经过一层的全连接层获得最后的输出(大小为32*10*64)。 Figure 9 shows a network structure of the visual encoder Encoder B1 . Encoder B1 compresses and encodes the read video image features and timing features. The dimension size of the image features is 32*80*2048, and the dimension size of the timing features is 32*10*512. Its input includes image features and temporal features. First, an average pooling operation is performed on the image features (the data output after pooling is 32*10*2048), and then the data is compressed through a layer of full connection. By compressing the temporal features through a layer of fully connected layers, and then splicing the features on both sides, the final output (size 32*10*64) is obtained through a layer of fully connected layers.
示例性的,解码器Decoder A1和Decoder B2可分别使用一层全连接网络,分别对音频特征和视频特征进行维度扩充,使得Y A*和Y B维度相同,大小均为32*10*64。Encoder A2对反馈的音频特征进行传输编码,其可利用一层的全连接网络对输入的特征进行计算,输入和输出大小均为32*10*26。Decoder A2使用一层全连接网络,主要实现对下发的音频辅助信息进行维度扩充,即使得Z A的维度与x B维度相一致,以进行后续Attention操作(其输入维度为32*10*26,输出维度为32*10*64)。 For example, Decoder A1 and Decoder B2 can respectively use a layer of fully connected networks to expand the dimensions of audio features and video features respectively, so that Y A * and Y B have the same dimensions and are both 32*10*64 in size. Encoder A2 transmits and encodes the feedback audio features. It can use a layer of fully connected network to calculate the input features. The input and output sizes are both 32*10*26. Decoder A2 uses a layer of fully connected network to mainly implement dimension expansion of the delivered audio auxiliary information, that is, to make the dimension of Z A consistent with the dimension of x B for subsequent Attention operations (its input dimension is 32*10*26 , the output dimension is 32*10*64).
本申请实施例还提供另一种编码器和解码器的架构。其中以节点1采集图像信号,节点2采集视频信号的场景为例。如图10a所示为图3所示编码器Encoder A1的另一种框架结构,如图10b所示为图3所示编码器Encoder B2的框架结构,如图10c所示为图3所示编码器Encoder B1的框架结构。其中,Encoder A1为残差的卷积网络,其对采集的图像特征进行压缩编码,采用残差设计是为了减少信息的丢失。Encoder B2是将注意力操作后的信息与视觉特征进行拼接后,通过一层卷积网络(例如卷积核大小为1*1),以实现多个模态信息的融合。Encoder B1将读入的视频的图像特征和时序特征进行压缩编码,压缩过程分别采用了3*3的卷积核以及1*1的卷积核来实现。可选的,上述编码器的输入数据和输出数据的维度大小和前述实施例一致。 The embodiment of the present application also provides another encoder and decoder architecture. Take the scenario where node 1 collects image signals and node 2 collects video signals as an example. Figure 10a shows another frame structure of the encoder Encoder A1 shown in Figure 3. Figure 10b shows the frame structure of the encoder Encoder B2 shown in Figure 3. Figure 10c shows the encoder shown in Figure 3. The frame structure of Encoder B1 . Among them, Encoder A1 is a residual convolutional network, which compresses and codes the collected image features. The residual design is used to reduce the loss of information. Encoder B2 splices the attention-operated information and visual features through a layer of convolutional network (for example, the convolution kernel size is 1*1) to achieve the fusion of multiple modal information. Encoder B1 compresses and encodes the image features and timing features of the read video. The compression process is implemented by using a 3*3 convolution kernel and a 1*1 convolution kernel respectively. Optionally, the dimension size of the input data and output data of the above encoder is consistent with the previous embodiment.
类似地,Decoder A1和Decoder B2也可以利用其它的网络结构,例如长短期记忆(Long short-term memory,LSTM)网络来实现对输入特征的压缩和维度扩充等,本方案对此不作严格限制。 Similarly, Decoder A1 and Decoder B2 can also use other network structures, such as long short-term memory (LSTM) networks to achieve compression and dimension expansion of input features. This solution does not impose strict restrictions on this.
需要说明的是,在本申请的各个实施例中,如果没有特殊说明以及逻辑冲突,各个实施例之间的术语和/或描述具有一致性、且可以相互引用,不同的实施例中的技术特征根据其内在的逻辑关系可以组合形成新的实施例。It should be noted that in the various embodiments of this application, if there are no special instructions or logical conflicts, the terminology and/or descriptions between the various embodiments are consistent and can be referenced to each other. The technical features in different embodiments New embodiments can be formed based on their internal logical relationships.
上述详细阐述了本申请实施例的方法,下面提供了本申请实施例的装置。可以理解的,本申请各个装置实施例中,对多个单元或者模块的划分仅是一种根据功能进行的逻辑划分,不作为对装置具体的结构的限定。在具体实现中,其中部分功能模块可能被细分为更多细小的功能模块,部分功能模块也可能组合成一个功能模块,但无论这些功能模块是进行了细分还是组合,装置所执行的大致流程是相同的。例如,一些装置中包含接收单元和发送单元。一些设计中,发送单元和接收单元也可以集成为通信单元,该通信单元可以实现接收单元和发送单元所实现的功能。通常,每个单元都对应有各自的程序代码(或者说程序指令),这些单元各自对应的程序代码在处理器上运行时,使得该单元受处理单元的控制而执行相应的流程从而实现相应功能。The method of the embodiment of the present application is described in detail above, and the device of the embodiment of the present application is provided below. It can be understood that in each device embodiment of the present application, the division of multiple units or modules is only a logical division based on functions and does not limit the specific structure of the device. In specific implementation, some of the functional modules may be subdivided into more small functional modules, and some of the functional modules may also be combined into one functional module. However, no matter whether these functional modules are subdivided or combined, the roughly what the device performs is The process is the same. For example, some devices include a receiving unit and a transmitting unit. In some designs, the sending unit and the receiving unit can also be integrated into a communication unit, and the communication unit can realize the functions realized by the receiving unit and the sending unit. Usually, each unit corresponds to its own program code (or program instruction). When the program codes corresponding to these units are run on the processor, the unit is controlled by the processing unit and executes the corresponding process to achieve the corresponding function. .
本申请实施例还提供用于实现以上任一种方法的装置,例如,提供一种数据处理装置包括用以实现以上任一种方法中第一节点所执行的各步骤的模块(或手段)。再如,还提供另一 种数据处理装置,包括用以实现以上任一种方法中服务器或基站所执行的各步骤的模块(或手段)。Embodiments of the present application also provide a device for implementing any of the above methods. For example, a data processing device is provided that includes modules (or means) for implementing each step performed by the first node in any of the above methods. As another example, another data processing device is also provided, including modules (or means) used to implement each step performed by the server or base station in any of the above methods.
例如,参照图11所示,是本申请实施例提供的一种数据处理装置的结构示意图。该数据处理装置用于实现前述的数据处理方法,例如图2中所示的第一节点所执行的数据处理方法。For example, refer to FIG. 11 , which is a schematic structural diagram of a data processing device provided by an embodiment of the present application. The data processing device is used to implement the aforementioned data processing method, such as the data processing method executed by the first node shown in FIG. 2 .
如图11所示,该装置可包括第一处理模块1101、接收模块1102、第二处理模块1103和发送模块1104,具体如下:As shown in Figure 11, the device may include a first processing module 1101, a receiving module 1102, a second processing module 1103 and a sending module 1104, specifically as follows:
第一处理模块1101,用于对采集的第一初始数据进行压缩编码处理,以得到第一数据;The first processing module 1101 is used to compress and encode the collected first initial data to obtain the first data;
接收模块1102,用于接收来自服务器或基站的跨节点辅助信息;The receiving module 1102 is used to receive cross-node auxiliary information from the server or base station;
第二处理模块1103,用于根据所述跨节点辅助信息对所述第一数据进行去冗余处理,以得到第二数据,所述跨节点辅助信息为与所述第一节点采集的所述第一初始数据和第二节点采集的第二初始数据相关的信息;The second processing module 1103 is configured to perform de-redundant processing on the first data according to the cross-node auxiliary information to obtain second data, where the cross-node auxiliary information is related to the data collected by the first node. Information related to the first initial data and the second initial data collected by the second node;
发送模块1104,用于发送所述第二数据。The sending module 1104 is used to send the second data.
在一种可能的实现方式中,所述接收模块1102,还用于:In a possible implementation, the receiving module 1102 is also used to:
接收来自所述服务器或基站的第一指示信息,所述第一指示信息用于指示所述第一节点采集第一模态的数据。Receive first instruction information from the server or base station, where the first instruction information is used to instruct the first node to collect data in the first mode.
在一种可能的实现方式中,所述第一节点采集的所述第一初始数据为所述第一模态的数据。In a possible implementation, the first initial data collected by the first node is data of the first modality.
在一种可能的实现方式中,所述第二处理模块1103,用于:In a possible implementation, the second processing module 1103 is used to:
将所述跨节点辅助信息和所述第一数据均输入至第一预设模型中进行处理,以得到第二数据,其中,当系统的接收信噪比变化值超出阈值时,触发训练所述第一预设模型。Both the cross-node auxiliary information and the first data are input into the first preset model for processing to obtain the second data, wherein when the change value of the system's received signal-to-noise ratio exceeds a threshold, the training is triggered. The first default model.
针对该模块对应的处理的具体介绍可参阅前述实施例中相应的记载,在此不再赘述。For a detailed introduction to the processing corresponding to this module, please refer to the corresponding records in the foregoing embodiments, and will not be described again here.
参照图12所示,是本申请实施例提供的另一种数据处理装置的结构示意图。该数据处理装置用于实现前述的数据处理方法,例如图2中所示的服务器所执行的数据处理方法。Refer to FIG. 12 , which is a schematic structural diagram of another data processing device provided by an embodiment of the present application. The data processing device is used to implement the aforementioned data processing method, such as the data processing method executed by the server shown in FIG. 2 .
如图12所示,该装置可包括接收模块1201、处理模块1202和发送模块1203,具体如下:As shown in Figure 12, the device may include a receiving module 1201, a processing module 1202 and a sending module 1203, specifically as follows:
接收模块1201,用于接收来自第二节点的第三数据,所述第三数据是所述第二节点对采集的第二初始数据进行压缩编码处理后得到的;The receiving module 1201 is configured to receive third data from the second node, where the third data is obtained by the second node after compressing and encoding the collected second initial data;
处理模块1202,用于对所述第三数据进行处理,以得到跨节点辅助信息,所述跨节点辅助信息为与所述第一节点采集的所述第一初始数据和第二节点采集的第二初始数据相关的信息;The processing module 1202 is configured to process the third data to obtain cross-node auxiliary information, where the cross-node auxiliary information is related to the first initial data collected by the first node and the third data collected by the second node. 2. Information related to initial data;
发送模块1203,用于向所述第一节点发送所述跨节点辅助信息。Sending module 1203, configured to send the cross-node auxiliary information to the first node.
在一种可能的实现方式中,所述接收模块1201,还用于接收来自所述第一节点的第二数据;In a possible implementation, the receiving module 1201 is also configured to receive second data from the first node;
所述处理模块1202,还用于对所述第二数据和所述第三数据进行融合处理。The processing module 1202 is also used to perform fusion processing on the second data and the third data.
在一种可能的实现方式中,所述发送模块1203,还用于:In a possible implementation, the sending module 1203 is also used to:
向所述第一节点发送第一指示信息,所述第一指示信息用于指示所述第一节点采集第一模态的数据;Send first instruction information to the first node, where the first instruction information is used to instruct the first node to collect data in the first mode;
向所述第二节点发送第二指示信息,所述第二指示信息用于指示所述第二节点采集第二模态的数据。Send second instruction information to the second node, where the second instruction information is used to instruct the second node to collect data in the second mode.
在一种可能的实现方式中,所述第一节点采集的所述第一初始数据为第一模态的数据,所述第二节点采集的所述第二初始数据为第二模态的数据。In a possible implementation, the first initial data collected by the first node is data of the first modality, and the second initial data collected by the second node is data of the second modality. .
在一种可能的实现方式中,所述处理模块1202,还用于:In a possible implementation, the processing module 1202 is also used to:
将所述第三数据输入至第二预设模型中进行处理,以得到所述跨节点辅助信息,其中,当系统的接收信噪比变化值超出阈值时,触发训练所述第二预设模型。The third data is input into a second preset model for processing to obtain the cross-node auxiliary information, wherein when the change value of the system's received signal-to-noise ratio exceeds a threshold, training of the second preset model is triggered. .
针对该模块对应的处理的具体介绍可参阅前述实施例中相应的记载,在此不再赘述。For a detailed introduction to the processing corresponding to this module, please refer to the corresponding records in the foregoing embodiments, and will not be described again here.
应理解以上各个装置中各模块的划分仅是一种逻辑功能的划分,实际实现时可以全部或部分集成到一个物理实体上,也可以物理上分开。此外,数据处理装置中的模块可以以处理器调用软件的形式实现;例如数据处理装置包括处理器,处理器与存储器连接,存储器中存储有指令,处理器调用存储器中存储的指令,以实现以上任一种方法或实现该装置各模块的功能,其中处理器例如为通用处理器,比如中央处理单元(central processing unit,CPU)或微处理器,存储器为装置内的存储器或装置外的存储器。或者,装置中的模块可以以硬件电路的形式实现,可以通过对硬件电路的设计实现部分或全部单元的功能,该硬件电路可以理解为一个或多个处理器;例如,在一种实现中,该硬件电路为专用集成电路(application-specific integrated circuit,ASIC),通过对电路内元件逻辑关系的设计,实现以上部分或全部单元的功能;再如,在另一种实现中,该硬件电路为可以通过可编程逻辑器件(programmable logic device,PLD)实现,以现场可编程门阵列(field programmable gate array,FPGA)为例,其可以包括大量逻辑门电路,通过配置文件来配置逻辑门电路之间的连接关系,从而实现以上部分或全部单元的功能。以上装置的所有模块可以全部通过处理器调用软件的形式实现,或全部通过硬件电路的形式实现,或部分通过处理器调用软件的形式实现,剩余部分通过硬件电路的形式实现。It should be understood that the division of each module in each device above is only a division of logical functions. In actual implementation, it can be fully or partially integrated into a physical entity, or it can also be physically separated. In addition, the modules in the data processing device can be implemented in the form of the processor calling software; for example, the data processing device includes a processor, the processor is connected to a memory, instructions are stored in the memory, and the processor calls the instructions stored in the memory to achieve the above. Any method or function of each module of the device is implemented, where the processor is, for example, a general-purpose processor, such as a central processing unit (CPU) or a microprocessor, and the memory is a memory within the device or a memory outside the device. Alternatively, the modules in the device can be implemented in the form of hardware circuits, and some or all of the unit functions can be implemented through the design of the hardware circuits, which can be understood as one or more processors; for example, in one implementation, The hardware circuit is an application-specific integrated circuit (ASIC), which realizes the functions of some or all of the above units through the design of the logical relationships of the components in the circuit; for another example, in another implementation, the hardware circuit is It can be realized by programmable logic device (PLD), taking field programmable gate array (FPGA) as an example, which can include a large number of logic gate circuits, and the logic gate circuits are configured through configuration files. connection relationships, thereby realizing the functions of some or all of the above units. All modules of the above device may be fully implemented by the processor calling software, or all may be implemented by hardware circuits, or part of the modules may be implemented by the processor calling software, and the remaining part may be implemented by hardware circuits.
参照图13所示,是本申请实施例提供的又一种数据处理装置的硬件结构示意图。如图13所示的数据处理装置1300(该装置1300具体可以是一种计算机设备)包括存储器1301、处理器1302、通信接口1303以及总线1304。其中,存储器1301、处理器1302、通信接口1303通过总线1304实现彼此之间的通信连接。Refer to FIG. 13 , which is a schematic diagram of the hardware structure of another data processing device provided by an embodiment of the present application. The data processing device 1300 shown in FIG. 13 (the device 1300 may specifically be a computer device) includes a memory 1301, a processor 1302, a communication interface 1303 and a bus 1304. Among them, the memory 1301, the processor 1302, and the communication interface 1303 implement communication connections between each other through the bus 1304.
存储器1301可以是只读存储器(read only memory,ROM),静态存储设备,动态存储设备或者随机存取存储器(random access memory,RAM)。The memory 1301 may be a read only memory (ROM), a static storage device, a dynamic storage device or a random access memory (RAM).
存储器1301可以存储程序,当存储器1301中存储的程序被处理器1302执行时,处理器1302和通信接口1303用于执行本申请实施例的数据处理方法的各个步骤。The memory 1301 can store programs. When the program stored in the memory 1301 is executed by the processor 1302, the processor 1302 and the communication interface 1303 are used to execute various steps of the data processing method in the embodiment of the present application.
处理器1302是一种具有信号的处理能力的电路,在一种实现中,处理器1302可以是具有指令读取与运行能力的电路,例如中央处理单元CPU、微处理器、图形处理器(graphics processing unit,GPU)(可以理解为一种微处理器)、或数字信号处理器(digital singnal processor,DSP)等;在另一种实现中,处理器1302可以通过硬件电路的逻辑关系实现一定功能,该硬件电路的逻辑关系是固定的或可以重构的,例如处理器1302为ASIC或可编程逻辑器件PLD实现的硬件电路,比如FPGA。在可重构的硬件电路中,处理器加载配置文档,实现硬件电路配置的过程,可以理解为处理器加载指令,以实现以上部分或全部模块的功能的过程。此外,还可以是针对人工智能设计的硬件电路,其可以理解为一种ASIC,例如神经网络处理单元(neural network processing unit,NPU)、张量处理单元(tensor processing unit,TPU)、深度学习处理单元(deep learning processing unit,DPU)等。处理器1302用于执行相关程序,以实现本申请实施例的数据处理装置中的单元所需执行的功能,或者执行本申请方法实施例的数据处理方法。The processor 1302 is a circuit with signal processing capabilities. In one implementation, the processor 1302 can be a circuit with the ability to read and run instructions, such as a central processing unit (CPU), a microprocessor, a graphics processor (graphics processor) processing unit (GPU) (can be understood as a microprocessor), or digital signal processor (digital signal processor, DSP), etc.; in another implementation, the processor 1302 can implement certain functions through the logical relationship of the hardware circuit , the logical relationship of the hardware circuit is fixed or reconfigurable, for example, the processor 1302 implements a hardware circuit for an ASIC or a programmable logic device PLD, such as an FPGA. In a reconfigurable hardware circuit, the process of the processor loading the configuration file and realizing the hardware circuit configuration can be understood as the process of the processor loading instructions to realize the functions of some or all of the above modules. In addition, it can also be a hardware circuit designed for artificial intelligence, which can be understood as an ASIC, such as a neural network processing unit (NPU), tensor processing unit (TPU), deep learning processing Unit (deep learning processing unit, DPU), etc. The processor 1302 is used to execute relevant programs to implement the functions required to be performed by the units in the data processing device in the embodiment of the present application, or to execute the data processing method in the method embodiment of the present application.
可见,以上装置中的各模块可以是被配置成实施以上方法的一个或多个处理器(或处理 电路),例如:CPU、GPU、NPU、TPU、DPU、微处理器、DSP、ASIC、FPGA,或这些处理器形式中至少两种的组合。It can be seen that each module in the above device can be one or more processors (or processing circuits) configured to implement the above method, such as: CPU, GPU, NPU, TPU, DPU, microprocessor, DSP, ASIC, FPGA , or a combination of at least two of these processor forms.
此外,以上装置中的各模块可以全部或部分可以集成在一起,或者可以独立实现。在一种实现中,这些模块集成在一起,以片上系统(system-on-a-chip,SOC)的形式实现。该SOC中可以包括至少一个处理器,用于实现以上任一种方法或实现该装置各模块的功能,该至少一个处理器的种类可以不同,例如包括CPU和FPGA,CPU和人工智能处理器,CPU和GPU等。In addition, all or part of the modules in the above device may be integrated together, or may be implemented independently. In one implementation, these modules are integrated together and implemented as a system-on-a-chip (SOC). The SOC may include at least one processor for implementing any of the above methods or implementing the functions of each module of the device. The at least one processor may be of different types, such as a CPU and an FPGA, or a CPU and an artificial intelligence processor. CPU and GPU etc.
通信接口1303使用例如但不限于收发器一类的收发装置,来实现装置1300与其他设备或通信网络之间的通信。例如,可以通过通信接口1303获取数据。The communication interface 1303 uses a transceiver device such as but not limited to a transceiver to implement communication between the device 1300 and other devices or communication networks. For example, data can be obtained through communication interface 1303.
总线1304可包括在装置1300各个部件(例如,存储器1301、处理器1302、通信接口1303)之间传送信息的通路。Bus 1304 may include a path that carries information between various components of device 1300 (eg, memory 1301, processor 1302, communication interface 1303).
应注意,尽管图13所示的装置1300仅仅示出了存储器、处理器、通信接口,但是在具体实现过程中,本领域的技术人员应当理解,装置1300还包括实现正常运行所必须的其他器件。同时,根据具体需要,本领域的技术人员应当理解,装置1300还可包括实现其他附加功能的硬件器件。此外,本领域的技术人员应当理解,装置1300也可仅仅包括实现本申请实施例所必须的器件,而不必包括图13中所示的全部器件。It should be noted that although the device 1300 shown in Figure 13 only shows a memory, a processor, and a communication interface, during specific implementation, those skilled in the art will understand that the device 1300 also includes other devices necessary for normal operation. . At the same time, based on specific needs, those skilled in the art should understand that the device 1300 may also include hardware devices that implement other additional functions. In addition, those skilled in the art should understand that the device 1300 may only include components necessary to implement the embodiments of the present application, and does not necessarily include all components shown in FIG. 13 .
本申请实施例还提供了一种数据处理系统,所述系统包括服务器或基站,还包括第一节点,其中:所述服务器或基站用于实现所述的第二方面提供的数据处理方法中的一个或多个步骤;所述第一节点用于实现所述的第一方面提供的数据处理方法中的一个或多个步骤。Embodiments of the present application also provide a data processing system. The system includes a server or a base station, and also includes a first node, wherein: the server or base station is used to implement the data processing method provided in the second aspect. One or more steps; the first node is used to implement one or more steps in the data processing method provided by the first aspect.
本申请实施例还提供了一种计算机可读存储介质,该计算机可读存储介质中存储有指令,当其在计算机或处理器上运行时,使得计算机或处理器执行上述任一个方法中的一个或多个步骤。Embodiments of the present application also provide a computer-readable storage medium. The computer-readable storage medium stores instructions, which when run on a computer or processor, cause the computer or processor to execute one of the above methods. or multiple steps.
本申请实施例还提供了一种包含指令的计算机程序产品。当该计算机程序产品在计算机或处理器上运行时,使得计算机或处理器执行上述任一个方法中的一个或多个步骤。An embodiment of the present application also provides a computer program product containing instructions. When the computer program product is run on a computer or processor, the computer or processor is caused to perform one or more steps in any of the above methods.
应理解,在本申请的描述中,除非另有说明,“/”表示前后关联的对象是一种“或”的关系,例如,A/B可以表示A或B;其中A,B可以是单数或者复数。并且,在本申请的描述中,除非另有说明,“多个”是指两个或多于两个。“以下至少一项(个)”或其类似表达,是指的这些项中的任意组合,包括单项(个)或复数项(个)的任意组合。例如,a,b,或c中的至少一项(个),可以表示:a,b,c,a-b,a-c,b-c,或a-b-c,其中a,b,c可以是单个,也可以是多个。另外,为了便于清楚描述本申请实施例的技术方案,在本申请的实施例中,采用了“第一”、“第二”等字样对功能和作用基本相同的相同项或相似项进行区分。本领域技术人员可以理解“第一”、“第二”等字样并不对数量和执行次序进行限定,并且“第一”、“第二”等字样也并不限定一定不同。同时,在本申请实施例中,“示例性的”或者“例如”等词用于表示作例子、例证或说明。本申请实施例中被描述为“示例性的”或者“例如”的任何实施例或设计方案不应被解释为比其它实施例或设计方案更优选或更具优势。确切而言,使用“示例性的”或者“例如”等词旨在以具体方式呈现相关概念,便于理解。It should be understood that in the description of this application, unless otherwise stated, "/" indicates that the related objects are in an "or" relationship. For example, A/B can mean A or B; where A and B can be singular numbers. Or plural. Furthermore, in the description of this application, unless otherwise specified, "plurality" means two or more than two. "At least one of the following" or similar expressions thereof refers to any combination of these items, including any combination of a single item (items) or a plurality of items (items). For example, at least one of a, b, or c can mean: a, b, c, a-b, a-c, b-c, or a-b-c, where a, b, c can be single or multiple . In addition, in order to facilitate a clear description of the technical solutions of the embodiments of the present application, in the embodiments of the present application, words such as “first” and “second” are used to distinguish identical or similar items with basically the same functions and effects. Those skilled in the art can understand that words such as "first" and "second" do not limit the number and execution order, and words such as "first" and "second" do not limit the number and execution order. At the same time, in the embodiments of this application, words such as "exemplary" or "for example" are used to represent examples, illustrations or explanations. Any embodiment or design described as "exemplary" or "such as" in the embodiments of the present application is not to be construed as preferred or advantageous over other embodiments or designs. Rather, the use of words such as "exemplary" or "such as" is intended to present related concepts in a concrete manner that is easier to understand.
在本申请所提供的几个实施例中,应该理解到,所揭露的系统、装置和方法,可以通过其它的方式实现。例如,该单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如,多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。所显示或讨论的相互之间的耦合、或直接耦合、或通信连接可以是通过一 些接口,装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed systems, devices and methods can be implemented in other ways. For example, the division of this unit is only a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components may be combined or integrated into another system, or some features may be ignored or not used. implement. The mutual coupling, direct coupling, or communication connection shown or discussed may be through some interfaces, and the indirect coupling or communication connection of devices or units may be in electrical, mechanical, or other forms.
作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。A unit described as a separate component may or may not be physically separate. A component shown as a unit may or may not be a physical unit, that is, it may be located in one place, or it may be distributed to multiple network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of this embodiment.
在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。该计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行该计算机程序指令时,全部或部分地产生按照本申请实施例的流程或功能。该计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。该计算机指令可以存储在计算机可读存储介质中,或者通过该计算机可读存储介质进行传输。该计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字用户线(digital subscriber line,DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。该计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。该可用介质可以是只读存储器(read-only memory,ROM),或随机存取存储器(random access memory,RAM),或磁性介质,例如,软盘、硬盘、磁带、磁碟、或光介质,例如,数字通用光盘(digital versatile disc,DVD)、或者半导体介质,例如,固态硬盘(solid state disk,SSD)等。In the above embodiments, it may be implemented in whole or in part by software, hardware, firmware, or any combination thereof. When implemented using software, it may be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on the computer, the processes or functions according to the embodiments of the present application are generated in whole or in part. The computer may be a general purpose computer, a special purpose computer, a computer network, or other programmable device. The computer instructions may be stored in or transmitted over a computer-readable storage medium. The computer instructions can be transmitted from one website, computer, server or data center to another through wired (such as coaxial cable, optical fiber, digital subscriber line (DSL)) or wireless (such as infrared, wireless, microwave, etc.) means A website site, computer, server or data center for transmission. The computer-readable storage medium can be any available medium that can be accessed by a computer or a data storage device such as a server or data center integrated with one or more available media. The available media may be read-only memory (ROM), random access memory (RAM), or magnetic media, such as floppy disks, hard disks, tapes, disks, or optical media, such as , digital versatile disc (digital versatile disc, DVD), or semiconductor media, such as solid state drive (solid state disk, SSD), etc.
以上所述,仅为本申请实施例的具体实施方式,但本申请实施例的保护范围并不局限于此,任何在本申请实施例揭露的技术范围内的变化或替换,都应涵盖在本申请实施例的保护范围之内。因此,本申请实施例的保护范围应以所述权利要求的保护范围为准。The above are only specific implementation modes of the embodiments of the present application, but the protection scope of the embodiments of the present application is not limited thereto. Any changes or substitutions within the technical scope disclosed in the embodiments of the present application shall be covered by this application. within the protection scope of the application embodiment. Therefore, the protection scope of the embodiments of the present application should be subject to the protection scope of the claims.

Claims (21)

  1. 一种数据处理方法,应用于第一节点,其特征在于,包括:A data processing method, applied to the first node, is characterized by including:
    对采集的第一初始数据进行压缩编码处理,以得到第一数据;Perform compression encoding processing on the collected first initial data to obtain the first data;
    接收来自服务器或基站的跨节点辅助信息;Receive cross-node assistance information from the server or base station;
    根据所述跨节点辅助信息对所述第一数据进行去冗余处理,以得到第二数据,所述跨节点辅助信息为与所述第一节点采集的所述第一初始数据和第二节点采集的第二初始数据相关的信息;The first data is de-redundantly processed according to the cross-node auxiliary information to obtain the second data. The cross-node auxiliary information is the first initial data collected by the first node and the second node. Information related to the second initial data collected;
    发送所述第二数据。Send the second data.
  2. 根据权利要求1所述的方法,其特征在于,所述方法还包括:The method of claim 1, further comprising:
    接收来自所述服务器或基站的第一指示信息,所述第一指示信息用于指示所述第一节点采集第一模态的数据。Receive first instruction information from the server or base station, where the first instruction information is used to instruct the first node to collect data in the first mode.
  3. 根据权利要求2所述的方法,其特征在于,所述第一节点采集的所述第一初始数据为所述第一模态的数据。The method of claim 2, wherein the first initial data collected by the first node is data of the first modality.
  4. 根据权利要求1至3任一项所述的方法,其特征在于,所述根据所述跨节点辅助信息对所述第一数据进行去冗余处理,以得到第二数据,包括:The method according to any one of claims 1 to 3, characterized in that: performing redundancy processing on the first data according to the cross-node auxiliary information to obtain the second data includes:
    将所述跨节点辅助信息和所述第一数据均输入至第一预设模型中进行处理,以得到第二数据,其中,当系统的接收信噪比变化值超出阈值时,触发训练所述第一预设模型。Both the cross-node auxiliary information and the first data are input into the first preset model for processing to obtain the second data, wherein when the change value of the system's received signal-to-noise ratio exceeds a threshold, the training is triggered. The first default model.
  5. 一种数据处理方法,应用于服务器或基站,其特征在于,包括:A data processing method, applied to servers or base stations, is characterized by including:
    接收来自第二节点的第三数据,所述第三数据是所述第二节点对采集的第二初始数据进行压缩编码处理后得到的;Receive third data from the second node, where the third data is obtained by the second node after compressing and encoding the collected second initial data;
    对所述第三数据进行处理,以得到跨节点辅助信息,所述跨节点辅助信息为与所述第一节点采集的所述第一初始数据和第二节点采集的第二初始数据相关的信息;The third data is processed to obtain cross-node auxiliary information. The cross-node auxiliary information is information related to the first initial data collected by the first node and the second initial data collected by the second node. ;
    向所述第一节点发送所述跨节点辅助信息。Send the cross-node assistance information to the first node.
  6. 根据权利要求5所述的方法,其特征在于,所述方法还包括:The method of claim 5, further comprising:
    接收来自所述第一节点的第二数据;receiving second data from the first node;
    对所述第二数据和所述第三数据进行融合处理。Perform fusion processing on the second data and the third data.
  7. 根据权利要求5或6所述的方法,其特征在于,所述方法还包括:The method according to claim 5 or 6, characterized in that, the method further includes:
    向所述第一节点发送第一指示信息,所述第一指示信息用于指示所述第一节点采集第一模态的数据;Send first instruction information to the first node, where the first instruction information is used to instruct the first node to collect data in the first mode;
    向所述第二节点发送第二指示信息,所述第二指示信息用于指示所述第二节点采集第二模态的数据。Send second instruction information to the second node, where the second instruction information is used to instruct the second node to collect data in the second mode.
  8. 根据权利要求7所述的方法,其特征在于,所述第一节点采集的所述第一初始数据为第一模态的数据,所述第二节点采集的所述第二初始数据为第二模态的数据。The method of claim 7, wherein the first initial data collected by the first node is data of a first modality, and the second initial data collected by the second node is data of a second modality. Modal data.
  9. 根据权利要求5至8任一项所述的方法,其特征在于,所述对所述第三数据进行处理,以得到跨节点辅助信息,包括:The method according to any one of claims 5 to 8, characterized in that processing the third data to obtain cross-node auxiliary information includes:
    将所述第三数据输入至第二预设模型中进行处理,以得到所述跨节点辅助信息,其中,当系统的接收信噪比变化值超出阈值时,触发训练所述第二预设模型。The third data is input into a second preset model for processing to obtain the cross-node auxiliary information, wherein when the change value of the system's received signal-to-noise ratio exceeds a threshold, training of the second preset model is triggered. .
  10. 一种数据处理装置,其特征在于,包括:A data processing device, characterized in that it includes:
    第一处理模块,用于对采集的第一初始数据进行压缩编码处理,以得到第一数据;The first processing module is used to compress and encode the collected first initial data to obtain the first data;
    接收模块,用于接收来自服务器或基站的跨节点辅助信息;A receiving module used to receive cross-node auxiliary information from the server or base station;
    第二处理模块,用于根据所述跨节点辅助信息对所述第一数据进行去冗余处理,以得到第二数据,所述跨节点辅助信息为与所述第一节点采集的所述第一初始数据和第二节点采集的第二初始数据相关的信息;The second processing module is configured to perform de-redundant processing on the first data according to the cross-node auxiliary information to obtain the second data. The cross-node auxiliary information is the same as the third data collected by the first node. Information related to the first initial data and the second initial data collected by the second node;
    发送模块,用于发送所述第二数据。A sending module, configured to send the second data.
  11. 根据权利要求10所述的装置,其特征在于,所述接收模块,还用于:The device according to claim 10, characterized in that the receiving module is also used to:
    接收来自所述服务器或基站的第一指示信息,所述第一指示信息用于指示所述第一节点采集第一模态的数据。Receive first instruction information from the server or base station, where the first instruction information is used to instruct the first node to collect data in the first mode.
  12. 根据权利要求11所述的装置,其特征在于,所述第一节点采集的所述第一初始数据为所述第一模态的数据。The device according to claim 11, wherein the first initial data collected by the first node is data of the first modality.
  13. 根据权利要求10至12任一项所述的装置,其特征在于,所述第二处理模块,用于:The device according to any one of claims 10 to 12, characterized in that the second processing module is used for:
    将所述跨节点辅助信息和所述第一数据均输入至第一预设模型中进行处理,以得到第二数据,其中,当系统的接收信噪比变化值超出阈值时,触发训练所述第一预设模型。Both the cross-node auxiliary information and the first data are input into the first preset model for processing to obtain the second data, wherein when the change value of the system's received signal-to-noise ratio exceeds a threshold, the training is triggered. The first default model.
  14. 一种数据处理装置,包括:A data processing device including:
    接收模块,用于接收来自第二节点的第三数据,所述第三数据是所述第二节点对采集的第二初始数据进行压缩编码处理后得到的;A receiving module, configured to receive third data from the second node, where the third data is obtained by the second node after compressing and encoding the collected second initial data;
    处理模块,用于对所述第三数据进行处理,以得到跨节点辅助信息,所述跨节点辅助信息为与所述第一节点采集的所述第一初始数据和第二节点采集的第二初始数据相关的信息;A processing module, configured to process the third data to obtain cross-node auxiliary information, where the cross-node auxiliary information is related to the first initial data collected by the first node and the second data collected by the second node. Information related to initial data;
    发送模块,用于向所述第一节点发送所述跨节点辅助信息。A sending module, configured to send the cross-node auxiliary information to the first node.
  15. 根据权利要求14所述的装置,其特征在于,所述接收模块,还用于接收来自所述第一节点的第二数据;The device according to claim 14, wherein the receiving module is further configured to receive second data from the first node;
    所述处理模块,还用于对所述第二数据和所述第三数据进行融合处理。The processing module is also used to perform fusion processing on the second data and the third data.
  16. 根据权利要求14或15所述的装置,其特征在于,所述发送模块,还用于:The device according to claim 14 or 15, characterized in that the sending module is also used to:
    向所述第一节点发送第一指示信息,所述第一指示信息用于指示所述第一节点采集第一模态的数据;Send first instruction information to the first node, where the first instruction information is used to instruct the first node to collect data in the first mode;
    向所述第二节点发送第二指示信息,所述第二指示信息用于指示所述第二节点采集第二模态的数据。Send second instruction information to the second node, where the second instruction information is used to instruct the second node to collect data in the second mode.
  17. 根据权利要求16所述的装置,其特征在于,所述第一节点采集的所述第一初始数据为第一模态的数据,所述第二节点采集的所述第二初始数据为第二模态的数据。The device according to claim 16, characterized in that the first initial data collected by the first node is data of a first modality, and the second initial data collected by the second node is second Modal data.
  18. 根据权利要求14至17任一项所述的装置,其特征在于,所述处理模块,还用于:The device according to any one of claims 14 to 17, characterized in that the processing module is also used to:
    将所述第三数据输入至第二预设模型中进行处理,以得到所述跨节点辅助信息,其中,当系统的接收信噪比变化值超出阈值时,触发训练所述第二预设模型。The third data is input into a second preset model for processing to obtain the cross-node auxiliary information, wherein when the change value of the system's received signal-to-noise ratio exceeds a threshold, training of the second preset model is triggered. .
  19. 一种数据处理装置,其特征在于,包括处理器和通信接口,所述通信接口用于接收和/或发送数据,和/或,所述通信接口用于为所述处理器提供输出和/或输出,所述处理器用于调用计算机指令,以实现权利要求1-4任一项所述的方法,和/或,实现权利要求5-9任一项所述的方法。A data processing device, characterized in that it includes a processor and a communication interface, the communication interface is used to receive and/or send data, and/or the communication interface is used to provide output to the processor and/or Output, the processor is used to call computer instructions to implement the method described in any one of claims 1-4, and/or to implement the method described in any one of claims 5-9.
  20. 一种数据处理系统,其特征在于,所述系统包括服务器或基站,还包括第一节点,其中:A data processing system, characterized in that the system includes a server or a base station, and also includes a first node, wherein:
    所述服务器或基站用于实现如权利要求5-9中任一项所述的数据处理方法;所述第一节点用于实现如权利要求1-4中任一项所述的数据处理方法。The server or base station is used to implement the data processing method as described in any one of claims 5-9; the first node is used to implement the data processing method as described in any one of claims 1-4.
  21. 一种计算机可读存储介质,其特征在于,所述计算机可读存储介质中存储有计算机程序,所述计算机程序用于实现权利要求1-4任一项所述的方法,和/或,实现权利要求5-9任一项所述的方法。A computer-readable storage medium, characterized in that a computer program is stored in the computer-readable storage medium, and the computer program is used to implement the method described in any one of claims 1-4, and/or, implement The method according to any one of claims 5-9.
PCT/CN2022/108901 2022-07-29 2022-07-29 Data processing method, device and system, and storage medium WO2024021008A1 (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
PCT/CN2022/108901 WO2024021008A1 (en) 2022-07-29 2022-07-29 Data processing method, device and system, and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2022/108901 WO2024021008A1 (en) 2022-07-29 2022-07-29 Data processing method, device and system, and storage medium

Publications (1)

Publication Number Publication Date
WO2024021008A1 true WO2024021008A1 (en) 2024-02-01

Family

ID=89705048

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2022/108901 WO2024021008A1 (en) 2022-07-29 2022-07-29 Data processing method, device and system, and storage medium

Country Status (1)

Country Link
WO (1) WO2024021008A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118035316A (en) * 2024-02-28 2024-05-14 广州市时代物业管理有限公司 Transaction management system and data processing method thereof

Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110298610A1 (en) * 2010-06-02 2011-12-08 Raul Hernan Etkin Compressing data in a wireless network
CN105744562A (en) * 2016-03-25 2016-07-06 中国地质大学(武汉) Method and system for compressing and reconstructing data of wireless sensor network based on symbolic aggregate approximation
CN105979564A (en) * 2016-04-19 2016-09-28 北京邮电大学 Method and device for sending data
CN113346912A (en) * 2020-03-02 2021-09-03 通用汽车环球科技运作有限责任公司 System and method for near lossless universal data compression using correlated data sequences
WO2021259336A1 (en) * 2020-06-23 2021-12-30 华为技术有限公司 Modal information complementation method, apparatus and device

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110298610A1 (en) * 2010-06-02 2011-12-08 Raul Hernan Etkin Compressing data in a wireless network
CN105744562A (en) * 2016-03-25 2016-07-06 中国地质大学(武汉) Method and system for compressing and reconstructing data of wireless sensor network based on symbolic aggregate approximation
CN105979564A (en) * 2016-04-19 2016-09-28 北京邮电大学 Method and device for sending data
CN113346912A (en) * 2020-03-02 2021-09-03 通用汽车环球科技运作有限责任公司 System and method for near lossless universal data compression using correlated data sequences
WO2021259336A1 (en) * 2020-06-23 2021-12-30 华为技术有限公司 Modal information complementation method, apparatus and device

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN118035316A (en) * 2024-02-28 2024-05-14 广州市时代物业管理有限公司 Transaction management system and data processing method thereof

Similar Documents

Publication Publication Date Title
US10511696B2 (en) System and method for aggregation, archiving and compression of internet of things wireless sensor data
CN110087084B (en) Video processing apparatus and video processing method
TWI674790B (en) Method of picture data encoding and decoding and apparatus
KR20230074137A (en) Instance adaptive image and video compression using machine learning systems
TW202016875A (en) Electronic device for high-speed compression processing of feature map of cnn utilizing system and controlling method thereof
EP3635953B1 (en) Electronic device for high-speed compression processing of feature map of cnn utilizing system and controlling method thereof
TWI667914B (en) Picture data encoding and decoding method and apparatus
CN114766035A (en) Pre-processing and data enhancement of frequency domain data
WO2017128632A1 (en) Method, apparatus and system for image compression and image reconstruction
WO2024021008A1 (en) Data processing method, device and system, and storage medium
US11910017B2 (en) Method for predicting point cloud attribute, encoder, decoder, and storage medium
KR20200022798A (en) System and method for 3d model compression and decompression
WO2012163059A1 (en) Method, device and system for device redirection data transmission
US9997132B2 (en) Data transmission method, data transmission system and portable display device of transmitting compressed data
CN110633798B (en) Parameter updating method and device in distributed training
CN115883658A (en) Cross-platform image information transmission method, system, equipment and readable storage medium
CN113784143A (en) Video transcoding method, device, electronic equipment and computer readable medium
US20210118186A1 (en) Image Decoding During Bitstream Interruptions
WO2024164219A1 (en) Data transmission method and apparatus
WO2024065803A1 (en) Data compression transmission method, apparatus, device, and storage medium
EP4250684B1 (en) Method and apparatus for processing streaming media service, and storage medium
EP4325754A1 (en) Coding and decoding methods, communication apparatus, and system
WO2023029689A1 (en) Multimedia data sharing method, media sharing server, terminal, electronic device, and computer-readable storage medium
WO2024051465A1 (en) Differential file restoration method and electronic device
WO2023169303A1 (en) Encoding and decoding method and apparatus, device, storage medium, and computer program product

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 22952459

Country of ref document: EP

Kind code of ref document: A1