1. Introduction
In the past decade, OFDM with MIMO has become a widely adopted wireless transmission technique due to its ability to achieve high data rates [
1,
2] and enhance diversity gain and system capacity, particularly in scenarios with dynamic, time-varying, and frequency-selective channels [
3,
4]. In the context of big data processing, tensor-decomposition-based channel estimation methods [
5,
6] have attracted a significant amount of attention in MIMO-OFDM systems due to their high efficiency in processing large complex datasets with improved estimation accuracy for high-dimensional problems.
Tensor-decomposition-based channel estimation algorithms generally consist of two steps: the first step involves the estimation of rank, which corresponds to the number of multipath components, and the second step utilizes the obtained rank to estimate the multipath component parameters. It is widely acknowledged that determining the tensor rank is an NP-hard problem, as discussed in [
7]. The predominant approaches to estimate the rank use information-theoretic methods, among which, the most popular methods are Akaike information criterion (AIC) and Bayesian information criterion (BIC), but these have drawbacks of oversimplification and overfitting [
8]. Further, the minimum description length (MDL) [
9] method demonstrates a significant reliance on prior knowledge and exhibits sensitivity. Based on the obtained rank, tensor decomposition can be applied to estimate the channel parameters. Based on the model of Tucker, M. Haardt [
10] extended the high-order singular-value decomposition (HOSVD) to the estimation of channel parameters using the estimation of signal parameters via rotational invariance techniques (ESPRIT). Further application has been expanded to 5G localization mapping as described in [
11]. Based on the model of CP, an enhanced approach was proposed in [
12] to address the downlink channel estimation problem in MIMO-OFDM systems with large antenna arrays. Further enhancement was conducted in [
13], where a tensor-space-assisted estimation scheme was proposed by exploiting the Vandermonde structure of the factor matrix. In addition, based on the two aforementioned models, the sequential unfolding singular-value decomposition (SUSVD) was proposed in [
14] by utilizing a distinctive hierarchical tree structure to obtain orthogonal factor matrices: also called the PARATREE method.
Due to the presence of interference, the performance of traditional tensor-decomposition-based channel estimation methods is severely degraded, as the actual channel interference cannot be simply modeled as colored noise. The degradation of rank estimation performance significantly reduces the performance of channel parameter estimation. This is particularly true in MIMO systems, where interference exhibits high correlation. Interference primarily arises from imperfect designs in MIMO-OFDM systems and front-end circuits, leading to signal distortion, including harmonic distortion, intermodulation distortion, and phase distortion, as demonstrated by radio frequency FEI [
15,
16]. Additionally, frequency reuse and bandwidth congestion may also result in CCI [
17,
18,
19]. When the same frequency bands are allocated to multiple transmitters, signal overlap and degradation usually occur. Traditional methods for addressing interference have relied on additional hardware and post-processing algorithms to filter out interference. However, emerging efficient spectrum allocation technologies based on spectrum sensing and interference identification have significant research implications [
20,
21].
Based on a non-Gaussian and non-stationary interference model, tensor decomposition can be employed to reduce the dimensionality of multi-dimensional data. Qibin Zhao proposed a tensor-based variational Bayesian approach in [
22] for channel estimation by eliminating interference in the channel matrix and utilizing the spatial coupling relationships of partially received tensors. References [
23,
24] proposed to use threshold-based interference exclusion methods for low-rank approximation and channel parameter estimation with incomplete data. In [
25], a multiplicative Gamma process (MGP) was used to reduce the complexity and enhance the speed of automatic rank determination (ARD). Similarly, the use of a generalized hyperbolic (GH) distribution can achieve more flexible sparsity awareness [
26]. It is worth mentioning that variational methods with incomplete observations still suffer from information entropy loss with the presence of interference.
Therefore, it is essential to incorporate channel information, including the additive interference structure. Traditional methods like adaptive filtering [
27], prior-knowledge-based MIMO systems [
28], and radio frequency (RF) front-end feedback networks [
29] focus on removing rather than estimating the interference. Moreover, these conventional approaches have drawbacks of high complexity and costs. An additive RCP [
30] was proposed by inferring interference terms for each pixel in image processing to enhance the precision of image processing, which was expanded to channel parameter estimation in [
31]. However, so far, the actual types of interference in the tensor have not yet been thoroughly considered in the model, which makes accurate interference estimation difficult.
In this paper, we propose an RCP based on alternate prior hypothesis (APH) for channel estimation in MIMO-OFDM systems, hereinafter referred to as RCP-APH. We first separate the interference tensor space and then construct spatial correlations through actual interferences, either FEI or CCI. Consequently, we perform variational iterations in the separated tensor space by alternately modifying the interference prior hypotheses conditions. The main contributions of this paper are summarized as follows:
We adopt an additive interference model for which the parameters are jointly estimated with channel parameters rather than mitigating the interference. As such, it has profound implications in anti-interference applications and dynamic spectrum allocation.
We propose to jointly estimate the channel and interference parameters without increasing the complexity and degrading the estimation performance. The proposed method enables simultaneously estimating the number of paths and the channel and interference parameters in MIMO-OFDM systems.
The structure of this paper is as follows.
Section 2 describes the preliminaries and basic concepts.
Section 3 presents the MIMO-OFDM system model. In
Section 4, we propose the RCP-APH algorithm, and
Section 5 shows the experimental analysis of the proposed algorithm. Finally,
Section 6 summarizes this paper.
2. Preliminaries and Notations
In this paper, we introduce the term “mode”, denoted by
n, to represent the order of a tensor, which is also referred to as the dimension in various disciplines. An
N-th order complex tensor is represented using calligraphic letters, as illustrated by
, for which the
entry is denoted by
,
,
. Furthermore, the unfolding of the tensor
with respect to the
n-th mode is represented by
in accordance with [
10].
Tensors are sliced along different dimensions to form a sub-tensor, which is also known as tensor slicing. Slices of a three-dimensional tensor are represented as matrices and are denoted by uppercase bold letters. A set of data along a specific dimension of the tensor is referred to as a fiber and is represented in vector form and denoted by lowercase bold letters. Therefore, in the context of a three-dimensional tensor, the relationship between a slice and a fiber is expressed as , where the row vector of the slice is represented as . Throughout this paper, We use the symbols ∗, T, H, , , , and to denote the conjugate, transposition, Hermitian transposition, matrix inversion, estimated value, and set of the same and Frobenius norm operations, respectively.
For multilinear mathematical operations, the complex inner product of vectors is defined by
. The Hadamard product is performed in an entrywise way between two items of the same size, such as
and
matrices, and the result is
. The Kronecker product of matrices
and
is a matrix of size
, denoted by
. The Khatri–Rao product of matrices,
and
, is
, which is defined by a columnwise Kronecker product. Without loss of generality, the Hadamard product and Khatri–Rao product of a set of matrices, except the
n-th matrix, can be simply denoted by
3. MIMO-OFDM System Model
We consider a typical traffic multipath scenario with the presence of interference as depicted in
Figure 1. The transmit and receive array consist of
and
antennas with equidistant spacing of
and
, respectively. The linear arrays at both ends form a MIMO system designed to estimate channel parameters, including the angle of departure (AoD)
, the angle of arrival (AoA)
, the delay
, and the complex amplitude
. It can be observed that the parameter set for the
l-th multipath is
, and the
-th path has angular differences in the transmission angle
and arrival angle
compared to the former path. In this paper, we use an OFDM signal with a bandwidth of
B and modulated by
K subcarriers for transmission. For convenience, the
K subcarriers with a spacing of
are all used to transmit periodic known training pilots. The periodicity of the signal ensures that the end of each OFDM symbol naturally connects with the beginning of the next symbol. We assume that the signal has been detected and synchronized, where the whole piece of hte signal symbol is recovered for channel estimation [
32]. And there are
L paths in the propagation channel. At the receiver, by utilizing the orthogonality of transmission symbols and stacking the channel matrices of
K frequency points, we can get the channel tensor
in the form of CP factorization as follows:
where “∘” indicates the outer product, factor matrices
are composed of the corresponding antenna array response,
,
, and
. At the same time, the phases of these are respectively represented by
and
, where
is the signal wavelength.
We assume additive interference, as seen in
Figure 2. The frequency power composition of the received tensor is composed as
, where
,
, and
represent a channel tensor with the channel information, channel interference, and the noise tensor, respectively, which all follow an independent and identical distribution (i.i.d.). It is essential to note that the yellow lightning inside the red circle in
Figure 1 indicates the FEI of the transmitter antenna, denoted as
. Similarly, the yellow lightning inside the red square represents the FEI of the receiver antenna, represented by
. Following that, the yellow lightning appearing on both sides of the road indicates CCI generated by other electronic devices and neighboring cells, referred to as
. Therefore, the interference tensor of FEI-R is made of a row fiber with a size of
, indicating this FEI-R from a particular receiving antenna to all transmitting sub-channels. In the same way, the interference tensor FEI-T is made of a column fiber with a size of
, indicating this FEI-T is from a particular transmitting antenna and affects all receiving sub-channels. In addition, the interference is assumed to occur at any possible spectrum location and to have an arbitrary amplitude and phase.
Section 5.4 describes the characteristics of the proposed algorithm for different interference bandwidths.
5. Simulation Analysis
In this section, a comprehensive simulation analysis was conducted to assess the performance of our algorithm. Each testing condition underwent 200 independent experiments and was accompanied by random noise and interference. Firstly, the rank estimation performance of the RCP-APH algorithm was compared with traditional information methods [
9] and traditional RCP [
30]. Secondly, under the assumption of accurate rank estimation, the parameter estimation performance of RCP-APH was compared with the performance of two mainstream tensor decompositions such as CP [
12] and Tucker [
10] as well as the RCP algorithm. Lastly, a detailed interference positioning performance comparison was conducted between the two variational methods.
According to the simulation conditions illustrated in
Figure 1, the configuration is set as follows. The transmitting array is located at (0 m, 0 m), while the receiving array is positioned at (30 m, 0 m). The actual number of multipaths is 2, with the line-of-sight (LOS) path being obstructed. Simultaneously, the actual parameters for the dual-path channel are set as
,
, and
ns. The signal bandwidth used is
MHz, with
. It is noted that the delay harmonic parameters are highly indistinguishable. Omnidirectional linear array antennas are equipped at both the transmitting and receiving ends and comprise
antennas with spacing of
. Under the above conditions, the uniqueness condition for CP decomposition is satisfied, as described in [
14]. Moreover, the complex gains follow a circularly symmetric Gaussian distribution
, where
c is the speed of light, the LOS distance
m, and the carrier frequency
is
GHz. Considering the maximum aperture of the receiving array
, we obtain
m, satisfying the far-field assumption and belonging to the Fraunhofer zone for channel testing. Lastly and most importantly, CCI and FEI are taken into account in the simulation. Thus, at the tensor lattice level, we introduce the parameter of the interference ratio
, which describes the proportion of interference terms in the received tensor.
5.1. Initialization and Termination Conditions
For the variational methods, after performing variance normalization on the received tensor, we should also assume the Gaussian distribution of
for the factor matrices, which allows for the initialization of factor matrices without prior information. The initial rank
is chosen as three times the number of true paths, i.e., six paths, satisfying the requirements of the weak upper bound, i.e.,
. In our model, the top-level hyperparameters, including
and
, are set to 1 ×
, resulting in a noninformative prior. Thus, the expectation of hyperparameters can be initialized by
,
, and
.
is simply set to
. For each category of interference,
is drawn from
, while
is set to
. The entire inference process of the model is summarized in Algorithm 1, where the posterior factors in Equation (
9) are sequentially updated from bottom to top, as depicted in
Figure 3. To enhance the speed of ARD for two variational algorithms in the presence of interference, redundant multipaths corresponding to
under the condition of
are eliminated. Additionally, for CP decomposition with known rank, the initial factor matrices are obtained using SVD operations. For the Tucker decomposition with known rank, we used the unitary ESPRIT algorithm with forward smoothing and HOSVD techniques.
Algorithm 1 The proposed RCP-APH |
Input: a third-order complete received tensor , the IPTH of , and the termination condition of ; Initialization: , , , , , , , the initial number of multipath is L, and is used to indicate the dimension in which variational operations are in progress;
1: while
do
2:
3: Increment variable by 1;
4: Increment variable by 1;
5: for to N do
6: Update the posterior using (11);
7: end for
8: Update the posterior using (12);
9: Update the posterior using (15);
10: Update the posterior using (13);
11: Update the posterior using (14);
12: Evaluate the lower bound using (17);
13: Reduce rank L by eliminating components of ;
14: Ensuring alternating execution between dimensions ;
15: end while
16: Calculate the channel parameters of .
|
5.2. Algorithm Performance
In the following, we choose the iteration number as
, the threshold of
, signal-to-noise ratio (SNR)
dB, and number of subcarriers
. To control the iterations, we set
. Interference power is set as five times the noise power, i.e.,
. The CCI items ratio is 0.5. We consider narrowband interference, i.e., it appears at a limited number of consecutive frequency sampling (CFSs). In this standard setup, three CFSs are occupied by FEI, while two CFSs are occupied by CCI, as shown in
Figure 2. The simulation results are depicted in
Figure 4.
In
Figure 4a, we primarily conduct a feasibility study on the proposed algorithm RCP-APH, where the interference power is calculated as the absolute power magnitude after variance normalization of the received tensor
. The blue lines (solid, dotted, and dashed lines) depict the variations of ELBO for three different interferences, while the red solid line represents the estimated rank, i.e., the number of paths. The maximum number of iterations is 166, and at the 57th iteration, the algorithm achieves the true rank as indicated by the red dotted line. It is noteworthy that at the 57th iteration the ELBO unexpectedly decreases slightly, which can be explained by the fact that the redundant paths are eliminated, resulting in the loss of information entropy due to the small value of
. However, though choosing a large value of
may solve the problem of “unexpectedly decreases” in ELBO values, a large
value would also increase the number of iterations and, in turn, the complexity. Therefore, the threshold
must be selected appropriately in order to balance between the complexity and accuracy.
Figure 4b mainly analyzes the estimation performance of the RCP and RCP-APH algorithms. Firstly, the two curves in the figure represent the interference power distributions estimated by the two algorithms. It can be observed that, compared to the distribution estimated by the RCP-APH algorithm, the interference power estimated by the RCP shows a concentrated distribution, making it difficult to distinguish the true interference. Secondly, the solid and dashed vertical lines in the figure represent the noise power, signal power, and noise precision estimated by the both algorithms. The RCP-APH estimates the SNR more accurately compared to the RCP, as evidenced by the difference between the estimated signal power and noise power. Finally, in selecting the interference threshold, we consider the three aforementioned estimation metrics. If the estimated noise power is used as the threshold, the RCP would be unable to capture interference information. Therefore, this paper uses noise precision as the threshold for extracting interference terms. This threshold has the advantage of not only extracting the high-power interference estimated by the RCP but also facilitating subsequent performance comparisons of both algorithms.
5.3. Channel Estimation Performance
Within this section, we evaluate the performance of rank estimation and channel parameter estimation. Firstly, a comparison of performance under different interference power ratios is conducted, as illustrated in
Figure 5. In the rank estimation of
Figure 5a, it is observed that information-theoretic methods, i.e., MDL and AIC, are ineffective in the presence of strong interference. This confirms the unsuitability of traditional information-theoretic approaches in the case of interference due to overfitting. As a result, channel parameter estimation algorithms based on matrix processing that strongly depend on rank estimation are significantly degraded. It is also evident that algorithms based on the variational model outperform information-theoretic methods.
Additionally, under low interference power (), the RCP-APH algorithm surpasses RCP for all interference ratios. Under high interference power (), RCP-APH only slightly lags behind RCP in the extremely unfavorable scenario of . This reveals the robustness of the proposed algorithm.
In
Figure 5b,c, it can be observed that the proposed RCP-APH outperforms other algorithms and reveals its robustness against changes in interference power as indicated by the black line. Furthermore, traditional RCP exhibits a certain degree of robustness. However, due to the lack of actual interference modeling, its performance is comparatively inferior, as indicated by the green lines. Moreover, for methods that require the number of multipaths to be known, such as the CP and Tucker decomposition methods, CP shows better performance due to its effective reduction of interference in single dimensions through multidimensional iterations. On the other hand, Tucker decomposition, due to HOSVD, encompasses interference information from multiple dimensions, resulting in the poorest performance.
5.4. Interference Estimation Performance
A comparison is conducted in terms of the performance of time–frequency position estimation for interference. In this context, “time” represents the large-scale sampling time, denoted as
t, not to be confused with the small-scale delay
. The term “frequency” denotes the position of frequency sampling points for interference. Since this paper processes all sub-channel snapshots at a single sampling time
t, the discussion is thereby simplified to identifying the interference position at frequency sampling points. Here, a simulation of the RCP-APH algorithm is performed under conditions of
and
dB. The received tensor is illustrated in
Figure 2, and the interference parameter estimation is depicted in
Figure 6.
Figure 6b depicts the unfolding form of
Figure 6a along the 1-mode pattern. The vertical axis has a size of
, and the horizontal axis has a size of
. The green color in
Figure 6b denotes the specific positions of the interference in the channel tensor. The red box visualizes an FEI-T that is composed of three vertical green lines, indicating that all receiving antennas are affected by interference from the same transmitter antenna for three CFSs. The blue box shows an FEI-R that occupies
units in the horizontal direction and that lasts for three CFSs. Further, the black block represents a CCI that spans over both the vertical and horizontal directions with two CFSs. In
Figure 6c, the RCP-APH accomplishes interference estimation for a single realization. It is evident in
Figure 6a that the lower-power regions, indicated by lighter colors, cannot be identified due to their power approaching the noise level. In
Figure 6d, which is the unfolding of
Figure 6c, interference items underestimated by the algorithm are represented in blue. Notably, the majority of interferences are accurately estimated, as indicated by the green color.
In
Figure 7, a single experimental comparison of two variational algorithms is conducted under different interference ratios. To better demonstrate the difference in performance, we use the same coordinate systems as in
Figure 4b and
Figure 6b. These plots in the first row depict the PDF of the interference power at different values of
. The second row represents the specific positions, where the true interference is in the unfolded form. The third and fourth rows show the estimation of interference positions for both the RCP-APH and RCP algorithms. Here, the performance metric for position estimation based on the binary classification model in [
33] is adopted. True positives (TPs) indicate accurately identified interference positions as depicted in green; false negatives (FNs) represent missed detections of interference positions, shown in blue; false positives (FPs) denote incorrectly identified interference positions as shown in red; and true negatives (TNs) signify correctly identified positions without interference, depicted in white. Seen from
Figure 7, the proposed RCP-APH algorithm can distinguish interference by an optimal threshold of noise precision. As evident in the subsequent three rows of the figure, both algorithms exhibit a decreasing trend in red and an increasing trend in blue with the rise of
. This corresponds to the actual mapping: transitioning from overestimation to underestimation. The distinct advantages of the proposed algorithm include: 1. There are rare occurrences of singular interference item estimation, enabling direct mapping between interference items and actual interference. 2. The proposed algorithm can discern the actual interference ratio, while the traditional RCP fails under
.
The statistical characteristics of the interference in the 200 independent experiments maintain the same conditions as in
Section 5.3. The subsequent analysis employs three binary performance parameters as follows: 1.
is utilized to depict the accuracy of estimations; 2.
signifies how many of the actual estimations are captured; 3.
provides a comprehensive balance between the first two metrics.
As illustrated in
Figure 8, the three performance metrics of the RCP algorithm increase with the growth of interference power. However, the performance gains associated with the interference power gradually diminish as
increases. A notable distinction between the RCP-APH and the RCP is that for
, there is a decline in recall, leading to a corresponding decrease in the F1 score. Importantly, the most crucial point is that across various interference powers, all three performance metrics of the proposed algorithm consistently surpass those of the RCP algorithm by a significant margin.
In
Figure 9, it is evident that as
decreases to 10 dB, all three performance metrics of both methods decline. The proposed algorithm exhibits slightly inferior performance compared to the RCP under low-
and high-
conditions. However, under high-
conditions and low-
with low-
conditions, RCP-APH demonstrates superior performance.
From
Figure 10a, it can be observed that with the increase in frequency points
K, there is a slight decrease in precision for RCP-APH, while recall and F1 score exhibit a monotonic increase, significantly outperforming the RCP. As shown in
Figure 10b, the three performance metrics remain nearly constant. However, due to incomplete observations, this outlier appears when the interference ratio reaches
. According to
Figure 10c, widening in the interference bandwidth results in an improvement in precision for both algorithms, while recall and F1 score decline. Importantly, the estimation performance of RCP-APH consistently outperforms that of RCP.
6. Conclusions
In this paper, we propose a robust RCP based on the APH to interference. With the strong correlation of the interference, the proposed algorithm is capable of simultaneous estimation of the rank, channel, and interference parameters. In comparison with the RCP, the proposed algorithm has the following features: 1. Increasing the model sparsity reduces the computational complexity. 2. The noise precision, from which interference items can be inferred, is reasonably and accurately estimated. 3. The estimated interference items show spatial correlation, enabling more accurate identification of the type of interference. 4. The prior hypothesis aligns more closely with real interference, enhancing the overall performance of communication systems. Through a simulation analysis, a comprehensive examination was conducted using different SNRs, interference powers, tensor spatial structures, proportions of interference items occupied by CCI, and lengths of the interference bandwidth. This analysis provides conclusive evidence of the superior estimation performance of rank and channel parameters using the RCP-APH algorithm. Finally, the accurate interference time–frequency position estimation performance of the proposed algorithm is validated.