A Bayesian Tensor Decomposition Method for Joint Estimation of Channel and Interference Parameters

Sun, Yuzhe; Wang, Wei; Wang, Yufan; He, Yuanfeng

doi:10.3390/s24165284

Open AccessArticle

A Bayesian Tensor Decomposition Method for Joint Estimation of Channel and Interference Parameters

School of Information Engineering, Chang’an University, Xi’an 710064, China

^*

Author to whom correspondence should be addressed.

Sensors 2024, 24(16), 5284; https://doi.org/10.3390/s24165284

Submission received: 28 June 2024 / Revised: 3 August 2024 / Accepted: 13 August 2024 / Published: 15 August 2024

(This article belongs to the Special Issue Integrated Localization and Communication: Advances and Challenges)

Download

Browse Figures

Versions Notes

Abstract

:

Bayesian tensor decomposition has been widely applied in channel parameter estimations, particularly in cases with the presence of interference. However, the types of interference are not considered in Bayesian tensor decomposition, making it difficult to accurately estimate the interference parameters. In this paper, we present a robust tensor variational method using a CANDECOMP/PARAFAC (CP)-based additive interference model for multiple input–multiple output (MIMO) with orthogonal frequency division multiplexing (OFDM) systems. A more realistic interference model compared to traditional colored noise is considered in terms of co-channel interference (CCI) and front-end interference (FEI). In contrast to conventional algorithms that filter out interference, the proposed method jointly estimates the channel and interference parameters in the time–frequency domain. Simulation results validate the correctness of the proposed method by the evidence lower bound (ELBO) and reveal the fact that the proposed method outperforms traditional information-theoretic methods, tensor decomposition models, and robust model based on CP (RCP) in terms of estimation accuracy. Further, the interference parameter estimation technique has profound implications for anti-interference applications and dynamic spectrum allocation.

Keywords:

robust Bayesian; automatic rank determination; interference estimation; tensor decomposition; channel estimation

1. Introduction

In the past decade, OFDM with MIMO has become a widely adopted wireless transmission technique due to its ability to achieve high data rates [1,2] and enhance diversity gain and system capacity, particularly in scenarios with dynamic, time-varying, and frequency-selective channels [3,4]. In the context of big data processing, tensor-decomposition-based channel estimation methods [5,6] have attracted a significant amount of attention in MIMO-OFDM systems due to their high efficiency in processing large complex datasets with improved estimation accuracy for high-dimensional problems.

Tensor-decomposition-based channel estimation algorithms generally consist of two steps: the first step involves the estimation of rank, which corresponds to the number of multipath components, and the second step utilizes the obtained rank to estimate the multipath component parameters. It is widely acknowledged that determining the tensor rank is an NP-hard problem, as discussed in [7]. The predominant approaches to estimate the rank use information-theoretic methods, among which, the most popular methods are Akaike information criterion (AIC) and Bayesian information criterion (BIC), but these have drawbacks of oversimplification and overfitting [8]. Further, the minimum description length (MDL) [9] method demonstrates a significant reliance on prior knowledge and exhibits sensitivity. Based on the obtained rank, tensor decomposition can be applied to estimate the channel parameters. Based on the model of Tucker, M. Haardt [10] extended the high-order singular-value decomposition (HOSVD) to the estimation of channel parameters using the estimation of signal parameters via rotational invariance techniques (ESPRIT). Further application has been expanded to 5G localization mapping as described in [11]. Based on the model of CP, an enhanced approach was proposed in [12] to address the downlink channel estimation problem in MIMO-OFDM systems with large antenna arrays. Further enhancement was conducted in [13], where a tensor-space-assisted estimation scheme was proposed by exploiting the Vandermonde structure of the factor matrix. In addition, based on the two aforementioned models, the sequential unfolding singular-value decomposition (SUSVD) was proposed in [14] by utilizing a distinctive hierarchical tree structure to obtain orthogonal factor matrices: also called the PARATREE method.

Due to the presence of interference, the performance of traditional tensor-decomposition-based channel estimation methods is severely degraded, as the actual channel interference cannot be simply modeled as colored noise. The degradation of rank estimation performance significantly reduces the performance of channel parameter estimation. This is particularly true in MIMO systems, where interference exhibits high correlation. Interference primarily arises from imperfect designs in MIMO-OFDM systems and front-end circuits, leading to signal distortion, including harmonic distortion, intermodulation distortion, and phase distortion, as demonstrated by radio frequency FEI [15,16]. Additionally, frequency reuse and bandwidth congestion may also result in CCI [17,18,19]. When the same frequency bands are allocated to multiple transmitters, signal overlap and degradation usually occur. Traditional methods for addressing interference have relied on additional hardware and post-processing algorithms to filter out interference. However, emerging efficient spectrum allocation technologies based on spectrum sensing and interference identification have significant research implications [20,21].

Based on a non-Gaussian and non-stationary interference model, tensor decomposition can be employed to reduce the dimensionality of multi-dimensional data. Qibin Zhao proposed a tensor-based variational Bayesian approach in [22] for channel estimation by eliminating interference in the channel matrix and utilizing the spatial coupling relationships of partially received tensors. References [23,24] proposed to use threshold-based interference exclusion methods for low-rank approximation and channel parameter estimation with incomplete data. In [25], a multiplicative Gamma process (MGP) was used to reduce the complexity and enhance the speed of automatic rank determination (ARD). Similarly, the use of a generalized hyperbolic (GH) distribution can achieve more flexible sparsity awareness [26]. It is worth mentioning that variational methods with incomplete observations still suffer from information entropy loss with the presence of interference.

Therefore, it is essential to incorporate channel information, including the additive interference structure. Traditional methods like adaptive filtering [27], prior-knowledge-based MIMO systems [28], and radio frequency (RF) front-end feedback networks [29] focus on removing rather than estimating the interference. Moreover, these conventional approaches have drawbacks of high complexity and costs. An additive RCP [30] was proposed by inferring interference terms for each pixel in image processing to enhance the precision of image processing, which was expanded to channel parameter estimation in [31]. However, so far, the actual types of interference in the tensor have not yet been thoroughly considered in the model, which makes accurate interference estimation difficult.

In this paper, we propose an RCP based on alternate prior hypothesis (APH) for channel estimation in MIMO-OFDM systems, hereinafter referred to as RCP-APH. We first separate the interference tensor space and then construct spatial correlations through actual interferences, either FEI or CCI. Consequently, we perform variational iterations in the separated tensor space by alternately modifying the interference prior hypotheses conditions. The main contributions of this paper are summarized as follows:

We adopt an additive interference model for which the parameters are jointly estimated with channel parameters rather than mitigating the interference. As such, it has profound implications in anti-interference applications and dynamic spectrum allocation.
We propose to jointly estimate the channel and interference parameters without increasing the complexity and degrading the estimation performance. The proposed method enables simultaneously estimating the number of paths and the channel and interference parameters in MIMO-OFDM systems.

The structure of this paper is as follows. Section 2 describes the preliminaries and basic concepts. Section 3 presents the MIMO-OFDM system model. In Section 4, we propose the RCP-APH algorithm, and Section 5 shows the experimental analysis of the proposed algorithm. Finally, Section 6 summarizes this paper.

2. Preliminaries and Notations

In this paper, we introduce the term “mode”, denoted by n, to represent the order of a tensor, which is also referred to as the dimension in various disciplines. An N-th order complex tensor is represented using calligraphic letters, as illustrated by

X \in C^{I_{1} \times I_{2} \times \dots \times I_{N}}

, for which the

(i_{1}, i_{2}, \dots, i_{N})

entry is denoted by

X_{i_{1}, i_{2}, \dots, i_{N}}

,

i_{n} = 1, 2, \dots, I_{n}

,

n = 1, 2, \dots, N

. Furthermore, the unfolding of the tensor

X

with respect to the n-th mode is represented by

X_{(n)}

in accordance with [10].

Tensors are sliced along different dimensions to form a sub-tensor, which is also known as tensor slicing. Slices of a three-dimensional tensor are represented as matrices and are denoted by uppercase bold letters. A set of data along a specific dimension of the tensor is referred to as a fiber and is represented in vector form and denoted by lowercase bold letters. Therefore, in the context of a three-dimensional tensor, the relationship between a slice and a fiber is expressed as

X_{:, :, i_{3}} = X = {[x_{1}, \dots, x_{i_{1}}, \dots, x_{I_{1}}]}^{T}

, where the row vector of the slice is represented as

X_{i_{1}, :, i_{3}} = x_{i_{1}} = {[x_{i_{1}, 1}, \dots, x_{i_{1}, i_{2}}, \dots, x_{i_{1}, I_{2}}]}^{T}

. Throughout this paper, We use the symbols ∗, T, H,

- 1

,

\tilde{▪}

,

\{▪\}

, and

{∥▪∥}_{F}

to denote the conjugate, transposition, Hermitian transposition, matrix inversion, estimated value, and set of the same and Frobenius norm operations, respectively.

For multilinear mathematical operations, the complex inner product of vectors is defined by

〈 x_{i_{1}}^{(1)}, x_{i_{2}}^{(2)}, \dots, x_{i_{N}}^{(N)} 〉 = \sum_{r} \prod_{n} x_{i_{n}, r}^{{(n)}^{*}} = x_{i_{n}}^{(n) H} (\underset{k \neq n}{⊛} x_{i_{k}}^{(k) *})

. The Hadamard product is performed in an entrywise way between two items of the same size, such as

A \in C^{I \times J}

and

B \in C^{I \times J}

matrices, and the result is

A ⊛ B \in C^{I \times J}

. The Kronecker product of matrices

A \in C^{I \times J}

and

B \in C^{K \times L}

is a matrix of size

I K \times J L

, denoted by

A \otimes B

. The Khatri–Rao product of matrices,

A \in C^{I \times K}

and

B \in C^{J \times K}

, is

A ⊙ B \in C^{I J \times K}

, which is defined by a columnwise Kronecker product. Without loss of generality, the Hadamard product and Khatri–Rao product of a set of matrices, except the n-th matrix, can be simply denoted by

\begin{matrix} \underset{k \neq n}{⊛} A^{(k)} = A^{(N)} ⊛ \dots ⊛ A^{(n + 1)} ⊛ A^{(n - 1)} \dots ⊛ A^{(1)}, \\ \underset{k \neq n}{⊙} A^{(k)} = A^{(N)} ⊙ \dots ⊙ A^{(n + 1)} ⊙ A^{(n - 1)} \dots ⊙ A^{(1)} . \end{matrix}

(1)

3. MIMO-OFDM System Model

We consider a typical traffic multipath scenario with the presence of interference as depicted in Figure 1. The transmit and receive array consist of

N_{B S - T}

and

N_{B S - R}

antennas with equidistant spacing of

d_{t}

and

d_{r}

, respectively. The linear arrays at both ends form a MIMO system designed to estimate channel parameters, including the angle of departure (AoD)

θ

, the angle of arrival (AoA)

ϕ

, the delay

τ

, and the complex amplitude

α

. It can be observed that the parameter set for the l-th multipath is

\{θ_{l}, ϕ_{l}, τ_{l}, α_{l}\}

, and the

(l + 1)

-th path has angular differences in the transmission angle

Δ θ

and arrival angle

Δ ϕ

compared to the former path. In this paper, we use an OFDM signal with a bandwidth of B and modulated by K subcarriers for transmission. For convenience, the K subcarriers with a spacing of

B / K

are all used to transmit periodic known training pilots. The periodicity of the signal ensures that the end of each OFDM symbol naturally connects with the beginning of the next symbol. We assume that the signal has been detected and synchronized, where the whole piece of hte signal symbol is recovered for channel estimation [32]. And there are L paths in the propagation channel. At the receiver, by utilizing the orthogonality of transmission symbols and stacking the channel matrices of K frequency points, we can get the channel tensor

H \in C^{N_{B S - R} \times N_{B S - T} \times K}

in the form of CP factorization as follows:

\begin{matrix} \begin{matrix} H = \sum_{l = 1}^{L} a_{BS - R} (ϕ_{l}) \circ a_{BS - T} (θ_{l}) \circ (α_{l} g (τ_{l})) = [[A^{(1)}, A^{(2)}, A^{(3)}]] \end{matrix}, \\ A^{(1)} ≜ [a_{BS - R} (ϕ_{1}), a_{BS - R} (ϕ_{2}), \dots, a_{BS - R} (ϕ_{L})], \\ A^{(2)} ≜ [a_{BS - T} (θ_{1}), a_{BS - T} (θ_{2}), \dots, a_{BS - T} (θ_{L})], \\ A^{(3)} ≜ [α_{1} g (τ_{1}), α_{2} g (τ_{2}), \dots, α_{L} g (τ_{L})], \end{matrix}

(2)

where “∘” indicates the outer product, factor matrices

{A^{(n)}}_{n = 1, 2, 3}

are composed of the corresponding antenna array response,

g (τ_{l}) ≜ [exp (- j 2 π τ_{l} B (1 / K)) .,

exp (- j 2 π τ_{l} B (2 / K)),

\dots,

{. exp (- j 2 π τ_{l} B)]}^{T}

,

a_{BS - R} (ϕ_{l}) = [\begin{matrix} 1 & e^{j μ (ϕ_{l})} \end{matrix} .

. \dots e^{j (N_{B S - R} - 1) μ (ϕ_{l})}]^{T}

, and

a_{BS - T} (θ_{l}) = [\begin{matrix} 1 & e^{j μ (θ_{l})} \end{matrix} .

. \dots e^{j (N_{B S - T} - 1) μ (θ_{l})}]^{T}

. At the same time, the phases of these are respectively represented by

μ (ϕ_{l}) = (2 π / λ_{c}) d_{r} sin ϕ_{l}

and

μ (θ_{l}) = (2 π / λ_{c}) d_{t} sin θ_{l}

, where

λ_{c}

is the signal wavelength.

We assume additive interference, as seen in Figure 2. The frequency power composition of the received tensor is composed as

Y = H + S + W

, where

H

,

S

, and

W

represent a channel tensor with the channel information, channel interference, and the noise tensor, respectively, which all follow an independent and identical distribution (i.i.d.). It is essential to note that the yellow lightning inside the red circle in Figure 1 indicates the FEI of the transmitter antenna, denoted as

S^{F E I - T}

. Similarly, the yellow lightning inside the red square represents the FEI of the receiver antenna, represented by

S^{F E I - R}

. Following that, the yellow lightning appearing on both sides of the road indicates CCI generated by other electronic devices and neighboring cells, referred to as

S^{C C I}

. Therefore, the interference tensor of FEI-R is made of a row fiber with a size of

1 \times N_{B S - R}

, indicating this FEI-R from a particular receiving antenna to all transmitting sub-channels. In the same way, the interference tensor FEI-T is made of a column fiber with a size of

N_{B S - T} \times 1

, indicating this FEI-T is from a particular transmitting antenna and affects all receiving sub-channels. In addition, the interference is assumed to occur at any possible spectrum location and to have an arbitrary amplitude and phase. Section 5.4 describes the characteristics of the proposed algorithm for different interference bandwidths.

4. Bayesian Tensor Factorization

Diverging from the traditional RCP algorithm [30], this paper involves a strong correlation assumption about the prior information about interference. Spatially, this correlation is established on the entire slice and on the fibers in both the horizontal and vertical directions. Under the condition of maximizing the evidence, different interferences are alternately estimated.

4.1. Alternate Prior Hypotheses

To alleviate the complexity in the description, a third-order CP generative model is employed. As previously mentioned, the full-set representation of subscripts is denoted as

Ω = {i_{1}, i_{2}, i_{3}}

, where

i_{n} \in [1, \dots, I_{n}]

and

n = 1, 2, 3

. It corresponds to the actual configuration, as

I_{1} = N_{B S - R}, I_{2} = N_{B S - T}

and

I_{3} = K

. In order to achieve RCP within a probabilistic framework, an observation model is introduced:

\begin{matrix} p (Y_{Ω} ∣ {\{A^{(n)}\}}_{n = 1}^{3} S_{Ω}, v) = \prod_{i_{1}, i_{2}, i_{3}} C N (Y_{i_{1}, i_{2}, i_{3}} ∣ 〈a_{i_{1}}^{(1)}, a_{i_{2}}^{(2)}, a_{i_{3}}^{(3)}〉 + S_{i_{1}, i_{2}, i_{3}}, v^{- 1}), \end{matrix}

(3)

where

ν

denotes the noise precision,

S_{Ω}

represents all items of interference, and one interference term is denoted as

S_{i_{1}, i_{2}, i_{3}}

. Each vector

a_{i_{n}}^{(n)}

influences a sub-tensor with index

i_{n}

under mode-n. The generalized inner product

〈 a_{i_{1}}^{(1)}, a_{i_{2}}^{(1)}, a_{i_{3}}^{(3)} 〉

of the three latent vectors enables us to capture multilinear interactions reflecting the intrinsic structural characteristics of the tensor data. But this “inner product” complicates the learning process of the model. Therefore, an attempt is made to minimize the dimensionality of the latent space by inducing sparsity in the columns of the factor matrices:

\begin{matrix} p (A^{(n)} ∣ λ) = \prod_{i_{n}} C N (a_{i_{n}}^{(n)}| 0, Λ^{- 1}), \\ p (λ) = \prod_{l = 1}^{L} Ga (λ_{l}| c_{0}, d_{0}), \end{matrix}

(4)

where

Λ = diag (λ)

represents the inverse covariance matrix, and

λ = [λ_{1}, λ_{2}, \dots, λ_{L}]

is shared by the factor matrices across all modes. Due to the uncorrelation of the channel multipath, these hyperpriors for

λ

assume an i.i.d. hypothesis. The Gamma distribution is denoted as

Ga (x |m, n) = n^{m} x^{m - 1} e^{- n x} / Γ (m)

, where

Γ (m)

is the Gamma function. Furthermore, considering the zero-mean complex Gaussian distribution characteristics of the actual channel’s amplitudes, we can draw a conclusion for the ARD: that is, when a certain path

λ_{l}

is sufficiently large, the corresponding l-th column of the latent factor matrix tends to zero, thereby removing the corresponding redundant path.

For the convenience of subsequent discussions, a category of interferences is denoted as

S^{T p}

, as previously discussed in Figure 2. An individual interference from this category is represented as

S_{I n d (T p)}^{T p}

, and its specific position in the time–frequency domain is determined by the subscript index

I n d (T p)

, including

I n d (C C I) = \{i_{3}\}

,

I n d (F E I - R) = \{i_{1}, i_{3}\}

, and

I n d (F E I - T) = \{i_{2}, i_{3}\}

. The complete set of interference terms for a certain category is represented as

S_{Ω}^{T p}

, with an individual interference term denoted as

S_{i_{1}, i_{2}, i_{3}}^{T p}

. Thus, taking the condition of mutual independence between different interferences into account, the following interference prior assumptions are made:

\begin{matrix} p (S^{T p}| γ^{T p}) = \prod_{I n d (T p)} C N (S_{I n d (T p)}^{T p}| 0, 1 / γ_{I n d (T p)}^{T p}), \\ p (γ^{T p}) = \prod_{I n d (T p)} Ga (γ_{I n d (T p)}^{T p}| a_{0}^{T p}, b_{0}^{T p}), \end{matrix}

(5)

where the different types of

T p

correspond to the different hyperparameters

γ^{T p}

. Moreover, according to the different resolution priors described in Section 3, the relationships between the interference and interference terms are given by

1_{I_{1} \times I_{2}} S_{i_{3}}^{C C I} = S_{:, :, i_{3}}^{C C I}

,

1_{I_{2}}^{T} S_{i_{1}, i_{3}}^{F E I - R} = S_{i_{1}, :, i_{3}}^{F E I - R}

, and

1_{I_{1}} S_{i_{2}, i_{3}}^{F E I - T} = S_{:, i_{2}, i_{3}}^{F E I - T}

. This also indicates that the interference cannot be simply considered to be a Gaussian distribution, nor can it be simply modeled as colored noise. Finally, a hyperprior is placed on the noise precision of the environment:

p (ν) = Ga (ν| e_{0}, f_{0}) .

(6)

All the mentioned prior assumptions have been assumed within the probabilistic graphical model, as illustrated in Figure 3. In this figure, white circles and squares represent hidden random variables and hyperparameters, respectively, while yellow circles denote the observed tensor. In the blue region, it is clear that this variational method is implemented through alternating iterations between two priors, including CCI and FEI.

4.2. Variational Bayesian Inference

For simplicity, all factor matrices and hyperparameters are integrated into the parameter set

Θ = {A^{(1)}, A^{(2)}, A^{(3)}, λ,

S^{Ξ}, γ^{Ξ}, ν}

, and

Ξ = \{C C I, F E I - R, F E I - T\}

. Consequently, with different types of interference, the likelihood function is obtained as follows:

\begin{matrix} p (Y_{Ω} - {\tilde{S}}_{Ω}^{\ T p}, Θ) & = p (Y_{Ω} - {\tilde{S}}_{Ω}^{\ T p} ∣ {\{A^{(n)}\}}_{n = 1}^{3}, S^{T p}, ν) \prod_{n = 1}^{3} p (A^{(n)} ∣ λ) \\ \cdot p (S^{T p} ∣ γ^{T p}) p (λ) p (γ^{T p}) p (ν), \end{matrix}

(7)

where the symbol “∖” denotes the complement of the set—for example, when

T p = C C I

,

\ T p = \{F E I - R, F E I - T\}

—and

{\tilde{S}}^{\ T p}

represents the estimated values of the remaining two types of interferences, such as

{\tilde{S}}^{\ C C I} = {\tilde{S}}^{F E I - R} + {\tilde{S}}^{F E I - T}

. The variational approach involves approximating the posterior distribution

p (Θ ∣ Y_{Ω} - {\tilde{S}}_{Ω}^{\ T p})

with the distribution of

q (Θ)

, and the relationship is as follows:

\begin{matrix} ln p (Y_{Ω} - {\tilde{S}}_{Ω}^{\ T p}) = KL (q (Θ) ∥ p (Θ | Y_{Ω} - {\tilde{S}}_{Ω}^{\ T p})) + L (q, T p) . \end{matrix}

(8)

In the above equation, the evidence of

p (Y_{Ω} - {\tilde{S}}_{Ω}^{\ T p})

remains a constant, so maximizing the ELBO of parameter

L (q, T p)

will inevitably minimize the Kullback–Leibler (KL) divergence, thereby completing the inference for the posterior distribution. In this process, given the uncorrelated characteristics of actual parameters, the uniform field theory is employed as follows:

q (Θ) = \prod_{n = 1}^{3} q (A^{(n)}) q (S^{T p}) q (λ) q (γ^{T p}) q (ν) .

(9)

Finally, by computing the expectation of the log-likelihood function

ln p (Y_{Ω} - {\tilde{S}}_{Ω}^{\ T p}, Θ)

under the posterior distribution

q (Θ / Θ_{j})

of the remaining parameters, precise posterior inference for this parameter

Θ_{j}

is obtained as:

ln q_{j} (Θ_{j}) = E_{q_{(Θ \ Θ_{j})}} [ln p (Y_{Ω} - {\tilde{S}}_{Ω}^{\ T p}, Θ)] + const .

(10)

4.2.1. Posterior Distribution of Factor Matrices $A^{(n)}$

By using Equation (8), after performing the posterior expectation on all unknown latent variables and hyperparameters, except the n-mode matrix

A^{(n)}

, the distribution follows a complex Gaussian distribution

q_{n} (A^{(n)}) = C N (A^{(n)}| {\tilde{A}}^{(n)}, V^{(n)})

, for which the mean and variance are

\begin{matrix} {\tilde{A}}^{(n)} & = ({[Y_{Ω} - {\tilde{S}}_{Ω}^{\ T p}]}_{(n)} - E_{q} [{(S_{Ω}^{T p})}_{(n)}]) \cdot E_{q} [A^{(\ n) *}] V^{(n)} E_{q} [ν], \\ V^{(n)} & = {\{E_{q} {[A^{(\ n) H} A^{(\ n)}]}^{T} \cdot E_{q} [ν] + E_{q} [Λ]\}}^{- 1}, \end{matrix}

(11)

where

E_{q} [\cdot]

represents the posterior expectation,

A^{(\ n)} = \underset{k \neq n}{⊙} A^{(k)}

, and

E_{q} [Λ] = diag (E_{q} [λ])

= \tilde{Λ}

. It should be noted that the derivation in this paper utilizes the uncorrelated characteristics between factor matrices from different modes as well as the uncorrelated characteristics among different row vectors of the same factor matrix. The above assumption aligns perfectly with the prior assumptions of the actual channel tensor.

4.2.2. Posterior Distribution of Hyperparameters $λ$

As assumed in Equation (4), we have:

q_{λ} (λ) = \prod_{l = 1}^{L} Ga (λ_{l}| c_{M}^{l}, d_{M}^{l})

, where

c_{M}^{l}, d_{M}^{l}

denote the posterior parameters learned from observations and can be updated by:

\begin{matrix} c_{M} = (c_{0} + \sum_{n = 1}^{3} I_{n}) 1_{L}, \\ d_{M} = d_{0} 1_{L} + \sum_{n = 1}^{3} \{d i a g ({\tilde{A}}^{(n) H} {\tilde{A}}^{(n)} + I_{n} V^{(n)})\}, \end{matrix}

(12)

where vector

c_{M} = [c_{M}^{1}, c_{M}^{2}, \dots, c_{M}^{L}]

, and

d_{M} = [d_{M}^{1}, d_{M}^{2},

\dots, d_{M}^{L}]

. With regard to the prior knowledge, we know the posterior

E_{q} [λ] = [c_{M}^{1} / d_{M}^{1}, \dots,

c_{M}^{L} / d_{M}^{L}]

, which directly determines whether a certain path should be eliminated. Because the mapping relationship between interference disrupts the Gaussian prior distribution at the lattice level, it reduces the algorithm’s generalization capability and decreases the speed of calculating the variational expectation. Therefore, in this paper, we set a multiplicative threshold

η

to perform principal component analysis (PCA) according to the maximum and minimum values in

E_{q} [λ]

. This method significantly improves the speed of convergence, as shown in Section 5.2.

4.2.3. Posterior Distribution of Hyperparameters $S$

In practical situations, different interferences are uncorrelated, and interferences from the same type at different frequency points are also uncorrelated. Therefore, we get the posterior distribution for

q (S^{T p}) = \prod_{I n d (T p)} C N (S_{I n d (T p)}^{T p} | {\tilde{S}}_{I n d (T p)}^{T p}, {(σ_{I n d (T p)}^{T P})}^{2})

as:

\begin{matrix} {\tilde{S}}_{I n d (T p)}^{T p} = \frac{{(σ_{I n d (T p)}^{T p})}^{2} E_{q} [ν]}{C s t [\ I n d (T p)]} \cdot \sum_{\ I n d (T p)} \{Y_{i_{1}, i_{2}, i_{3}} - {\tilde{S}}_{i_{1}, i_{2}, i_{3}}^{\ T p} - E_{q} [〈a_{i_{1}}^{(1)}, a_{i_{2}}^{(2)}, a_{i_{3}}^{(3)}〉]\}, \\ {(σ_{I n d (T p)}^{T p})}^{2} = {(E_{q} [ν] + E_{q} [γ_{I n d (T p)}^{T p}])}^{- 1} . \end{matrix}

(13)

In the above equation, when

T p

is equal to

C C I

, the remaining terms’ operation is

\ I n d (C C I) = \{i_{1}, i_{2}\}

.

C s t [\cdot]

denotes the multiplication of the maximum index, resulting in

C s t [i_{1}, i_{2}] = I_{1} I_{2}

. It is evident that in the posterior estimation of each interference type, the global information

E_{q} [〈 a_{i_{1}}^{(1)}, a_{i_{2}}^{(2)}, a_{i_{3}}^{(3)} 〉]

of the channel must be utilized, while the interference variance is determined by environmental Gaussian noise

E_{q} [ν]

. Therefore, accurately estimating the noise is a crucial prerequisite for precisely assessing interference, as illustrated in Figure 4b.

4.2.4. Posterior Distribution of Hyperparameters $γ$

In the prior assumption of Equation (5), the estimation of interference precision directly dictates the presence of interference, so we need to set an appropriate interference power threshold (IPTH), as discussed in Section 5.2. The posterior distribution,

q (γ^{T p}) = \prod_{I n d (T p)} G a (γ_{I n d (T p)}^{T p}| a_{M}^{γ_{I n d (T p)}}, b_{M}^{γ_{I n d (T p)}})

, is represented by the following equation:

\begin{matrix} a_{M}^{γ_{I n d (T p)}} = C s t (\ I n d (T p)) + a_{0}^{T p}, \\ b_{M}^{γ_{I n d (T p)}} = b_{0}^{T p} + \sum_{\ I n d (T p)} E_{q} [{|S_{i_{1}, i_{2}, i_{3}}^{T p}|}^{2}], \end{matrix}

(14)

where the prior hyperparameter

b_{0}^{T p}

is assumed for a specific type of interference, and the posterior hyperparameter

b_{M}^{γ_{I n d (T p)}}

varies with different indices

I n d (T p)

.

4.2.5. Posterior Distribution of Hyperparameters $ν$

The inference of noise precision is achieved through three factor matrices and observed data. Its posterior follows a Gamma distribution

q (ν) = Ga (ν | e_{M}, f_{M})

, determined by the following:

\begin{matrix} e_{M} & = \prod_{n = 1}^{3} I_{n} + e_{0}, \\ f_{M} & = f_{0} + E_{q (Θ \ ν)} [{∥Y_{Ω} - {\tilde{S}}_{Ω}^{\ T p} - [[A^{(1)}, A^{(2)}, A^{(3)}]]∥}_{F}^{2} - S_{Ω}^{T p}], \end{matrix}

(15)

where the expectation operation is expressed in Equation (16), where the

i_{n}

-th row of

F^{(n)}

is denoted by

f_{i_{n}}^{(n)} = v e c [E_{q} {(a_{i_{n}}^{(n)} a_{i_{n}}^{(n) H})}^{T}]

.

\begin{matrix} E_{q (Θ \ ν)} [{∥Y_{Ω} - {\tilde{S}}_{Ω}^{\ T p} - [[A^{(1)}, A^{(2)}, A^{(3)}]]∥}_{F}^{2} - S_{Ω}^{T p}] \\ = {∥Y_{Ω} - {\tilde{S}}_{Ω}^{\ T p}∥}_{F}^{2} - 2 Re \{v e c^{H} (Y_{Ω} - {\tilde{S}}_{Ω}^{\ T p}) v e c ({\tilde{S}}_{Ω}^{T p})\} + 1_{\prod_{n}}^{T} I_{n} (\underset{n}{⊙} F^{(n)}) 1_{L^{2}} \\ - 2 Re \{v e c^{H} (Y_{Ω} - {\tilde{S}}_{Ω}^{\ T p}) v e c ([[{\tilde{A}}^{(1)}, {\tilde{A}}^{(2)}, {\tilde{A}}^{(3)}]])\} + E_{q} [{∥S_{Ω}^{T p}∥}_{F}^{2}] \\ + 2 Re \{v e c^{H} ([[{\tilde{A}}^{(1)}, {\tilde{A}}^{(2)}, {\tilde{A}}^{(3)}]]) v e c ({\tilde{S}}_{Ω}^{T p})\} \end{matrix} .

(16)

4.3. Evidence Lower Bound

From Equation (8), it can be observed that the algorithm conducts variational inference from three dimensions. Naturally, under the accurate elimination of redundant paths, the evidence lower bound (ELBO) undergoes a monotonically non-decreasing iterative process. The concept of maximizing ELBO involves the posterior expectation of the joint distribution and the entropy of the posterior distribution. The derivation of ELBO can be expressed as Equation (17) (see Appendix A for details), where we divide different dimensions with Tp, allowing for the validation of each dimension separately, as shown in Figure 4a.

\begin{matrix} L (q, T p) = E_{q (Θ)} [ln p (Y - {\tilde{S}}^{\ T p}, Θ)] + H (q (Θ)) \\ = - \frac{e_{M}}{f_{M}} E_{q} [{∥Y_{Ω} - {\tilde{S}}_{Ω}^{\ T p} - [[A^{(1)}, A^{(2)}, A^{(3)}]] - S_{Ω}^{T p}∥}_{F}^{2}] - Tr \{\tilde{Λ} \sum_{n} ({\tilde{A}}^{(n) H} {\tilde{A}}^{(n)} + I_{n} V^{(n)})\} \\ + \sum_{n} I_{n} |V^{(n)}| + \sum_{l} \{ln Γ (c_{M}^{l}) + c_{M}^{l} (1 - ln d_{M}^{l} - \frac{d_{0}^{l}}{d_{M}^{l}})\} + e_{M} (1 - \frac{f_{0}}{f_{M}} - ln f_{M}) \\ + \sum_{I n d (T p)} \{ln Γ (a_{M}^{γ_{I n d (T p)}}) + a_{M}^{γ_{I n d (T p)}} (1 - \frac{b_{0}^{T p}}{b_{M}^{γ_{I n d (T p)}}} - ln b_{M}^{γ_{I n d (T p)}})\} + ln Γ (e_{M}) \\ - \sum_{I n d (T p)} (\sum_{\ I n d (T p)} [\frac{a_{M}^{γ_{I n d (T p)}}}{b_{M}^{γ_{I n d (T p)}}} [{(σ_{i_{1}, i_{2}, i_{3}}^{T p})}^{2} + {|{\tilde{S}}_{i_{1}, i_{2}, i_{3}}^{T p}|}^{2}] - ln {(σ_{i_{1}, i_{2}, i_{3}}^{T p})}^{2}]) \\ + C s t (I n d (T p)) (a_{0}^{T p} ln b_{0}^{T p} - ln Γ (a_{0}^{T p})) + const \end{matrix} .

(17)

4.4. Computational Complexity

The time complexity of the three factor matrices in Equation (11) is

O (3 L^{3} + 3 M L + \sum_{n} I_{n} L^{2})

, where the total size of the observed data is

M = \prod_{n} I_{n}

, and L represents the number of multipaths and the model complexity. The computational cost for

λ

in Equation (12) is

O (\sum_{n} I_{n} L^{2})

. And similarly, the computational cost for

ν

is

O (M L^{2})

. So far, for the calculation of the above parameters, the proposed method has the same computational complexity as the traditional RCP algorithm. Furthermore, since this algorithm performs iterations at different resolutions for the classified interference

S^{T p}

, the computational complexities for each iteration in terms of CCI, FEI-R, and FEI-T are

O (3 I_{3} L)

,

O (3 I_{1} I_{3} L)

, and

O (3 I_{2} I_{3} L)

, respectively. These values are less than the complexity of

O (3 M L)

for the RCP algorithm. In summary, compared to the traditional algorithm, the proposed RCP-APH significantly reduces computational complexity when the number of iterations is large.

5. Simulation Analysis

In this section, a comprehensive simulation analysis was conducted to assess the performance of our algorithm. Each testing condition underwent 200 independent experiments and was accompanied by random noise and interference. Firstly, the rank estimation performance of the RCP-APH algorithm was compared with traditional information methods [9] and traditional RCP [30]. Secondly, under the assumption of accurate rank estimation, the parameter estimation performance of RCP-APH was compared with the performance of two mainstream tensor decompositions such as CP [12] and Tucker [10] as well as the RCP algorithm. Lastly, a detailed interference positioning performance comparison was conducted between the two variational methods.

According to the simulation conditions illustrated in Figure 1, the configuration is set as follows. The transmitting array is located at (0 m, 0 m), while the receiving array is positioned at (30 m, 0 m). The actual number of multipaths is 2, with the line-of-sight (LOS) path being obstructed. Simultaneously, the actual parameters for the dual-path channel are set as

θ = [45^{\circ}, 30^{\circ}]

,

ϕ = [45^{\circ}, 60^{\circ}]

, and

τ = [142.13, 136.60]

ns. The signal bandwidth used is

B = 100

MHz, with

Δ τ \cdot B = 0.553

. It is noted that the delay harmonic parameters are highly indistinguishable. Omnidirectional linear array antennas are equipped at both the transmitting and receiving ends and comprise

N_{B S - R} = N_{B S - T} = 5

antennas with spacing of

d_{t} = d_{r} = λ_{c} / 2

. Under the above conditions, the uniqueness condition for CP decomposition is satisfied, as described in [14]. Moreover, the complex gains follow a circularly symmetric Gaussian distribution

α_{l} \sim C N (0, 1 / {(4 π D f_{c} / c)}^{2})

, where c is the speed of light, the LOS distance

D = 30

m, and the carrier frequency

f_{c}

is

5.9

GHz. Considering the maximum aperture of the receiving array

A_{p} = (5 - 1) d_{r}

, we obtain

D \geq {2 A_{p}^{2} / λ}_{c} = 0.41

m, satisfying the far-field assumption and belonging to the Fraunhofer zone for channel testing. Lastly and most importantly, CCI and FEI are taken into account in the simulation. Thus, at the tensor lattice level, we introduce the parameter of the interference ratio

β

, which describes the proportion of interference terms in the received tensor.

5.1. Initialization and Termination Conditions

For the variational methods, after performing variance normalization on the received tensor, we should also assume the Gaussian distribution of

C N (0, I)

for the factor matrices, which allows for the initialization of factor matrices without prior information. The initial rank

R_{i n t}

is chosen as three times the number of true paths, i.e., six paths, satisfying the requirements of the weak upper bound, i.e.,

R_{i n t} \leq \min_{n} (\sum_{i \neq n} I_{i})

. In our model, the top-level hyperparameters, including

c_{0}, d_{0}, e_{0}, f_{0}, a_{0}^{Ξ},

and

b_{0}^{Ξ}

, are set to 1 ×

10^{- 6}

, resulting in a noninformative prior. Thus, the expectation of hyperparameters can be initialized by

E [Λ] = I_{L}

,

E [ν] = 1

, and

E [γ^{Ξ}] = 1

.

V^{(n)}

is simply set to

E [Λ^{- 1}]

. For each category of interference,

E [S^{T p}]

is drawn from

C N (0, I_{C s t [I n d (T p)]})

, while

{(σ^{T p})}^{2}

is set to

E [{(γ^{T p})}^{- 1}]

. The entire inference process of the model is summarized in Algorithm 1, where the posterior factors in Equation (9) are sequentially updated from bottom to top, as depicted in Figure 3. To enhance the speed of ARD for two variational algorithms in the presence of interference, redundant multipaths corresponding to

{\tilde{λ}}_{l}

under the condition of

E [λ] / min (E [λ]) > η

are eliminated. Additionally, for CP decomposition with known rank, the initial factor matrices are obtained using SVD operations. For the Tucker decomposition with known rank, we used the unitary ESPRIT algorithm with forward smoothing and HOSVD techniques.

Algorithm 1 The proposed RCP-APH

Input: a third-order complete received tensor

Y

, the IPTH of

η

, and the termination condition of

M_{I t e r s}

;
Initialization:

{\tilde{A}}^{(n)}

,

V_{i_{n}}^{(n)}

,

\forall n \in [1, N]

,

\forall i_{n} \in [1, I_{n} 0, d_{0}, e_{0}, f_{0}, a_{0}^{Ξ}, b_{0}^{Ξ}

,

E [γ^{T p}] = 1

,

{\tilde{S}}^{T p} \sim C N (0, I_{C s t [I n d (T p)]})

,

T p \in Ξ = [C C I, F E I - R, F E I - T]

, the initial number of multipath is L, and

W a y = 1

is used to indicate the dimension in which variational operations are in progress;
1: while

i t \leq M_{I t e r s}

do
2:

T p = Ξ [W a y]

3: Increment variable

i t

by 1;
4: Increment variable

W a y

by 1;
5: for

n = 1

to N do
6: Update the posterior

q_{n} (A^{(n)})

using (11);
7: end for
8: Update the posterior

q (λ)

using (12);
9: Update the posterior

q (ν)

using (15);
10: Update the posterior

q (S^{T p})

using (13);
11: Update the posterior

q (γ^{T p})

using (14);
12: Evaluate the lower bound

E L B O^{T p} (i t)

using (17);
13: Reduce rank L by eliminating components of

E [λ] / min (E [λ]) > η

;
14: Ensuring alternating execution between dimensions

W a y = (W a y = = 4) ? 1 : W a y

;
15: end while
16: Calculate the channel parameters of

[θ_{l}, ϕ_{l}, τ_{l}, α_{l}], \forall l \in [1, L]

.

5.2. Algorithm Performance

In the following, we choose the iteration number as

M_{I t e r s} = 500

, the threshold of

η = 10

, signal-to-noise ratio (SNR)

ρ = 20

dB, and number of subcarriers

K = 64

. To control the iterations, we set

β = 0.2

. Interference power is set as five times the noise power, i.e.,

1 / γ^{T p} = {(σ^{T p})}^{2} = 5 / ν = 5 σ_{N o i s e}^{2}

. The CCI items ratio is 0.5. We consider narrowband interference, i.e., it appears at a limited number of consecutive frequency sampling (CFSs). In this standard setup, three CFSs are occupied by FEI, while two CFSs are occupied by CCI, as shown in Figure 2. The simulation results are depicted in Figure 4.

In Figure 4a, we primarily conduct a feasibility study on the proposed algorithm RCP-APH, where the interference power is calculated as the absolute power magnitude after variance normalization of the received tensor

Y

. The blue lines (solid, dotted, and dashed lines) depict the variations of ELBO for three different interferences, while the red solid line represents the estimated rank, i.e., the number of paths. The maximum number of iterations is 166, and at the 57th iteration, the algorithm achieves the true rank as indicated by the red dotted line. It is noteworthy that at the 57th iteration the ELBO unexpectedly decreases slightly, which can be explained by the fact that the redundant paths are eliminated, resulting in the loss of information entropy due to the small value of

η

. However, though choosing a large value of

η

may solve the problem of “unexpectedly decreases” in ELBO values, a large

η

value would also increase the number of iterations and, in turn, the complexity. Therefore, the threshold

η

must be selected appropriately in order to balance between the complexity and accuracy.

Figure 4b mainly analyzes the estimation performance of the RCP and RCP-APH algorithms. Firstly, the two curves in the figure represent the interference power distributions estimated by the two algorithms. It can be observed that, compared to the distribution estimated by the RCP-APH algorithm, the interference power estimated by the RCP shows a concentrated distribution, making it difficult to distinguish the true interference. Secondly, the solid and dashed vertical lines in the figure represent the noise power, signal power, and noise precision estimated by the both algorithms. The RCP-APH estimates the SNR more accurately compared to the RCP, as evidenced by the difference between the estimated signal power and noise power. Finally, in selecting the interference threshold, we consider the three aforementioned estimation metrics. If the estimated noise power is used as the threshold, the RCP would be unable to capture interference information. Therefore, this paper uses noise precision as the threshold for extracting interference terms. This threshold has the advantage of not only extracting the high-power interference estimated by the RCP but also facilitating subsequent performance comparisons of both algorithms.

5.3. Channel Estimation Performance

Within this section, we evaluate the performance of rank estimation and channel parameter estimation. Firstly, a comparison of performance under different interference power ratios is conducted, as illustrated in Figure 5. In the rank estimation of Figure 5a, it is observed that information-theoretic methods, i.e., MDL and AIC, are ineffective in the presence of strong interference. This confirms the unsuitability of traditional information-theoretic approaches in the case of interference due to overfitting. As a result, channel parameter estimation algorithms based on matrix processing that strongly depend on rank estimation are significantly degraded. It is also evident that algorithms based on the variational model outperform information-theoretic methods.

Additionally, under low interference power (

{(σ^{T p})}^{2} = 5 σ_{N o i s e}^{2}

), the RCP-APH algorithm surpasses RCP for all interference ratios. Under high interference power (

{(σ^{T p})}^{2} = 10 σ_{N o i s e}^{2}

), RCP-APH only slightly lags behind RCP in the extremely unfavorable scenario of

β = 0.8

. This reveals the robustness of the proposed algorithm.

In Figure 5b,c, it can be observed that the proposed RCP-APH outperforms other algorithms and reveals its robustness against changes in interference power as indicated by the black line. Furthermore, traditional RCP exhibits a certain degree of robustness. However, due to the lack of actual interference modeling, its performance is comparatively inferior, as indicated by the green lines. Moreover, for methods that require the number of multipaths to be known, such as the CP and Tucker decomposition methods, CP shows better performance due to its effective reduction of interference in single dimensions through multidimensional iterations. On the other hand, Tucker decomposition, due to HOSVD, encompasses interference information from multiple dimensions, resulting in the poorest performance.

5.4. Interference Estimation Performance

A comparison is conducted in terms of the performance of time–frequency position estimation for interference. In this context, “time” represents the large-scale sampling time, denoted as t, not to be confused with the small-scale delay

τ

. The term “frequency” denotes the position of frequency sampling points for interference. Since this paper processes all sub-channel snapshots at a single sampling time t, the discussion is thereby simplified to identifying the interference position at frequency sampling points. Here, a simulation of the RCP-APH algorithm is performed under conditions of

β = 0.2

and

ρ = 20

dB. The received tensor is illustrated in Figure 2, and the interference parameter estimation is depicted in Figure 6.

Figure 6b depicts the unfolding form of Figure 6a along the 1-mode pattern. The vertical axis has a size of

N_{B S - R}

, and the horizontal axis has a size of

N_{B S - T} \cdot K

. The green color in Figure 6b denotes the specific positions of the interference in the channel tensor. The red box visualizes an FEI-T that is composed of three vertical green lines, indicating that all receiving antennas are affected by interference from the same transmitter antenna for three CFSs. The blue box shows an FEI-R that occupies

N_{B S - T}

units in the horizontal direction and that lasts for three CFSs. Further, the black block represents a CCI that spans over both the vertical and horizontal directions with two CFSs. In Figure 6c, the RCP-APH accomplishes interference estimation for a single realization. It is evident in Figure 6a that the lower-power regions, indicated by lighter colors, cannot be identified due to their power approaching the noise level. In Figure 6d, which is the unfolding of Figure 6c, interference items underestimated by the algorithm are represented in blue. Notably, the majority of interferences are accurately estimated, as indicated by the green color.

In Figure 7, a single experimental comparison of two variational algorithms is conducted under different interference ratios. To better demonstrate the difference in performance, we use the same coordinate systems as in Figure 4b and Figure 6b. These plots in the first row depict the PDF of the interference power at different values of

β

. The second row represents the specific positions, where the true interference is in the unfolded form. The third and fourth rows show the estimation of interference positions for both the RCP-APH and RCP algorithms. Here, the performance metric for position estimation based on the binary classification model in [33] is adopted. True positives (TPs) indicate accurately identified interference positions as depicted in green; false negatives (FNs) represent missed detections of interference positions, shown in blue; false positives (FPs) denote incorrectly identified interference positions as shown in red; and true negatives (TNs) signify correctly identified positions without interference, depicted in white. Seen from Figure 7, the proposed RCP-APH algorithm can distinguish interference by an optimal threshold of noise precision. As evident in the subsequent three rows of the figure, both algorithms exhibit a decreasing trend in red and an increasing trend in blue with the rise of

β

. This corresponds to the actual mapping: transitioning from overestimation to underestimation. The distinct advantages of the proposed algorithm include: 1. There are rare occurrences of singular interference item estimation, enabling direct mapping between interference items and actual interference. 2. The proposed algorithm can discern the actual interference ratio, while the traditional RCP fails under

β = 0

.

The statistical characteristics of the interference in the 200 independent experiments maintain the same conditions as in Section 5.3. The subsequent analysis employs three binary performance parameters as follows: 1.

P r e c i s i o n = TP / (TP + FP)

is utilized to depict the accuracy of estimations; 2.

R e c a l l = TP / (TP + FN)

signifies how many of the actual estimations are captured; 3.

F 1 S c o r e = 2 \cdot P r e c i s i o n \cdot R e c a l l / (P r e c i s i o n + R e c a l l)

provides a comprehensive balance between the first two metrics.

As illustrated in Figure 8, the three performance metrics of the RCP algorithm increase with the growth of interference power. However, the performance gains associated with the interference power gradually diminish as

β

increases. A notable distinction between the RCP-APH and the RCP is that for

β \geq 0.6

, there is a decline in recall, leading to a corresponding decrease in the F1 score. Importantly, the most crucial point is that across various interference powers, all three performance metrics of the proposed algorithm consistently surpass those of the RCP algorithm by a significant margin.

In Figure 9, it is evident that as

ρ

decreases to 10 dB, all three performance metrics of both methods decline. The proposed algorithm exhibits slightly inferior performance compared to the RCP under low-

ρ

and high-

β

conditions. However, under high-

ρ

conditions and low-

ρ

with low-

β

conditions, RCP-APH demonstrates superior performance.

From Figure 10a, it can be observed that with the increase in frequency points K, there is a slight decrease in precision for RCP-APH, while recall and F1 score exhibit a monotonic increase, significantly outperforming the RCP. As shown in Figure 10b, the three performance metrics remain nearly constant. However, due to incomplete observations, this outlier appears when the interference ratio reaches

50 %

. According to Figure 10c, widening in the interference bandwidth results in an improvement in precision for both algorithms, while recall and F1 score decline. Importantly, the estimation performance of RCP-APH consistently outperforms that of RCP.

6. Conclusions

In this paper, we propose a robust RCP based on the APH to interference. With the strong correlation of the interference, the proposed algorithm is capable of simultaneous estimation of the rank, channel, and interference parameters. In comparison with the RCP, the proposed algorithm has the following features: 1. Increasing the model sparsity reduces the computational complexity. 2. The noise precision, from which interference items can be inferred, is reasonably and accurately estimated. 3. The estimated interference items show spatial correlation, enabling more accurate identification of the type of interference. 4. The prior hypothesis aligns more closely with real interference, enhancing the overall performance of communication systems. Through a simulation analysis, a comprehensive examination was conducted using different SNRs, interference powers, tensor spatial structures, proportions of interference items occupied by CCI, and lengths of the interference bandwidth. This analysis provides conclusive evidence of the superior estimation performance of rank and channel parameters using the RCP-APH algorithm. Finally, the accurate interference time–frequency position estimation performance of the proposed algorithm is validated.

Author Contributions

Y.S.: conceptualization, analysis, and original draft preparation and writing; W.W.: methodology, original draft preparation, and review and editing; Y.W.: investigation and validation; Y.H.: investigation and validation. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China under grant 61871059 and in part by the Innovation Capability Support Program of Shaanxi under grant 2022TD-41.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

\begin{matrix} L (q, T p) = E_{q (Θ)} [ln p (Y - {\tilde{S}}^{\ T p}, Θ)] + H (q (Θ)) \\ = E_{q (\{A^{(n)}\}, S^{T p}, ν)} [ln p (Y - {\tilde{S}}^{\ T p} ∣ \{A^{(n)}\}, S^{T p}, ν^{- 1})] + E_{q (\{A^{(n)}\}, λ)} [\sum_{n = 1}^{3} ln p (A^{(n)} ∣ λ)] \\ + E_{q (S^{T p}, γ^{T p})} [ln p (S^{T p} ∣ γ^{T p})] + E_{q (λ)} [ln p (λ)] + E_{q (γ^{T p})} [ln p (γ^{T p})] + E_{q (ν)} [ln p (ν)] \\ - E_{q (\{A^{(n)}\})} [\sum_{n = 1}^{3} ln q (A^{(n)})] - E_{q (S^{T p})} [ln q (S^{T p})] - E_{q (λ)} [ln q (λ)] \\ - E_{q (γ^{T p})} [ln q (γ^{T p})] - E_{q (ν)} [ln q (ν)], \end{matrix}

where

\begin{matrix} E_{q (\{A^{(n)}\}, S^{T p}, ν)} [ln p (Y - {\tilde{S}}^{\ T p} ∣ \{A^{(n)}\}, S^{T p}, ν^{- 1})] \\ = - \prod_{n} I_{n} ln (π) + \prod_{n} I_{n} E_{q} [ln ν] - E_{q} [ν] E_{q} [{∥Y - {\tilde{S}}^{\ T p} - [[A^{(1)}, A^{(2)}, A^{(3)}]] - S^{T p}∥}_{F}^{2}] \\ = - \prod_{n} I_{n} ln (π) + \prod_{n} I_{n} [ψ (e_{M}) - ln f_{M}] - \frac{e_{M}}{f_{M}} E_{q} [{∥Y - {\tilde{S}}^{\ T p} - [[A^{(1)}, A^{(2)}, A^{(3)}]] - S^{T p}∥}_{F}^{2}], \end{matrix}

\begin{matrix} E_{q (\{A^{(n)}\}, λ)} [\sum_{n = 1}^{3} ln p (A^{(n)} ∣ λ)] = E_{q} [\sum_{n} \sum_{i_{n}} \{- L ln π + ln | Λ | - a_{i_{n}}^{(n) H} Λ a_{i_{n}}^{(n)}\}] \\ = \sum_{n} \{- L I_{n} ln π + I_{n} \sum_{l} E_{q} [ln λ_{l}] - \sum_{i_{n}} E_{q} [a_{i_{n}}^{(n) H} Λ a_{i_{n}}^{(n)}]\} \\ = - L \sum_{n} I_{n} ln π + \sum_{n} I_{n} \sum_{l} E_{q} [ln λ_{l}] - \sum_{n} \sum_{i_{n}} \{\sum_{l} E_{q} [a_{i_{n}, l}^{(n) *} a_{i_{n}, l}^{(n)}] E_{q} [ln λ_{l}]\} \\ = - L \sum_{n} I_{n} ln π + \sum_{n} I_{n} \sum_{l} (ψ (c_{M}^{l}) - ln d_{M}^{l}) - Tr \{\tilde{Λ} \sum_{n} ({\tilde{A}}^{(n) H} {\tilde{A}}^{(n)} + I_{n} V^{(n)})\}, \end{matrix}

\begin{matrix} E_{q (S^{T p}, γ^{T p})} [ln p (S^{T p} ∣ γ^{T p})] \\ = E_{q} [\sum_{I n d (T p)} C s t (\ I n d (T p)) (\ln γ_{I n d (T p)}^{T p} - ln π) - γ_{I n d (T p)}^{T p} \sum_{\ I n d (T p)} {|S_{i_{1}, i_{2}, i_{3}}^{T p}|}^{2}] \\ = - \prod_{n} I_{n} ln π + C s t (\ I n d (T p)) \sum_{I n d (T p)} (ψ (a_{M}^{γ_{I n d (T p)}}) - ln b_{M}^{γ_{I n d (T p)}}) \\ - \sum_{I n d (T p)} (\sum_{\ I n d (T p)} \frac{a_{M}^{γ_{I n d (T p)}}}{b_{M}^{γ_{I n d (T p)}}} ({(σ_{i_{1}, i_{2}, i_{3}}^{T P})}^{2} + {|{\tilde{S}}_{i_{1}, i_{2}, i_{3}}^{T p}|}^{2})), \end{matrix}

E_{q} [ln p (λ)] = - L ln Γ (c_{0}^{l}) + L c_{0}^{l} ln d_{0}^{l} + \sum_{l} \{(c_{0}^{l} - 1) (ψ (c_{M}^{l}) - ln d_{M}^{l}) - d_{0}^{l} \frac{c_{M}^{l}}{d_{M}^{l}}\},

\begin{matrix} E_{q} [ln p (γ^{T p})] = - C s t (I n d (T p)) ln Γ (a_{0}^{T p}) + C s t (I n d (T p)) a_{0}^{T p} ln b_{0}^{T p} \\ + \sum_{I n d (T p)} \{(a_{0}^{T p} - 1) (ψ (a_{M}^{γ_{I n d (T p)}}) - ln b_{M}^{γ_{I n d (T p)}}) - b_{0}^{T p} \frac{a_{M}^{γ_{I n d (T p)}}}{b_{M}^{γ_{I n d (T p)}}}\}, \end{matrix}

E_{q} [ln p (ν)] = - ln Γ (e_{0}) + e_{0} ln f_{0} + (e_{0} - 1) (ψ (e_{M}) - ln f_{M}) - f_{0} \frac{e_{M}}{f_{M}},

- E_{q} [\sum_{n = 1}^{3} ln q (A^{(n)})] = \sum_{n} I_{n} L (ln π + 1) + \sum_{n} I_{n} |V^{(n)}|,

- E_{q} [ln q (S_{Ω}^{T p})] = \prod_{n} I_{n} (1 + ln π) + \sum_{Ω} ln {(σ_{Ω}^{T p})}^{2},

- E_{q} [ln q (λ)] = \sum_{l = 1}^{L} \{ln Γ (c_{M}^{l}) - (c_{M}^{l} - 1) ψ (c_{M}^{l}) - ln d_{M}^{l} + c_{M}^{l}\},

- E_{q} [ln q (γ^{T p})] = \sum_{I n d (T p)} \{ln Γ (a_{M}^{γ_{I n d (T p)}}) - (a_{M}^{γ_{I n d (T p)}} - 1) ψ (a_{M}^{γ_{I n d (T p)}}) - ln b_{M}^{γ_{I n d (T p)}} + a_{M}^{γ_{I n d (T p)}}\},

- E_{q} [ln q (ν)] = ln Γ (e_{M}) - (e_{M} - 1) ψ (e_{M}) - ln f_{M} + e_{M} .

References

Andrews, J.G.; Buzzi, S.; Choi, W.; Hanly, S.V.; Lozano, A.; Soong, A.C.; Zhang, J.C. What will 5G be? IEEE J. Sel. Areas Commun. 2014, 32, 1065–1082. [Google Scholar] [CrossRef]
Wei, Z.; Cai, Y.; Sun, Z.; Ng, D.W.K.; Yuan, J.; Zhou, M.; Sun, L. Sum-rate maximization for IRS-assisted UAV OFDMA communication systems. IEEE Trans. Wirel. Commun. 2020, 20, 2530–2550. [Google Scholar] [CrossRef]
Harkat, H.; Monteiro, P.; Gameiro, A.; Guiomar, F.; Farhana Thariq Ahmed, H. A survey on MIMO-OFDM systems: Review of recent trends. Signals 2022, 3, 359–395. [Google Scholar] [CrossRef]
Patil, P.; Patil, M.; Itraj, S.; Bomble, U. A review on MIMO OFDM technology basics and more. In Proceedings of the 2017 International Conference on Current Trends in Computer, Electrical, Electronics and Communication (CTCEEC), Mysore, India, 8–9 September 2017. [Google Scholar]
Lin, Y.; Jin, S.; Matthaiou, M.; You, X. Tensor-based algebraic channel estimation for hybrid IRS-assisted MIMO-OFDM. IEEE Trans. Wirel. Commun. 2021, 20, 3770–3784. [Google Scholar] [CrossRef]
Araújo, D.C.; De Almeida, A.L.; Da Costa, J.P.; de Sousa, R.T. Tensor-based channel estimation for massive MIMO-OFDM systems. IEEE Access 2019, 7, 42133–42147. [Google Scholar] [CrossRef]
Hillar, C.J.; Lim, L.H. Most tensor problems are NP-hard. J. ACM (JACM) 2013, 60, 1–39. [Google Scholar] [CrossRef]
Stoica, P.; Selen, Y. Model-order selection: A review of information criterion rules. IEEE Signal Process. Mag. 2004, 21, 36–47. [Google Scholar] [CrossRef]
Lam, W.; Bacchus, F. Learning Bayesian belief networks: An approach based on the MDL principle. Comput. Intell. 1994, 10, 269–293. [Google Scholar] [CrossRef]
Haardt, M.; Roemer, F.; Del Galdo, G. Higher-order SVD-based subspace estimation to improve the parameter estimation accuracy in multidimensional harmonic retrieval problems. IEEE Trans. Signal Process. 2008, 56, 3198–3213. [Google Scholar] [CrossRef]
Wen, F.; Wymeersch, H. 5G synchronization, positioning, and mapping from diffuse multipath. IEEE Wirel. Commun. Lett. 2020, 10, 43–47. [Google Scholar] [CrossRef]
Zhou, Z.; Fang, J.; Yang, L.; Li, H.; Chen, Z.; Blum, R.S. Low-rank tensor decomposition-aided channel estimation for millimeter wave MIMO-OFDM systems. IEEE J. Sel. Areas Commun. 2017, 35, 1524–1538. [Google Scholar] [CrossRef]
Li, J.; Wu, Z.; Wan, Z.; Zhu, P.; Wang, D.; You, X. Structured tensor CP decomposition-aided pilot decontamination for UAV communication in cell-free massive MIMO systems. IEEE Commun. Lett. 2022, 26, 2156–2160. [Google Scholar] [CrossRef]
Salmi, J.; Richter, A.; Koivunen, V. Sequential unfolding SVD for tensors with applications in array signal processing. IEEE Trans. Signal Process. 2009, 57, 4719–4733. [Google Scholar] [CrossRef]
Li, Y.; Liu, D.; Wang, K.; Tan, Z. Failure mechanisms and reliability evaluation of RF front-end integrated circuit. In Proceedings of the 12th International Conference on Quality, Reliability, Risk, Maintenance, and Safety Engineering (QR2MSE 2022), Emeishan, China, 27–30 July 2022. [Google Scholar]
Shin, S.; Naglich, E.J.; Guyette, A.C. Autonomously Tunable Filters for Interference Mitigation: Advances in Autonomously Switchable/Tunable RF/Microwave Filters for Interference Mitigation without Operator Intervention. IEEE Microw. Mag. 2020, 21, 79–87. [Google Scholar] [CrossRef]
Yang, X.; Petropulu, A.P. Co-channel interference modeling and analysis in a Poisson field of interferers in wireless communications. IEEE Trans. Signal Process. 2003, 51, 64–76. [Google Scholar] [CrossRef]
Feng, C.; Cui, H.; Ma, M.; Jiao, B. On statistical properties of co-channel interference in OFDM systems. IEEE Commun. Lett. 2013, 17, 2328–2331. [Google Scholar] [CrossRef]
Xu, D.; Zhang, G.; Ding, X. Analysis of co-channel interference in low-orbit satellite Internet of Things. In Proceedings of the 2019 15th International Wireless Communications & Mobile Computing Conference (IWCMC), Tangier, Morocco, 24–28 June 2019. [Google Scholar]
Rakovic, V.; Denkovski, D.; Atanasovski, V.; Mähönen, P.; Gavrilovska, L. Capacity-aware cooperative spectrum sensing based on noise power estimation. IEEE Trans. Commun. 2015, 63, 2428–2441. [Google Scholar] [CrossRef]
Arjoune, Y.; Kaabouch, N. A comprehensive survey on spectrum sensing in cognitive radio networks: Recent advances, new challenges, and future research directions. Sensors 2019, 19, 126. [Google Scholar] [CrossRef]
Zhao, Q.; Zhang, L.; Cichocki, A. Bayesian CP factorization of incomplete tensors with automatic rank determination. IEEE Trans. Pattern Anal. Mach. Intell. 2015, 37, 1751–1763. [Google Scholar] [CrossRef] [PubMed]
Du, J.; Dong, J.; Jin, L.; Gao, F. Bayesian Robust Tensor Factorization for Angle Estimation in Bistatic MIMO Radar with Unknown Spatially Colored Noise. IEEE Trans. Signal Process. 2022, 70, 6051–6064. [Google Scholar] [CrossRef]
Sun, Y.; Wang, W.; Chai, J.; Lv, Y. Tensor Based Channel Parameter Estimation for Positioning Applications. In Proceedings of the 2023 17th European Conference on Antennas and Propagation (EuCAP), Florence, Italy, 26-31 March 2023. [Google Scholar]
Takayama, H.; Zhao, Q.; Hontani, H.; Yokota, T. Bayesian Tensor Completion and Decomposition with Automatic CP Rank Determination Using MGP Shrinkage Prior. SN Comput. Sci. 2022, 3, 225. [Google Scholar] [CrossRef]
Cheng, L.; Chen, Z.; Shi, Q.; Wu, Y.C.; Theodoridis, S. Towards flexible sparsity-aware modeling: Automatic tensor rank learning using the generalized hyperbolic prior. IEEE Trans. Signal Process. 2022, 70, 1834–1849. [Google Scholar] [CrossRef]
Alaei, M.A.; Golabighezelahmad, S.; De Boer, P.T.; van Vliet, F.E.; Klumperink, E.A.; Kokkeler, A.B. Interference mitigation by adaptive analog spatial filtering for MIMO receivers. IEEE Trans. Microw. Theory Tech. 2021, 69, 4169–4179. [Google Scholar] [CrossRef]
Domizioli, C.P.; Hughes, B.L. Front-end design for compact MIMO receivers: A communication theory perspective. IEEE Trans. Commun. 2012, 60, 2938–2949. [Google Scholar] [CrossRef]
Irazoqui, R.W.; Fulton, C.J. Spatial interference nulling before RF frontend for fully digital phased arrays. IEEE Access 2019, 7, 151261–151272. [Google Scholar] [CrossRef]
Zhao, Q.; Zhou, G.; Zhang, L.; Cichocki, A.; Amari, S.I. Bayesian robust tensor factorization for incomplete multiway data. IEEE Trans. Neural Netw. Learn. Syst. 2015, 27, 736–748. [Google Scholar] [CrossRef] [PubMed]
Sun, Y.; Wang, W.; Yue, H.; Lyu, Y. Robust Tensor Positioning Based on Channel Parameter Estimation under Spatially Colored Noise. In Proceedings of the 2024 18th European Conference on Antennas and Propagation (EuCAP), Glasgow, UK, 17–22 March 2024. [Google Scholar]
Wang, W.; Jost, T.; Gentner, C.; Zhang, S.; Dammann, A. A semiblind tracking algorithm for joint communication and ranging with OFDM signals. IEEE Trans. Veh. Technol. 2015, 65, 5237–5250. [Google Scholar] [CrossRef]
Hand, D.J. Classifier technology and the illusion of progress. Statist. Sci. 2006, 21, 1–14. [Google Scholar] [CrossRef]

Figure 1. A typical traffic scenario.

Figure 2. The power composition of the received tensor.

Figure 3. Probabilistic graphical model.

Figure 4. (a) The changes in the number of paths and the three variations of ELBO for RCP-APH. (b) The probability density function (PDF) of the interference item power distribution and other estimated parameters for RCP and RCP-APH.

Figure 5. For different interference item ratios, a comparison of rank and parameter estimation performance is conducted for interference powers of

5 σ_{N o i s e}^{2}

and

10 σ_{N o i s e}^{2}

. (a) Rank estimation. (b) Angle estimation. (c) Delay estimation. Here, (b,c) share a common legend.

Figure 5. For different interference item ratios, a comparison of rank and parameter estimation performance is conducted for interference powers of

5 σ_{N o i s e}^{2}

and

10 σ_{N o i s e}^{2}

. (a) Rank estimation. (b) Angle estimation. (c) Delay estimation. Here, (b,c) share a common legend.

Figure 6. Study on the performance of interference estimation for the RCP-APH. Green indicates accurately estimated interference positions, while blue represents unestimated interference positions. (a) True interference; (b) matrix unfolding of true interference; (c) estimated interference; (d) matrix unfolding of estimated interference.

Figure 7. The estimations of the interference positions are compared between two variational algorithms under different interference ratios. To clearly depict the performance differences between the algorithms, coordinate annotations for all subplots are omitted. The first row illustrates the estimated noise precision and PDF of the interference item power for both the RCP-APH and RCP algorithms. The coordinate scales are consistent with Figure 4b. The second row represents the actual interference, while the third and fourth rows depict the estimations of the interference positions for both algorithms. The coordinate scales align with those in Figure 6b.

Figure 8. Under different interference item ratios, a comparison of interference estimation is conducted for interference powers of

5 σ_{N o i s e}^{2}

and

10 σ_{N o i s e}^{2}

. (a) Recall. (b) Precision. (c) F1 Score. Here, all subplots share a common legend.

Figure 8. Under different interference item ratios, a comparison of interference estimation is conducted for interference powers of

5 σ_{N o i s e}^{2}

and

10 σ_{N o i s e}^{2}

. (a) Recall. (b) Precision. (c) F1 Score. Here, all subplots share a common legend.

Figure 9. Interference estimation performance is compared for different interference item ratios for both 10 dB and 20 dB of

ρ

. (a) Recall. (b) Precision. (c) F1 Score. Here, all subplots share a common legend.

Figure 9. Interference estimation performance is compared for different interference item ratios for both 10 dB and 20 dB of

ρ

. (a) Recall. (b) Precision. (c) F1 Score. Here, all subplots share a common legend.

Figure 10. Performance metrics for interference estimation for different spatial structures and interference characteristics. (a) Different sampling K. (b) Different ratio of CCI. (c) Different bandwidth of FEI. All subplots share a common legend.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Sun, Y.; Wang, W.; Wang, Y.; He, Y. A Bayesian Tensor Decomposition Method for Joint Estimation of Channel and Interference Parameters. Sensors 2024, 24, 5284. https://doi.org/10.3390/s24165284

AMA Style

Sun Y, Wang W, Wang Y, He Y. A Bayesian Tensor Decomposition Method for Joint Estimation of Channel and Interference Parameters. Sensors. 2024; 24(16):5284. https://doi.org/10.3390/s24165284

Chicago/Turabian Style

Sun, Yuzhe, Wei Wang, Yufan Wang, and Yuanfeng He. 2024. "A Bayesian Tensor Decomposition Method for Joint Estimation of Channel and Interference Parameters" Sensors 24, no. 16: 5284. https://doi.org/10.3390/s24165284

APA Style

Sun, Y., Wang, W., Wang, Y., & He, Y. (2024). A Bayesian Tensor Decomposition Method for Joint Estimation of Channel and Interference Parameters. Sensors, 24(16), 5284. https://doi.org/10.3390/s24165284

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Bayesian Tensor Decomposition Method for Joint Estimation of Channel and Interference Parameters

Abstract

1. Introduction

2. Preliminaries and Notations

3. MIMO-OFDM System Model

4. Bayesian Tensor Factorization

4.1. Alternate Prior Hypotheses

4.2. Variational Bayesian Inference

4.2.1. Posterior Distribution of Factor Matrices $A^{(n)}$

4.2.2. Posterior Distribution of Hyperparameters $λ$

4.2.3. Posterior Distribution of Hyperparameters $S$

4.2.4. Posterior Distribution of Hyperparameters $γ$

4.2.5. Posterior Distribution of Hyperparameters $ν$

4.3. Evidence Lower Bound

4.4. Computational Complexity

5. Simulation Analysis

5.1. Initialization and Termination Conditions

5.2. Algorithm Performance

5.3. Channel Estimation Performance

5.4. Interference Estimation Performance

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

A Bayesian Tensor Decomposition Method for Joint Estimation of Channel and Interference Parameters

Abstract

1. Introduction

2. Preliminaries and Notations

3. MIMO-OFDM System Model

4. Bayesian Tensor Factorization

4.1. Alternate Prior Hypotheses

4.2. Variational Bayesian Inference

4.2.1. Posterior Distribution of Factor Matrices A ( n )

4.2.2. Posterior Distribution of Hyperparameters λ

4.2.3. Posterior Distribution of Hyperparameters S

4.2.4. Posterior Distribution of Hyperparameters γ

4.2.5. Posterior Distribution of Hyperparameters ν

4.3. Evidence Lower Bound

4.4. Computational Complexity

5. Simulation Analysis

5.1. Initialization and Termination Conditions

5.2. Algorithm Performance

5.3. Channel Estimation Performance

5.4. Interference Estimation Performance

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.2.1. Posterior Distribution of Factor Matrices $A^{(n)}$

4.2.2. Posterior Distribution of Hyperparameters $λ$

4.2.3. Posterior Distribution of Hyperparameters $S$

4.2.4. Posterior Distribution of Hyperparameters $γ$

4.2.5. Posterior Distribution of Hyperparameters $ν$