1. Introduction
With the exceeding demand for faster and more reliable wireless communication systems, accurate models for high-power amplifier (HPA) designs that involve both active devices and passive components become increasingly essential [
1]. The HPA plays a key role for conditioning the transmitted signals and consumes a large amount of power in system designs. Typically, realizing any radio frequency (RF) power amplifier (PA) starts with modeling of circuit components, and then continues with verification of the models with several measurements. The designers have to consider various topologies and techniques manually to achieve the desired design goals [
2]. However, due to the increased complexity of the modern wideband communication systems, such as fifth generation (5G) and sixth generation (6G), intelligent design steps and advanced optimization methods are required [
1,
3].
Generally,
scattering parameters (S-parameters) can be used for designing HPAs [
4]. However, proper design and analysis of active circuits working with large signals require additional parameters that are dynamically linked to each other, such as amplitude of the input signal, input impedance, load impedance at fundamental and harmonic frequencies, biasing voltages/currents at the input and output, and temperature specifications [
5,
6]. Therefore, S-parameters should be extended to analyze nonlinear behavior of the circuit with a sufficient number of harmonics and adequate range of biasing.
X-parameters are an extension form of S-parameters and can be extracted from large signal measurement data [
7] or existing wideband nonlinear models. These parameters are based on the poly-harmonic distortion (PHD) and can be employed for modeling the nonlinear high-frequency components [
5] and for designing amplifiers, as reported in [
7,
8,
9,
10]. Despite the fact that considerable research has been performed in this area, the usage of these factors to produce post-layouts is lacking. Hence, in this study, we construct our proposed optimization method based on X-parameters for providing comprehensive solutions.
After determining the proper modeling parameters, a suitable platform which incorporates different optimization schemes with X-parameters must be considered. Nowadays, popular optimizations such as Bayesian optimization, space mapping, genetic algorithm, and differential evolution [
11] are not useful enough when the design parameters are in high dimension and advanced optimization techniques through machine learning and neural networks (NNs) for nonlinear designs are required substantially [
12]. Constructing an accurate NN for these nonlinear circuits is not straightforward, and efficient methods are needed for finding the optimal hyperparameters of any NN (i.e., number of neurons and number of hidden layers) [
13]. In [
14], the usefulness of
space mapping in the amplifier designs is proved; hence, this method can be a good solution for training and constructing NNs. The
coarse and fine modeling method is a space mapping algorithm where the fine model achieves the optimal parameters without going to the direct optimization and uses the coarse model with updated values. It helps the designer to approximate and obtain accurate modeling in high-fidelity validations [
15].
In this study, we present a superior experiment coarse and fine modeling approach based on multi-objective optimization algorithms where four various deep neural networks (DNNs) are employed, automatically. Due to the effective accuracy of DNNs (i.e., networks with more than two hidden layers), these types of NNs are used instead of shallow neural networks [
16]. The proposed optimization method consists of two sequential phases for modeling: (i) active device (i.e., transistor), and (ii) HPA design, respectively. In the
“fine modeling” phase, the gallium nitride (GaN) high-electron mobility transistor (HEMT) is modeled with X-parameters, and optimal hyperparameters that will be applied in the HPA modeling are achieved. Afterwards, three DNNs are developed for (i) selecting the best HPA configuration, (ii) modeling the selected HPA structure with S-parameters for re-evaluating the accuracy of achieved hyperparameters in the fine modeling phase (
“coarse modeling” stage), and (iii) modeling the behavior of HPA by using multi-objective multi-verse optimizer (MOMVO) [
17] for optimizing both one-tone and two-tone continuous wave (CW) harmonic balance (HB) performances, concurrently. By employing the multi-objective optimization, one-tone CW performances (i.e., output power (
), power gain (
), and drain efficiency (
)) and also two-tone CW performances (includes harmonic and intermodulation distortions (IMDs (dBc))) are optimized once together [
18,
19].
In combination with [
12,
20], our proposed technique is demonstrated to be more effective for solving microwave modeling problems, and for optimizing multi-objective nonlinear HPA specifications jointly and automatically. Furthermore, by applying coarse and fine modeling method, higher-accuracy hidden layers’ structures are determined in an effective way. The HPA is modeled and optimized for nonlinear design specifications where not only one-tone performances are optimized but also two-tone performances up to
IMD are considered and optimized, concurrently. For accelerating the optimization process, the classification DNN is applied for selecting the well-matched HPA topology among various matching network (MN) configurations. The MOMVO algorithm is applied as it can beneficially approximate the Pareto-optimal front (POF) for more than three objectives in a cheaper computation [
17]. At the final stage, for providing ready-to-fabricate layout, the constructed regression DNN with MOMVO method is applied. It optimizes the design parameters of the selected HPA model for achieving desired design specifications in conjunction with a full-wave electromagnetic (EM) analysis.
This article is organized as follows:
Section 2 is devoted to describe the framework of the proposed optimization method including modeling of GaN HEMT transistor and HPA design optimizations through DNNs.
Section 3 describes the practical implementation of proposed optimization method.
Section 4 validates the proposed method by designing and measuring a 10 W HPA design. Lastly, conclusions are provided in
Section 5.
2. Optimization-Oriented Strategy in a Nutshell
In microwave devices, optimizing nonlinear objective functions is not straightforward and needs powerful attempt. Employing NN with multi-layers (i.e., DNN) is a proficient technique for learning nonlinear behavior between input and output corresponding data [
21]. Hence, in this section, we explain the framework of the proposed optimization method that leads to an automated HPA design with the aid of DNNs for improving output power, efficiency, gain, and linearity performances, concurrently.
The proposed optimization-oriented strategy is based on a “coarse and fine modeling” approach and it provides an automated optimization method for (i) modeling the active nonlinear GaN HEMT transistor, and (ii) optimizing the HPA’s performance in terms of one-tone and two-tone HB specifications, sequentially. In the first phase, the aimed transistor model (i.e., GaN model in this paper) is modeled with the X-parameters (fine modeling) using the “
regression DNN”. After that, the “
classification DNN” is employed for selecting the best configuration of HPA among various topologies achieved from the simplified real frequency technique (SRFT) [
22]. Then, the
DNN (i.e., regression DNN) is used for modeling the selected HPA configuration, from the previous step, using the S-parameters (coarse modeling). For accelerating the optimization process, the hidden layer structure achieved from the fine modeling is used for constructing this
DNN and for verifying the accuracy of the trained network. This performance will accelerate the optimization process, leading to define the hyperparameters of HPA’s DNN, effectively. Finally, the last regression DNN (i.e.,
DNN) is constructed, where the hyperparameters are the ones obtained from fine and coarse modeling. This network is employed for sizing the design parameters using multi-objective MOMVO algorithm where fabrication rules and constraints are also employed inside the optimization process. For polishing and achieving ready-to-fabricate layouts, various transmission line (TL)-microstrip models can be replaced, added to, or removed from the MNs of optimized HPA design.
Each NN to be constructed needs three kinds of data as training, validation, and testing data, defined as
,
, and
, with the division rate of 70%, 15%, and 15%, respectively. The corresponding responses of each data can also be provided as
,
, and
. In all the presented networks, input layer features donate to
,
, and
data and the output layer features yield to
,
, and
data. After generating suitable data, the NNs can be trained using (
1). The accuracy of any NN is also measured with the difference amount between
and
, predicted outputs by
, as clarified in (
2). These groups of data are achieved by arranging co-simulation environment between electronic design automation (EDA) tools (such as ADS) and numerical analyzer (such as MATLAB) [
23] and by setting HB simulation environments in the ADS platform.
In this work for the proposed classification DNN, softmax layer is used as an activation function and the crossentropyex layer is used as a loss function to the network with long short-term memory (LSTM) layers and one fully connected layer size of
k. Additionally, for three regression DNNs, the rectified linear unit (ReLU) function is employed as the activation function, and the loss function is defined as the root mean squared error (RSME) along with LSTM layers. An overview of the automated proposed methodology is shown in
Figure 1 and Algorithm 1 (at the end of this section).
Algorithm 1: Automated multi-tone HB optimization method |
Initial preparation: Prepare the co-simulation environment between the EDA tool and numerical analyzer; Determine the GaN HEMT transistor model; Provide the SRFT method on the numerical analyzer; Adjust one-tone and two-tone HB simulation setups in the EDA tool; Determine activation and loss functions for classification and regression DNNs;
Fine modeling through the regression DNN: Prediction of best HPA configuration by the classification DNN: |
2.1. Modeling of GaN HEMT with X-Parameters (Fine Modeling)
Selecting a suitable semiconductor device with a wider bandgap plays a significant role in HPA designs. Hence, we prefer using GaN technology in this work due to its superior power density and durability that enable higher power operation at high frequencies in comparison with Si and GaAs devices [
24]. Output spectrum of an active device such as GaN HEMT, operating in nonlinear region, carries not only the fundamental signal, but also the signals at harmonic frequencies. Hence, a considerable question is: How can this nonlinear device be modeled and characterized?
X-parameters are based on the poly-harmonic distortion and are used as an accurate and fast nonlinear modeling approach for representing small and large signal nonlinearity of S-parameters [
7,
8]. These harmonics consist of three terms:
,
, and
in the output spectrum that are defined in (
3) and (
4).
captures large signal harmonic response and
with
captures the small signal sensitivity by representing the incident and scattered waves. Functions for
(labeled with port
p and harmonic
m) are given small extraction tones as
(labeled with port
q and harmonic
n).
where
These X-parameters of active device (i.e., GaN HEMT in this study) can be extracted by preparing suitable simulation setup in EDA tool such as Keysight ADS [
25]. For this case, firstly, four parameters, input frequency (
), input power (
), gate-source (
), and drain-source (
) voltage biasing are swept with appropriate step sizes. Afterwards, all the data appear in a file name such as
“.xnp” (
Step-
). By using all the data and terms presented in the .xnp file and by constructing different output data (i.e.,
), the regression DNN presenting the active device model can be constructed. Input layer consists of inputs such as
,
,
, and
, where the output layer includes the X-parameters determined in (
3) for
p = 2 and
m = 5.
To accurately train and construct the NN, a suitable and optimal set of hyperparameters (i.e., number of neurons and hidden layers) must be determined. For this case, we apply
Bayesian optimization (BO) as it is well suited to optimize hyperparameters and it is faster than grid search and randomized search [
26]. The BO aims to model hyperparameters that yield the lowest value of the score (i.e., error rate) by keeping Gaussian process (GP) model, internally. By applying BO, initial network hyperparameters for constructing DNN can be achieved (
Step-
).
2.2. Modeling and Optimizing HPA Design with DNNs Includes Coarse Modeling
In this section, we model the behavior of the HPA through DNNs for both one-tone and two-tone HB specifications. For this case, we firstly determine the best HPA configuration and topology using the classification DNN. Then, by using two other regression DNNs, the overall performance of HPA is optimized. Classification and regression DNNs can be constructed by defining input, hidden, and output layers with the activation and loss functions. The following is the explanation of each employed DNN and optimization process.
2.2.1. Predicting Suitable HPA Configuration Using Classification DNN
The SRFT method is a suitable method for obtaining initial guess for the input and output MNs with distributed elements and design parameters for PAs. This method depends highly on the definition of ad hoc matrix, namely, . The matrix is defined as [∓1, ∓1,…, ∓1] where t ≥ 3 results in k various PA models differ in the number of TLs at input and output MNs. For selecting the most suitable topology and configuration among various k structures, classification DNN can be useful enough. By constructing this kind of DNN, any designer can define the desired output specifications and the DNN would predict which type (i.e., label) of PA can be appropriate among k various constructions. The following are the explanations for training the classification DNN.
A. Dataset Generation
Firstly, the selected GaN HEMT transistor model is inserted into the SRFT method that results in k different PAs, where these amplifiers vary in the number of TLs. Then an appropriate large amount of sampling dataset for training the classification DNN can be obtained by employing Gaussian random distribution with 5% of standard deviation () around the achieved component values from the SRFT method, which corresponds to 15% of 3. deviation. For k different amplifier configurations, p data in terms of input layer features are obtained. Hence, k×p data including , , and can be generated.
B. Features of Input, Hidden, and Output Layers for Classification DNN
As the classification DNN in
Step-
shows, the features of input layer consist of one-tone HB specifications such as (
,
,
), and two-tone specifications such as (
,
,
IMDs).
For obtaining
,
, and
specifications, a
single-tone HB simulation environment, and for achieving IMDs (i.e.,
,
, and
), a
two-tone HB simulation environment must be set, where various values for each frequency appear. Even-order IMDs such as second, forth, and so on appear inside the input tones; however, odd-order tones such as third, fifth, and so on interface with the fundamental tones, and these interruptions can make problems for RF designs. For illustrating the
characteristic,
Figure 2 shows third-order distortion products in a wide range of frequencies where there are certain mixing tones and distortion products for the
. The extended version of
Figure 2 for the
and
IMDs can be developed in special tones, and the following is the summary of distortion products for different tones.
The first tone mix is [{1,0}, {0,1}]. The third, fifth, and seventh order distortion orders are , , and , , sequentially. All of these data are presented in a file named “spectra.raw” in the arranged co-simulation environment and all the data are in the format of voltage and current. Hence, by using MATLAB tool and defining the especial expressions, the aimed specifications can be calculated.
Each set of p data is denoted by “1”,“2”,…,“k” categorical labels or numeric responses for presenting the output layer features (i.e., , , and ). For this DNN, the initial number of LSTM layers with neuron numbers is estimated using the BO algorithm.
2.2.2. Coarse Modeling
HPAs are nonlinear circuits that include a large number of design parameters. For this case, modeling of this kind of design is not straightforward and requires serious attention. For our problem, coarse modeling can be performed by training the regression DNN through S-parameters, which are the simplified version of X-parameters. This type of modeling can help designers in finding the optimal hyperparameters of regression DNNs with reduced difficulty for designing HPAs. For this
DNN, the features of input layer are
and
and the feature of output layer is
. The sampling data for this network can be generated by iterating the component values of selected HPA design in (
Step-
) in the range of
%,
%,
%,
%, and
%.
In this phase, the optimal hyperparameters from the “fine modeling” step are used to model the selected HPA configuration through the classification DNN. If the trained DNN does not respond with acceptable accuracy, the fine model’s hyperparameters are changed and re-evaluated in the coarse modeling phase until the desired accuracy is achieved (i.e., more than 90%) (
Step-
).
2.2.3. Optimizing Design Parameters with Multi-Objective-Based Regression DNN
After achieving the optimal hyperparameters of regression DNN from coarse modeling, it is time to predict the optimal design parameters of the selected topology.
Step-
results in high-performance one-tone and two-tone specifications, concurrently. Here, the regression DNN is trained where the hyperparamers are the ones achieved from the fine and coarse modeling phases (
Step-
). In this step, it is targeted to optimize HPAs that can pass the EM-based simulations, resulting in generating ready-to-fabricate layouts. Hence, some design rules and fabrication criteria must be employed inside the optimization process, as
Table 1 clarifies. The following also presents the features of input layer and output layer, and also the method of generating data for training the
regression DNN.
B. Data Generation for Training the DNN
After determining the various features of DNN layers, it is time to introduce how the sampling data are generated for training and constructing the final regression DNN. For this case, we prepare sampling data summarized in (
6), where
k denotes number of various PA models with TLs achieved from the SRFT method and
j represents the iteration of component values randomly for each PA model within the range of [
%–
%] with step size of 5%. By sweeping the sampling design parameters, multi-tone HB results are optimized by the MOMVO algorithm and are exerted to the output layer (
Step-
).
2.3. Electromagnetic-Based HPA Design with TLs
After modeling and optimizing the HPA design, the post-layout must be generated for fabrication. With the target of polishment to the final layout, any designer can remodel the TLs of the optimized HAP in
Step-
with other
TLines microstrip library palettes existing in the ADS platform (
Step-
). This step helps the designers to generate more reliable layouts that have passed the EM simulations and are more suitable schemes for fabrication in the RF companies.
3. Practical Implementation of Coarse and Fine Modeling Approach
Our proposed automated optimization method is exerted on the CPU execution environment, featuring an Intel Core i7-4790 CPU @ 3.60 GHz with 32.0 GB RAM. In this work, we use Ampleon CLF1G0060-10 GaN HEMT as the dynamic load-line model of this transistor, which has been verified in [
20]. The transistor is modeled by the regression DNN that is based on X-parameters by obtaining boundary knowledge of
,
,
and
from the data sheet of the transistor and by generating the
items that are described in (
3) and (
4).
The fine modeling phase starts with one hidden layer, and the number of neurons are predicted using the BO method.
Figure 3 shows the normalized RMSE in one hidden layer that is around 0.5 and is not acceptable. Hence, sequentially, the number of hidden layers are increased such that in the
layer with 300 neurons, the normalized RMSE is decreased to 0.07. The total sampling data contains 1700 sequences in 200 sweeping points and includes multi-segment output results.
After modeling the GaN HEMT, it is time to predict the best HPA topology and configuration that fits best to the used transistor model. For this case, the classification DNN is employed. In order to achieve various topologies and configurations, the SRFT method is employed, resulting in 11 different amplifier models that differ in the number of TLs. For each model, 500 sequences, including multi-segment one-tone and two-tone CW values (i.e., , , , , , and ), over the operation bandwidth are prepared, where in total, data are generated. The hyperparameters of the classification DNN are predicted using the BO method, where, in the LSTM layer with 350 neurons, it obtains the testing accuracy of 98.2%.
After constructing the classification DNN, the determined one-tone and two-tone features of the used transistor model are entered to the DNN. The trained DNN predicts that the eighth amplifier model achieved from the SRFT method can be a suitable topology and structure to start designing (see
Figure 4).
Afterwards, the coarse modeling including the regression DNN is applied and the hyperparameters are taken directly from the fine modeling that involves four LSTM layers and 300 neurons in each hidden layer. The input layer and output layer of this DNN include S-parameters , , and , respectively. The accuracy of this DNN (i.e., network) is significant since it paves the way of constructing DNN for sizing the design parameters. In simple words, modeling the HPA design with S-parameters can make the designer sure of the accuracy of the constructed DNN.
The last regression DNN is constructed for sizing the design parameters of the selected topology (i.e.,
amplifier model from SRFT method). For this case, easily, the hyperparameters are taken from coarse modeling, and input layer and output layer features are constructed. As shown in
Figure 1, the input layer consists of
,
,
, and IMDs, where the output layer consists of the optimized values of these input features using the MOMVO algorithm (see Equation (
5)). It must be noted that in this phase, design rules are also applied for preparing the layout that can pass the EM simulation. The total sampling data for this regression DNN is around 10,000, which includes multi-segment input layer features over the operational frequency band. Finally, the polishing process is employed for finding the appropriate TL microstrip model among various existing libraries in the ADS tool and also for generating a post-layout that passes the EM simulation and is ready to be fabricated in the companies.
4. Fabrication and Measurement of Optimized HPA
The proposed optimization method using vendor component models is applied for optimizing the design of a 10 W HPA consisting of a CLF1G0060-10 GaN HEMT transistor. This transistor model is a 6 GHz general-purpose RF power transistor that is without any in-package matching network, and its wideband verified nonlinear model is provided by the manufacturer. The targeted HPA design is optimized for enhancing efficiency, power gain, output power, and linearity in the operational frequency band of 1.8 GHZ to 2.2 GHz (i.e., cellular frequency band used in mobile wireless communications).
The optimized HPA is biased at a drain voltage of 50 V with a quiescent drain current of 40 mA and fabricated on a Rogers RO4350B substrate with
= 3.66 and a thickness of 0.508 mm. The final HPA layout with design parameters is depicted in
Figure 5. The implemented HPA’s performance is measured with CW signals using the Maury Microwave’s MT2000 mixed-signal characterization system. The measurement setup enables the large signal measurements with frequency and power sweep using single-tone, two-tone, and modulated signals.
The wideband S-parameter simulation and measurement results are shown in
Figure 6, and it illustrates the
larger than 15.5 dB and the
lower than −16 dB in the operational frequency band. The
,
, and
specifications are displayed in
Figure 7, revealing the
between 12.5 dB and 14.1 dB,
higher than 54%, and around 41 dBm
at p3dB.
Figure 8 shows the simulated and measurement
and
variation with the increasing output power at the center and corner frequencies.
High and low
,
, and
inter-modulation products (i.e.,
,
, and
) are depicted in
Figure 9. LTE supports six different signal bandwidth options: 1.4 MHz, 3 MHz, 5 MHz, 10 MHz, 15 MHz, and 20 MHz. We have tested the designed amplifier with the widest bandwidth signal of the LTE standard, 20 MHz, to evaluate its linearity.
Figure 10 presents the modulated signal response of the optimized HPA design with and without digital predistortion (DPD). In the measurement stage, a 20 MHz LTE signal with 10.7 dB peak-to-average power ratio (PAPR) is used for the analysis. For our optimized HPA, better than −50 dBc adjacent channel power ratio (ACPR) is achieved at around 32.1 dBm average output power.
As mentioned above, the HPAs being nonlinear circuits during their design, various parameters must be considered.
Table 2 presents the state-of-the-art situation on the employed optimization algorithms and methodologies for designing amplifiers. Our proposed method is concurrently optimizing various design specifications such as efficiency, output power, power gain, and various IMDs, automatically, while in the recently published literature, optimization is limited to a lower number of specifications.
5. Conclusions
In this work, we present an automated optimization process for solving high-dimensional modeling problems of HPA designs. In our technique, we apply coarse and fine modeling by using four DNNs based on transistor modeling with X-parameters (fine model), classification DNN, coarse modeling, and, finally, multi-objective optimization method for sizing the design parameters. The proposed method provides an advanced high-accuracy multi-objective optimization process for training data in a high-dimensional space with optimal hyperparameters. For validating the proposed multi-tone optimization method, we design and fabricate a 10 W HPA design; the results reveal high-performance outcomes in terms of power gain, efficiency, output power, and linearity.
The achieved results show that despite the complexity of the goals and large amount of the nonlinear parameters to be optimized, automated design techniques and optimization algorithms can be applied to nonlinear RF circuit design with regular workstations. Cross-relations of the optimization methods and device models have key roles for performance of automated RF circuit designs. Future studies can be focused on developing and integrating objective functions, considering the weighting algorithms and device model interpolations.