Next Article in Journal
Text-Enhanced Graph Attention Hashing for Cross-Modal Retrieval
Previous Article in Journal
Investigation of Laser Ablation Quality Based upon Entropy Analysis of Data Science
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Some Improvements on Good Lattice Point Sets

1
Research Center for Frontier Fundamental Studies, Zhejiang Lab, Kechuang Avenue, Zhongtai Sub-District, Yuhang District, Hangzhou 311121, China
2
Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science, BNU-HKBU United International College, Zhuhai 519087, China
3
School of Mathematics, Renmin University of China, No.59, Zhongguancun Street, Haidian District, Beijing 100872, China
4
Department of Statistics and Data Science, Faculty of Science and Technology, BNU-HKBU United International College, 2000 Jintong Road, Tangjiawan, Zhuhai 519087, China
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Submission received: 25 September 2024 / Revised: 24 October 2024 / Accepted: 25 October 2024 / Published: 27 October 2024
(This article belongs to the Section Information Theory, Probability and Statistics)

Abstract

:
Good lattice point (GLP) sets are a type of number-theoretic method widely utilized across various fields. Their space-filling property can be further improved, especially with large numbers of runs and factors. In this paper, Kullback-Leibler (KL) divergence is used to measure GLP sets. The generalized good lattice point (GGLP) sets obtained from linear-level permutations of GLP sets have demonstrated that the permutation does not reduce the criterion maximin distance. This paper confirms that linear-level permutation may lead to greater mixture discrepancy. Nevertheless, GGLP sets can still enhance the space-filling property of GLP sets under various criteria. For small-sized cases, the KL divergence from the uniform distribution of GGLP sets is lower than that of the initial GLP sets, and there is nearly no difference for large-sized points, indicating the similarity of their distributions. This paper incorporates a threshold-accepting algorithm in the construction of GGLP sets and adopts Frobenius distance as the space-filling criterion for large-sized cases. The initial GLP sets have been included in many monographs and are widely utilized. The corresponding GGLP sets are partially included in this paper and will be further calculated and posted online in the future. The performance of GGLP sets is evaluated in two applications: computer experiments and representative points, compared to the initial GLP sets. It shows that GGLP sets perform better in many cases.

1. Introduction

Number theory plays a significant role in information theory, providing fundamentals for many applications. Number-theoretic methods aim at deriving a point set uniformly scattered to replace the Monte Carlo (MC) method in the numerical evaluation of multiple integrals. Many methods have been proposed, including Korobov (1959) [1], Hua and Wang (1981) [2], Niederreiter (1992) [3], and Hlawka (1962) [4], to obtain a point set scattered as uniformly as possible over the unit hypercube C s = [ 0 , 1 ] s , called number-theoretic net (NT-net). This net has been widely utilized in experimental designs, high-dimensional integration, statistical inference, and computer experiments. Computer experiments involving a large number of runs and/or factors have become increasingly prevalent in various fields, driven by rapid developments in computational techniques. A key goal of these experiments is to approximate the true underlying system with a simpler surrogate metamodel. An adequate design of experiments can significantly reduce the bias between the metamodel and the true model. Space-filling designs are the most popular type of designs for computer experiments. A space-filling design method aims to uniformly scatter the design points across the experimental domain, similar to the generation of NT-net. To measure the space-filling properties of a point set or design, many criteria are used, including the maximin distance (Jonhson et al., 1990 [5]) and mixture discrepancy (Zhou et al., 2013 [6]). The Kullback–Leibler (KL) divergence (Kullback and Leibler 1951 [7]) is suitable for measuring the divergence between the distribution of a point set and the uniform distribution. KL divergence is widely used in many applications of information theory, and it can be used to measure space-filling properties.
There are many estimation methods used for computing the KL divergence or relative entropy between two continuous multivariate distributions based on samples. The KL divergence between a distribution and a uniform distribution can be simplified to the problem of estimation of entropy. Let K L ( f g ) denote the KL divergence between two multivariate densities: f ( x ) and g ( x ) . For an s-variate uniform distribution over a unit hypercube, denoted as C s = [ 0 , 1 ] s , the s-variate density g ( x ) = 1 for x C s . Therefore,
K L ( f g ) = f ( x ) l o g f ( x ) g ( x ) d x = f ( x ) l o g ( f ( x ) ) d x = H ( x ) ,
where H ( x ) represents the entropy of the continuous random variable x . Joe (1989) [8] provided the estimator of functionals of a multivariate density, including the entropy function based on a kernel density estimate, such that an empirical distribution and numerical integration are not necessary. Hence, in our case, Joe’s estimator of the KL divergence between the distribution of a point set over a unit hypercube X = { x 1 , , x n } and the uniform distribution is given by the following:
K L ^ ( f g ) = 1 n i = 1 n l o g ( f ^ ( x i ) ) ,
where f ^ ( x ) is the kernel density estimate of f ( x ) with bandwidth h > 0 , and
f ^ ( x ) = 1 n h s i = 1 n k ( x x i h ) , x R s ,
in which k is an s-variate density. Joe (1989) [8] provided some suggested choices of bandwidths and kernels for s 4 . In this paper, we chose the normal kernel and Scott’s rule for bandwidth (Scott 2015 [9]), in which h = n 1 s + 4 , and sometimes h is fixed with specifications in the tables of this paper.
The L p -distance of an n × s design D is defined by M i n d p ( D ) = m i n { d p ( x , y ) : x y , x , y D } , p 1 , where d p ( x , y ) = k = 1 s | x k y k | p as the L p -distance of any two rows x and y in D. Remark that the definition of the L p -distance between two rows of a design does not necessarily involve taking the 1 p power as in the Euclidean distance between two points and the design can be defined on C s or another finite field. The L p -distance of a design, also known as the separation distance, is the minimal distance between pairs of design points. The goal of a maximin distance design is to maximize this separation distance. On the other hand, discrepancy, which measures uniformity, should be minimized to achieve a uniform design. Discrepancy has been widely used in number-theoretic methods to evaluate the uniformity of a set of points. Within the context of reproducing kernel Hilbert spaces, Hickernell (1998) [10] proposed generalized L 2 -discrepancies including centered L 2 -discrepancy (CD) and wrap-around L 2 -discrepancy (WD). Zhou et al. (2013) [6] proposed mixture discrepancy (MD), and indicated that MD can overcome the shortcomings of CD and WD. For a point set in the domain of a unit hypercube C s = [ 0 , 1 ] s that can be expressed as a matrix X = ( x i k ) n × s , the square value of MD is defined by the following:
M D 2 ( X ) = 1 n 2 i = 1 n j = 1 n k = 1 s ( 15 8 1 4 | x i k 1 2 | 1 4 | x j k 1 2 | 3 4 | x i k x j k | + 1 2 | x i k x j k | 2 ) 2 n i = 1 n k = 1 s ( 5 3 1 4 | x i k 1 2 | 1 4 | x i k 1 2 | 2 ) + ( 19 12 ) s .
To evaluate the discrepancy of a design D = ( d i k ) in which the levels are labeled by integers, the elements can be linearly transformed to X = ( x i k ) C s by x i k = ( d i k 0.5 ) / n . However, the computational complexities of MD and L p -distance result in significant time consumption when n or s is too large. Instead, to evaluate the uniformity of the point set X , this paper adopts the Frobenius distance (FD); see Horn and Johnson (2012) [11]. A natural uniform condition on X is that the mean vector is 0 with a covariance matrix close to 1 12 I s , where I s is the identity matrix in R s . The uniformity of X can be measured by the Frobenius distance between the two matrices, the covariance matrix of X denoted by A = ( a i k ) and B = ( b i k ) = 1 12 I s , as follows:
F D ( X ) = A B F = t r a c e ( A B ) T ( A B ) = i = 1 n k = 1 s ( a i k b i k ) 2 1 / 2 .
Since the calculation of FD in this paper involves point sets with large n, it is not necessary to transform the design D into X for computational convenience.
The good lattice point (GLP) set proposed by Korobov (1959) [1] is the most popularly used method for generating an NT-net. Given a generating vector h = ( h 1 , , h s ) satisfying 1 h k < n , h j h k for j k , and the greatest common divisor g c d ( h k , n ) = 1 , so that each column is a permutation of { 1 , , n } , the design D = ( d i k ) with
d i k = i h k ( m o d n ) , i = 1 , , n , k = 1 , , s ,
is called the lattice point set of the generating vector h , where the multiplication operation modulo n is modified such that the result falls within the range [ 1 , n ] . Without loss of generality, h 1 is always set to be 1. If D has the minimal discrepancy among all possible generating vectors, it is called a good lattice point set. Hua and Wang (1981) [2] indicated that GLP sets tend to have a lower discrepancy. Searching for a GLP in large n and s requires long calculations. Saltykov (1963) [12] provided tables of suggested generating vectors for GLP sets for 3 s 10 and 100 < n 155,093 according to Korobov’s method (Korobov 1959 [1]). Saltykov’s tables have been widely utilized in numerical analyses. Hua and Wang (1981) [2] further extended these tables to include more values of n and s. Fang and Wang (1994) [13] attached the extended tables as appendices, and further developed the applications of GLP sets in statistics, including experimental designs and representative points. However, the space-filling properties of GLP sets can be further improved so that the tables of suggested generating vectors of GLP sets can be updated.
To study the space-filling properties of GLP sets, Zhou and Xu (2015) [14] treated the GLP set as a special class of regular designs. Therefore, the theorem by Zhou and Xu (2015) [14] stating that a linear-level permutation of a regular design does not decrease the L p -distance also applies to GLP sets. The linear-level permuting design is defined by the following:
D u = D + 1 n u = { x i + u , x i D } ( m o d n ) , i = 1 , , n ,
with a given permutation vector u = ( u 1 , , u s ) , and u k { 0 , 1 , , n 1 } , k = 1 , , s . Qi et al. (2018) [15] defined the best design under a certain space-filling criterion among the linear level permuting designs of a GLP set as the generalized good lattice point (GGLP) set. They noted that a specific type of permuting vector u ( i ) = ( i , , i ) , i { 1 , , n 1 } often results in better L p -distances. With a given acceptable number of searches, Qi et al. (2018) [15] considered n 1 u ( i ) vectors of this special type and then conducted a random search for all possible u vectors, fixing the first element u 1 as 0. Searching for the best permuting vector is a combinatorial optimization problem with n s 1 possibilities. When n and s are large, a simple random search is not efficient; instead, a stochastic optimization algorithm is adequate. The threshold accepting (TA) algorithm proposed by Dueck and Scheuer (1990) [16], as a variation of the simulated annealing algorithm, has solved many NP-hard optimization problems. It has been successfully applied in constructing uniform designs, as noted by Fang et al. (2000) [17]. There are many adjustments for TA settings, among which, we consider the version by Fang et al. (2017) [18].
To update the tables of generating vectors for GLP sets, this paper considers the best special permuting vector from n 1 searches as a benchmark and then searches for the optimal u in the form of ( 0 , u 2 , , u s ) using the TA algorithm. In Section 2, we provide a brief review of GGLP, including related basic theoretical evidence according to Zhou and Xu (2015) [14] and Qi et al. (2018) [15]. The details of conducting the TA algorithm to search for GGLP sets are elaborated in Section 3. With the given initial GLP sets provided in work by Fang and Wang (1994) [13], the resulting GGLP sets, searched under the Frobenius distance, are evaluated by mixture discrepancy, maximin distance, and KL divergence in Section 4. According to the KL divergence, the distributions of GGLP sets and the initial GLP sets are very close, as the KL divergence is nearly the same when n is large. For small n, most GGLP sets exhibit lower KL divergence. The mixture discrepancy of GGLP sets can be improved for all n. Moreover, we also compare the performances in several case studies, including representative points and computer experiments. For a computer experiment, we generate training data according to pairs of GLP and GGLP sets from the Wood function, and predict a randomly generated testing set with the Kriging modeling technique. As for the application of representative points, we utilize the Box–Muller method to transform the points generated with Monte Carlo, GLP, and GGLP methods, respectively, into quasi-Monte Carlo points of bivariate mixture normal distribution, and use them in bootstrap resampling to approximate parameters of the distribution. The performances are evaluated by the mean squared error. In many cases, GGLP sets outperform GLP sets.

2. Review on Generalized Good Lattice Point Sets

Zhou and Xu (2015) [14] specified that a regular q s r design D with q levels for each factor can be constructed by D = M G , where M is the q s r full factorial design over the finite ring Z q = { 0 , 1 , , s 1 } and G is a generating matrix. A regular q s r design defined in such a way has s r independent columns spanning an ( s r ) -dimensional linear space over the ring Z q and r-dependent columns specified in G. For any row, x = ( x 1 , , x s ) D , the L p -Lee weight is defined as L e e p ( x ) = k = 1 s { m i n ( x k , s x k ) } p , implying that L e e p ( | x y | ) d p ( x , y ) , where | x y | = ( | x 1 y 1 | , , | x s y s | ) . Similar to the L p -distance of D, the L p -Lee weight of a design D is defined as M i n L e e p ( D ) = m i n { L e e p ( x ) : x 0 , x D } . Zhou and Xu (2015) [14] have proved the following Lemmas 1 and 2.
Lemma 1.
The L p -distance of a regular q s r design D over Z q is M i n d p ( D ) = M i n L e e p ( D ) .
Lemma 2.
For a regular q s r design D over Z q , a linear level permutation does not decrease the L p -distance, i.e., M i n d p ( D u ) M i n d p ( D ) where D u is as defined in (4).
A GLP set can be considered a type of regular design featuring an independent column M = ( 0 , 1 , , n 1 ) T and the generating matrix G = h . The only difference between GLP sets and regular designs is the replacement of 0 with n in GLP sets. Zhou and Xu (2015) [14] indicated that the special regular design, i.e.,
D 0 = M h ( m o d n ) ,
as well as the corresponding GLP set D derived by replacing the first row ( 0 , , 0 ) of D 0 with row ( n , , n ) , have the same L p -distance.
Proposition 1.
The regular design  D 0  defined in (5) and the resulting good lattice point set D have the same L p -distance, and according to Lemma 1,
M i n d p ( D ) = M i n d p ( D 0 ) = m i n { L e e p ( x ) : x n 1 s T , x D } .
Considering a GLP set as a special case of regular designs, Qi et al. (2018) [15] provided the following corollary:
Corollary 1.
Given an n × s good lattice point set D, the linear level permutation of D does not decrease the L 2 -distance, i.e., d 2 ( D u ) d 2 ( D ) , where D u is defined as (4).
Zhou and Xu (2015) [14] also provided the upper bound of L p -distance for any p. For a given initial GLP set, the linear level permuting design with the maximal L 2 -distance or with the minimal MD is called the generalized good lattice point set (Qi et al., 2018 [15]). When exploring the best permuting vector u , it is sufficient to permute the s 1 dependent column while keeping the first independent column unchanged, allowing the first element of u to be fixed as 0. However, Qi et al., 2018 [15] found that the special permuting vector u ( i ) increases the L p -distance for p = 2 in most cases. Hence, when limiting the number of searches to K times with a given initial GLP set, they first considered the n 1 special u ( i ) vector and then conducted a random search for u of the form u = ( 0 , u 2 , , u s ) . We incorporate the threshold-accepting algorithm for searching the best permuting vector by taking the best special u ( i ) as a benchmark. Since the tables of GLPs in Fang and Wang (1994) [13] refer to large n and s, the time consumption of the MD or L 2 -distance is not admissible. Instead, when conducting the TA algorithm, this paper considers the Frobenius distance defined in (3). When n is large, the KL divergence of GGLP sets is almost the same as the initial GLP sets. This indicates that the distributions of GLP and GGLP sets are very close. Hence, the KL divergence defined in (1) is used only for comparing the results and is not considered a criterion of the TA algorithm.

3. Modified Threshold Accepting Algorithm

To find the best linear level permuting design of a given GLP set denoted as X 0 , the threshold accepting algorithm starts with a random initial permuting vector, say u 0 , as the current solution u c , and then randomly chooses a neighborhood of u c as a candidate new solution u n . The neighborhood of u c is defined by mutating one randomly chosen element u k c from u c , k = 2 , , s , satisfying u k n u k c and u k n { 0 , 1 , , n 1 } . Here, k starts from 2 since the first element of u is fixed as 0 and remains unchanged during iterations. We denote the corresponding permuting designs of X 0 as X u c and X u n , given by (4). Under the current threshold, T i 0 , from a predetermined sequence, T 1 > > T I = 0 , the current solution is updated by X n if f ( X n ) f ( X c ) T i in the minimization problem or f ( X c ) f ( X n ) T i in the maximization problem, where f ( · ) represents any space-filling criterion. After J iterations of searches, the threshold is updated, and the procedure is repeated. The final threshold is set to 0, which implies a transition to a greedy algorithm. In this paper, we adopt the diminishing manner of threshold sequence and the historical optimum reversion proposed by Fang et al. (2017) [18]. Moreover, we include a uniqueness mechanism to ensure that each candidate new solution, represented by the linear-level permuting vector u , only appears once.

3.1. Threshold Sequence and Historical Optimum Reversion

In the literature, the threshold sequence of TA is always data-driven and diminishes exponentially, ending at 0. With a specified space-filling criterion f ( · ) , J random linear level permuting vectors are generated, and the f ( · ) values of the J resulting permuting designs are measured. Let R denote the range of f ( · ) values. With a predetermined ratio 0 < α < 1 , the first threshold is given by T 1 = α R . If the sequence diminishes in an exponential manner, the thresholds are defined as follows:
T i = I i I T i 1 , i = 2 , , I ,
where I is the number of thresholds. Fang et al. (2017) [18] found that the exponential diminishing manner approaches 0 too early so that the last several thresholds are all 0s. Therefore, Fang et al. (2017) [18] adopted a mixture diminishing manner in which the sequence first diminishes exponentially, and then starts to diminish linearly if T i c T 1 , 0 < c < 1 , given by
T i = I i 1 I i T i 1 .
Moreover, in Fang et al. (2017) [18], the best solution is recorded and updated during the entire search process, and the current solution is replaced by the best solution before updating the threshold. This mechanism avoids the situation in which TA visits the best solution and then slips away in the following iterations.

3.2. Flowchart

With the initially specified GLP set X 0 , a space-filling criterion f ( · ) , threshold parameters α = 0.1 , c = 0.03 , I = 10 , and the number of total searches K = 10 4 , 5 × 10 4 , 10 5 depending on the case, the flowchart of the modified TA algorithm for GGLP is presented in Figure 1.
The n 1 searches are first conducted for the special type of permuting vectors u ( i ) . The best one among them under a specified space-filling criterion is recorded as a benchmark. With large n and s, the Frobenius distance in (3) is adopted as the criterion while the MD or maximin distance can be adopted for small-sized cases. The TA result is compared with the benchmark under the given criterion. The final GGLP set is also evaluated by the MD and L p -distance for p = 2 . Since we adopt a uniqueness mechanism, the number of searches in the TA algorithm, I × J , should be less than n s 1 , total possibilities for permuting vectors. The comparisons of the resulting GGLP sets with the initial GLP sets are illustrated in the next section.

4. Comparisons Between GLP and GGLP Sets in Space-Filling Criteria and Applications

In this section, we compare the space-filling properties in terms of MD, maximin distance, and KL divergence between the initial GLP sets and the resulting GGLP sets obtained from our approach, with respect to L 2 -distance M i n d 2 ( · ) , MD, and FD, respectively. We can see that MD can be improved by our algorithm with respect to MD or FD. In most cases of small-sized n, GGLP sets have lower KL divergence from the uniform distribution than the initial GLP sets. Frobenius distance is easy to calculate, especially for large n. The maximin distance M i n d 2 ( · ) is only efficient in cases with small-sized n. To evaluate the performances of GGLP sets in practice, we utilize pairs of initial GLP sets and the resulting GGLP sets in two applications of computer experiments and representative points, respectively.

4.1. Comparisons in Maximin Distance, Mixture Discrepancy, and KL Divergence

The findings of this subsection are listed below.
  • The initial GLP sets are good enough and even better than the GGLP sets in the literature.
  • For small-sized n, our algorithm can improve both MD and maximin distance. When using M i n d 2 ( · ) as the criterion of our algorithm, both criteria can be improved in most cases. However, when using MD as the criterion, M i n d 2 ( · ) cannot be improved in most cases, except for large s.
  • Most of the resulting GGLP sets for small-sized n have lower KL divergence from the uniform distribution than the initial GLP sets. For large-sized n, the KL divergence of GGLP sets and the initial GLP sets is nearly the same.
  • When n is large, M i n d 2 ( · ) is unchanged so that it is no longer suitable to be considered as a criterion.
  • For large n, using the Frobenius distance as the criterion can improve MD, which is not better than the MD when using MD as the criterion, but is better than the MD when using M i n d 2 ( · ) as the criterion.
  • The Corollary 1 does not hold for MD, as we find a contradictory case where the MD of linear-level permuting design is worse than the initial GLP sets.
  • The number of searches K hardly affects the result of our algorithm, and even larger K sometimes leads to worse results.
For small-sized n and s, the criterion of the TA algorithm can be set as MD or the maximin distance with p = 2 , i.e., the L 2 -distance M i n d 2 ( · ) . Table 1 shows the comparisons of M i n d 2 ( · ) , MD, and KL divergence among the initial GLP sets in the work by Fang and Wang (1994) [13], and the resulting GGLP sets with respect to M i n d 2 ( · ) and MD, respectively.
The K in Table 1 is 10 4 , except for the 29 × 4 setting K = 1.5 × 10 4 . The best u of the form [ i ] in Table 1 represents that the best result comes from the special type of permuting vector u ( i ) while the best u starting from 0 is produced by the TA algorithm. For n = 17 and s = 8 , Qi et al. (2018) [15] provided a GGLP set with M i n d 2 ( · ) = 204 and M D = 0.87 under either maximin distance or MD, using an initial GLP set with M i n d 2 ( · ) = 204 and M D = 0.89 . In our case, the initial 17 × 8 GLP set is already better than the resulting GGLP set of Qi et al. (2018) [15]. Our algorithm can further improve the two space-filling criteria. The KL divergence of the resulting GGLP sets is also improved in most cases. With respect to MD, our algorithm cannot improve the L p -distance most of the time but can provide better MD. If taking maximin distance as the criterion, our algorithm can simultaneously improve the L p -distance and MD in most cases, but sometimes, MD may be worse as in the case of the 13 × 5 GGLP set. It also indicates that Corollary 1 does not hold for MD. Similarly, the KL divergence is improved, except for the 13 × 5 GGLP set. It indicates the consistency between mixture discrepancy and the KL divergence from the uniform distribution. More GGLP sets with respect to maximin distance and MD are included in Appendix A. For large n, the calculation of L p -distance or MD is not admissible so we suggest adopting Frobenius distance in (3) instead. As for the KL divergence, the differences between GLP and GGLP sets are very small and can be considered computational errors. We first conduct our algorithm with respect to M i n d 2 ( · ) and MD on the three illustrative cases, 1069 × 5 , 3997 × 8 , 4661 × 10 in Table 2, and then compare them with the results (with respect to the Frobenius distance) in Table 3.
In Table 2, the three cases are calculated with K = 10 4 and 10 5 , respectively, and have the same results, except for 4661 × 10 with respect to MD. Among them, TA hardly beats the benchmarks. With 10 times larger K, the only TA result has improved slightly from 0.005262 to 0.005259. Hence, increasing K is not crucial in our case. The results of the three cases when using Frobenius distance (FD) as a criterion are listed in Table 3, where M i n d 2 ( · ) and the MD of the initial GLP sets are the same as in Table 2, and will be omitted. M i n d 2 ( · ) is hardly improved no matter the criterion. Even though the MD of GGLP sets under the Frobenius distance is no better than the one under MD, it is better than the MD of GGLP sets under maximin distance. Similarly, as shown in Table 2, taking a larger K = 10 5 for the case 3997 × 8 even leads to a worse result. More GGLP sets with large n under Frobenius distance are included in Appendix A, and calculations will continue to be updated and posted online in the future.

4.2. Case Study

We conducted two applications to evaluate the performances of generalized good lattice point sets obtained in our approach, compared to the initial GLP sets. For a computer experiment, the number of experimental runs can be large since the experimental cost is often affordable. In the first case study, we use the Wood function as the true underlying model and adopt the Kriging modeling technique to obtain the metamodel. The performance of designs is evaluated by the mean squared error (MSE) of prediction. It turns out that the GGLP sets perform better in most cases. In the second case study, we combined the Box–Muller method with MC, GLP, and GGLP methods to generate QMC points for approximating a bivariate mixed normal distribution. The approximation performance was measured using the MSE of bootstrap resampling. The results indicate that GLP and GGLP points significantly outperform MC points, with GGLP often outperforming GLP.

4.2.1. Computer Experiment of the Wood Function

The Wood function has been widely used to evaluate the performance of optimization algorithms, defined as follows:
y ( x ) = 100 ( x 1 2 x 2 ) 2 + ( 1 x 1 ) 2 + 90 ( x 4 x 3 2 ) 2 + ( 1 x 3 ) 2 + 10.1 ( ( x 2 1 ) 2 + ( x 4 1 ) 2 ) + 19.8 ( x 2 1 ) ( x 4 1 ) ,
where x = ( x 1 , x 2 , x 3 , x 4 ) [ 2 , 2 ] 4 . Suppose there are n design points x i , i = 1 , , n , and the Wood function gives outputs y i = y ( x i ) , the universal Kriging model is defined as follows:
y ( x ) = j = 0 L β j B j ( x ) + z ( x ) ,
where the set of B j is a predetermined basis, and z ( x ) is a Gaussian process. The ordinary Kriging model is the most commonly used one with the form y ( x ) = μ + z ( x ) , in which μ is the overall mean of y i . We first obtain six pairs of 4-factor GLP and GGLP sets with different numbers of design points and their corresponding responses. The Kriging modeling is conducted using the ooDACE toolbox in MATLAB to obtain the metamodel, in which three kinds of bases are tried, respectively, i.e., the ordinary Kriging model denoted by P o l y 0 , the polynomial basis with the first order, denoted by P o l y 1 , and with the second order, denoted by P o l y 2 . For each metamodel, the MSE between the predicted outputs of the same set of M random points and the true responses is calculated by the following:
M S E = 1 M m = 1 M ( y m y ^ m ) 2 ,
in which we take M = 1000 . The resulting MSE is listed in Table 4.
The prediction MSE does not simply decrease with a large number of runs, indicating the significance of the design in computer experiments. The observations are listed as follows:
  • The Kriging model with a second-order polynomial basis is more adequate for the Wood function.
  • The best result is produced by GGLP sets with n = 3001 and the Kriging model with a second-order polynomial basis.
  • For n = 562 , with the Kriging models P o l y 0 and P o l y 1 , GLP sets are better than GGLP sets. In all other cases, GGLP sets perform better.
  • The effectiveness of designs with n = 701 and n = 3001 is much better than others. The designs with n = 701 are more effective than the ones with n = 1019 , 2129 .

4.2.2. Quasi-Monte Carlo Points

Discrete approximations of continuous distributions are widely utilized, particularly when integration techniques fail to provide analytic solutions for complex multivariate distributions. Let x be a p-variate random vector following a continuous distribution F ( x ) . The MC method generates a random sample x 1 , , x k with an empirical distribution given by
F k ( x ) = 1 k j = 1 k I x j x ,
where I { A } is the indicator function of event A. Under regular assumptions, the weak law of large numbers implies that F k ( x ) converges to F ( x ) in probability as k . Therefore, F k ( x ) can be used to estimate F ( x ) .
There are three popular methods for representative points, namely, the MC method, the QMC method, and the MSE method. The convergence rate for numerical integration by the MC method is O k 1 / 2 , independent of dimension p, which is slow. The mean squared error method provides a set of representative points by minimizing the MSE of the discrete approximation to a given continuous distribution. Another approach involves a set of points of size k generated by QMC methods, called quasi-random F-numbers (or QMC F-numbers) with respect to F ( x ) with a convergence rate of O ( log k ) p / k , offering a faster alternative to the MC method when p is not large. Fang and Pan (2023) [19] provided evidence that representative points outperform random samples in approximating distributions. Simulation results indicate that QMC and MSE representative points perform better in most comparisons, suggesting that these representative points have significant potential in statistical inference.
Assume that the random vector x F ( x ) , x R p has a stochastic representation:
x = h ( y ) , y U C s , s p ,
where the random vector y is uniformly distributed over the unit cube C s , and h is a continuous function on C s . Utilizing number-theoretic methods to generate a uniformly scattered set of points c i , i = 1 , , n on C s , let
x i = h c i , i = 1 , , n .
The points of ( x 1 , x 2 , , x n ) are referred to as QMC points when F ( x ) is the uniform distribution on a s-dimensional unit cube C s . We consider using MC, GLP, and GGLP points to generate QMC points to approximate multivariate mixed distributions. Let the cumulative distribution function (CDF) of X be denoted by F x . Algorithms for generating quasi-random samples ( x 1 , x 2 , , x n ) usually rely on a sequence of uniform quasi-random samples ( u 1 , u 2 , , u n ) . If the CDF F x is continuous, then given a uniform sample u , the desired sample x may be obtained by the formula
x = F x 1 ( u ) ,
where F x 1 denotes the inverse function.
For the mixture of bivariate normal distributions, M N 2 ( α , μ 1 , Σ 1 , μ 2 , Σ 2 ) , the CDF F x ( x ) is given by the following:
F x ( x ) = α · ϕ ( x | μ 1 , Σ 1 ) + ( 1 α ) · ϕ ( x | μ 2 , Σ 2 ) ,
where ϕ ( x | μ , Σ ) denotes the bivariate normal density function with mean vector μ and covariance matrix Σ .
If we use the same inverse transform method as before, namely, x = F x 1 ( u ) , it is important to note that the CDF F x ( x ) of M N 2 ( α , μ 1 , Σ 1 , μ 2 , Σ 2 ) has no simple closed form. Therefore, the inverse operation requires advanced numerical methods.
Box–Muller (1958) [20] proposed a transformation method that constructs samples from a normal distribution using samples uniformly distributed. If ( U 1 , U 2 ) are a pair of uniform and independent random variables in ( 0 , 1 ) , then ( X 1 , X 2 ) are a pair of standard normal variables N ( 0 , 1 ) , also independent, with X 1 and X 2 given by the following:
X 1 = 2 ln U 1 cos 2 π U 2 , X 2 = 2 ln U 1 sin 2 π U 2 .
To obtain a pair of variables Y 1 , Y 2 that follow a N ( μ , σ 2 ) distribution, we use the following transformations:
Y 1 = μ + σ X 1 , Y 2 = μ + σ X 2 .
Therefore, it is possible to generate QMC points from the M N 2 ( α , μ 1 , Σ 1 , μ 2 , Σ 2 ) distribution using the Box–Muller method by generating uniformly distributed samples over a unit hypercube C 5 = [ 0 , 1 ] 5 .

4.2.3. The Algorithm to Generate QMC Points from M N 2 ( α , μ 1 , Σ 1 , μ 2 , Σ 2 )

Based on the Box–Muller method, the problem of generating samples from the distribution M N 2 ( α , μ 1 , Σ 1 , μ 2 , Σ 2 ) is transformed into the problem of sampling over a unit hypercube C 5 = [ 0 , 1 ] 5 . We employ three different methods: MC, GLP, and GGLP methods to generate five-dimensional uniformly distributed random points over this unit hypercube. Consider the following algorithm to generate a random sample from M N 2 ( α , μ 1 , Σ 1 , μ 2 , Σ 2 ) .
Step 1. 
Find the positive definite square root A i such that A i T A i = Σ i , i = 1 , 2 .
Step 2. 
Generate random numbers u = ( u 1 , u 2 , u 3 , u 4 , u 5 ) uniformly distributed over a five-dimensional unit hypercube using MC, GLP, and GGLP methods.
Step 3. 
Use the Box–Muller method to calculate the corresponding standard normal variates n 1 , n 2 , n 3 , n 4 .
r 1 = 2 log u 1 , r 2 = 2 log u 3 , n 1 = r 1 cos 2 π u 2 , n 2 = r 1 sin 2 π u 2 , n 3 = r 2 cos 2 π u 4 , n 4 = r 2 sin 2 π u 4 .
Step 4. 
Calculate
N 1 = μ 1 + A 1 n 1 n 2 , and N 2 = μ 2 + A 2 n 3 n 4 .
Step 5. 
Generate a two-dimensional random vector X . If u 5 α , take X = N 1 ; otherwise, take X = N 2 . Deliver X , which follows M N 2 α , μ 1 , Σ 1 , μ 2 , Σ 2 .
Step 6. 
Repeat steps 2 to 5 n times to obtain a set of QMC points X 1 , , X n of M N 2 ( α , μ 1 , Σ 1 , μ 2 , Σ 2 ) .
For a univariate mixture of normal distributions, denoted as M i x N ( α , μ 1 , σ 1 2 , μ 2 , σ 2 2 ) , a MixN is a convex combination of two independent normal distributions with five parameters. The normal distribution belongs to the location-scale family. However, the convex combination of two normal distributions is no longer considered a location-scale distribution. There exist two notable subclasses that arise when the two normal components either share the same location parameter or possess identical scale parameters. Similarly, we can classify types of the bivariate mixture M N 2 ( α , μ 1 , Σ 1 , μ 2 , Σ 2 ) based on these attributes.
  • Scale mixture: μ 1 = μ 2 = μ and Σ 1 Σ 2 ;
  • Location mixture: μ 1 μ 2 and Σ 1 = Σ 2 = Σ , where
    Σ = σ 11 σ 12 σ 21 σ 22 = σ 1 2 σ 1 σ 2 ρ σ 1 σ 2 ρ σ 2 2 ;
  • Special case: A special case is when both mixture components have symmetric means and identical covariance matrices, where μ 1 = μ 2 = μ and Σ 1 = Σ 2 = Σ ;
  • More specific case: A more specific case of the location mixture occurs when the correlation matrix is R ρ , where σ 1 = σ 2 = 1 and 1 < ρ < 1 . μ 1 = μ 2 = μ and Σ 1 = Σ 2 = R ρ ;
The parameter settings of distributions are given in Table 5.
The Bootstrap method, proposed by Efron (1994) [21], is a resampling technique widely used in statistical inference. Based on the four underlying distributions listed in Table 5, we use the Bootstrap method to evaluate the sampling effectiveness for three different methods and various types of M N 2 ( α , μ 1 , Σ 1 , μ 2 , Σ 2 ) . Given X = x 1 , , x n , which represents a set of n two-dimensional QMC points drawn from M N 2 α , μ 1 , Σ 1 , μ 2 , Σ 2 , the Bootstrap method is used to estimate the mean vector μ , and the covariance matrix Σ involves the following steps:
Step 1. 
Initialize i = 1 . Generate a sample x i * = x 1 i , , x n i from X = x 1 , , x n .
Step 2. 
Calculate:
μ ^ i = 1 n j = 1 n x j i * , Σ ^ i = 1 n j = 1 n ( x j i * μ ^ i ) ( x j i * μ ^ i ) .
Step 3. 
Repeat Steps 1 and 2 for B iterations. Compute the estimated mean vector and covariance matrix of x as follows:
μ ^ = 1 B i = 1 B μ ^ i , Σ ^ = 1 B i = 1 B Σ ^ i .
Step 4. 
Define the true population mean and covariance matrix of M N 2 α , μ 1 , Σ 1 , μ 2 , Σ 2 as follows:
μ = α μ 1 + ( 1 α ) μ 2 , Σ = α Σ 1 + ( 1 α ) Σ 2 .
  • Evaluate the sampling effectiveness by calculating the MSE for the estimates:
M S E μ = 1 B i = 1 B ( μ ^ i μ ) 2 , M S E Σ = 1 B i = 1 B ( Σ ^ i Σ ) 2 .
In the following experiments, let B = 5000 . Use the calculated five-dimensional GLP and GGLP points to generate random samples of the corresponding sample size n, and the results are shown in Table 6 and Table 7.
As we can see in Table 6 and Table 7, when approximating the target distribution M N 2 ( α , μ 1 , Σ 1 , μ 2 , Σ 2 ) using representative points generated by the MC, GLP, and GGLP methods, the GLP and GGLP methods show significant improvement in almost all cases compared to the MC method, particularly when the sample size n is small. In many instances, the GGLP method outperforms both the GLP and MC methods, particularly when the mixture type is a scale mixture. As n increases, the performance of the GGLP method is very similar to that of the GLP method, and both consistently outperform the MC method. Therefore, the sampling method based on the GGLP and Box–Muller techniques for M N 2 α , μ 1 , Σ 1 , μ 2 , Σ 2 can more efficiently generate samples that approximate the target distribution.

5. Discussion

Under the maximin distance measurement, linear level permutation does not decrease the L p -distance of GLP sets. Although this does not hold for mixture discrepancy, as we find some contradictory instances, GGLP sets can still improve the MD of GLP sets. According to the KL divergence from the uniform distribution, GGLP sets perform better than GLP sets in most small-sized cases and are nearly the same in large-sized cases. If maximin distance is used as a criterion, GGLP sets can simultaneously improve L p -distance and MD for small-sized n. This indicates that the two criteria are consistent. In most large-sized cases, the maximin distance of initial GLP sets is hard to improve, while MD can be improved in all cases. However, for large-sized cases, calculating MD or L p -distance is time-consuming. Instead, we adopt the Frobenius distance, which is easier to calculate, and the resulting GGLP sets also have lower MDs. Increasing the number of searches for large-sized GGLP sets is not necessary to improve results. The complete results with respect to the aforementioned three criteria are included in Appendix A. More results will be calculated and posted online in the future. To verify the performances of the resulting GGLP sets, we utilize the initial GLP sets and the corresponding GGLP sets in two applications of computer experiments and representative points. The advantage of GGLP sets in computer experiments is more significant. The initial GLP sets still maintain their effectiveness in the application of representative points, although in some cases, GGLP sets perform better.

Author Contributions

Conceptualization, K.-T.F.; formal analysis, T.-Y.Y.; investigation, T.-Y.Y.; methodology, Y.-X.L. and K.-T.F.; resources, K.-T.F.; software, Y.-X.L. and T.-Y.Y.; supervision, K.-T.F.; validation, Y.-X.L.; writing—original draft, Y.-X.L. and T.-Y.Y.; Writing—review and editing, K.-T.F. All authors have read and agreed to the published version of this manuscript.

Funding

This research was funded by the China Postdoctoral Science Foundation, grant number 2023TQ0326.

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The original contributions presented in the study are included in this article; further inquiries can be directed to the corresponding authors. The computer codes of this article are uploaded online in a GitHub repository, GGLP project https://github.com/YuxuanLin8/Improved-Generalized-Good-Lattice-Point-Sets (accessed on 1 October 2024).

Acknowledgments

Our work was supported in part by the Research Center for Frontier Fundamental Studies, Zhejiang Lab, and the Guangdong Provincial Key Laboratory of Interdisciplinary Research and Application for Data Science. The authors thank Hong Yin for her support.

Conflicts of Interest

The authors declare no conflicts of interest. The funders had no role in the design of the study; in the collection, analyses, or interpretation of data; in the writing of the manuscript; or in the decision to publish the results.

Abbreviations

The following abbreviations are used in this manuscript:
MCMonte Carlo
QMCquasi-Monte Carlo
CDcentered L 2 -discrepancy
WDwrap-around L 2 -discrepancy
MDmixture discrepancy
GLPgood lattice point
GGLPgeneralized good lattice point
TAthreshold accepting
FDFrobenius distance
MSEmean squared error
CDFcumulative distribution function

Appendix A

Table A1. GGLP sets with respect to d 2 ( · ) and MD, respectively.
Table A1. GGLP sets with respect to d 2 ( · ) and MD, respectively.
KnsInitial GLP SetsGGLP Sets w.r.t. d 2 ( · ) GGLP Sets w.r.t. MD
d 2 ( · ) MD d 2 ( · ) MDBest u d 2 ( · ) MDBest u
15,0002941470.01111470.0106[28]1470.0101[0 1 1 21]
3141500.01051500.0107[30]1500.0099[0 12 19 0]
10 4 115550.1216660.1190[10]550.1168[7]
135550.0943660.0971[0 6 11 9 11]550.0923[0 7 9 8 7]
155710.1077740.1048[0 9 0 8 6]710.0990[0 14 7 14 2]
175750.0739750.0668[16]750.0659[0 12 13 2 12]
195990.0606990.0587[18]990.0583[0 13 18 8 4]
2151100.05701100.0540[20]1100.0532[0 5 7 13 14]
2351340.05571340.0521[22]1340.0513[17]
2551250.03201250.0312[24]1250.0306[20]
2752060.05072060.0446[26]2060.0423[0 1 21 25 8]
2952540.04412540.0420[28]2540.0404[0 17 26 3 27]
50,0003153100.05553100.0478[30]3100.0446[8]
10 4 1069541,0100.00023741,0100.000220[1068]41,0100.000176[549]
10 4 1782040.81352040.7626[16]2040.7616[0 1 12 1 5 12 2 5]
1982040.71832400.6809[0 15 13 6 12 1 9 9]2040.6642[18]
2383000.60883350.5622[0 6 13 22 15 0 8 13]3000.5409[17]
2582600.54803650.5119[0 23 1 11 19 13 24 0]2600.5005[0 0 22 7 10 17 1 16]
2783800.50024280.4602[0 9 2 1 21 11 14 1]3800.4468[22]
2984430.49604670.4525[0 17 24 18 8 2 4 26]4430.4346[23]
3184370.43284640.4096[0 3 25 26 9 13 18 14]4370.3942[7]
399781,951,9440.0015731,951,9440.001558[3996]1,951,9440.001278[3085]
1792051.82372291.7667[0 6 8 3 14 15 1 6 13]2051.7051[0 2 12 9 5 8 2 12 12]
1992851.50202851.4023[18]2851.4023[18]
2393491.25103931.1682[0 22 8 8 6 5 16 19 14]3491.1264[7]
2593811.14634441.0929[0 14 5 17 5 23 5 20 12]3811.0622[0 17 21 11 9 7 4 14 18]
2795491.01095670.9546[26]5490.9479[0 26 9 23 2 7 1 9 9]
2994681.01375550.9484[0 18 5 2 3 8 13 24 1]4680.9030[23]
3195480.89395810.8395[0 26 23 8 15 15 2 8 5]5480.8192[25]
19102863.27353133.1116[0 9 17 3 12 11 0 15 3 13]2893.0548[2]
23103852.52764482.3573[0 12 20 7 19 15 7 3 15 1]3852.2997[17]
25105252.34415252.2001[24]5252.1927[0 9 18 7 21 17 3 23 21 20]
27105502.35725772.2038[0 23 14 8 19 24 9 11 10 25]5502.1252[0 9 21 17 24 21 22 9 21 24]
29105682.03796641.8879[0 5 8 0 19 23 8 25 13 4]5681.8399[23]
31105971.81336691.7315[0 20 16 10 9 10 20 0 27 13]5971.6732[25]
4661103,565,9980.0068463,565,9980.006722[4660]3,565,9980.005262[0 4341 3652 1461 1551 3665 3348 3082 167 3343]
2915101658.3406114455.4192[0 13 16 10 22 9 12 21 18 28 12 4 8 14 7]]104554.8724[23]
3115124052.0468142649.5877[22]133349.2098[7]
The blank cells are the same as the above ones.
Table A2. GGLP sets with respect to the Frobenius distance.
Table A2. GGLP sets with respect to the Frobenius distance.
KnsInitial GLP SetsGGLP Sets w.r.t. Frobenius Distance
d 2 ( · ) MDFDMDFDBest u
15,0002941470.0110525153.380.0100892144.92[0 5 4 24]
3141500.0104753169.980.0099667165.34[0 20 14 10]
50,000307442450.000319515,789.380.000284815,759.36[232 232 232 232]
562416,8600.000064052,744.840.000061052,734.18[551 551 551 551]
701413,9570.000047382,025.300.000045082,016.91[0 651 142 292]
1019423,0300.0001684173,573.640.0001335173,230.40[0 843 884 822]
2129452,6680.0000101755,801.980.0000101755,794.91[0 1770 1491 364]
30014138,0830.00000841,501,679.830.00000641,501,500.25[1720 1720 1720 1720]
10 4 115550.121552328.100.116778024.90[7 7 7 7 7]
135550.094340235.420.093212534.07[12 12 12 12 12]
155710.107716754.320.099207645.31[0 2 2 14 11]
175750.073868970.810.066359657.15[0 7 7 14 1]
195990.060590074.910.058683271.08[18 18 18 18 18]
2151100.056990994.570.053382786.37[0 2 1 16 4]
2351340.0556996114.340.0513021103.10[17 17 17 17 17]
2551250.0319601126.340.0306470121.08[20 20 20 20 20]
2752060.0506670173.660.0426908141.87[0 20 0 2 20]
2952540.0441466177.540.0403895162.70[0 6 7 15 25]
50,0003153100.0554706239.450.0445367185.67[0 25 15 3 9]
10 5 1069541,0100.0002368213,516.310.0001766213,140.01[557]
15435120,1970.0000706443,983.370.0000671443,932.86[1389]
5000015435120,1970.0000706443,983.370.0000671443,932.86[1389]
21295224,1460.0000895845,357.900.0000754845,006.63[1173 1173 1173 1173 1173]
30015276,7490.00002361,678,784.600.00002241,678,729.13[1558 1558 1558 1558 1558]
40015386,4910.00000992,983,681.300.00000962,983,660.35[2533 2533 2533 2533 2533]
10 6 21295224,1460.0000895845,357.900.0000754845,006.63[1173 1173 1173 1173 1173]
30015276,7490.00002361,678,784.600.00002261,678,728.62[0 95 863 1139 720]
40015386,4910.00000992,983,681.300.00000962,983,660.35[2533 2533 2533 2533 2533]
10 5 50035344,0490.00001414,665,155.640.00001324,665,001.28[0 4446 4815 4068 4393]
60075814,4400.00000636,725,006.060.00000626,724,984.97[5451 5451 5451 5451 5451]
819151,823,5670.000004612,503,683.850.000004012,503,472.17[6214 6214 6214 6214 6214]
10 5 399771,434,0020.00092103,528,607.040.00074233,523,303.69[0 1715 672 440 232 3237 1928]
50,00039977 0.00074603,523,306.21[0 2367 478 1948 819 3938 3095]
11,21575,820,2660.000136027,738,414.420.000115427,733,553.77[0 7173 10558 4080 3009 5621 2387]
15,019713,364,5830.000021449,737,519.290.000020249,736,916.77[0 8596 7317 11883 2266 12803 7990]
50,000399781,951,9440.00157333,771,719.970.00128003,766,576.88[0 2896 272 2747 2776 2399 3598 516]
10 5 39978 0.00128203,766,577.57[0 2888 3448 685 555 2157 560 3164]
10 5 4661103,565,9980.00684555,729,496.430.00532825,726,228.42[0 4647 3322 1819 3264 2552 3289 4447 3926 2730]
10 6 4661113,568,3020.01295696,009,019.340.01097506,005,710.12[0 3453 274 174 2376 1064 915 3375 3619 1790 1527]
50,000466111 0.01049036,005,723.50[0 1585 1490 4083 120 2797 754 3846 1690 759 759]
13,5871134,729,1420.004514351,038,555.630.004232151,026,473.39[0 6556 7402 4150 12,363 3013 1700 8317 10,104 6757 10,277]
24,0761191,049,5680.0036279160,274,172.880.0031430160,214,837.54[0 2924 9970 14,063 10,282 1365 17,773 7813 22,065 15,267 23,322]
30,00024,07611 0.0031374160,214,923.47[16,454 16,454 16,454 16,454 16,454 16,454 16,454 16,454 16,454 16,454 16,454]
d 2 ( · ) of GGLP is omitted since it is unchanged in all cases. The blank cells are the same as the above ones.

References

  1. Korobov, A.N. The approximate computation of multiple integrals. Dokl. Akad. Nauk SSSR 1959, 124, 1207–1210. [Google Scholar]
  2. Hua, L.K.; Wang, Y. Applications of Number Theory to Numerical Analysis; Springer: Berlin/Heidelberg, Germany, 1981; pp. 160–225. [Google Scholar]
  3. Niederreiter, H. Random Number Generation and Quasi-Monte Carlo Methods; Society for Industrial and Applied Mathematics (SIAM): Philadelphia, PA, USA, 1992. [Google Scholar]
  4. Hlawka, E. For approximate calculation of multiple integrals. Mon. Bull. Math. 1962, 66, 140–151. [Google Scholar] [CrossRef]
  5. Johnson, M.E.; Moore, L.M.; Ylvisaker, D. Minimax and maximin distance designs. J. Stat. Plan. Inference 1990, 26, 131–148. [Google Scholar] [CrossRef]
  6. Zhou, Y.D.; Fang, K.T.; Ning, J.H. Mixture discrepancy for quasi-random point sets. J. Complex. 2013, 29, 283–301. [Google Scholar] [CrossRef]
  7. Kullback, S.; Leibler, R.A. On information and sufficiency. Ann. Math. Stat. 1951, 22, 79–86. [Google Scholar] [CrossRef]
  8. Joe, H. Estimation of entropy and other functionals of a multivariate density. Ann. Inst. Stat. Math. 1989, 41, 683–697. [Google Scholar] [CrossRef]
  9. Scott, D.W. Multivariate Density Estimation: Theory, Practice, and Visualization; John Wiley & Sons: New York, NY, USA, 2015. [Google Scholar]
  10. Hickernell, F. A generalized discrepancy and quadrature error bound. Math. Comput. 1998, 67, 299–322. [Google Scholar] [CrossRef]
  11. Horn, R.A.; Johnson, C.R. Matrix Analysis, 2nd ed.; Cambridge University Press: New York, NY, USA, 2012; p. 321. [Google Scholar]
  12. Saltykov, A.I. Tables for the computation of multiple integrals using the method of optimal coefficients. USSR Comput. Math. Math. Phys. 1963, 3, 235–242. [Google Scholar] [CrossRef]
  13. Fang, K.T.; Wang, Y. Number-Theoretic Methods in Statistics; Chapman and Hall: London, UK, 1994; pp. 270–280. [Google Scholar]
  14. Zhou, Y.; Xu, H. Space-filling properties of good lattice point sets. Biometrika 2015, 102, 959–966. [Google Scholar] [CrossRef]
  15. Qi, Z.F.; Zhang, X.R.; Zhou, Y.D. Generalized good lattice point sets. Comput. Stat. 2018, 33, 887–901. [Google Scholar] [CrossRef]
  16. Dueck, G.; Scheuer, T. Threshold accepting: A general purpose optimization algorithm appearing superior to simulated annealing. J. Comput. Phys. 1990, 90, 161–175. [Google Scholar] [CrossRef]
  17. Fang, K.T.; Lin, D.K.; Winker, P.; Zhang, Y. Uniform design: Theory and application. Technometrics 2000, 42, 237–248. [Google Scholar] [CrossRef]
  18. Fang, K.T.; Ke, X.; Elsawah, A.M. Construction of uniform designs via an adjusted threshold accepting algorithm. J. Complex. 2017, 43, 28–37. [Google Scholar] [CrossRef]
  19. Fang, K.T.; Pan, J. A Review of Representative Points of Statistical Distributions and Their Applications. Mathematics 2023, 11, 2930. [Google Scholar] [CrossRef]
  20. Box, G.E.P.; Muller, M.E. A note on the generation of random normal deviates. Ann. Math. Stat. 1958, 29, 610–611. [Google Scholar] [CrossRef]
  21. Efron, B.; Tibshirani, R.J. An Introduction to the Bootstrap; Chapman and Hall/CRC: Boca Raton, FL, USA, 1994. [Google Scholar]
Figure 1. The flowchart of modified TA for GGLP.
Figure 1. The flowchart of modified TA for GGLP.
Entropy 26 00910 g001
Table 1. GGLP sets with respect to M i n d 2 ( · ) and MD, respectively.
Table 1. GGLP sets with respect to M i n d 2 ( · ) and MD, respectively.
nsInitial GLP SetsGGLP Sets w.r.t. Min d 2 GGLP Sets w.r.t. MD
M i n d 2 MDKL M i n d 2 MDKLBest u M i n d 2 MDKLBest u
2941470.010.13 (0.6)1470.010.09 (0.6)[28]1470.010.07 (0.6)[0 1 1 21]
135550.090.49660.100.79[0 6 11 9 11]550.090.74[0 7 9 8 7]
2952540.040.382540.040.22[28]2540.040.31[0 17 26 3 27]
1782040.813.112040.761.81[16]2040.762.13[0 1 12 1 5 12 2 5]
3184370.432.404640.411.68[0 3 25 26 9 13 18 14]4370.391.43[7]
2593811.153.344441.092.21[0 14 5 17 5 23 5 20 12]3811.062.20[0 17 21 11 9 7 4 14 18]
27105502.364.385772.202.74[0 23 14 8 19 24 9 11 10 25]5502.132.82[0 9 21 17 24 21 22 9 21 24]
2915101658.347.65114455.425.15[0 13 16 10 22 9 12 21 18 28 12 4 8 14 7]104554.874.57[23]
3115124052.057.44142649.594.50[22]133349.214.31[7]
For KL divergence, the values in the parentheses indicate the fixed bandwidth, and the bandwidth adopts Scott’s rule if no parenthesis.
Table 2. Three illustrative GGLP sets with respect to M i n d 2 ( · ) and MD, respectively.
Table 2. Three illustrative GGLP sets with respect to M i n d 2 ( · ) and MD, respectively.
KnsInitial GLP SetsGGLP Sets w.r.t. d 2 GGLP Sets w.r.t. MD
M i n d 2 MDKL M i n d 2 MDBest u M i n d 2 MDBest u
10 4 1069541,0100.00023680.81 (0.3)41,0100.0002196[1068]41,0100.0001757[549]
399781,951,9440.0015730.101,951,9440.001558[3996]1,951,9440.001278[3085]
4661103,565,9980.0068460.923,565,9980.006722[4660]3,565,9980.005262[0 4341 3652 1461 1551 3665 3348 3082 167 3343]
10 5 466110 3,565,9980.005259[0 3781 3492 598 4249 3434 3656 1060 2896 2627]
The blank cells are the same as the ones above. The KL divergence of the GGLP sets is almost the same as the GLP sets, so it will be omitted. For KL divergence, the values in parentheses indicate a fixed bandwidth, and the bandwidth adopts Scott’s rule if there are no parentheses.
Table 3. Three illustrative GGLP sets with respect to the Frobenius distance.
Table 3. Three illustrative GGLP sets with respect to the Frobenius distance.
KnsInitial GLP SetsGGLP Sets w.r.t. Frobenius Distance
FD M i n d 2 ( · ) MDFDKLBest u
10 5 10695213,516.3141,0100.0001767213,140.010.81 (0.3)[557]
5 × 10 4 399783,771,719.971,951,9440.0012803,766,576.880.10[0 2896 272 2747 2776 2399 3598 516]
10 5 39978 1,951,9440.0012823,766,577.570.10[0 2888 3448 685 555 2157 560 3164]
4661105,729,496.433,565,9980.0053285,726,228.420.91[0 4647 3322 1819 3264 2552 3289 4447 3926 2730]
For KL divergence, the values in the parentheses indicate the fixed bandwidth and the bandwidth adopts Scott’s rule if no parenthesis.
Table 4. The prediction MSE of Kriging models with different designs.
Table 4. The prediction MSE of Kriging models with different designs.
n307562701101921293001
DesignGLP1GGLP1GLP2GGLP2GLP3GGLP3GLP4GGLP4GLP5GGLP5GLP6GGLP6
P o l y 0 242.80139.74134.22166.0434.8820.96236.20138.00125.5997.2437.6919.75
P o l y 1 230.72125.03121.40232.3235.7420.33214.75124.83115.0688.6733.7817.68
P o l y 2 72.6041.9638.1129.789.045.4373.0242.1037.5232.258.845.15
Table 5. Parameters of underlying distributions for the simulation.
Table 5. Parameters of underlying distributions for the simulation.
Mixture Type α μ 1 Σ 1 μ 2 Σ 2
Scale0.7 [ 0 , 1 ] T [ 1 , 0.5 ; 0.5 , 2 ] [ 0 , 1 ] T [ 1 , 0.3 ; 0.3 , 1.2 ]
Location0.7 [ 0 , 1 ] T [ 1 , 0.5 ; 0.5 , 1 ] [ 3 , 4 ] T [ 1 , 0.5 ; 0.5 , 1 ]
Special Case0.7 [ 1 , 1 ] T [ 2 , 0.5 ; 0.5 , 1 ] [ 1 , 1 ] T [ 2 , 0.5 ; 0.5 , 1 ]
More Special0.7 [ 1 , 1 ] T [ 1 , 0.5 ; 0.5 , 1 ] [ 1 , 1 ] T [ 1 , 0.5 ; 0.5 , 1 ]
Table 6. The MSE error of sampling results for M N 2 ( α , μ 1 , Σ 1 , μ 2 , Σ 2 ) of different methods.
Table 6. The MSE error of sampling results for M N 2 ( α , μ 1 , Σ 1 , μ 2 , Σ 2 ) of different methods.
Mixture Typen μ Error Σ Error
GLPGGLPMCGLPGGLPMC
Scale Mixture110.127480.161690.233560.173510.250610.43120
130.107040.121900.202750.344700.321990.38910
150.155760.142040.176670.174650.388020.34706
170.073770.091420.157990.174870.164590.31083
190.097930.104820.135810.132540.147040.28491
210.088220.079070.128740.128510.155440.26249
230.075620.065260.122060.159740.123880.25402
250.077310.076140.107070.145480.124690.22479
270.084610.057060.100160.115760.164830.20820
290.072870.062880.089040.415620.185200.19639
310.086790.088240.086520.165820.142920.18767
10690.001390.001330.002520.003310.003000.00605
15430.001030.000920.001760.002900.002220.00411
21290.000640.000650.001310.001510.001560.00304
30010.000470.000470.000920.001180.001100.00218
40010.000350.000350.000690.000790.000790.00162
50030.000280.000280.000550.000660.000650.00128
60070.000230.000230.000450.000540.000550.00108
81910.000170.000170.000340.000390.000390.00079
Location Mixture110.259310.334280.522691.097382.684763.52069
130.207500.214030.424122.127402.060513.52895
150.329090.293430.364422.840735.413173.76739
170.166240.177350.339332.535763.831083.57281
190.154070.207590.295072.934984.269993.54642
210.145480.172020.267863.359145.318883.67118
230.133060.121500.248393.452162.325133.61538
250.106980.130390.230172.854253.428013.63253
270.105990.104590.208232.342472.772793.66441
290.113410.108960.197803.143582.557643.65183
310.098360.138880.183902.999494.553713.63388
10690.002730.002810.005363.645803.578663.57075
15430.001930.001970.003703.578723.567733.57322
21290.001350.001350.002703.554983.590953.58118
30010.000970.000920.001913.580233.603283.56851
40010.000720.000730.001443.566313.554253.57361
50030.000580.000580.001153.539043.578513.57453
60070.000490.000510.000973.564143.570533.58156
81910.000350.000360.000713.560463.571183.57603
The best performances with each method are highlighted in blue and bold.
Table 7. The MSE errors of sampling results for M N 2 ( α , μ 1 , Σ 1 , μ 2 , Σ 2 ) of different methods.
Table 7. The MSE errors of sampling results for M N 2 ( α , μ 1 , Σ 1 , μ 2 , Σ 2 ) of different methods.
Mixture Typen μ Error Σ Error
GLPGGLPMCGLPGGLPMC
Special Case110.251650.299290.413740.931311.030961.43671
130.149740.166020.346520.583701.087171.36398
150.309210.175770.303131.003880.771471.23658
170.149900.144870.269700.901350.601541.20171
190.137190.184290.242690.572810.634721.14612
210.146130.152870.215790.869591.774261.12139
230.104890.119860.196080.443740.710081.08158
250.105180.090050.182040.494320.781701.06981
270.092360.099860.173130.663061.084871.06859
290.105700.092770.156241.296160.981711.03050
310.100990.168490.147940.850600.830341.03772
10690.002190.002350.004410.704150.714140.71624
15430.001610.001610.003080.680280.705360.71176
21290.001120.001120.002170.706240.721430.71126
30010.000790.000780.001600.706090.713310.70712
40010.000610.000600.001160.707760.708450.70659
50030.000470.000470.000930.699040.712850.70713
60070.000400.000400.000770.705670.702520.70727
81910.000290.000290.000580.703970.704240.70638
More Specific Case110.193220.227150.311400.882040.873511.02618
130.120600.128010.265120.504930.831280.98595
150.214950.129760.245140.897070.589710.97239
170.111660.103290.211240.839070.533030.92968
190.118370.128250.187460.616870.611670.90584
210.113450.109980.173280.801741.139890.91073
230.087710.092850.155960.500790.805910.88328
250.085870.070180.144210.552420.723460.88145
270.075200.079400.138090.628661.110450.86447
290.075480.071330.125590.823620.854410.84072
310.078110.108860.118130.771920.653580.84658
10690.001730.001790.003440.702370.710450.71045
15430.001240.001200.002380.694040.707600.70679
21290.000860.000860.001720.706500.712580.70838
30010.000610.000610.001240.704700.708810.70590
40010.000460.000450.000940.707010.706730.70691
50030.000380.000380.000730.702760.710450.70600
60070.000310.000310.000600.705170.703280.70642
81910.000230.000230.000450.704850.705290.70610
The best performances with each method are highlighted in blue and bold.
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Lin, Y.-X.; Yan, T.-Y.; Fang, K.-T. Some Improvements on Good Lattice Point Sets. Entropy 2024, 26, 910. https://doi.org/10.3390/e26110910

AMA Style

Lin Y-X, Yan T-Y, Fang K-T. Some Improvements on Good Lattice Point Sets. Entropy. 2024; 26(11):910. https://doi.org/10.3390/e26110910

Chicago/Turabian Style

Lin, Yu-Xuan, Tian-Yu Yan, and Kai-Tai Fang. 2024. "Some Improvements on Good Lattice Point Sets" Entropy 26, no. 11: 910. https://doi.org/10.3390/e26110910

APA Style

Lin, Y.-X., Yan, T.-Y., & Fang, K.-T. (2024). Some Improvements on Good Lattice Point Sets. Entropy, 26(11), 910. https://doi.org/10.3390/e26110910

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop