Mathematics
\degreeyear2024
\advisorYuliy Baryshnikov
\committeeProfessor Vadim Zharnitsky, Chair
Professor Yuliy Baryshnikov, Director of Research
Professor Richard Sowers
Professor Lee DeVille
Cyclicity Analysis of the Ornstein-Uhlenbeck Process
Abstract
In this thesis, we consider an -dimensional Ornstein-Uhlenbeck (OU) process satisfying the linear stochastic differential equation Here, is a fixed circulant friction matrix whose eigenvalues have positive real parts, is a fixed matrix for some and is the standard -dimensional Wiener process.
We consider a signal propagation model governed by this OU process. In this model, an underlying signal propagates throughout a network consisting of linked sensors located in space. For each we interpret the -th component of the OU process at time as the measurement of the propagating effect made by the -th sensor. The matrix represents the sensor network structure: if has first row where and then the magnitude of quantifies how receptive the -th sensor is to activity within the -th sensor, where is indexed mod Finally, the -th entry of the matrix is the covariance of the component noises injected into the -th and -th sensors.
For different choices of and we investigate whether Cyclicity Analysis enables us to recover the structure of network. Roughly speaking, Cyclicity Analysis studies the lead-lag dynamics pertaining to the components of a multivariate signal. We specifically consider an skew-symmetric matrix known as the lead matrix, in which the sign of its -th entry captures the lead-lag relationship between the -th and -th component OU processes. We investigate whether the structure of the leading eigenvector of the eigenvector corresponding to the largest eigenvalue of in modulus, reflects the network structure induced by
Acknowledgements.
Many people have supported me throughout my Ph.D. journey. I would like to acknowledge them here. Firstly, I would like to thank my advisor Professor Baryshnikov for his guidance. Professor Baryshnikov introduced me to the topic of Cyclicity Analysis and strongly encouraged me to explore its applications on real-world data. Later on, we formulated the topic for this thesis. Throughout my Ph.D. journey, Professor Baryshnikov instilled the meaning of a Ph.D. scholar: a scholar who is able to conduct independent research. Via his meticulous comments and feedback on my dissertation drafts, he taught me that being a good researcher does not just involve doing research but also involves articulating my findings in a clear and concise manner. Next, I would like to thank all the other thesis committee members. During my undergraduate career, Professor Sowers supervised my first data science project, which initiated my curiosity for the field. He also taught me Stochastic Calculus, a course that was essential to my thesis research. Professor DeVille strongly encouraged me to pursue a Ph.D. at the University of Illinois and taught me Dynamical Systems, another course that was essential to my thesis research. Although I did not take courses or do research with him, Professor Zharnitsky and I would communicate frequently especially in Russian. I thank him for kindly lending me a book on the Russian language. Furthermore, I would like to thank Professor Reznick and Professor D’Angelo from the University of Illinois and Professor Ritelli from the University of Bologna. All professors played a significant role during my early research career. I enrolled as an undergraduate freshman in Professor Reznick’s senior level research seminar, during which I learned how to conduct and share research. The work done in that seminar would later turn into joint work with Professor Ritelli during my undergraduate career. Professor Ritelli guided me in publishing my first undergraduate paper in the AMS Quarterly of Applied Mathematics. Lastly, Professor D’Angelo has always been supportive of my research and regularly encouraged me to share my research findings with him. He also shared with me his insights on the math department and on issues related to the broader mathematical community. Moreover, I thank all of my colleagues at eBay. During my Ph.D. journey, I had the opportunity to work at eBay as a data scientist and collaborate with many experts in the field. eBay gave me a unique platform to showcase my skills in solving business related problems. My colleagues were very supportive of me and helped me grow as a data scientist. The work I did at eBay helped shaped my decision to pursue a Ph.D. Finally, I would like to thank my loving family. Admittedly, I faced various challenges pertaining to the completion of the thesis. But my family has always been extremely supportive of me nonetheless.Contents
- 1 Introduction
- 2 Preliminaries
- 3 The Ornstein-Uhlenbeck Process
-
4 Cyclicity Analysis of the Cyclic Ornstein-Uhlenbeck Process
- 4.1 Lead Matrix Explicit Representation
- 4.2 Only One Nonzero Propagation Coefficient and Injection of Noise into Only One Sensor
- 4.3 Second Regime Special Case: First Propagation Coefficient Nonzero and Noise Injected into Last Sensor
- 4.4 Only One Nonzero Propagation Coefficient and Independent and Identical Noise Injected into Few Sensors
- 4.5 Independent and Identical Noise Injected into All Sensors
-
5 Experimental Results
- 5.1 OU Process Realization Time Series Simulation
- 5.2 OU Process Empirical Lead Matrix Generation
-
5.3 Cyclicity Analysis of the Cyclic OU Process
- 5.3.1 Injection of Noise into Only One Sensor
- 5.3.2 Observations
- 5.3.3 Independent, Identically Distributed Noise Injected into Few Sensors
- 5.3.4 Observations
- 5.3.5 Independent, Identically Distributed Noise Injected into a Large Number of Sensors
- 5.3.6 Observations
- 5.3.7 Independent, Identically Distributed Noise Injected into All Sensors
- 5.3.8 Observations
- 6 Conclusion
Chapter 1 Introduction
Throughout this chapter, we fix and let be an interval in which we regard as an interval of time. We let be a continuous function. Note that we can write in which is a real-valued continuous function. We refer to as the -th component of
In many practical applications, is a signal (wave/pulse) representing some observed phenomenon, in which we interpret as a measurement of the observed phenomenon recorded at the fixed time For example, if we are observing the behavior of the entire stock market, then we could consider an -dimensional signal where is the total number of companies in the market; here, the component could be the stock price for the -th company at the time
1.1 Leader-Follower Dynamics Questions
In this section, we pose two very general questions pertaining to the leader-follower dynamics amongst the components of relative to their evolution throughout time. These questions are central to the thesis as a whole.
First, fix two specific indices and assume that the corresponding component signals and evolve very similarly up to a time shift. Does lead (precede) or follow (lag) throughout time ? To illustrate the meaning of this question, we consider the signals defined by and We plot these signals in Figure 1.1. Note that is a horizontal translation of to the right by unit of time, and the number is the value of the time shift. As a result, throughout time, we can describe the evolution of and in relation to one another: when increases, later increases, and when decreases, later decreases. Thus, we say that leads or equivalently, follows
Next, assume all of the component signals of trace some underlying real-valued signal up to scaling constants and time shifts. What is the order in which the component signals evolve throughout time ? More specifically, what is the order of the time shifts ? To illustrate the meaning of this question, let be the time interval and consider the sinusoidal signal whose -th component signal is defined by
(1.1) |
for each In Figure 1.2, we fix and plot some of these component sinusoidal signals. Each is a time-shifted copy of the signal with being the value of the time shift. Since for each we interpret the -th component signal as leading the -th component signal. Thus, the order in which the component signals evolve throughout time is
1.2 Analysis of Periodic Signals
In this section, we focus on analyzing periodic signals. Recall that a signal is periodic if there exists a minimal constant such that for all The constant is known as the period of We specifically say is -periodic in order to emphasize has period . Equivalently, we can view a -periodic signal as a well-defined map between and where is the additive group equipped with addition modulo
To analyze periodic signals in general, one uses tools from harmonic analysis, which include Fourier series [1, Chapter 2] and the Fourier transform [1, Chapter 5]. Another common tool that is suitable for analyzing periodic signals is the cross-correlation function [2, Chapter 12]. Recall if are two -periodic signals, then the (periodic) cross-correlation of and is the -periodic signal defined by
(1.2) |
The cross-correlation function measures the similarity between the input signals and as a function of the displacement variable The displacement at which the cross-correlation attains its absolute maximum reveals information about and . Up to an integer multiple of the period, the maximum displacement coincides with the time shift value such that shifting by this value would result in the signals and being most aligned.
For example, let be the periodic signals and Explicitly computing the cross-correlation of and using (1.2), we obtain
We plot this cross-correlation function in Figure 1.3. On one hand, note that is a time-shifted copy of to the right by units of time. On the other hand, we observe the cross-correlation function attains its maximum at for any This means the displacement at which the cross-correlation function attains its maximum coincides with the time shift between and up to an integer.
1.2.1 The Chain of Offsets Model
We answer the leader follower dynamics questions that we posed in Section 1.1 for periodic signals. We assume that obeys a Chain of Offsets model (COOM), which was first introduced in [3]. This model stipulates the existence of a -periodic signal scaling constants and offsets such that
(1.3) |
for each In other words, under COOM, all component signals of trace an underlying real-valued periodic signal up to scaling constants and offsets.
Answering our posed leader follower dynamics questions amounts to determining the order of the offsets under COOM. However, we emphasize this order of offsets is a cyclic order; it corresponds to the order in which the points are traversed on the unit circle in the counterclockwise direction. We represent the cyclic order as a permutation of the indices satisfying the condition that comes before in the counterclockwise direction. In terms of the signals themselves, we interpret as (cyclically) preceding for each Furthermore, is unique up to cyclic shifts of indices. This means if is another permutation representing the cyclic order of the offsets, then there exists an integer such that
for all in which is indexed mod
We give an example of how a permutation representing the cyclic order is unique up to cyclic shifts. Reconsider the sinsoidal signal with -th component signal defined in (1.1) for each Note that satisfies COOM in (1.3) if we set and and for each On one hand, note that which means the points are traversed in that order on the unit circle in the counterclockwise direction beginning from the positive real axis. Hence, the identity permutation is one permutation that represents the cyclic order of the offsets. In fact, any permutation defined by where is a fixed integer and is indexed mod is a valid permutation representing the cyclic order of the offsets. Nevertheless, observe that
(1.4) |
which means the identity permutation is equal to up to cyclic shifts.
1.2.2 Recovery of the Cyclic Order under COOM
Given the -periodic signal we outline a procedure to recover the cyclic order of the offsets under COOM. This procedure involves the utilization of the cross-correlation function, which we defined previously in (1.7).
First, we write the underlying -periodic signal in COOM as a Fourier series:
(1.5) |
where is the -th Fourier coefficient of for each Then, for each the cross-correlation of the component signals and is explicitly
where is the Kronecker delta function. Differentiating the cross-correlation function with respect to , we obtain
Equating the derivative to we see that any critical point of is of the form
where This means that the critical points of the are equal to the difference between the offsets and up to integer multiples of the period
Now, we are ready to construct a permutation of the indices representing the cyclic order of the offsets under COOM. For each let
in which is specifically reduced modulo More explicitly, we choose an integer such that and set equal to for this choice of Define the permutation be a permutation as follows: let be arbitrary, and for each let be the smallest index not in such that is minimal i.e. we have for all Observe ensures that is the closest point on the unit circle to the point in the counterclockwise direction. Therefore, is a candidate permutation representing the cyclic order.
It remains to show that our constructed permutation from the previous paragraph is unique up to cyclic shifts. To this end, let be another permutation constructed according to the procedure in the previous paragraph. Note that there exists a unique index such that which follows from the fact that the composition is itself a permutation in We claim that for this chosen index we have
(1.6) |
for all in which is indexed mod We prove this by induction. The statement (1.6) already holds for by the property of Now, suppose the statement (1.6) holds for for some We show it must also hold for By the construction of we have that is the smallest index not in such that is minimal. On the other hand, by the induction hypothesis, is the smallest index not in such that is minimal. By the construction of we conclude Therefore, the statement (1.6) holds for Thus, we have shown our candidate permutation is unique up to cyclic shifts.
1.3 Analysis of Non-Periodic Signals
In the previous section, we utilized the cross-correlation function to answer our posed leader follower dynamics questions in Section 1.1 for periodic signals. In this section, we investigate whether cross-correlation can be used to answer our posed leader follower dynamics questions for signals that are not necessarily periodic. The cross-correlation of two generic signals is defined as the map where
(1.7) |
provided the integral on the right hand side exists.
1.3.1 Time Shifted Model
Suppose is a signal that is compactly supported in a closed interval i.e. the closure of the set of all points with is a closed interval. Let be the map defined by
(1.8) |
for some time shift We refer to the model (1.8) as a time shifted model. Determining the leader follower relationship between and under the time shifted model amounts to determining the sign of the time shift If then we interpret to lead If then we interpret to follow
Given the signals and we determine the value of the time shift under the time shifted model (1.8). In order to do this, we utilize the cross-correlation of and defined in (1.7). For each we have
(1.9) | ||||
(1.10) |
in which the first term of (1.10) follows from observing the first two integral terms in (1.9) are both equal to via the respective changes of variables and
Note that the integral after the minus sign in (1.9) is a non-negative quantity because it is the integral of a non-negative function. This implies the cross-correlation function has an upper bound
(1.11) |
for all Equality in (1.11) holds if and only if the integral term in (1.10) is for some Because is continuous and compactly supported in a closed interval, equality in (1.11) holds if and only if the integrand of the integral term in (1.10) is i.e.
for all But since is not periodic, we have if and only if Therefore, the unique displacement at which the cross-correlation attains its maximum is precisely the time shift under the model (1.8). Having recovered the time shift we can determine the leader follower relationship between the signals and upon inspecting the sign of
1.3.2 Time Reparameterization Invariance
Based on our analysis so far, it seems that the cross-correlation function is a useful signal processing tool that enables us to determine the leader follower relationship between any two input signals, provided these signals have nice enough properties. We wonder whether there are drawbacks to using the cross-correlation.
We focus on one drawback central to the thesis. The cross-correlation function suffers under time reparameterization. A time reparameterization of the time interval is a strictly increasing homeomorphism between and itself. This means that if we use the cross-correlation as a tool to analyze signals, then the results given by cross-correlation may not be the same across all time reparameterizations. In other words, the results we obtain via cross-correlation may vary according to how we observe the signals throughout time.
To illustrate with an example, let and consider the signals where
(1.12) |
and Consider the map defined by
(1.13) |
Note that is a valid time reparameterization of Indeed, we have and and since is strictly increasing on , the map is also strictly increasing on and thus strictly increasing in Therefore, Finally, the inverse map is continuous on all of and is thus continuous in Therefore, is a strictly increasing homeomorphism, which means it is a valid time reparameterization. However, in Figure 1.4, we plot the correlation functions and We see that the displacement maximum for the original cross-correlation and the displacement maximum for the reparameterized cross-correlation differ in sign. Hence, the cross-correlation function does not allow us to conclude whether leads or vice versa.
Thus, in order to perform any meaningful analysis of signals, we need to consider using tools that are time reparameterization invariant.
1.4 Cyclicity Analysis
In this section, we consider cyclic signals. A signal is cyclic if is periodic for some time reparameterization Cyclic signals generalize periodic signals. Roughly speaking, cyclic signals have repeating temporal patterns, but such patterns do not repeat in a predictable fashion, which makes them different from periodic signals. Many real world phenomena are cyclic in nature. For example, the economy undergoes expansion and recession, but not predictably due to various external factors that control the economy. Other examples include cardiac cycles and fMRI signals [4].
We illustrate the difference between a periodic and a cyclic signal with a theoretical example. Consider the signals and on In Figure 1.5, we plot and and the resulting signals we observe after shifting the time axis by unit. Since is -periodic, shifting the time axis by unit results in us observing the exact same signal that we started with. On the other hand, is not -periodic; shifting the time axis by unit results in us observing a completely different signal than the one we started with. In fact, is not -periodic for any constant However, is cyclic: if is defined by then note that is a valid time reparameterization satisfying
In this section, we describe the procedure of Cyclicity Analysis, first introduced in [3], which is the study of the leader-follower dynamics pertaining to cyclic signals. Cyclicity Analysis directly answers both of our posed leader follower questions in Section 1.1 in a time reparameterization invariant fashion.
1.4.1 Iterated Path Integrals
First, we assume is a generic signal in which is a closed interval. We recall the first question we posed in Section 1.1, which is to determine the pairwise leader follower relationship between the -th component and -th component signals of for fixed indices assuming such component signals evolve similarly throughout time.
In order to answer this question, Cyclicity Analysis invokes the theory of iterated path integrals [5, 6, 7, 8]. For each , a -th order iterated path integral of is a functional of the form
(1.14) |
where For example, a first order iterated path integral of is explicitly of the form
which is the increment of the trajectory parameterized by
Iterated path integrals date back to the times of Riemann and Picard. They were initially used as tools to construct and analyze solutions to ordinary differential equations. Later on, K.T. Chen [5, 6, 7, 8] discovered the relevance of iterated path integrals to algebraic topology, while Terry Lyons [9, 10] discovered the relevance to the analysis of rough paths, which are Holder trajectories.
Iterated path integrals of the signal obey several invariant properties [3], which make them a suitable tool for data analysis. Firstly, they are indeed time reparameterization invariant: for any time reparameterization we have
which follows directly from performing a change of variables in (1.14). Next, the iterated path integrals of are invariant under parallel translation: if is fixed and if is the signal defined by then
Moreover, the iterated path integrals are invariant under detours. If are oriented curves in such that the ending point of is the starting point of their concatenation possesses a detour if for some the curves and are the same but with opposite orientations. In other words, if we traverse in that order, then our traversal will involve backtracking. If and parameterize the same curve, in which traverses it with detours but traverses it without detours, then
Finally, K.T. Chen [5, 6, 7, 8] proved a stronger statement: two signals that parameterize their respective curves without detours are the same up to time reparameterization and parallel translation if and only if all of their corresponding iterated path integrals agree.
1.4.2 Oriented Areas
Now, assume parameterizes a closed curve i.e. We consider the second order iterated path integrals of For each we substitute and and into (1.14) to obtain the explicit form of the second order iterated path integral:
(1.15) | ||||
in which the second term of (1.15) vanishes due to the assumption parameterizes a closed curve.
For all we define
(1.16) |
By Green’s Theorem, the integral (1.16) is a line integral that is equal to the area of the region enclosed by the curve parameterized by the map We note, however, this area is actually an oriented (signed) area. In particular, is positive if traverses the aforementioned curve in the counterclockwise direction and is negative otherwise. We heuristically interpret the oriented area as an indicator of the lead-lag relationship between the signals and . We say that leads (precedes) if We say that follows (lags) if Otherwise, we say and are in sync.
We illustrate the oriented area with an example. Reconsider the signals with and with The curve is a closed curve oriented in the counterclockwise direction, which we plot in Figure 1.6. The oriented area of the region enclosed by this curve is explicitly
which is positive. Therefore, we interpret to lead based on the oriented area.
Recall our assumption parameterizes a closed curve. We can modify the definition of the oriented area in the case does not parameterize a closed curve. In this situation, we may consider any map that parameterizes a closed curve formed by concatenating the curve parameterized by the original signal and any oriented curve in with starting point and ending point Different choices of yield different values of For our purposes, however, we will choose a signal such that
(1.17) |
i.e. the -th oriented area of is the same as the -th line integral of over the curve parameterized by
Consider the signal defined by
(1.18) |
This signal parameterizes the curve formed by concatenating the curve parameterized by the original signal and the oriented line segment with starting point and ending point We show this signal satisfies our requirement (1.17). Explicitly computing we have
(1.19) | ||||
(1.20) |
in which (1.20) follows from performing the change of variables in (1.19). Thus, our choice of satisfies our specific requirement in (1.17).
So from henceforth, if the signal does not parameterize a closed curve, then we define where is the signal defined in (1.18).
1.4.3 The Lead Matrix
If is a signal with then we will write as the oriented area in place of the original notation In isolation, the oriented area is simply a number that heuristically indicates the pairwise leader follower relationship between any two specific components of For future purposes, however, we would like to consider all of these total oriented areas together. So we will arrange all the oriented areas into an matrix. Let be the matrix whose -th entry is the oriented area for each We refer to as the lead(-lag) matrix corresponding to . More succinctly, we can rewrite as an integral of a matrix-valued function:
(1.21) |
where is the derivative of with respect to
We state several properties pertaining to the lead matrix. Firstly, the matrix is skew-symmetric i.e. Equivalently, for all which follows immediately upon swapping the indices and in the equation (1.16). Next, each eigenvalue of is either or purely imaginary [11, Exercise 7B.4]. These purely imaginary eigenvalues of come in complex conjugate pairs: if is an eigenvalue of for some nonzero then is also an eigenvalue of Lastly, is diagonalizable [11, Exercise 7B.4]. In particular, there is an orthonormal basis of consisting of the eigenvectors of
For each we let be the -th largest eigenvalue of in modulus and consider the corresponding orthonormal basis of eigenvectors in which is the associated unit eigenvector with for each We refer to the eigenvector as the leading (dominant) eigenvector of If is nonzero, then our notation for the largest eigenvalue and leading eigenvector is actually ambiguous because would be the complex conjugate of and thus satisfy For our purposes, however, this ambiguity does not matter. By [11, Exercise 7B.4], we have in which is the vector whose -th component is equal to the complex conjugate of the -th component of Therefore, regardless of whether is assigned to have a positive or negative imaginary part, the leading eigenvector is unique up to the complex conjugation of its components.
Finally, we mention how to construct a lead matrix so that it is suitable for practical applications. In the real world, one does not directly observe a continuous signal. Rather, one observes a signal at finitely many different times. Given a signal with a time series is a finite collection of the form where is fixed, is an increasing sequence in with and and Typically, is a very large integer. We define the lead matrix for the time series to be the lead matrix for the signal parameterizing the oriented curve formed by concatenating the oriented line segments in which has starting point and ending point for each and has starting point and ending point Explicitly,
(1.22) |
If we write out then the -th entry of is the oriented area of the convex polygon with vertices computed via the Shoelace Formula [12].
1.4.4 Chain of Offsets Model
Now, we assume that is a cyclic signal. We recall our second leader-follower dynamics question that we posed in Section 1.1, which is to determine the order in which the component signals of evolve throughout time assuming all these signals trace some underlying signal up to scaling constants and time shifts. Formally, we assume that the cyclic signal satisfies the following Chain of Offsets Model (COOM) [3], which stipulates the existence of an underlying real-valued -periodic signal scaling constants offsets and a time reparameterization such that
(1.23) |
for each and each
Given the cyclic signal we determine the cyclic order of the offsets under COOM. But this time, we will utilize the lead matrix to do so. Without loss of generality, we may assume that is the identity map, which reduces the COOM in (1.23) to the COOM defined earlier in (1.3). If is not the identity map to begin with, then we can the consider the COOM in (1.3) for the -dimensional signal
As before, we begin by writing the underlying signal as the Fourier series in (1.5). Let be the lead matrix for over one period under COOM, that is, the lead matrix for the restricted signal Then, for each the -th entry of is explicitly
(1.24) |
1.4.5 One Harmonic
Assume that the underlying -periodic signal in (1.23) has only one harmonic. This means the Fourier Coefficient is nonzero for only one index We show that we can recover the cyclic order of the offsets via the leading eigenvector of the lead matrix defined in (1.24). Note for each the -th entry of the lead matrix is explicitly
(1.25) |
We show that if the lead matrix is nonzero, then it must have rank two. Recall if then the outer product of and is the complex matrix where is the conjugate transpose of A statement made in [3] is that a complex skew-symmetric matrix has rank two if and only if it is of the form for some linearly independent So we verify that our lead matrix indeed satisfies this condition stated in [3]. Define the vectors
Then, by (1.25) immediately implies
(1.26) |
Now, we need to show that and are linearly independent in Consider the matrix whose first row is and second row is The condition that and are linearly independent is equivalent to the condition has rank The latter condition is equivalent to the condition there exists a submatrix of with a nonzero determinant. Recall our assumption is nonzero. This means has a nonzero entry for some two indices On the other hand, consider the submatrix formed by extracting the -th and -th rows of for the same choices of and namely the matrix
By (1.25), the determinant of is precisely which shows has rank two.
We now determine the leading eigenvector of Since has rank two, it has exactly two nonzero eigenvalues coming in a purely imaginary pair, namely and in which we recall is the -th largest eigenvalue of in modulus. By [3],
is the largest eigenvalue of with associated eigenvector
where is the angle between the vectors and and is the Euclidean -norm of In particular, if is the -th component of then upon extracting real and imaginary parts of we obtain
(1.27) | ||||
(1.28) |
where
are the -th components of and respectively.
Through the equations (1.27) and (1.28), we see is the result of a certain linear transformation evaluated at the point Since the points lie on a circle, the components of the leading eigenvector lie on an ellipse. The cyclic order in which the points are traversed on the circle is the cyclic order in which the points are traversed on the ellipse. Thus, the cyclic order of where is the principal argument of the complex number matches the cyclic order of the offsets in COOM. We refer to the principal arguments of the components of the leading eigenvector as phases.
We show an example of how the cyclic order of the leading eigenvector component phases and the cyclic order of the offsets match when the underlying signal has only one harmonic. Reconsider the sinusoidal signal (1.1) for each On one hand, we already established the identity permutation in represents the cyclic order of the offsets. On the other hand, in Figure 1.7, for we plot the heat map of the lead matrix over one period, as well as the complex components, component moduli, and phases of the leading eigenvector Since the components of have equal moduli, they lie on a circle. Observe the permutation that sorts the leading eigenvector component phases in ascending order is given by in which is indexed mod This is the identity permutation up to a cyclic shift.
1.4.6 Multiple Harmonics
Suppose the underlying -periodic signal in (1.23) has multiple harmonics, meaning has multiple nonzero Fourier coefficients. Assume, however, one of these Fourier coefficients dominates: there exists exactly one such that
With these assumptions, it is still possible to approximately recover the cyclic order of the offsets under COOM via the leading eigenvector of the lead matrix over one period.
Let be the rank two skew-symmetric matrix, whose -th entry is equal to (1.25) for all . Then, the -th entry of will dominate the series expansion in (1.24). This means we well approximate the original lead matrix with the lower rank matrix
On the other hand, a rank two approximation of is the matrix in which is the matrix corresponding to the projection operator onto the subspace of spanned by the vectors and The smaller the Frobenius norm is, the better this low rank approximation is of . Moreover, since sends to itself, we have
which means the leading eigenvector of is Therefore, assuming the low rank matrix approximates well enough, we can still approximately recover the cyclic order of the offsets in COOM via the cyclic order of the component phases of
For practical purposes, one heuristic indicator [3] that measures the dominance of the low rank approximation of the lead matrix is the magnitude of the ratio of the first to third largest eigenvalue of This is because in the rank situation, this ratio would be infinite.
1.5 Ornstein-Uhlenbeck Process
In this section, we fix a probability space and We consider an (-dimensional) Ornstein Uhlenbeck (OU) process on this probability space, which is an -dimensional stochastic process satisfying the linear stochastic differential equation [13]:
(1.29) |
Here, is a fixed linear operator such that is Hurwitz (stable) i.e. all eigenvalues of have negative real parts, is a fixed linear operator for some and is the standard -dimensional Wiener process independent of the initial random vector . We refer to as the friction (drift) operator and as the volatility operator, and we collectively refer to and as the OU model parameters. We refer to the matrix
(1.30) |
as the diffusion operator.
The OU process is an ergodic Gaussian diffusion process [14] that prominently appears in Financial Mathematics and Biology [15, 16, 17]. In engineering disciplines, the OU Process is heuristically characterized as the solution to the Langevin equation [18]:
(1.31) |
in which is the -dimensional Gaussian “white noise” process whose autocovariance function satisfies
for all Here, is the Dirac measure. The OU process has a unique stationary distribution [14, 19], namely the Gaussian distribution with mean and covariance matrix in which satisfies the Lyapunov equation [20]:
(1.32) |
For this reason, we refer to as the stationary covariance operator. Any realization of the OU process will fluctuate around the origin in the long term [18]. These long term fluctuations vary according to the stationary Gaussian distribution. Finally, if is the Gaussian distribution with mean and covariance matrix then the OU process is stationary.
1.6 Problem Statement
In this thesis, we will investigate Cyclicity Analysis for the stationary OU process with model parameters and in which is the Gaussian distribution with mean and covariance matrix We define the auxiliary lead process, which is the matrix-valued stochastic process where
(1.33) |
for each The right hand side is a random matrix whose entries are one-dimensional Ito stochastic integrals [21] with respect to the component OU processes . In particular, the -th entry of is a one-dimensional Ito integral of the form
By definition, represents the lead matrix corresponding to a realization of the OU process over the interval
We will prove the lead process obeys the strong law of large numbers identity:
(1.34) |
almost surely. The identity (1.34) states we can infer the long term average pairwise leader follower dynamics amongst the component OU process through a time-averaged lead matrix corresponding to one single realization of the OU process. For this reason, we refer to the matrix
(1.35) |
as the lead matrix of the OU process.
Next, we consider a cyclic OU process, in which the friction matrix is specifically a circulant matrix: if is the first row of then the -th entry of is where is indexed mod We assume and is large enough so that is a valid friction matrix. We consider a signal propagation model governed by this cyclic OU process. In our signal propagation model, we assume that there is a network of sensors located in space and an underlying one-dimensional signal propagating throughout the network. Similar signal propagation models are considered in [22, 4, 23], for example. Letting be the OU process, we interpret the -th component of as a measurement made by the -th sensor of the propagating effect at time The friction matrix represents the cyclic sensor network structure. We refer to the constant as a propagation coefficient for each and we interpret its magnitude to represent how receptive the -th sensor is to the activity within the neighboring -th sensor for each in which is indexed mod We refer to the constant as a suppression coefficient, and we interpret its magnitude to represent how quickly the signal dissipates as it is propagating throughout the network. Meanwhile, the diffusion matrix is the covariance matrix whose -th entry is the covariance of the -th and -th component noises injected into the -th and -th sensors, respectively.
For large dimension and for different choices of cyclic OU model parameters and we derive the lead matrix of the cyclic OU process. Then, we investigate whether Cyclicity Analysis enables us to recover the network structure given by More specifically, we investigate whether the structure of the leading eigenvector recovers the structure of the network.
For example, let be the circulant friction matrix such that the suppression coefficient the propagation coefficient and all other propagation coefficients are The cyclic network structure of is such that the -th sensor is only linked to the -th sensor, the neighboring sensor to its right. Does the structure of the leading eigenvector reflect this ground truth network structure in any way ? For example, does the (cyclic) order of the leading eigenvector component phases match the expected (cyclic) order of the sensors receiving the signal as it propagates throughout the network ? In addition, do the leading eigenvector component moduli and the lead matrix eigenvalues indicate anything about the network structure ?
We will investigate our problem statement under two specific regimes: the first regime is when the suppression coefficient is large i.e. the propagating signal quickly dissipates, and the second regime is when is small i.e. the propagating signal does not quickly dissipate. In the former regime, the circulant friction matrix is perturbed significantly from an unstable matrix, while in the latter regime, is perturbed slightly away from an unstable matrix. We investigate these regimes because they yield different types of asymptotic structures pertaining to the lead matrix and its leading eigenvector
1.7 Literature Review
We conclude the introduction by conducting a literature review of the main topics of the thesis.
1.7.1 Lead Lag Dynamics
There are various tools to identify lead lag dynamics between two signals or between two time series. Such tools have been utilized to study financial markets [24] and fMRI signals [25]. We already discussed one tool, namely the cross-correlation function. Another algorithmic tool is Dynamic Time Warping (DTW) [26]. We briefly describe it. Consider two one-dimensional time series of the form and The goal of DTW is to find the optimal alignment of such time-series. To this end, one constructs an matrix whose -th entry corresponds to the optimal cost between the restricted time series and The optimal cost is defined via the formula
(1.36) |
where is a specified distance metric. Then, a warping path is any collection of indices satisfying and and for all where is a fixed length. Finally, the optimal alignment between the original time-series and is the warping path minimizing the quantity
DTW, however, has drawbacks. Seeking the warping path to guarantee optimal alignment is a computationally expensive problem. Moreover, DTW is prone to noise.
1.7.2 Cyclicity Analysis
We review the literature on Cyclicity Analysis. As mentioned earlier, Cyclicity Analysis was officially introduced in [3] as a means to study cyclic signals. Later on, a dissertation [23] was published demonstrating how Cyclicity Analysis of fMRI signals yields new insights into understanding the brain processes of patients with a neurological condition known as tinnitus. Moreover, the authors in [4] published an experimental article comparing the results given by Cyclicity Analysis on fMRI signals to results given by other standard tools.
We emphasize that in this thesis, we are applying Cyclicity Analysis in a completely different setting. We are using it to investigate signals generated via a stochastic process.
1.7.3 OU Process
As mentioned earlier, the OU process is one of the most common stochastic processes with applications in Financial Mathematics and Biology.
Firstly, many authors have studied the estimation of OU process model parameters. In [27], for example, the authors showed using MAP estimation how to recover OU model parameters and in which is an invertible matrix, given a time series that approximates a realization of an OU process. In [14], the authors showed how to approximate the stationary distribution of a generalized OU process with a modified drift coefficient. We emphasize that our problem statement is completely different from model parameter estimation. We are not trying to recover the friction matrix itself from realizations of the OU process; rather, we interpret as a weighted adjacency matrix associated with a network of sensors, and we are investigating whether we can recover the network structure induced by from any realization of the OU process.
Moreover, the lead matrix of the OU process defined in (3.40) has been studied in the literature as a means to characterize the time-irreversibility of the OU process [18, 28, 29]. A stationary stochastic process is said to be time-reversible [29] if for all for all times the probability distributions of and of are equal. The main result is the stationary OU process is time reversible if and only if is symmetric, where is the stationary covariance matrix of the OU process. Equivalently, the OU process is time reversible if and only if the matrix vanishes.
To study the degree to which the OU process is time irreversible, we briefly outline the approach described in [28]. Let be the probability density function of at time Since we are considering the OU stationary process, for each the random vector has a Gaussian distribution with mean and covariance matrix Explicitly, we have
(1.37) |
for each The density function satisfies the Fokker-Planck (Kolmogorov forward) equation [28], which is the parabolic partial differential equation
(1.38) |
where is the diffusion matrix, is the gradient of with respect to variable and is the divergence of the vector field Define the map by the expression within the divergence operator on the right hand side of (1.38), namely
(1.39) |
The map is known as the probability flux (current) [18, 28]. In general, the probability flux describes how probabilities associated with different states in the state space of a stochastic process change over time. Intuitively, when the probability flux is nonzero, the realizations follow paths that lead towards regions with higher probability density i.e. those states with higher probability of occurrence.
For each upon recognizing that we can rewrite the probability flux in the following way:
Thus, the stationary OU process is time-reversible if and only if the probability flux vanishes. If the flux does not vanish, then observe is involved in the measurement of the flux.
Chapter 2 Preliminaries
In this chapter, we will briefly review the needed notations, definitions, and standard results from Linear Algebra, Probability Theory, and Stochastic Processes that will be used throughout this thesis.
2.1 Linear Algebra
Throughout this section, we let and we fix We denote as the set of all matrices with entries in and we tacitly identify the -dimensional vector space with Recall if then there is a unique linear operator between and whose transformation matrix is Therefore, we will interchangeably refer to as either a matrix or as a linear operator between the appropriate vector spaces.
Let and We recall some standard linear algebra notation. For each and each we denote as the -th entry of and we denote as the -th component of We denote as the transpose of and denote as the conjugate-transpose of We denote as the identity matrix, in which the value of will be clear in context. We denote as either the zero matrix or the zero vector in in which the situation, along with the values of and , will be clear in context. Finally, we denote as the -th canonical basis vector having -th component equal to and all other components equal to
Now, let We denote and as the determinant, trace, and inverse of respectively. Recall the matrix exponential of is the power series
(2.1) |
We state the basic properties of the matrix exponential that we will be using later in the thesis. Proofs of these properties can be found in [30], for example.
Theorem 2.1 (Matrix Exponential Properties).
If then the following properties hold.
-
i)
commutes with under matrix multiplication.
-
ii)
is invertible with
-
iii)
We have
(2.2) -
iv)
The matrix-valued map is differentiable on . In particular,
(2.3)
Let The matrix is Hurwitz if all of its eigenvalues have negative real parts. Note if is Hurwitz, then The matrix is Hermitian if and skew-Hermitian (anti-Hermitian) if Note that Hermitian real matrices are the same as symmetric real matrices, while skew-Hermitian real matrices are the same as skew-symmetric real matrices. We state the important properties of Hermitian and skew-Hermitian matrices. One may consult [30] for proofs.
Theorem 2.2 (Hermitian and skew-Hermitian Matrix Properties).
If is either Hermitian or skew-Hermitian, then the following properties hold
-
i)
is skew-Hermitian if and only if is Hermitian.
-
ii)
If is Hermitian or skew-Hermitian, then is diagonalizable; there is an orthonormal basis of consisting of the eigenvectors of
-
iii)
If is Hermitian, then all its eigenvalues are real. If is skew-Hermitian, then all eigenvalues of are either or purely imaginary.
Finally, let The matrix is circulant if there exists a sequence of numbers such that where is indexed mod Circulant matrices obey several properties [31]. Circulant matrices form a subspace of They commute under matrix multiplication. The product of circulant matrices is circulant, and the transpose of a circulant matrix is circulant.
We state and prove the properties pertaining to the eigenvalues and eigenvectors of circulant matrices that are essential for this thesis.
Theorem 2.3 (Circulant Matrix Properties).
If are circulant matrices, then the following properties hold.
-
i)
is diagonalizable. In particular, if the first row of is of the form then for each the vector
(2.4) is an eigenvector of associated with the eigenvalue
(2.5) where is an -th root of unity. Furthermore, the collection of eigenvectors forms an orthonormal basis of and we have for each
-
ii)
For each we have
(2.6) -
iii)
For each the matrix exponential of is explicitly
(2.7) Moreover, is a circulant matrix.
Proof.
We prove the first statement. For each the -th component of is of the form
which is the -th component of This shows is an eigenvalue of with associated eigenvector Next, note that
(2.8) |
for each which implies Finally, for all with we use the finite geometric series identity
for all to see that
If then
Since is a set of orthonormal vectors in we deduce it is a basis for This proves the first statement.
We prove the second statement. Let be fixed. Because is an eigenvalue of note that is an eigenvalue of with associated eigenvector For each the orthonormality of the collection of eigenvectors yields
This shows that the linear operator and the linear operator agree on the basis of eigenvectors for each . Because every vector in is a unique linear combination of vectors from this basis, the operators agree on all of and thus are identical. This proves the second statement.
Now, we prove the last statement. By the definition of the matrix exponential in (2.1), we have
(2.9) | ||||
(2.10) |
in which we obtained (2.10) by simplifying the inner sum in (2.9) using the Taylor series of the ordinary exponential function:
The product of circulant matrices is circulant and the scalar multiple of a circulant matrix is also circulant. Therefore, for each the matrix is a circulant matrix. Since the sum of circulant matrices is circulant, the partial sum
is circulant for each Thus, upon taking the limit as we deduce is a circulant matrix. ∎
We will adopt the following notation in this thesis. If then we denote as the circulant matrix whose first row is
2.2 Probability Theory
Throughout this section, we let be a probability space, and we also fix Recall a random vector on is a measurable function between and in which is equipped with its Borel sigma algebra.
Let and be two -dimensional and -dimensional random vectors on respectively. We denote as the expectation (mean) of and recall
is the cross-covariance matrix between and The covariance matrix of is the positive semidefinite matrix
Let be an -dimensional random vector on Recall is (-dimensional) Gaussian if its probability density function satisfies
(2.11) |
for some and positive definite matrix Note that has mean and covariance matrix and we write Recall that any linear combination of independent Gaussian random vectors is also a Gaussian random vector. Moreover, any linear transformation of a Gaussian random vector is also a Gaussian random vector.
2.3 Stochastic Processes
Throughout this section, we let be probability space, be a filtration on and we fix Recall an event occurs ()-almost surely (with probability one) if
Recall an (-dimensional) stochastic process on is a map such that for each fixed the map is an -dimensional random vector on A realization (sample path/trajectory) of is a map of the form where is fixed. From henceforth, we adopt the notation in which we write as the random vector at the time The process is adapted to the filtration if is a random vector on specifically equipped with the sigma algebra for each In particular, the standard (natural) filtration of the process is the filtration such that is the sigma algebra generated by the collection of random vectors for each Note that is always adapted to its own standard filtration.
Let be a stochastic process. We recall the following time dependent statistical functions associated with the process. The mean function, autocovariance function, and the covariance function of are defined respectively as the maps and where
(2.12) | ||||
(2.13) | ||||
(2.14) |
The process is (strictly) stationary if the cumulative distribution functions of and are identical for all The process is weakly (wide sense) stationary if its mean function is identically constant and the map between and depends only on the variable Note that all stationary processes are weakly stationary.
Let be a stochastic process adapted to the filtration The process is Markov with respect to the filtration if for all and any measurable, bounded function we have
almost surely. We simply say is Markov if it is Markov with respect to its standard filtration. Moreover, the process is a diffusion process if it is a Markov process having continuous sample paths with probability one. The process is a martingale with respect to the filtration if for all the expectation exists and We simply say is a martingale if it is a martingale with respect to its standard filtration.
Let be a stochastic process. The process is said to be (-dimensional) Gaussian if for all and all times and all the linear combination is a Gaussian random vector. For Gaussian processes, the notions of stationarity and weak stationarity are equivalent. The process is the (-dimensional) standard Wiener process (Brownian Motion) if it is a diffusion process such that almost surely with independent, Gaussian increments: for all both the increments and are independent and where is the identity matrix. Note that the Wiener process is a Gaussian process and a martingale. Furthermore, its component processes are independent for each
Let be a Markov process adapted to the filtration and let the map defined between and be measurable, where is the Borel sigma algebra on Let be a collection of shift operators on such that for all For each and and let
A probability measure on equipped with the Borel sigma algebra is invariant with respect to if
for each and each A set is shift invariant if for all The Markov process is ergodic if it admits an invariant probability measure satisfying for every shift invariant set in which is the probability measure defined by
(2.15) |
The most important theorem pertaining to Markov ergodic processes is Birkhoff’s Ergodic Theorem [32].
Theorem 2.4 (Birkhoff’s Ergodic Theorem).
If is a Markov ergodic process, then for any and for any measurable satisfying the condition that exists, we have
(2.16) |
-almost surely, where
(2.17) |
2.4 Stochastic Differential Equations
As before, let be a probability space and fix Let be the standard -dimensional Wiener process. In this section, we briefly review the theory of stochastic differential equations.
Let be a map such that the -dimensional stochastic process is adapted to the standard filtration of the Wiener process, where vec is the vectorization operator that maps matrix to the -dimensional vector formed by stacking the columns of the matrix one after another. The time integral of and stochastic integral of with respect to the Wiener process are respectively defined as
(2.18) | ||||
(2.19) |
where is an increasing sequence in satisfying and Note the limits in (2.18) and (2.19) are taken in the sense of i.e. the mean-squared limit.
An Ito process is a process adapted to the standard filtration of the Wiener process satisfying
(2.20) |
where and are measurable functions. Informally, we write (2.20) as an equation of the form
(2.21) |
which we refer to as a stochastic differential equation (SDE). We interpret any Ito process in (2.20) as a solution to the SDE (2.21).
We state the existence and uniqueness criteria [21, Theorem 5.2.1] pertaining to the solution of the SDE (2.21).
Theorem 2.5 (SDE Solution Existence and Uniqueness Conditions).
Consider the SDE (2.21). Suppose there exist constants satisfying
(2.22) | ||||
(2.23) |
for all and all where is the Frobenius norm and is the Euclidean -norm. Suppose further that the initial random vector satisfies being finite and is independent of the Wiener process. Then, the SDE (2.21) has a unique solution for which is a diffusion process adapted to the standard filtration of the Wiener process. Moreover, the solution is square integrable i.e. is finite for each
Chapter 3 The Ornstein-Uhlenbeck Process
Let be a probability space. Recall in the introduction, we defined an -dimensional OU Process on this probability space, namely the -dimensional stochastic process satisfying the SDE
(3.1) |
The linear operator is called the friction operator which satisfies the condition is Hurwitz. The linear operator is called the volatility operator. The process is the -dimensional standard Wiener Process. Moreover, we assume that is independent of the Wiener process, in which we recall is the stationary covariance operator satisfying the Lyapunov equation
(3.2) |
and
(3.3) |
is the diffusion operator. We view the friction operator and volatility operator as OU model parameters.
In this thesis, we will utilize the matrix-valued function defined by
(3.4) |
which is commonly known as the Green’s function (propagator) in the literature [18]. We note that is the fundamental solution operator to the linear ordinary differential equation
The stationary covariance matrix has an explicit formula as a matrix-valued integral involving We will be using this explicit formula throughout this thesis.
Theorem 3.1 (OU Process Stationary Covariance Matrix Explicit Formula).
The stationary covariance matrix is explicitly
(3.5) |
Moreover, is positive semidefinite.
Proof.
First, we verify that (3.5) is indeed a solution to the Lyapunov equation (3.2). Substituting the right hand side of (3.5) for into the left hand side of the equation (3.2) and utilizing the property from Theorem 2.1, we have
Next, we verify our solution to the Lyapunov equation (3.2) is unique. To this end, we let and be two solutions to (3.2). Via the product rule, we obtain
which implies must be a constant function of Since is Hurwitz, we have
which forces
(3.6) |
for all Since and are invertible, we left multiply both sides of the equation by and right multiply both sides of the equation by to obtain the equality This shows which means the Lyapunov equation (3.2) has only one solution.
Finally, we show is positive semidefinite. Note that
(3.7) |
which implies solves the Lyapunov equation (3.2). But by uniqueness, we have which means is symmetric. Now, for all nonzero we have
which is non-negative because the integrand is non-negative for all Hence, is positive semidefinite. ∎
3.1 Statistical Properties
In this section, we will recall the important statistical properties of the OU Process. We first derive the explicit representation of the OU process i.e. the solution to the governing stochastic differential equation (3.1). The solution also reveals the essential statistics of the OU process.
Theorem 3.2 (OU Process Explicit Representation).
The unique solution of the governing OU process stochastic differential equation (3.1) is explicitly
(3.8) |
Moreover, the OU process is an Ito diffusion process that is square integrable.
Proof.
First, we show that the right hand side of (3.8) solves the stochastic differential equation (3.1). Let
Using Ito’s product rule and the Ito formalisms [21, Theorem 4.2.1], namely
we obtain
(3.9) |
Formally, (3.9) is the integral equation
(3.10) |
Upon recognizing that and recalling that is invertible with inverse by Theorem 2.1, we have
Thus, the right side of (3.8) is a solution to the governing stochastic differential equation (3.1).
Next, we show that (3.1) has a unique solution that is an Ito diffusion process that is square integrable. We need to verify the existence and uniqueness criteria listed in Theorem 2.5 for the OU process. Define and by
(3.11) | ||||
(3.12) |
Since is a linear operator that is independent of the variable it is continuous and is hence measurable. Since is constant function in both and it is automatically measurable. Furthermore, is bounded; by the definition of boundedness, there exists a constant directly satisfying (2.23). Now, let Then
Hence, our choice of satisfies (2.22). Moreover, recall by assumption that is independent of the Wiener process, and exists by virtue of having a finite covariance matrix. Hence, the SDE (3.1) satisfies the assumptions listed in Theorem 2.5. This completes the proof. ∎
Using the solution of the OU process, we derive its important statistical properties.
Theorem 3.3 (OU Process is Gaussian).
The OU process is a Gaussian process.
Proof.
Consider the stochastic processes and where
for each Note that every finite linear combination of random vectors from the OU process is the sum of a finite linear combination of random vectors from and a finite linear combination of random vectors from Furthermore, any linear combination of random vectors from is independent of any linear combination of random vectors from since is independent of the Wiener process.
We show that the process is Gaussian. Every finite linear combination of random vectors from is of the form
(3.13) |
for some and times and constants Observe that the right hand side of (3.13) is a deterministic linear transformation of the Gaussian random vector which means it is also Gaussian. Hence, is a Gaussian process.
Next, we show is a Gaussian process. Consider the map defined between and Since this map does not depend on the variable it is a deterministic matrix-valued function whose entry functions are deterministic real-valued functions. These entry functions are automatically adapted to the standard filtration of the Wiener process and are continuous in which means they are square integrable over any closed, finite interval. Hence, by [21, Theorem 3.2.1] , the process is a Gaussian process.
Finally, every finite linear combination of random vectors from the OU process is the sum of two independent Gaussian random vectors, one of which is a linear combination of random vectors from the process while the other is a linear combination of random vectors from the process Therefore, every linear combination of random vectors from the OU process is a Gaussian random vector. This proves the claim.
∎
Theorem 3.4 (OU Process Statistics).
The OU process is a Gaussian process whose mean, auto-covariance functions are
(3.14) | ||||
(3.15) |
respectively.
Proof.
First, we determine the mean function of the OU process. Let and be the processes as defined in the proof of the previous theorem. Since for each we use the linearity of the expectation operator to obtain
(3.16) | ||||
in which we recognized the expectation of the stochastic integral term in (3.16) vanishes due to the process being a martingale.
Next, we calculate the autocovariance function of the OU process. Using the definition (2.13) and the bilinearity of the covariance operator, we have
(3.17) | ||||
(3.18) | ||||
in which (3.18) is the result of expanding out the covariance expression in (3.17) using bilinearity and the fact and are independent and hence uncorrelated. We now determine For all we use the additivity of the stochastic integral and the bilinearity of the covariance operator to obtain
(3.19) | ||||
(3.20) | ||||
(3.21) | ||||
in which the second covariance term in (3.19) vanishes due to the Wiener process having independent increments, and (3.21) follows from evaluating the covariance term in (3.20) with Ito’s Isometry [21, Corollary 3.1.7]. Similar reasoning gives
for Upon combining the results corresponding to the cases and we have
for all Hence, we obtain the OU process autocovariance function. ∎
Based on the time-dependent statistics, we can deduce the OU process is stationary.
Theorem 3.5 (OU Process Stationary).
The OU process is stationary, and its covariance function is
(3.22) |
for all
Proof.
Since the OU Process is a Gaussian process, it suffices to show the process is weakly stationary. In particular, we show the map defined between and depends only on Differentiating the map with respect to we obtain
(3.23) | ||||
(3.24) | ||||
(3.25) | ||||
(3.26) |
where the second term of (3.24) follows from applying the change of variables on the integral term within the partial derivative expression of (3.23), and the first term of (3.25) follows from rewriting the first term of (3.24) using the the fact that and commute according to Theorem 2.1. Finally, (3.26) follows from the Lyapunov equation (3.2). Since the partial derivative of vanishes on the connected set we conclude that depends only on
Next, for each we have
∎
The previous theorem implies for each which trivially implies the stationary distribution of the OU process is the Gaussian distribution with mean and covariance matrix
We provide a sufficient condition for the OU process to be ergodic.
Theorem 3.6 (OU Process Ergodicity).
If the diffusion matrix is invertible, then the OU process is ergodic.
Proof.
In order to prove ergodicity, we need to show that the stationary distribution of the OU process, namely the Gaussian distribution with mean and covariance matrix is unique [32].
Consider a generic (time homogeneous) Ito diffusion process satisfying the stochastic differential equation
(3.27) |
with and being Borel measurable functions. A sufficient condition [33, Theorem 5.3.2] for this diffusion process to have a unique stationary distribution is that there is an open, bounded domain with smooth boundary such that
-
i)
For each the smallest eigenvalue of the diffusion coefficient is bounded away from
-
ii)
The process is recurrent relative to : for each the mean time at which a path issuing from reaches the set is finite and exists for each compact set in
Moreover, in [34, Theorem 3.9], it was shown that the condition ii) is equivalent to the Foster Lyapunov drift condition, which states there is a non-negative, twice continuously differentiable function for which is negative on Here, is the infinitesimal generator defined by
(3.28) |
for any twice continuously-differentiable where is the gradient of and is the Hessian matrix of
First, we verify statement i) holds for the OU process. Let
where is the unique positive definite solution to the Lyapunov equation
(3.29) |
where is the identity matrix. Since and are positive definite, their traces are positive. Hence, is nonempty. It is an open ball in centered around Note that the diffusion coefficient of the OU process is the constant matrix Since is positive definite, its eigenvalues are all positive. So automatically, its smallest eigenvalue is bounded away from on This means the OU process satisfies statement
Next, we verify the second statement ii) by verifying the Foster Lyapunov drift condition holds. For the OU process, the infinitesimal generator is explicitly
(3.30) |
for each twice-continuously differentiable Consider the Lyapunov function defined by
Then, we compute
For all by the Lyapunov equation (3.29), we have
(3.31) | ||||
in which the second term of (3.31) follows from the Cauchy Schwarz inequality applied to the trace of the product of two positive semidefinite matrices. Hence, the OU process satisfies statement and we conclude the process is ergodic. ∎
3.2 The Lead Process
In this section, we consider the OU process generated by model parameters and and we assume the process is ergodic. We recall the auxiliary lead process that we defined in the introduction, namely the matrix-valued stochastic process in which
(3.32) |
for each More explicitly, for each the -th entry of is an Ito stochastic integral of the form
(3.33) |
where is the -th component OU process. By definition, each is the lead matrix corresponding to a realization of the OU process over the finite time interval
We prove a strong law of large numbers identity pertaining to the lead process.
Theorem 3.7 (Lead Process Strong Law of Large Numbers Identity).
The lead process satisfies the following strong law of large numbers identity:
(3.34) |
almost surely.
Proof.
Using the governing stochastic differential equation (3.1), we write
(3.35) |
We need to determine the almost sure limits of both integral terms in (3.35) as
First, we show that the almost sure limit of the first term of (3.35) is as To this end, consider the map defined by
(3.36) |
This map has real-valued entry functions of the form defined by for Each is continuous and hence measurable between the spaces and equipped with their respective Borel sigma-algebras. Hence, itself is measurable and by Theorem 2.4, we obtain
almost surely.
Next, we show that the almost sure limit of the second term of (3.37) is To this end, consider the matrix-valued stochastic process defined by
for each We need to show that
almost surely. Equivalently, we need to show each entry process of namely the process in which
for each satisfies
(3.37) |
almost surely. Note that is a continuous time square integrable martingale with respect to the standard filtration of the Wiener process because the component OU process is square integrable and is adapted to the component Wiener process By Ito’s Isometry, for each we have
Moreover, the discrete process is a discrete time square integrable martingale.
We show that the discrete time martingale satisfies the strong law of large numbers identity
(3.38) |
almost surely, in which runs over the integers. We consider the martingale difference sequence where
for each Because is a martingale, we have
where is the standard filtration of the Wiener process. Therefore,
which converges as . By [35, Exercise 4.4.10], we deduce converges almost surely to some limit as Because the sequence is an increasing sequence diverging to infinity, we deduce by Kronecker’s Lemma [35, Theorem 2.5.9] that
almost surely, which proves the identity (3.38). Now, by Doob’s submartingale inequality [35, Theorem 4.4.2], we have
which we note converges as . Therefore, by the Borel Cantelli Lemma, we have
This means for large there is a unique integer with such that
(3.39) |
almost surely. Therefore, as we have which means (3.39) converges almost surely to Hence, we deduce the identity (3.37) and conclude the second integral in (3.35) almost surely converges to
Thus, we conclude
almost surely. ∎
Because of this strong law of large numbers identity, we use the matrix
(3.40) |
as our main tool for the Cyclicity Analysis of the OU process. We refer to as the lead matrix of the OU process. In the next chapter, we will investigate the structure of this matrix in depth.
Chapter 4 Cyclicity Analysis of the Cyclic Ornstein-Uhlenbeck Process
In this chapter, we consider the -dimensional cyclic OU process with model parameters and in which is a circulant friction matrix. We write in which for each and is large enough so that is a friction matrix. We let be the diffusion matrix corresponding to this OU process, as defined in (3.3). Recall by Theorem 2.3, for each
(4.1) |
is an eigenvalue of with associated eigenvector
(4.2) |
in which
Using the eigenvalues of , we can determine the explicit lower bound of to ensure that is a valid friction matrix.
Theorem 4.1 (Cyclic OU Process Friction Matrix Stability Condition).
is a valid friction matrix if and only if
Proof.
In order for to be a valid friction matrix, must be Hurwitz i.e. for all where is the real part of the complex number This is equivalent to the condition that
We determine explicitly. For each we extract the real part of to obtain
(4.3) | ||||
in which the inequality (4.3) follows from the fact that is non-positive and for each Hence, we see
which means if and only if
Rearranging the above inequality yields the claim. ∎
In this chapter, we consider the signal propagation model that we introduced Chapter 1 governed by the aforementioned cyclic OU process. In this model, there is a one-dimensional signal propagating throughout a network consisting of sensors. Writing out for each we interpret as the measurement made by the -th sensor of the propagating effect at time . The circulant friction matrix represents the sensor network structure. More specifically, for each the constant is a propagation coefficient, whose magnitude quantifies how receptive the -th sensor is to the activity within the -th sensor for each in which is indexed mod The constant as a suppression coefficient, whose magnitude quantifies the rate at which the propagating signal in our model dissipates. Finally, the diffusion matrix is the covariance matrix of the noise injected into the sensors. In particular, is the covariance between -th component noise injected into the -th sensor and the -th component noise injected into the -th sensor for each
For large dimension and different choices of cyclic OU model parameters and we compute the lead matrix defined in (3.40). We investigate whether Cyclicity Analysis enables us to recover the structure of the cyclic network structure induced by More specifically, we investigate whether the structure of the leading eigenvector of reflects the cyclic network structure. We investigate this problem under two regimes: the first regime is when is far from and the second regime is when is close to
4.1 Lead Matrix Explicit Representation
In this section, we derive the explicit formula of the lead matrix corresponding to the cyclic OU Process generated by the model parameters and defined at the beginning of the chapter.
Theorem 4.2 (Cyclic OU Process Lead Matrix Explicit Formula).
The lead matrix of the cyclic OU process is explicitly
(4.4) |
Proof.
First, we explicitly compute the stationary covariance matrix of the OU process in (3.5). By Theorem 2.3, the Green’s Function defined in (3.4), is explicitly
Substituting this expression for into the integrand of (3.5), we have
Now, upon substituting this explicit representation of into the lead matrix definition in (3.40), we obtain
This completes the proof. ∎
As mentioned in the beginning of the chapter, we consider two regimes for our signal propagation model: the first regime in which the propagating signal exhibits significant dissipation and the second regime in which the signal does not exhibit significant dissipation. In order to better distinguish these regimes and make it suitable for future calculations, we modify our setup. For each we consider the circulant friction matrix
(4.5) |
in place of the original friction matrix in which the constant is a perturbation. In this setup, the former regime occurs when is large i.e. the friction matrix is significantly perturbed from an unstable matrix, while the latter regime occurs when is small i.e. the friction matrix is slightly perturbed from an unstable matrix. So in future sections, we will consider the cyclic OU process with model parameters and with defined as in (4.5). Consequently, we let be the lead matrix of this cyclic OU process and be its leading eigenvector.
In future sections, we will also extend the big-Oh notation to matrices in the following way: if is a function, then we write to represent an matrix-valued function of such that there exists a for which all entry functions of are bounded above by We will specify in context as to whether tends to infinity or tends to
4.2 Only One Nonzero Propagation Coefficient and Injection of Noise into Only One Sensor
In this section, for each perturbation we consider the signal propagation model governed by the Cyclic OU process with model parameters and in which is defined as in (4.5). Here, we assume exactly one propagation coefficient is nonzero i.e. is nonzero for only one index The ground truth network structure is such that the -th sensor is only linked to the -th sensor for each where is indexed mod We further assume and are coprime. Note that this condition ensures the signal reaches every sensor in the network. To see this, implies there exist integers for which i.e. has an integer solution Hence, for each we have
In other words, if the signal is broadcast from the -th sensor, then it reaches the -th sensor after passing through intermediate sensors.
Moreover, we assume that is a volatility matrix such that is a diagonal diffusion matrix whose -th diagonal entry is nonzero for only one index This corresponds to the situation where noise is only injected into only one sensor, namely the -th sensor.
We derive the lead matrix corresponding to this particular cyclic OU process with the assumptions imposed on and
Theorem 4.3 (One Nonzero Propagation Coefficient, Noise in One Sensor Lead Matrix Explicit Formula).
For each with the assumptions imposed on the model parameters and the lead matrix is explicitly
(4.6) |
Proof.
Substituting and for all with into the general cyclic OU process lead matrix formula (4.6), we simplify the resulting summand using matrix multiplication:
This proves the claim. ∎
Now, under the two regimes, we determine whether the structure of the leading eigenvector of enables us to recover the expected network structure induced by .
4.2.1 First Regime
We start with the first regime, in which the perturbation is large. We derive the asymptotic expansion of the lead matrix under this regime.
Theorem 4.4 (One Nonzero Propagation Coefficient, Noise in One Sensor First Regime Lead Matrix Asymptotic Expansion).
If is the lead matrix in (4.6), then as we have
(4.7) |
where is the skew-symmetric matrix whose -th entry is
(4.8) |
Proof.
Upon dividing the numerator and denominator of the summand in (4.6) by we obtain
(4.9) |
Suppose satisfies
for all We can then rewrite the summand on the right hand side of (4.9) as a geometric series:
(4.10) | ||||
(4.11) |
in which (4.11) is the result of extracting the inner double sum corresponding to the term in (4.10) and replacing the remaining sum over all with Extracting the -th entry of (4.11), we get
(4.12) |
To complete the proof, we evaluate the double sum appearing on the right hand side of (4.12). In order to do so, we utilize the following geometric series identity for all :
(4.13) |
Using (4.13), we obtain
(4.14) |
Now, for consider the systems of congruence equalities appearing in (4.14), namely
(4.15) | ||||
(4.16) |
Observe the second congruence equation in (4.15) holds if and only if divides But since the second congruence equation in (4.15) holds if and only if Similarly, the second congruence equation in (4.16) holds if and only if Moreover, note that (4.15) and (4.16) cannot simultaneously hold if . Otherwise, we would have both and which would imply i.e. This would mean divides and if this would contradict the assumption Hence, we can simplify (4.14) in the following way:
which proves the claim. ∎
Knowing the asymptotic structure of the lead matrix under the first regime, we now investigate the asymptotic structure of its leading eigenvector.
Theorem 4.5 (One Nonzero Propagation Coefficient, Noise in One Sensor First Regime Leading Eigenvector Asymptotic Expansion).
The leading eigenvector of satisfies
(4.17) |
where is the -th canonical basis vector in and is indexed mod as before.
Proof.
Consider matrix whose -th entry is defined in (4.8) for all Note that is a skew-symmetric matrix that has only two nonzero rows, namely and Since these rows are orthogonal, we deduce has rank Moreover, the previous theorem states as we have
which implies a low rank approximation of as
We now determine the leading eigenvector of Define
Note that
(4.18) |
Because and are orthogonal, they are linearly independent, and the angle between them is Hence, by the formula for the eigenvalue and eigenvector of a rank skew-symmetric matrix stated in [3], the largest eigenvalue of is
(4.19) | ||||
Its associated eigenvector is the leading eigenvector of which is explicitly
∎
We interpret the result of this previous theorem. In the first regime, the leading eigenvector of the lead matrix converges to a vector with exactly two nonzero components with equal moduli, namely the -th and -th components. The -th component lies on the real axis, while the -th component lies on the imaginary axis. All other components are equal to Therefore, the structure of the asymptotic leading eigenvector under the first regime suggests that only the -th and -th sensors receive the dissipating signal that is propagating throughout the network. Thus, Cyclicity Analysis does not enable us to fully recover the expected network structure under the first regime.
4.2.2 Second Regime
We now investigate the second regime, in which the perturbation is small. We derive the asymptotic expansion of the lead matrix under this regime.
Theorem 4.6 (One Nonzero Propagation Coefficient and Noise in One Sensor Second Regime Lead Matrix Asymptotic Formula).
Consider the lead matrix in (4.6). Then, as the matrix has an asymptotic expansion of the form
(4.20) |
in which the summand inside the double sum is defined to be if
Proof.
We use the same technique that was used to prove Theorem 4.4. We divide the numerator and denominator of the summand in (4.6) by the quantity to obtain
(4.21) |
Suppose
for all Then, we can rewrite (4.21) as a geometric series. We have
(4.22) | ||||
(4.23) |
in which we obtained (4.23) by extracting the term in the outer sum of (4.22) and replacing the sum over all with
Now, we need to justify defining the summand in the double sum of (4.20) to be when Suppose are two indices satisfying
(4.24) |
Upon taking the moduli of both sides and using the triangle inequality, we obtain
Therefore, we have
(4.25) |
Recall if satisfy i.e. are such that equality is obtained in the triangle inequality for complex numbers, then either one of must be zero or (see [36]). Since are both nonzero, (4.25) implies i.e. and so Now, substituting into (4.24), we obtain which implies
Because divides we see the congruence equation has a solution for In particular, by [37, Section 2.6], we have
(4.26) |
for But since this forces This shows (4.21) implies Finally, note that
if But taking the limit of the left hand side as we obtain via L’Hospital’s rule:
(4.27) |
This completes the proof. ∎
4.3 Second Regime Special Case: First Propagation Coefficient Nonzero and Noise Injected into Last Sensor
At the current time of writing, we are not aware of any closed formulas for the eigenvalues and eigenvectors of the matrix defined on the right hand side of (4.20). Nevertheless, in this section, we investigate the second regime in the situation where and That is, the ground truth network structure is such that -th sensor is linked only to the -th sensor, and noise is injected only into the -th sensor. To simplify the analysis, we will also assume that and
We investigate the asymptotic structure of the lead matrix in the second regime.
Theorem 4.7 (First Nonzero Propagation Coefficient, Noise in Last Sensor Second Regime Lead Matrix).
Consider the lead matrix in (4.6). Then, as we have an alternate asymptotic expansion
(4.28) |
where is the skew-symmetric matrix whose -th entry is
(4.29) |
and
is the binomial coefficient.
Proof.
Substituting and into (4.20), we get
(4.30) |
Extracting the -th entry of (4.30), we obtain
(4.31) | ||||
(4.32) |
Upon expanding the expression within the summand of (4.32) with the binomial theorem, we get
(4.33) | ||||
(4.34) | ||||
where (4.34) follows from evaluating the inner double sum within (4.33) using the geometric series identity (4.13).
Now, consider the system of congruence equations
(4.35) | ||||
(4.36) |
where and The second congruence equation in (4.35) implies for some integer while the first implies for some integer Similarly, The congruence equations in (4.36) imply and for some integers and Hence, we can rewrite (4.34) as a double sum over all and in the following way:
(4.37) | ||||
(4.38) | ||||
(4.39) | ||||
in which (4.38) follows from extracting the term corresponding in (4.37) and replacing the remaining sum over all other with and (4.39) follows from rewriting (4.38) using the binomial coefficient identity [38]:
This completes the proof. ∎
4.3.1 Structure of Eigenvalues
We investigate the spectrum of whose -th entry is defined in (4.29), for large . To make things suitable for future calculations, we will consider the Hermitian matrix in place of the original Note that the eigenvalues of are the eigenvalues of multiplied by However, the eigenvectors of are the same as those of Furthermore, we parameterize by its size In the end, we define to be the Hermitian matrix whose -th entry is the quantity (4.29) multiplied by for all Throughout, we let be the -th largest eigenvalue of and be the orthonormal basis of eigenvectors such that is the associated unit eigenvector of with Finally, we denote as the -th entry of
Recall the -th Gershgorin disk of is the disk in the complex plane with center and radius equal to
(4.40) |
We concentrate on the largest eigenvalue of for large By Gershgorin’s Circle Theorem [39], all eigenvalues of lie in the union of the Gershgorin disks. But we conjecture an upper bound on the radii of these disks.
Conjecture 4.1 (Gershgorin Radii Upper Bound).
The Gershgorin radii of satisfy
(4.41) |
To see this conjecture holds for we evaluate
(4.42) | ||||
(4.43) | ||||
(4.44) | ||||
(4.45) |
in which we obtained (4.43) by rewriting (4.42) after extracting the term and used the identity
to obtain (4.44) from (4.43). Taking the limit of (4.45) as we see the conjecture holds true for the case
Assuming Conjecture 4.1 holds, we can show the sequence of largest eigenvalues converges.
Theorem 4.8 (Asymptotic Binomial Matrix Largest Eigenvalue Convergence).
The sequence of largest eigenvalues converges.
Proof.
To show convergence, we show that the sequence is increasing and bounded above in
First, we show the sequence is increasing. By induction on we can write in the block form
(4.46) |
where is the first column of after excluding its first component . Notice that is the first principal minor of that is, we obtain by deleting the first row and first column of By the Cauchy Interlacing Theorem [40], the eigenvalues of interlace those of which specifically means
(4.47) |
The above inequality directly implies is an increasing sequence.
Next, we show that is a sequence that is bounded above. Consider the -th Gershgorin disk radius of Assuming Conjecture 4.1 holds, for we have
Hence, as we see the Gershgorin radii of are bounded above by This implies the sequence is bounded above by ∎
Now, we make a conjecture about the limit of the sequence
Conjecture 4.2 (Asymptotic Binomial Matrix Largest Eigenvalue Convergence).
For small we have
(4.48) |
We provide numerical evidence supporting Conjecture 4.2. In Table 4.1, we tabulate the distance between and for We observe such distances are close to zero.
Now, we pose a conjecture about the largest eigenvalues corresponding to the principal minors of Recall the -th principal minor of is the matrix formed by deleting the -th row and -th column of
Conjecture 4.3 (Asymptotic Binomial Matrix Principal Minor Largest Eigenvalues).
For each we have
(4.49) |
where is the -th largest eigenvalue of
We provide some numerical evidence supporting the conjecture. In Table 4.2, we tabulate the largest eigenvalues of the principal minors of in the situation . We observe as gets larger, the largest eigenvalue of the -th principal minor precipitously decreases.
4.3.2 Structure of Leading Eigenvector
For large we now investigate the structure of the leading eigenvector of assuming all the previous conjectures we stated pertaining to the spectrum of hold.
First, we describe the behavior of the modulus of the -th component
Theorem 4.9 (Asymptotic Binomial Matrix Leading Eigenvector Component Moduli Structure).
For each we have
(4.50) |
Moreover, we have
(4.51) |
Proof.
We first prove the first statement. We utilize the eigenvalue-eigenvector identity [40] which states
(4.52) |
Assuming Conjecture 4.3 holds, we have
This means we can bound the numerator on the right hand side of (4.52) in the following way:
which proves the first statement.
Now, we prove the second statement, recall that by the Cauchy Interlacing inequality. We have
which approaches as by Theorem 4.8. This proves the second statement. ∎
We interpret the previous theorem. The previous theorem states for large , the component moduli decrease from right to left. We interpret this in the context of our signal propagation model. As the signal propagates throughout the network, the left most sensors are less receptive to the signal than the right most sensors.
Now, we pose a conjecture about the component phases of
Conjecture 4.4 (Asymptotic Binomial Matrix Leading Eigenvector Component Phase Structure).
For large there exists large for which
In Table 4.3, we show numerical evidence of Conjecture 4.4 when We observe the last component phases are in increasing order, supporting the conjecture. But at the current time of writing, we do not have a general conjecture pertaining to the behavior of the other phases.
We interpret the result of the conjecture. Suppose the signal in our propagation model were to be broadcast from the last sensor, namely the -th sensor. Then, Conjecture 4.4 states the order of the last few eigenvector component phases reflects the expected order of the next few sensors that receive the signal. Therefore, Cyclicity Analysis enables us to partially recover the expected network structure.
4.4 Only One Nonzero Propagation Coefficient and Independent and Identical Noise Injected into Few Sensors
Next, let be the friction matrix as defined in the beginning of Section 4.2. But this time, let be a volatility matrix such that the diffusion matrix is a diagonal diffusion matrix containing multiple nonzero diagonal entries that are equal. In particular, we let the last diagonal entries of be positive and equal i.e. we assume for some while all other diagonal entries are This corresponds to the situation we inject independent and identical noise into the last sensors.
We explicitly derive the lead matrix with the assumptions imposed on and
Theorem 4.10 (Cyclic OU Process One Nonzero Propagation Coefficient and Independent and Identical Noise in Multiple Sensors Lead Matrix Explicit Formula).
For each the lead matrix is explicitly
(4.53) |
Proof.
Upon substituting and for all into the right hand side of the Cyclic OU lead matrix formula in (4.6) and letting be the -th canonical basis vector in we have
This proves the claim. ∎
We now investigate the structure of the leading eigenvector of under the two regimes.
4.4.1 First Regime
We begin with the first regime when is large.
Theorem 4.11 (One Nonzero Propagation Coefficient and Independent and Identical Noise in Multiple Sensors First Regime Lead Matrix Asymptotic Expansion).
Consider the lead matrix defined in (4.53). Then, as we have the following asymptotic expansion:
(4.54) |
where is the skew symmetric matrix whose -th entry is of the form
(4.55) |
Proof.
Under the first regime, we investigate the leading eigenvector as
At the current time of writing, we are not aware of any explicit formulas for the eigenvalues and eigenvectors of (4.54) for general So we concentrate on the case i.e. only the propagation coefficient is nonzero. For simplicity, we let and
Theorem 4.12 (First Nonzero Propagation Coefficient and Independent and Identical Noise in Multiple Sensors First Regime Leading Eigenvector Asymptotic Structure).
If and then the lead matrix in (4.53) satisfies
(4.56) |
Proof.
Let be the skew-symmetric matrix whose -th entry is defined in (4.55) for each Since
(4.57) |
we see that is a low rank approximation of as
We seek the leading eigenvector of Note that can be written in the block form
(4.58) |
in which is the matrix such that for each and all other entries are
We note that is a tridiagonal Toeplitz matrix, whose eigenvalues and eigenvectors are known [41]. In particular, if for each and for all Then,
(4.59) |
is an eigenvalue of with associated eigenvector
(4.60) |
Therefore, upon substituting and and the eigenvalues of the bottom right submatrix are of the form
(4.61) |
for each with corresponding eigenvector
(4.62) |
Therefore, the eigenvalues of are the eigenvalues of and In particular, the largest eigenvalue of is with associated leading eigenvector being the vector formed by prepending with zeros. Hence, upon normalization, we have
(4.63) |
which proves the claim. ∎
We interpret the result given by Theorem 4.12. The last leading eigenvector components are nonzero, suggesting the propagating signal in our model is received by the sensors. However, because the last components either lie on the real or imaginary axes the leading eigenvector component phases are either or So we observe the cyclic order of the component phases does not match the expected cyclic order of sensors receiving the signal. Hence, Cyclicity Analysis does not enable us to fully recover the network structure.
However, we notice the last leading eigenvector component moduli form a sinusoidal pattern, which is interesting. This suggests if the signal were to be broadcast from the -th sensor, then one of the later sensors is most receptive to the signal.
4.4.2 Second Regime
We investigate the second regime, in which the perturbation is small.
Theorem 4.13 (One Nonzero Propagation Coefficient and Independent and Identical Noise in Multiple Sensors Second Regime Lead Matrix Asymptotic Expansion).
Consider the lead matrix in (4.53). Then, as the matrix has an asymptotic expansion of the form
(4.64) |
in which the summand inside the double sum is defined to be if
We omit the proof, as it involves the exact same reasoning that was used to prove Theorem 4.6. At the current time of writing, we are not aware of any closed formulas for the eigenvalues and eigenvectors of the matrix on the right hand side of (4.64). Nevertheless, we pose a conjecture in the situation where and
Conjecture 4.5 (First Nonzero Propagation Coefficient and Independent and Identical Noise in Multiple Sensors Second Regime Asymptotic Leading Eigenvector Structure).
Let be the matrix on the right hand side of (4.64). If is the leading eigenvector of then we have
(4.65) |
and for each and for each
We interpret this conjecture. In the situation where we inject independent and identical noise into nodes, the order of the few leading eigenvector component phases matches the expected order of those last few sensors receiving the signal in our propagation model. In addition, even though we inject noise into the last sensors, the -th sensor is most receptive to the propagating signal, which is interesting.
In Table 4.4, we provide evidence supporting Conjecture 4.5 in the situation where and In particular, we observe the -th component has the largest modulus. Moreover, we observe the last component phases are increasing.
4.5 Independent and Identical Noise Injected into All Sensors
We now investigate the signal propagation model under the assumptions the propagation coefficient is nonzero for exactly one index with and is a volatility matrix such that the diffusion matrix is proportional to the identity matrix. This choice of corresponds to the situation where we inject independent and identical noise into all sensors. Since the diagonal entries of are all the same, we may assume without loss of generality that is the identity matrix itself.
We first derive the lead matrix with the imposed assumptions on these cyclic OU model parameters.
Theorem 4.14 (Cyclic OU Process One Nonzero Propagation Coefficient and Independent and Identical Noise in All Sensors Lead Matrix Explicit Formula).
For each with the assumptions imposed on and the lead matrix is explicitly
(4.66) |
Furthermore, is a circulant matrix.
Proof.
We substitute the identity matrix for and and for all into the general Cyclic OU lead matrix formula (4.4). Recalling is an orthonormal basis of we can simplify the summand in (4.4) in the following way:
(4.67) |
Now, recall the Euler identities
Using the Euler identities, we rewrite the numerator and denominator of the summand in (4.67) to obtain
Next, since is circulant, recall is circulant for each by Theorem 2.3. Examining the explicit formula for the stationary covariance matrix in (3.5), we see that the integrand is a product of the circulant matrices and its transpose, which implies is circulant. Therefore, and are both circulant, and since is proportional to the difference between and we conclude is circulant. ∎
4.5.1 Second Regime
We now consider the second regime in which the perturbation constant is small.
Theorem 4.15 (Cyclic OU Process One Nonzero Propagation Coefficient and Independent and Identical Noise in All Sensors Second Regime Leading Eigenvector Explicit Formula).
Consider the lead matrix defined in (4.66). Then, as the leading eigenvector of is where satisfies
Proof.
First, we need to determine the eigenvalues of In order to do this, we need to compute the first row of Since is skew-symmetric, its first row is the opposite of its first column. To obtain its first column, we compute the product of and where is the first standard basis vector in This means the first row of is
(4.68) |
As a result, for each each eigenvalue of is of the form
Note that
(4.69) |
We seek the index maximizing Note that the map is strictly decreasing on and strictly increasing on Since for all the map restricted to the interval is symmetric about Finally, note that is -periodic. The integers maximizing specifically in the domain are and By periodicity, the integer maximizing on its entire domain satisfies
This means an index that maximizes the eigenvalue limit (4.66) must satisfy either the congruence equation or the congruence equation Both congruence equations have integer solutions because which is equal to by assumption, divides
Let be such that Then, the associated eigenvector of the eigenvalue is which is the leading eigenvector of ∎
Knowing the leading eigenvector of under the second regime, we investigate whether the structure of the leading eigenvector recovers the structure of the cyclic sensor network.
Theorem 4.16 (Cyclic OU Process One Nonzero Propagation Coefficient and Independent and Identical Noise in All Sensors Second Regime Leading Eigenvector Structure).
Let be the leading eigenvector of Then, the permutation representing the cyclic order of the component phases is defined by
Proof.
First, we note that To see this, recall the previous theorem states that By definition, this means
for some integer Hence, we can write as an integer linear combination of the numbers and which implies
Let be the permutation such that
(4.70) |
for each Firstly, we note is well-defined. For each because divides there is a unique solution to the congruence equation (4.70) for
For each we have
Therefore, we deduce
Because we may divide both sides of the congruence equation by to deduce that
(4.71) |
Finally, we claim is a cyclic order permutation corresponding to the component principal arguments of the eigenvector Recall the -th component of is explicitly
By (4.70), we have
which has principal argument Therefore, our permutation ensures the components of are sorted in increasing order by their corresponding principal arguments. Thus, is a valid cyclic order permutation. ∎
We interpret the result of the previous theorem. If we inject independent and identical noise into all sensors, then under the second regime, the structure of the leading eigenvector of enables us to recover the expected network structure induced by the friction matrix In particular, the cyclic order of the phases of the leading eigenvector matches the expected cyclic order of the sensors receiving the signal in our propagation model.
Chapter 5 Experimental Results
In this chapter, we discuss the experimental results pertaining to the main topic of our thesis. Throughout, we consider an -dimensional OU Process with fixed model parameters and We let be the diffusion matrix and be the lead matrix of the OU process. For the purposes of this chapter, we specifically refer to as the theoretical lead matrix of the OU process.
5.1 OU Process Realization Time Series Simulation
In this section, we describe how to generate a time series approximating a realization of the OU process over a time interval of the form where is a fixed, large constant. We employ the Euler-Maruyama method [42], which we describe in detail.
First, we fix a large integer representing the number of iterations, and a small constant representing the time step size between consecutive times. In the original governing SDE (3.1) of the OU process, we replace the time differential with the stochastic differential with the difference and the Wiener differential with the difference
Recall the increments of the standard Wiener process are independent and Gaussian; in particular, is an -dimensional Gaussian random vector with mean and covariance matrix where is the identity matrix. So we generate a sequence of vectors representing the Wiener increments, in which all vectors are randomly and independently chosen according to the -dimensional Gaussian distribution with mean and covariance matrix We let be an initial starting vector randomly chosen according to the Gaussian distribution with mean and covariance matrix where solves the Lyapunov equation (3.2).
We now construct the time series where for each Explicitly, we have
(5.1) |
for each This constructed time series is what we use to approximate a realization of the OU process over the specific time interval
5.2 OU Process Empirical Lead Matrix Generation
Throughout, we fix and Consider a time series produced according to our scheme in (5.1). We compute its empirical lead matrix, denoted via the shoelace formula mentioned in (1.22).
As a consequence of the Strong Law of Large numbers identity stated in Theorem 3.7, for each fixed we have
(5.2) |
for almost all realizations of the OU process, where is the theoretical lead matrix.
We show numerical evidence of the discrete law of large numbers identity (5.2) via an example. Consider the -dimensional OU process with model parameters and equal to the identity matrix. In Figure 5.2, for each and each we display a scatterplot, in which the first coordinate is the -th entry of the time-averaged empirical lead matrix and the second coordinate is the -th entry of the theoretical lead matrix We observe as increases, the scatterplot approaches the straight line which means the entries of the time-averaged empirical lead matrix approach the corresponding entries of the theoretical matrix . Furthermore, our example shows that even when decreasing the time step size from to , we still would need a large number of iterations if we want an accurate estimate of from a time-averaged empirical lead matrix.
5.3 Cyclicity Analysis of the Cyclic OU Process
We revisit the main problem statement of our thesis, in which we consider signal propagation model governed by the cyclic OU process with model parameters and for each where the circulant friction matrix is of the form
for some fixed propagation coefficients
Throughout this section, we fix the dimension assume the first propagation coefficient is equal to while all other propagation coefficients are This means the expected network structure is such that the -th sensor is receptive only to activity within the -th sensor, its immediate neighbor to the right, in which is indexed mod . We also assume is a volatility matrix such that the diffusion matrix is a diagonal matrix whose positive entries are equal. We let be the -th diagonal entry of Recall corresponds to the situation where noise is injected into the -th sensor. For different choices of and perturbation in which is some power of we produce a time-averaged empirical lead matrix corresponding to a realization of the -dimensional cyclic OU process with model parameters and Here, we fix iterations and as the time step size. We present numerical results as to whether the leading eigenvector of the empirical lead matrix enables us to recover the structure of the cyclic network structure induced by
5.3.1 Injection of Noise into Only One Sensor
Suppose we inject noise into only one sensor. Here, we consider the situation where we inject noise into the last sensor i.e. and for We let We chose these particular values of for a specific reason. For the friction matrix is considered unstable due to precision issues. For there was no change to the structure of the leading eigenvector. For the other listed values of we noticed intermediary changes within the structure of the leading eigenvector.
In Figure 5.3, for each such value of we plot the heatmap of a time-averaged empirical lead matrix and the logarithms of its eigenvalue moduli in descending order. We also plot the component phases and moduli of the leading eigenvector. In Table 5.1, we record the ratio of the largest to third largest eigenvalue for each such empirical lead matrix.
5.3.2 Observations
For small we observe the components of the leading eigenvector spiral around the origin. We observe that the last few component phases are decreasing. This means if the signal in our propagation model is broadcast from the last sensor, then the leading eigenvector correctly detects the next few sensors that receive the signal as it propagates. However, we do not see overall decreasing behavior within the component phases corresponding to other indices. This means the global order of the component phases does not reflect the expected order of sensors receiving the propagating signal. Therefore, the phases alone suggest we cannot fully recover the overall network structure. Moreover, we observe all but the last few leading eigenvector component moduli are close to which suggests as the signal is propagating, all but the last few sensors are receptive to the signal. Finally, we observe there is no clear separation between the largest eigenvalue and third largest eigenvalue of the empirical lead matrix, which suggests the largest eigenvalue does not actually dominate the spectrum of the lead matrix.
As gets larger, the spiraling pattern among the leading eigenvector components begins to disappear. The last leading eigenvector component stays put on the real axis. But the second to last leading eigenvector component gets closer to the imaginary axis, while all other leading eigenvector components get closer to the origin. We see chaotic behavior in all but the last two component phases. This is because while the last two leading eigenvector components approach the origin, they do so from different directions. This suggests that only the last two sensors are receptive to the propagating signal, which means we cannot fully recover the expected network structure. Moreover, there is a noticeable gap between the largest eigenvalue and the third largest eigenvalue of the empirical lead matrix, which means the empirical lead matrix can be well-approximated via a low rank matrix.
5.3.3 Independent, Identically Distributed Noise Injected into Few Sensors
Now, suppose we inject independent and identical noise into only a few sensors. Consider the situation where we inject noise into the last sensors i.e. for all and for all other We let In Figure 5.4, we plot the heatmap of a time-averaged empirical lead matrix and the logarithms of its eigenvalue moduli in descending order. We also plot the component phases and moduli of the leading eigenvector. In Table 5.2, we record the ratio of the largest to third largest eigenvalue of each empirical lead matrix.
5.3.4 Observations
For small the components of the leading eigenvector exhibit a visible spiral pattern. We notice, however, the last few leading eigenvector component phases are not decreasing. This suggests if the signal in our propagation model is broadcast from the -th sensor, then we are not able to recover the correct order of the next few sensors receiving the signal. Moreover, we observe that the -th leading eigenvector component modulus is the largest, which suggests the -th sensor is most receptive to the propagating signal. Recall that the index is the smallest index corresponding to the sensor we inject noise. Despite the fact we inject noise into the -th sensor for the corresponding leading eigenvector component moduli suggest the other sensors receiving noise are not as receptive to the propagating signal. Finally, we observe there is no clear separation between the largest eigenvalue of the empirical lead matrix and third largest eigenvalue.
As gets larger, we see the spiraling pattern within the leading eigenvector components disappears. The leading eigenvector components eventually lie on either the real axis, imaginary axis, or the origin.
5.3.5 Independent, Identically Distributed Noise Injected into a Large Number of Sensors
Suppose we inject noise into a larger number of sensors. In particular, suppose we inject noise into the last sensors i.e. for all and for all other We let In Figure 5.5, we plot the heatmap of a time-averaged empirical lead matrix and the logarithms of its eigenvalue moduli in descending order. We also plot the component phases and moduli of the leading eigenvector. In Table 5.3, we record the ratio of the largest to third largest eigenvalue for each such empirical lead matrix.
5.3.6 Observations
For small we observe the components of the leading eigenvector appear to form an elliptical pattern. The component principal arguments exhibit linear monotonically decreasing behavior. This suggests the leading eigenvector phases do approximately reflect the expected structure of the network. Moreover, the component moduli are nonzero and appear to exhibit a symmetric pattern. Finally, we observe there is no clear separation between the first and third largest eigenvalues of the lead matrix.
When gets larger, the elliptical pattern disappears, and the leading eigenvector components approach either the real axis, imaginary axis, or the origin. The last leading eigenvector component moduli are large, while the others are close to Therefore, Cyclicity Analysis suggests we cannot fully recover the structure of the network.
5.3.7 Independent, Identically Distributed Noise Injected into All Sensors
Now, suppose we inject independent and identical noise into all sensors. This corresponds to the situation where is a volatility matrix such that the diffusion matrix is the identity matrix.
In Figure 5.6, we plot the heatmap of a time-averaged empirical lead matrix for each and the logarithms of its eigenvalue moduli in descending order. We also plot the component phases and moduli of the leading eigenvector. In Table 5.3, we record the ratio of the largest to third largest eigenvalue for each such empirical lead matrix.
5.3.8 Observations
For small the components of the leading eigenvector lie on a circle, as indicated by their moduli being equal. Observe the cyclic order of the leading eigenvector component phases correctly matches the order of sensors receiving the signal in our model. Therefore, Cyclicity Analysis enables us to fully recover the network structure.
For large the components of the leading eigenvector still lie on a circle, but they lie on either the positive real, positive imaginary, negative real, or the negative imaginary axis. Therefore, the leading eigenvector component phases do not enable us to recover the network structure.
Chapter 6 Conclusion
In this thesis, we posed very general questions pertaining to the lead lag dynamics amongst the components of an -dimensional signal. Firstly, if two signals evolve similarly throughout time, which signal leads and which signal follows ? Secondly, if all signals evolve similarly throughout time, what is the order in which such signals evolve ?
We discussed Cyclicity Analysis, an established way of answering the aforementioned two questions for cyclic signals, which are repeating, yet aperiodic, signals. Cyclicity Analysis is different from many standard tools in that it is time reparameterization invariant; the results of the analysis do not depend on how one observes the signals throughout time. It involves two parts. The first part is to construct an lead matrix, in which the sign of the -th entry heuristically captures the pairwise leader follower relationship between the -th and -th component signals. It is a specific construction based on iterated path integrals, which are the functionals of the signal that are time reparameterization invariant. The second part of Cyclicity Analysis is to assume a model in which the component signals of trace some periodic signal up to time reparameterization, scaling constants, and time shifts. Under this model, one determines the order in which the component signals evolve throughout time, which is the cyclic order of those time shifts. Under certain baseline situations, the leading eigenvector of the lead matrix can approximately recover the order in which the component signals of evolve throughout time. More specifically, the cyclic order of the eigenvector component phases reflects the cyclic order of the offsets.
In this thesis, we investigated Cyclicity Analysis in a novel setting. We considered an -dimensional Ornstein-Uhlenbeck (OU) stochastic process with model parameters and in which is a friction matrix and is a volatility matrix. We considered a signal propagation model governed by the cyclic OU process, in which the model parameter is a circulant friction matrix. In this model, there is a signal propagating throughout a network of sensors. The matrix represents the cyclic network structure of the sensors, while the diffusion matrix is the covariance matrix whose entries are pairwise covariances of noises injected into any two sensors.
To make Cyclicity Analysis suitable for analyzing the OU process, we defined an auxiliary lead process, a matrix-valued stochastic process in which each random matrix in the collection represents a lead matrix corresponding to a realization of the OU process. The first main result of the thesis is that the lead process obeys a strong law of large numbers identity: its time average converges almost surely to a skew-symmetric matrix .
Then, with different choices of cyclic OU model parameters and we investigated whether Cyclicity Analysis could enable us to recover the network structure induced by under our signal propagation model. More specifically, we analyzed whether the structure of the leading eigenvector of could enable us to recover the network structure induced by
For our analysis, we assumed the ground truth network structure is such that every sensor receives the propagating signal from exactly one sensor to its right. In addition, we assumed independent and identical noise is injected into multiple sensors, which corresponds to the situation where is a diagonal diffusion matrix whose nonzero diagonal entries are equal.
With the imposed assumptions on and our experimental results showed that Cyclicity Analysis was in fact able to partially recover the structure of the network. For example, in the situation where we inject noise only into the last node, the cyclic order of the component phases of the leading eigenvector of partially reflected the expected cyclic order of the sensors receiving the signal in our model. As we increase the number of sensors receiving noise, we observed the cyclic order of the component phases approximately match the expected cyclic order of the sensors. In the ultimate case identical noise is injected into all sensors, the cyclic orders perfectly match.
There is future work that can be done. For example, what if we remove the assumption that has equal positive diagonal entries ? This would correspond to a situation where independent, but not identical, noise is injected into multiple sensors. What if represents a network structure such that each sensor does not receive the signal from only one other sensor, but multiple sensors ? Does the leading eigenvector of the lead matrix capture those phenomena ?
References
- [1] Elias M Stein and Rami Shakarchi. Fourier analysis: an introduction, volume 1. Princeton University Press, 2011.
- [2] Athanasios Papoulis. The Fourier integral and its applications. McGraw-Hill New York, 1967.
- [3] Yuliy Baryshnikov and Emily Schlafly. Cyclicity in multivariate time series and applications to functional mri data. In 2016 IEEE 55th conference on decision and control (CDC), pages 1625–1630. IEEE, 2016.
- [4] Ivan Abraham, Somayeh Shahsavarani, Benjamin Zimmerman, Fatima Husain, and Yuliy Baryshnikov. Slow cortical waves through cyclicity analysis. bioRxiv, 2021.
- [5] Kuo-Tsai Chen. Integration of paths–a faithful representation of paths by noncommutative formal power series. Transactions of the American Mathematical Society, 89(2):395–407, 1958.
- [6] Kuo-Tsai Chen. Algebras of iterated path integrals and fundamental groups. Transactions of the American Mathematical Society, 156:359–379, 1971.
- [7] Kuo-Tsai Chen. Iterated integrals of differential forms and loop space homology. Annals of Mathematics, 97(2):217–246, 1973.
- [8] Kuo-Tsai Chen. Iterated path integrals. Bulletin of the American Mathematical Society, 83(5):831–879, 1977.
- [9] Terry J Lyons. Differential equations driven by rough signals. Revista Matemática Iberoamericana, 14(2):215–310, 1998.
- [10] Terry J Lyons, Michael Caruana, and Thierry Lévy. Differential equations driven by rough paths. Springer, 2007.
- [11] Sheldon Axler. Linear algebra done right. Springer Nature, 2024.
- [12] Eugene L Allgower and Phillip H Schmidt. Computing volumes of polyhedra. mathematics of computation, 46(173):171–174, 1986.
- [13] George E Uhlenbeck and Leonard S Ornstein. On the theory of the brownian motion. Physical review, 36(5):823, 1930.
- [14] Emmanuel Gobet and Qihao She. Perturbation of ornstein-uhlenbeck stationary distributions: expansion and simulation. 2016.
- [15] Vicky Fasen. Statistical estimation of multivariate ornstein–uhlenbeck processes and applications to co-integration. Journal of Econometrics, 172(2):325–337, 2013.
- [16] Attilio Meucci. Review of statistical arbitrage, cointegration, and multivariate ornstein-uhlenbeck. 2009.
- [17] Krzysztof Bartoszek, Jesualdo Fuentes-González, Venelin Mitov, Jason Pienaar, Marcin Piwczyński, Radosław Puchałka, Krzysztof Spalik, and Kjetil Lysne Voje. Model Selection Performance in Phylogenetic Comparative Methods Under Multivariate Ornstein–Uhlenbeck Models of Trait Evolution. Systematic Biology, 72(2):275–293, 12 2022.
- [18] Claude Godrèche and Jean-Marc Luck. Characterising the nonequilibrium stationary states of ornstein–uhlenbeck processes. Journal of Physics A: Mathematical and Theoretical, 52(3):035002, 2018.
- [19] Valentin Courgeau and Almut ED Veraart. Likelihood theory for the graph ornstein-uhlenbeck process. Statistical Inference for Stochastic Processes, pages 1–34, 2022.
- [20] Alexander S. Poznyak. Chapter 9 - stable matrices and polynomials. In Alexander S. Poznyak, editor, Advanced Mathematical Tools for Automatic Control Engineers: Deterministic Techniques, pages 139–174. Elsevier, Oxford, 2008.
- [21] Bernt Øksendal. Stochastic Differential Equations, pages 65–84. Springer Berlin Heidelberg, Berlin, Heidelberg, 2003.
- [22] M. Gilson, N. E. Kouvaris, G. Deco, and G. Zamora-López. Framework based on communicability and flow to analyze complex network dynamics. Phys. Rev. E, 97:052301, May 2018.
- [23] Ivan T Abraham. On geometric & topological methods for analysis of biophysical time series data. PhD thesis, 2022.
- [24] Yongli Li, Tianchen Wang, Baiqing Sun, and Chao Liu. Detecting the lead–lag effect in stock markets: definition, patterns, and investment strategies. Financial Innovation, 8(1):51, 2022.
- [25] Anish Mitra, Abraham Z Snyder, Carl D Hacker, and Marcus E Raichle. Lag structure in resting-state fmri. Journal of neurophysiology, 111(11):2374–2391, 2014.
- [26] Hiroaki Sakoe and Seibi Chiba. Dynamic programming algorithm optimization for spoken word recognition. IEEE transactions on acoustics, speech, and signal processing, 26(1):43–49, 1978.
- [27] Rajesh Singh, Dipanjan Ghosh, and R Adhikari. Fast bayesian inference of the multivariate ornstein-uhlenbeck process. Physical Review E, 98(1):012136, 2018.
- [28] Matthieu Gilson, Enzo Tagliazucchi, and Rodrigo Cofré. Entropy production of multivariate ornstein-uhlenbeck processes correlates with consciousness levels in the human brain. Physical Review E, 107(2):024121, 2023.
- [29] Hong Qian. Mathematical formalism for isothermal linear irreversibility. Proceedings of the Royal Society of London. Series A: Mathematical, Physical and Engineering Sciences, 457(2011):1645–1655, 2001.
- [30] Roger A Horn and Charles R Johnson. Matrix analysis. Cambridge university press, 2012.
- [31] Robert M Gray et al. Toeplitz and circulant matrices: A review. Foundations and Trends® in Communications and Information Theory, 2(3):155–239, 2006.
- [32] Nikola Sandrić. A note on the birkhoff ergodic theorem. Results in mathematics, 72:715–730, 2017.
- [33] Vincenzo Capasso and Devaid Bakstein. Introduction to Continuous-Time Stochastic Processes. Springer, 2021.
- [34] Rafail Khasminskii. Stochastic stability of differential equations. Springer, 2012.
- [35] Rick Durrett. Probability: theory and examples, volume 49. Cambridge university press, 2019.
- [36] D Martin and LV Ahlfors. Complex analysis. New York: McGraw-Hill, 1966.
- [37] L-K Hua. Introduction to number theory. Springer Science & Business Media, 1982.
- [38] Ronald L. Graham, Donald E. Knuth, and Oren Patashnik. Concrete Mathematics: A Foundation for Computer Science. Addison-Wesley, Reading, 1989.
- [39] Richard S Varga. Geršgorin and his circles, volume 36. Springer Science & Business Media, 2011.
- [40] Peter Denton, Stephen Parke, Terence Tao, and Xining Zhang. Eigenvectors from eigenvalues: A survey of a basic identity in linear algebra. Bulletin of the American Mathematical Society, 59(1):31–58, 2022.
- [41] Silvia Noschese, Lionello Pasquini, and Lothar Reichel. Tridiagonal toeplitz matrices: properties and novel applications. Numerical linear algebra with applications, 20(2):302–326, 2013.
- [42] Peter E. Kloeden and Eckhard Platen. Introduction to Stochastic Time Discrete Approximation. Springer Berlin Heidelberg, Berlin, Heidelberg, 1992.