1. Introduction
Quantum mechanics is at the basis of all modern physics and fundamental for the understanding of the world that we live in. As a general theory, quantum mechanics should apply also to the measurement process. From the general experience of non-destructive measurements, we draw conclusions about the interaction between the observed system and the measurement apparatus and how this can be described within quantum mechanics.
We thus consider a quantum system , interacting with a measurement device. For simplicity we assume that is a two-level system that is not destroyed in the process. Then after the measurement, ends up in one of the eigenstates of the measured observable. If is prepared in one of these eigenstates, it remains in that state after the measurement. If is initially in a superposition of the two eigenstates, it still ends up in one of the eigenstates and the measurement result is the corresponding eigenvalue. The probability for a certain outcome is the squared modulus of the corresponding state component in the superposition (Born’s rule).
An essential question is whether the probabilistic nature of quantum measurement with Born’s rule is an inherent feature of quantum mechanics or whether it can be shown to hold as a result of a quantum-mechanical treatment of the measurement process combined with a statistical analysis. In the latter case, a single measurement would be a quantum-mechanical process in which the state of the measurement apparatus (possibly including its surroundings) determines the result. Born’s rule would then emerge from the statistics of the ensemble that describes the measurement apparatus in interaction with the system subject to measurement.
The requirement that
, if initially in an eigenstate of the observable, remains in that eigenstate after interacting with the apparatus, is usually considered to lead to a well-known dilemma: If applying the (linear) quantum mechanics of the 1930s to
in an initial superposition of those eigenstates, the result of the process appears to be a superposition of the two possible resulting states for
and the apparatus without any change in the proportions between the channels. This has been referred to as von Neumann’s dilemma [
1], and it has led to paradoxical conclusions such as Schrödinger’s cat.
Attempts to get around this problem include Everett’s relative-state formulation [
2] and its continuation in DeWitt’s many-worlds interpretation [
3] as well as non-linear modifications of quantum mechanics [
4,
5,
6,
7,
8]. In the non-linear modifications, one gets the bifurcation of the measurement process but the non-linear character of the basic theory introduces new conceptual difficulties. Mathematically our treatment can be seen to be very close to quantum diffusion [
5]; we have chosen to follow the same conventions in handling the statistics of stochastic variables as in Ref. [
5]. The ambition to understand quantum measurement as a deterministic process we share with the De Broglie-Bohm theory [
9], with the difference that we look for how details in the measurement device influence the process.
Bell pointed out that the Everett-DeWitt theory does not properly reflect the fact that the presence of inverse processes and interference are inherent features of quantum mechanics [
10]:
Thus, DeWitt seems to share our idea that the fundamental concepts of the theory should be meaningful on a microscopic level and not only on some ill-defined macroscopic level. However, at the microscopic level there is no such asymmetry in time as would be indicated by the existence of branching and the non-existence of debranching. [...] [I]t even seems reasonable to regard the coalescence of previously different branches, and the resulting interference phenomena, as the characteristic feature of quantum mechanics. In this respect an accurate picture, which does not have any tree-like character, is the ’sum over all possible paths’ of Feynman.
Therefore, as suggested by Bell, we investigate work of Feynman for a correct theory. We choose the scattering theory of quantum field theory, including Feynman diagrams, as a basis for our description of the measurement process. This theory contains inverse processes that result in a non-linear dependence on the initial state which removes the von Neumann dilemma.
In the field of investigation of the measurement process, a strong belief has been established that microscopic details of the measurement interaction cannot lead to the bifurcation determining the result of measurement (see, e.g., [
11,
12]). This belief is based on von Neumann’s way of handling linear quantum mechanics. In this situation, many have abandoned the ambition to understand the mechanism of a single measurement and concentrated on the full ensemble of measurements. There one has studied the irreversible decoherence process that takes the initial ensemble, with
in a pure state, into a mixed state for the final ensemble after measurement. Since 1970, the year of DeWitt’s many-worlds theory [
3] and Zeh’s paper on decoherence [
11], a new tradition has developed that includes different views on how to interpret quantum mechanics, see for instance Refs. [
13,
14]. Epistemological aspects play an important role in these interpretations.
Our idea is that the microscopic details of the measurement apparatus affect the process so that it takes into either of the eigenstates of the measured observable and initiates a recording of the corresponding measurement result. This is possible to see in a more developed form of linear deterministic quantum mechanics, namely the scattering theory of quantum field theory.
The development of relativistic quantum mechanics led to quantum field theory. For a situation where a quantum system meets a part A of a measurement device, interacts with it and then leaves it, scattering theory can be an adequate description. As we have pointed out already, the scattering theory of quantum field theory has the reversibility that was requested by Bell. These are our reasons for the choice of studying measurement in the scattering theory of quantum field theory.
In our approach, measurement is part of the physics studied, rather than a subject for epistemological analysis. We show how non-linearities can be generated within quantum theory. Our statistical study of measurement processes then shows that those states of the apparatus which are competitive in leading to a final state, also take into one of the eigenstates of the measured observable. Moreover, this bifurcation, leading to one of the two possible final states for , occurs with the frequencies given by Born’s rule.
In the following sections, we shall first give our scattering theory description and then make all possible processes subject to a statistical analysis.
2. The Initial Phase of Measurement as A Scattering Process
Here we study the interaction between the small system and a larger system A with a large number of degrees of freedom. The larger system is assumed to be characterized by an ensemble of possible initial microstates of A. We consider this interaction to be the first part of a measurement process.
Since we are dealing with a two-level system , the Pauli matrices provide a suitable formalism with representing the observable to be measured, with eigenstates and .
Let us investigate the characteristics of the interaction between and A in scattering theory for the case with A in a state with (unknown) microscopic details that are summarized in a variable . We then denote the normalized initial state of A by (with 0 indicating a state of preparedness). This means that we assume to represent one microstate in an ensemble of possible initial states.
A basic requirement is that if is initially in the state , after the interaction with A, its state remains the same. In this process A changes from the initial state to a final state , also normalized. The first j here indicates that A has been marked by the state of . All other characteristics of the final state of A are collected in . The interaction thus transforms the system A from an initial state of readiness, characterized by , to a final state, marked by and characterized by .
For a general normalized state of
,
(with
), the combined initial state of
is
A measurement of
on
leads to a certain result. Since two different results are possible, the
-interaction should in general result in a transition to one of the following states,
The conclusion is then that the outcome must depend on the initial state of A, i.e., on .
In scattering theory, the interaction between
and
A is characterized by a transition operator
M, and this leads to the (non-normalized) final state (see
Figure 1),
In general, the amplitudes,
and
, are not equal and therefore the proportions between + and − can change in a way that depends on the initial state
of
A. (Please note that
M must not to be confused with the unitary scattering operator
S, see
Appendix A).
The requirement of a statistically unbiased measurement means that , where denotes mean value over the ensemble of initial states of A.
Equation (
3) describes a mechanism of the measurement process in which von Neumann’s dilemma is not present. Relativistic quantum mechanics, in the form of scattering theory of quantum field theory, is a more correct theory than the non-relativistic Schrödinger equation, as used in the 1930s, and we choose to use Equation (
3) as our starting point.
In the Feynman-diagram language of quantum field theory, transitions between the two channels, + and −, are possible via returns to the initial state. This is a way to understand how the proportions of the channels can change as described by Equation (
3). A formulation based on perturbation theory to all orders, leads to an explicitly unitary description of the whole process. (This is shown in
Appendix B)
In scattering theory, transition rate (transition probability per unit time) is a central concept as we have reviewed in
Appendix A. The transition rate from the initial state (
1) to the final state (
3) is
, where
is the squared modulus of (
3),
Each term in (
4) represents the partial transition rate for the corresponding channel. Because of our assumption of common mean values for
, the mean value of (
4) is the same,
.
For equation (
3) to properly represent a measurement process, i.e., a bifurcation that leads to a final state with
in either of the eigenstates of
, it is necessary that the squared moduli of the amplitudes satisfy either
or
. If this holds for (almost) all microstates
in the resulting ensemble of final states, then it can function as a mechanism for the bifurcation of the measurement process.
Part of the basis for the von Neumann dilemma was the assumption that
A is in a given initial state
. Before the
-interaction,
A can be in any of the states of the available initial ensemble. These states are ready to influence the recording process in different ways. To reach a final state, given by Equation (
3), they compete with their transition rates,
, which can differ widely between different values of
. The competition can lead to a selection and to a statistical distribution over
of the final states that is very different from the distribution in the initial ensemble.
In the following section, we will construct a mathematical model of how A influences the -interaction, including a description of the ensemble of possible initial states . We then make a statistical analysis to show how both the bifurcation and the proper weights for the different measurement results can be understood in this simple setting.
3. Mathematical Model of the μA-interaction
We shall now model the amplitudes describing the
-interaction, leading to the final state (
3), and how it depends on the initial state
.
To have a generic model, we think of our system
A as consisting of
N independent subsystems, each interacting with
, resulting in amplitudes that are products of
N factors [
15,
16]. Since only factors resulting in differences between the two amplitudes are important, we assume
where
. Small deviations from unity in the factors are characterized by a zero mean,
, while the independence between the subsystems is expressed by
, and
. We have followed the convention, used in stochastic dynamics (as for instance in Ref. [
5]), to calculate to second order in
and then replace
by its mean
. This model reflects the unbiased character of the measurement device, and it guarantees that on average the squared moduli of the amplitudes are identical and equal to unity,
.
As an illustration, in
Appendix C, we describe a situation where
is a fast charged particle emerging from some process and then interacting electrically with the system
A. Here A consists of a chain of small cylinders of ionizable material along the track where
is passing in one of its state components. We show how the amplitude factorizes in this case.
We have chosen to keep each factor in the model close to unity in order to illustrate that even very small variations in how the subsystems interact with the system may result in one of the channels (one of the amplitudes), dominating over the other one, depending on the microstate of the device. This makes it necessary to have a very large number N of subsystems. The critical assumptions are (i) the unbiased character of the device, i.e., not favoring any of the channels; and (ii) the independence of the subsystems of the device. The statistics of the interaction is treated in the following section.
4. Statistical Theory of the Transition to the Final State
We are now ready to investigate the statistical consequences of the
-interaction modelled in the previous section. We introduce the total variance of the parameters in Equation (
5),
, and define an aggregate variable
of
A, suitably normalized,
to represent the overall degree of enhancement/suppression (so that
for net enhancement of + and
for net enhancement of −). It follows that
Y is characterized by mean and variance
and
. Then, for sufficiently small
, we can rewrite (
5) as
with
. Here, in the exponent, we have done the calculation to second order in
and we have replaced
by
.
Since all factors in the product (
5) are independent, the distribution
over the aggregate variable
, defined by Equation (
6), in the ensemble of initial states of
A, is well described by the Gaussian distribution,
centered around
with variance
.
Initial states differ in their efficiency in leading to a transition to a final state, since the total transition rate may depend strongly on
. The transition to the final state (
3) with
given by (
7) has the rate
where
with
. The terms in (
9) are the partial transition rates for the + and − channels.
The total transition rate (
9) depends strongly on
Y. We shall now go into the statistics of the final states which is strongly influenced by
. To get the distribution
over
Y for the
final states, corresponding to
for the initial states, we must multiply
by the transition rate (
9) which is normalized in the sense that its mean value is 1. This is the standard approach in scattering theory, see, e.g., Ref. [
17]. Here, it can be interpreted as a selection process, as previously discussed, that favors initial states which are efficient in leading to a transition, with a selective fitness being proportional to the transition rate (
9). Thus, the distribution over final states can be written (see
Figure 2)
The normalized partial distributions,
and
, also with variance
, are centered around
and
and refer to
ending up in the state
and
, respectively. The coefficients of
and
in
are
and
, expressing Born’s rule explicitly.
It is instructive to follow the distribution
with growing
. For small
(
) it is broad and unimodal; it then turns broad and bimodal with narrowing peaks. For large
, it is split into two well separated distributions with sharp peaks, weighted by the squared moduli of the state components of
,
and
, at
and
, respectively. They represent two different sub-ensembles of final states (see Equation (
2)). Other values of
Y correspond to non-competitive processes. The aggregate variable
Y is "hidden" in the fine unknown details of
A that can influence the
-interaction.
From Equation (
7) we see that the dominance of either channel is characterized by
being either very large or very small. For a small system, i.e., small
N and hence small
, this ratio does not deviate much from unity which means that we have an entangled superposition of the two final states. As
increases, the ratio deviates more strongly from unity and one of the channels starts to dominate. When
is of the order of 10, the sub-distributions,
and
, are essentially non-overlapping,
is concentrated around
, and the ratio of dominance between the channels is of the order of
.
The initial state for
in (
1) is a superposition, a ’
both-and state’, and it ends up in (
2) which is again a product state, with
in
either or . The initial states of
A vary widely in their efficiency to lead to a final state. When one transition-rate term in (
9) is large, the other one is small. The selection of a large transition rate therefore also leads to a bifurcation with one of the terms in (
3) totally dominating the final state.
5. Concluding Remarks
The main contribution in our work is that we have demonstrated that the initial stage of quantum measurement can be described within reversible quantum mechanics. The key components are (i) a scattering theory formulation with inverse processes that both guarantee unitarity and allow for a non-linear mechanism leading to the bifurcation; and (ii) a statistical analysis that reveals how initial states that are efficient in leading to a transition to a final state have a selective advantage and how this results in the correct probabilities of the measurement results as stated by Born’s rule.
In our description, we want the system
A to be big enough for a bifurcation to take place, i.e., for
to be sufficiently large. Our idea has been to follow the qualitative recipe given by Bell who formulated a principle concerning the position of the Heisenberg cut [
10], i.e., the boundary of the system
A, interacting with
according to quantum dynamics (Ref. [
10], p.124):
put sufficiently much into the quantum system that the inclusion of more would not significantly alter practical predictions
On the other hand, the system
A should not be so large that
cannot be described by deterministic quantum mechanics. In the model that we have described, the bifurcation of measurement takes place in the reversible stage of the interaction between
and
A before irreversibility sets in and fixes the result. In this respect, our analysis is very different from decoherence analysis [
11,
18].
Beyond
A, the measurement apparatus must be considered to be an open system with its dynamics described by a Lindblad equation [
19]. The starting point for the development here is one of the final states before the Heisenberg cut, i.e.,
or
, in one of the sub-ensembles described by
or
. Thus, the open dynamics continues only in the channel that happens to have been chosen, + or −.
For future work, a more detailed description is needed of a typical
-interaction, including the statistics of the initial states and the selection of one state
with a large transition amplitude, leading to a final state (
2) with
in one eigenstate,
or
. An important task is to construct a detailed physical model of a non-biased measurement apparatus. The model of
Appendix C is a beginning in this respect, but the mathematical assumptions in Equation (
3) should be directly tied to physical properties of
A. In particular, the non-bias property of
A should be analyzed.
The system A should be neither too small nor too large. Then it is reasonable to describe it as mesoscopic, but Bell’s principle that we have quoted above gives no indication of its actual size. In the development of realistic models, questions of limits and the accuracy of approximations will have to be handled in more detail.
In practical scientific research, there is a common working understanding of quantum mechanics. Physicists have a common reality concept for a quantum-mechanical system when it is not observed, a kind of pragmatic quantum ontology with the quantum-mechanical state of the studied system as the basic concept. Development of this state in time then constitutes the quantum dynamics. If quantum mechanics now can also be used to describe the measurement process, this pragmatic quantum ontology can have a wider validity than has been commonly expected.