Entanglement and coherence

I have come across a wonderful review of entanglement by Chris Drost in his answer to this post. One part that left me puzzled was: (This post is merely an attempt to understand a portion of Chris' answer, unfortunately I do not have enough reputation to ask this as a comment in his post, so I figured a new post wouldn't be a terrible idea as this is a rather important conceptual question for all beginners.)

Obviously, the product-states have a "quantum coherence" to both qubits: doing our double-slit experiment means that we see an interference pattern. Shockingly, entanglement weakens and sometimes eliminates this interference pattern. For example, the state 12|00+12|11 describes an entangled state. If you pass the first qubit of this through the double-slit experiment, normal rules of quantum mechanics give the distribution 12|f0(x)|2+12|f1(x)|2: classically overlapping bell curves!

  1. Unfortunately I fail to see how by entangling two particles they lose their coherence. But When I have a particle A in a superposition state ψA=a|0+b|1 and entangle it to another system B, in state ψB, my first particle still remains in a superposition, and its measurement is still random, is it not?

  2. So why do we say that entanglement destroys coherence? It would be great if one could elaborately show this for the simplest entangled pairs! Is the point maybe that if B is measured first, only then A loses its coherence? (assume here a complete correlation).

  3. Small digression if I may: if it is true that entangelemnt destroys coherence, does the converse mean that the concept of decoherence is tightly related to entanglement of a small system with its environment? Or in other words would decoherence happen at all without entanglement?

I find that a bit puzzling too. This means that the overlap terms vanish. This would make sense if, as you say in (2), one were measured before the other was sent through. Can someone explain?

Respuestas (4)

Okay, this is getting even more into depth, which is great stuff! I heartily recommend anyone who is this dedicated take a few courses on the subject, if you haven't already.

The state matrix formulation of Quantum Mechanics

Here's the most basic formulation of quantum mechanics which adequately shows all of these properties, called the density-matrix or state-matrix formulation. Take a wavefunction |ψ and identify the state-matrix ρ=|ψψ| with this state. The state matrix has all of the same information as the wavefunction but evolves according to the product rule,

i ρt=H^ρρH^.

As always, we predict expectation values of experiments by associating to their numerical parameters a Hermitian operator A^. Now, instead of calculating this as the usual A=ψ|A^|ψ we insert some orthonormal basis I=i|ii| into the middle of this expression as

A=iψ|A^|ii|ψ=ii|ψψ|A^|i=ii|ρ A^|i=TrρA^.
All expectation values are therefore traces of these matrix products. We can also insert a further identity inside these two between them, to find i|ρ|j=ρij,j|A^|i=Aji, and so we have a matrix expression A=ijρijAji, if you like. Any discrete basis of the Hilbert space will work even if it has no particular tie to our Hamiltonian.

How to generate an effective substate matrix

Now suppose we have an observable which only impacts one subsystem of the whole system. Here we simply convert the basis to one that spans both subsystems, |i,j and our observable has the form A^I in terms of its effect on the respective systems. Our expression for the expected value is therefore:

Trρ(A^I)=iji,j|ρ(A^I)|i,j
Inserting another identity I=mn|m,nm,n| we can look carefully to the second term:
A=ij mni,j|ρ|m,nm,n|(A^I)|i,j=ij mni,j|ρ|m,nAmiδnj.
We therefore find that there is an expression for something which acts precisely as an effective substate-matrix ρ~ for the subsystem: it reproduces all of the expectation values that you see above for any operator which only works in the substate. That substate matrix is:
ρ~ij=ni,n|ρ|j,n,
whence A=ijρ~ijAji.

We call the process which generates the substate matrix "tracing out" the rest of the superstate, because it has the same structure as a partial trace.

The difference between superposition and entanglement.

Let us calculate the state matrix for a|0+b|1. This is very simple: it is

ρ=aa|00|+ab|01|+ba|10|+bb|11|,
or, written as a bona-fide matrix,
ρ=[aaabbabb].

Now let us entangle it with another system. We will use the CNOT operation to entangle it with a constant |0, generating a|00+b|11. When we perform the above recipe to this system we find ourselves looking at a completely different density matrix:

ρ~=[aa00bb].
Now let me explain why I couldn't use wavefunctions to get this result: it is that this state matrix cannot be expressed as a wavefunction, unless either a=0 or b=0. The previous matrix ρ is actually as general as a single-particle wavefunction can be, and it has off-diagonal terms. This one does not, precisely because there is no way that the "tracing out" step can convert a |0011| term to anything "internal" to the substate matrix. It lives outside of the substate matrix and can only be measured by measuring both parts of the global state and comparing them!

The double-slit observable

The simplest observable is A^1=|11|, measuring the probability that a qubit is in state |1. Now suppose that we don't do this directly, but first evolve the state with a unitary matrix. This will correspond to a photon going through a slit corresponding to the qubit and then traveling to a photomultiplier tube at position y, which will "click" (transition from |0 to |1 with amplitudes f0,1(y) when only one of these is open. So the unitary transformation is, for some α0,1 that don't matter,

|0α0(y)|0+f0(y)|1|1α1(y)|0+f1(y)|1.
We measure the resulting qubit, which results in
A=Tr(UρU A^1)=Tr(ρ UA^1U).
The matrix UA^1U is therefore
[α0f0α1f1][0001][α0α1f0f1]=[f0f0f1f0f0f1f1f1].
This is our double-slit observable matrix.

From this you have enough to calculate the two cases, which are

Tr(ρA^)=aaf0f0+abf0f1+abf0f1+bbf1f1=|af0(y)+bf1(y)|2Tr(ρ~A^)=aaf0f0+bbf1f1=|af0(y)|2+|bf1(y)|2.
In fact in general the latter probability matrix, with no off-diagonal terms, behaves like a classical probabilistic mixture of classical bits 0 and 1. That is a very general result from the linearity of trace; in general if ρ=ipiρi then Tr(ρA^)=ipiTr(ρiA^), so the system behaves like a classical-probability-mixture of the different constituent ρi. (Caution: this basis is generally not unique. If you work it out, ρ=12|00|+12|11| is actually the same as ρ=12|++|+12||. I am telling you this because I have heard people who do not know this argue that this explains how Quantum Mechanics "chooses" a basis for its decoherence, hence why the world looks classical rather than quantum at a macro-scale... it doesn't really resolve that problem at all!

So that is how to easily understand entanglement as destroying coherence: the more you're entangled, the more the orthogonality of the other system kills your off-diagonal terms, and the more your substate looks like a classical probability mixture, transferring the cool quantum effects to the system-as-a-whole.

Wow Chris! Thank you so much for taking the time! Clarified many things for me! If I may ask some more questions: 1) Is a substate matrix same as reduced density matrix? Unfortunately I tend to get lost when ideas like "partial trace" or "trace out" are used, what do we mean? 2) I feel like your point about the density matrix of |00+|11 state was very crucial, but I didn't get why it was only obtainable via a substate matrix? 3) so in your last example you showed that using ρ the whole system is coherent, and using ρ~ either subsystem is not coherent anymore
4) I feel like if I understood better what is meant by tracing out ideas, I'd understand why off diagonal terms being 0 would be so important (and imply decoherence) but I still lack that :( 5) What happened here with state 0: |0α0(y)|0+f0(y)|1, did it go through a Hadamard for example? Is there any book that follows arguments similar to yours in such matters? I'd love to read up on all these things, very interesting!
1. Yes, the reduced density matrix is the effective state matrix for the substate. The meaning of "trace out" is given above (equation following "That substate matrix is..."); so the |ab| component of the effective substate matrix is a0|ρ|b0+a1|ρ|b1, summing over all terms which contain the same outside-substate (the part of the system that is not in the substate we're generating a reduced density matrix for), same on both the left and right sides. 3. Not quite; when I am saying ρ there, I mean it as defined in the previous section.
So, it is just the density matrix of the a|0+b|1 state, not the "overall" state. 4. That's not really about "tracing out". You just need those off-diagonal terms to get something that doesn't look like p1ρ1+p2ρ2, which acts thoroughly like a classical probabilistic mixture, as pointed out above. 5. I am not sure what you're asking; I am trying to produce a double-slit interference pattern |f0(y)+f1(y)|2 for the qubit |0+|1, so I am just formalizing what the observable matrix looks like. 6. Nielsen and Chuang is pretty standard.
Thanks a bunch Chris, your help is much appreciated! I need to read up on density matrices to fully grasp all your points, e.g. I don't understand what's the point of using identities such as I=mn|m,nm,n|...when we were already in a basis (i,j). Thanks again

I am posting these notes following a request for further information regarding this question. Should not affect the OP's choice of answer.

Notes added in proof:

On the meaning of quantum coherence:

Quantum coherence is a direct extension of the classical concept of wave coherence. Two classical waves are said to be coherent if they can produce a well-defined interference pattern. In order for this to happen, for instance with electromagnetic waves, the two waves must have the same frequency and a constant phase difference, such that when they add/superpose/overlap the resulting wave pattern remains well-defined. This is how coherent sources were first defined in optics.

In contrast, incoherent optical sources, even if monochromatic, produce an ensemble, or statistical superposition, of light waves with random relative phases (and polarizations, to be precise), which do not/cannot interfere which each other. To get an interference pattern one must first isolate a single coherent component and use it to set up coherent sources, such as the two slits in the famous double-slit example.

When electron interference patterns where first detected, it made sense to interpret them in the same terms as optical interference, and the concept of coherence transferred automatically to superpositions of wave functions and quantum states in general. So did the concept of incoherent statistical ensemble.

So, in general a coherent quantum state means a coherent superposition that can produce interference patterns (there is also a more specific notion of "coherent states", as in those of the harmonic oscillator, please do not confuse the concepts). For this to happen it must be a pure state |ψ. If such a |ψ is expressed as a superposition of two other states, say |ψ|0+|a|eiθ|1, then it implies a well-defined relative phase (or phase difference) between states |0 and |1, even if the superposition amplitude |a|eiθ changes in time. See some good explanations along these lines in answers to this related question.

On the other hand, the concept of incoherent superposition evolved into that of mixed state, described no longer by a state vector |ψ, but by a positive definite state operator ρ. A mixed quantum state ρ is understood in two distinct ways that are equivalent as long as the overall dynamics remains linear (yes, nonlinear dynamics would distinguish between the two):

1) Following the optics analogy: as an incoherent superposition of coherent states, or in quantum theory terms, as a statistical mixture of pure states. That is,

ρ=kpk|ψkψk|

where pk is the probability of pure state |ψk, 0pk1, and the states |ψk need not be mutually orthogonal (in which case they are not the eigenstates of ρ, those are different and always exist!). This sort of statistical mixture is equivalent to a physical ensemble of identical quantum systems (copies), each in some pure state |ψk. In this case pk represents the frequency of copies in the respective |ψk.

2) As the reduced state of a subsystem of a larger quantum system that is overall in a pure state. This definition gives an intrinsic quantum meaning to mixed states, and relies in turn on the concept of entanglement.

Formally, a joint pure state of two systems A and B is entangled if it is not a direct product of "local" pure states, that is, |ψAB|ψA|ψB. Conversely, if A and B are in a joint pure state, then they are disentangled if and only if each of them is in a pure state and |ψAB=|ψA|ψB. The latter is called a separable pure state.

The operational meaning of a separable pure state |ψAB is that measurements of any two "local" observables OA and OB are statistically_uncorrelated_, in the sense that the average of a product OAOB=OAOB equals the product of the averages,

ψAB|OAOB|ψAB=ψAB|OA|ψABψAB|OB|ψAB
or equivalently, that the statistical correlation of OA and OB is null,
ψAB|OAOB|ψABψAB|OA|ψABψAB|OB|ψAB=0

On entanglement and loss of coherence:

From the above it follows immediately that a joint pure state is entangled if and only if it produces non-vanishing correlations for at least one pair of "local" observables. In this case we know with certainty that neither A nor B can be in pure states, since otherwise the state would be separable!

But now we can also see an interesting relation between entanglement and coherence, which answers questions 1 & 2:

An entangled pure state is by all means a coherent state, generally a coherent superposition of separable pure states of two or more subsystems. Yet the individual subsystems can no longer be in coherent, pure states themselves. This is what Chris Drost pointed out when he wrote that entanglement is paradoxically responsible for loss of coherence. Coherence is necessarily lost within individual entangled subsystems because they cannot be in coherent states, but at the same time correlations between subsystems keep the total state coherent.

Things get somewhat more complicated as soon as we acknowledge that entangled states may also be mixed states themselves, but this is the general idea.

In order to give any simple example we need to complete the 2nd definition of a mixed state above and see what becomes of the "local", reduced state of an entangled subsystem. The following derivation hopefully emphasizes the connection to basic probability rules. Let the total entangled state be |ψAB, or equivalently ρAB=|ψABψAB|, and let OA be any arbitrary observable of A, with eigenbasis {|jA}j and corresponding eigenvalues ωj. Also let {|kB}k be an arbitrary orthonormal basis set of B. The average of OA in state |ψAB is

ψAB|OA|ψABψAB|OAIB|ψAB=j,kψAB|jAkBωjjAkB|ψAB==jωjkψAB|jAkBjAkB|ψAB
The meaning of the last expression is quite transparent, since the sum over k gives the total probability pj that subsystem A is in state |jA while B is in any of the basis states |kB. Let us rewrite this probability slightly differently:
pj=kψAB|jAkBjAkB|ψAB=kjAkB|ψABψAB|jAkB==jA|[kkB|ψABψAB|kB]|jA
Notice that this time the expression in the square brackets is independent of the eigenbasis {|jA}j and therefore of the choice of OA. If we denote it as
ρA=kkB|ψABψAB|kB
we obtain that the total probability to have subsystem A in any state |jA is given by
pj=jA|ρA|jA
and that the average of OA amounts to
ψAB|OA|ψAB=jωjjA|ρA|jA=jjA|[j|jAωjjA|]ρA|jA=TrA(OAρA)
It can be easily verified that the entity ρA is in fact a hermitian, positive definite operator on the Hilbert space of A. In addition, since the pj's must sum up to 1, jpj=jjA|ρA|jA=1, we also have that TrAρA=1, a property that is again independent of the basis {|jA}j. In other words, ρA is a density matrix that encapsulates all information about the statistics of subsystem A, regardless of the state of B. It is said that the information on B is averaged out.

Furthermore, we can rewrite ρA as

ρA=kkB|ψABψAB|kB=kkB|[|ψABψAB|]|kB=kkB|ρAB|kB
or
ρA=TrBρAB=TrB(|ψABψAB|)
The latter expression is the one we want to keep, since it can be shown that it is independent of the choice of basis {|kB}k.

The density matrix ρA describes the reduced state of subsystem A. Similarly, the density matrix ρB=TrAρAB=TrA(|ψABψAB|) describes the reduced state of subsystem B. Show as an exercise that the average of any observable OB of B is given by ψAB|OB|ψAB=TrB(OBρB) :)

The above is all that is needed for a basic understanding of various examples of coherence and entanglement. For instance:

  • Any pure state |ψA=α0|0A+α1|1A of system A is a coherent superposition showing interference between pure states |0A and |1A.

  • Same goes for states |ψB=β0|0B+β1|1B of B.

  • States |ψA|ψB, |ψA|0B, etc, are separable pure states such that both A and B are each individually in coherent superpositions of pure states. Interference experiments on A alone will show the same interference patterns as in the absence of B, and vice-versa.

  • Entangled states |ψAB=γ0|0A0B+γ1|1A1B of the joint system A-B are coherent with respect to joint pure (and separable) states |0A0B and |1A1B. That is, a joint interference experiment on A and B produces an interference pattern. But now the "local" state of A alone is described by the reduced density matrix

    ρA=TrB(|ψABψAB|)=TrB[(γ0|0A0B+γ1|1A1B)(γ00A0B|+γ11A1B|)]==|γ0|2|0A0A|+|γ1|2|1A1A|
    and it is an "incoherent" mixed state: it does not produce an interference pattern by itself ("locally"), or when the interference experiment erases all information on B. Notice that ρA is the intrinsic reduced (local) mixed state of A when the total entangled state is |ψAB. No additional measurement needs to be performed on either A or B to bring A in state ρA. Check as an exercise that the same goes for B.

Finally, a very brief answer to question 3: Yes, decoherence understood as loss of coherent superposition involves entanglement and/or a dissipative dynamics in the presence of another system (measurement apparatus, environment, etc). Sometimes though it may mean loss of phase coherence under internal interactions.

I just finished reading it for the 2nd time, and i have to say this is an absolute gem! I understood everything, such a nice feeling!! I finally grasped what is meant by partial trace, and if im not mistaken it s exactly analogous when one "integrates out" degrees of freedom in stat. mech, e.g. when we want the distribution function of a subspace of particles, so we write f(n)=f(N)drNndpNn meaning the probability that the particles of subspace n are in volume drndpn regardless of the state of the N-n ones, right?
Yes, that's it. Actually, talking analogy with probability distribution, you may want to take a look at this, just for fun: en.wikipedia.org/wiki/Wigner_quasiprobability_distribution.
Another question: in the entangled case, when we say "if we run interference experiments on a subspace only" then no interference pattern is observed. But experimentally, how does it go? I only send my system A through the slit? As opposed to sending AB together. (I mean what do we measure to collapse the system A to its mixed state, when AB is initially entangled) I ask because now following your explanations, i understand the theoretical part, namely that ρA is a mixed state thus lacks information on the exact state of A.
Thanks a lot for the link, in return you may be interested in this lecture note by Scott Aaronson, where he shows how all the weirdness in QM can be shown to arise from the idea of allowing for negative probabilities abd then taking the 2-norm instead of the usual probability theory choice of 1-norm. Let me know what you think of it. scottaaronson.com/democritus/lec9.html
On mixed states: yes, there is intrinsic uncertainty in mixed states. It is measured for instance by entropy, Trρlnρ>0, or by (lack of) purity, Trρ2<1. Pure states otoh have zero entropy/unit purity. On interference patterns: well-defined usually refers to the contrast between minima and maxima. The sharper the contrast, the higher the source coherence. A sharp maximum is somewhat ambiguous, but clear zeros I think are still a good sign of coherence.

When I have a particle A in a superposition state ψA=a|0+b|1 and entangle it to another system B, in state ψB, my first particle still remains in a superposition, and its measurement is still random, is it not?

When two particles are entangled then you simply do not have particle A in state A and particle B in state B. If the two particles had their own states then the joint state would be the product of the two states.

Go back and reread the first part where the author talks about what it means to be entangled, when you are not entangled you have the general state as the product of two single particle states. But entangled states don't have that (by definition). If you are rereading it note that a superposition of two eigenstates of a spin 1/2 direction is simply an eigenstate of a differently oriented eigenstate. A superposition of single particle states doesn't have to be any weirder than an eigenstate, so when the author says the single particle superpositions are weird and nonclassical this might not be the case. And the past about expectation values is wrong too, there are no functions of x after you take an expectation value. But the rest. The definition of entanglement seemed fine, though you seem to have not grasped it.

So why do we say that entanglement destroys coherence?

Don't focus on superposition, there is no physical meaning to the result of a superposition, what you get after a superposition could be what someone else starts with to make superpositions so it isn't the key to anything. Its real, but don't for instance think you can loom at something and tell whether it was a superposition. A superposition is like a sum. You might look at at 5 and say that it is 2+3 and so is a sum but someone else can look at 5+7 and say that 5 is a term. Term ... Sum. You can't necessarily tell.

Interference happens when you have two things overlap and not be orthogonal. It is possible for instance to entangle the spin and still get spatial interference as long as the spin dynamics don't couple to the spatial dynamics.

The reason the entanglement can destroy the interference is by making them not overlap. I said you can get interference even if you entangle the spins. A way to lose the interference is if you entangle going left for one particle with going left for the other particle.

You see the wave isn't a wave in space, people just fail to tell you that sometimes. When you have two (or more) particles the wave is in configuration space, which means you assign a complex number to a 6d space where the first three coordinates tell you where the first particle is and the next three tell you where the second is and so on. So knowing all the particles tells you the configuration and knowing the configuration tells you all the particles.

So when you entangle the positions of both particles then the wave is nonzero only for configurations where they are both left or both right. When you try to get an interference you need two waves to evaluated at the same point. In the post you read it was written as x but it should have been a point in 6d space like (x1,y1,z1,x2,y2,z2). So they don't interfere because at every (x1,y1,z1,x2,y2,z2) the one that went left still has the second part like on the left and the one that went right still the second particle on the right so the 6d x where the wave is simply doesn't have the |00 and the |11 overlap anywhere on the screen. In a sense it is just that the waves don't overlap.

It would be great if one could elaborately show this for the simplest entangled pairs!

It is 100% like of the left slit shot the beam upwards and the right beam shot it downwards. To the right and down you see a big spot and the the left and up you see a bug spot and there is no interference because the two paths didn't overlap.

It is lack of overlap that makes coherence irrelevant. And it seems deep only because you didn't get told all the details. Every alleged deep thing in quantum mechanics is just making a big deal about the words instead of looking at the details of the dynamics of the actual experimental setup.

Is the point maybe that if B is measured first, only then A loses its coherence?

The order of measurements on different particles does not change the frequency of the results you get.

does the converse mean that the concept of decoherence is tightly related to entanglement of a small system with its environment?

Yes. What you call measurement is the end result of a process of entangling the subject with the device and then the environment. Entanglement is natural.

Or in other words would decoherence happen at all without entanglement?

There is no "without entanglement" entanglement is a natural thing that happens all the time. There is no known way to not have it I guess if you had no interactions you might be able to avoid it.

Coherence and entanglement are opposite situations. Coherent electrons means they have the same quantum status, so they have same spin, while entangled electrons means they have opposite (antiparallel) spin, and they always act like a couple. Coherent photons means they have the same wavefunction (like all photons of a laser beam), while entangled photons means they have antisymmetric wavefunctions, adding each other like a pair. A laser beam can produce some few entangled photons under special conditions.