3

I'm having some trouble with understanding the derivation of the action of the $X$ operator. It seems to be a result of the notation used and not a property of itself.

The usual argument is to consider eigenfunctions of the $X$-operator: $X|x\rangle = x|x\rangle$ where $X$ is an operator, $|x\rangle$ is an eigenket of $X$ and $x$ is the corresponding eigenvalue. Then \begin{eqnarray*} \color{red}{\langle x'|}X|x\rangle &=& \color{red}{\langle x'|}x|x\rangle \\ \\ \langle x'|X|x\rangle &=& x\langle x'|x\rangle \\ \\ \langle x'|X|x\rangle &=& x\,\delta(x'-x) \end{eqnarray*} where $\delta$ is Dirac's $\delta$-"function". Then we ask what $X$ does to arbitrary kets like $|f\rangle$: \begin{eqnarray*} (Xf)(x) &=& \langle x | X | f\rangle \\ \\ &=&\int_{-\infty}^{+\infty} \langle x|X|x'\rangle\langle x'|f\rangle~\mathrm dx' \\ \\ &=& \int_{-\infty}^{+\infty}f(x')\color{red}{\langle x|X|x'\rangle}~\mathrm dx' \\ \\ &=& \int_{-\infty}^{+\infty} f(x')\,\color{red}{x'\,\delta(x-x')}~\mathrm dx' \end{eqnarray*} The defining property of the $\delta$-"function" is that $\int_{\mathbb R} f(y)\,\delta(x-y)~\mathrm dy=f(x)$, and so $$(Xf)(x) = x\,f(x)$$

However, if I do this with other symbols, then I can't get the same result. Let's say $X|x\rangle = \lambda |x\rangle$. Then following the same steps gives $\langle x'|X|x\rangle = \lambda \langle x'|x\rangle=\lambda \,\delta(x'-x)$, and whence $$(Xf)(x)=\lambda f(x)$$ This is to be expected: $Xf$ is just an eigenvalue multiple of $f$.

It seems that the property that $X : f \mapsto xf$ comes from the fact that we used $x$ to denote the eigenvalue of $X$. What am I missing here?

Perhaps because $x$ is a real number and the set of all kets $|x\rangle$ can be identified with the real line by $|x\rangle \mapsto x$, and that $x$ must be an eigenvalue of $|x\rangle$ under this construction?

DanielSank
  • 24,439
  • 5
    The important point is that you use the same letter for the eigenvalue and the ket, because you label the kets by their eigenvalue – Javier Jun 27 '17 at 17:10
  • @Javier. I understand the convention, but if something is a property of $X$ then it should be independent of labels. There must be something deeper. – Fly by Night Jun 27 '17 at 17:16
  • @FlybyNight The relationship $X|x⟩=x|x⟩$ isn't just a property of $X$: it's a property of $X$ and $|x⟩$, and it is independent of labels - change the labels (consistently!) and you get $X|\lambda⟩=\lambda|\lambda⟩$. The notation in QM is abridged (alternative less-ambiguous notations would be $|x⟩_X$ and $|k⟩_K$ or $|X,x⟩$ and $|K,k⟩$, but we don't use them because they're clunky) so that we're able to write $|x⟩$ and $|k⟩$ and still infer what quantities they're related to, but that notational convenience is the only thing that ties $|x⟩$ to the specific label $x$. – Emilio Pisanty Jun 27 '17 at 23:27

3 Answers3

10

By saying $X|x\rangle = \lambda |x\rangle$ and then integrating over $x$ without allowing for the fact that $\lambda$ depends on $x$, you're essentially saying that the action of $X$ on all $|x\rangle$ states is the same, so that $X$ is a constant: $$ X = X\int|x\rangle\langle x|\mathrm dx = \int X|x\rangle\langle x|\mathrm dx = \int \lambda|x\rangle\langle x|\mathrm dx = \lambda\int|x\rangle\langle x|\mathrm dx =\lambda\mathbb I. $$ Allow for $\lambda$ to depend on $x$ (i.e. allow for $X$ to have different eigenvalues for different eigenvectors), say, via notation like $\lambda_x$ or similar, and you'll recover the initial behaviour in the case $\lambda_x = x$.


That said, from what you've said in comments, the confusion is a bit more fundamental. To be clear: the property $$ (Xf)(x) = xf(x) $$ is the definition of the $X$ operator, at least when working on the explicit instantiation of the abstract Hilbert space in the position-based $L_2(\mathbb R)$ space; as a definition, it hardly needs any justification.

Shankar's treatment arises because he starts using an explicit coordinate-based construction of finite-dimensional Hilbert spaces and then decides to keep on using the same notation without showing how the Hilbert-space basis is built, or the theorems that guarantee its existence; this is fine for a first pass at the subject but there are better ways to do it once you're accustomed to the basics. (And that means that you shouldn't take that section as representative of the rigorous ways to construct these mathematical structures.)

Frankly, I find that part of Shankar's presentation to be rather confused (not necessarily wrong, as such, but I would encourage you to read other textbooks to get a better feeling for those topics). In particular, Shankar introduces the notation of the $X$ operator as the operator "responsible for the $|x⟩$ basis" - but this is not really necessary. You already have that basis, you don't really need an operator to generate it. (It exists, of course, but you don't need it, in the way Shankar implies.)

Either way, at some point you need to commit to one or another definition of $X$: the simplest way is to define it via $(Xf)(x) = xf(x)$, but you can also define it, as Shankar does, as the unique operator whose action on the basis states is $$ X|x⟩=x|x⟩, $$ i.e. to multiply each state by its label (and one can then show, as Shankar does, that the two definitions are equivalent). Now why would you define something like that? because you can, and because it works. Is this purely a trick of the notation? No, there's clear physical content to the assertion that $X|x⟩=x|x⟩$, and if you want to just "change the labels" then you need to do so consistently: you'd need to say something like $X|\lambda⟩=\lambda|\lambda⟩$, while making clear that $|\lambda⟩$ is the same position basis state as $|x⟩$ but with altered notation.

And if this was not enough to dig you out, then I would seriously recommend taking up a few more books for alternative perspectives on the topic: there's better ways to do it (or at least, ways that are better in this respect) but there's no space in a Q&A format to develop them fully. Read around. Seriously.

Emilio Pisanty
  • 132,859
  • 33
  • 351
  • 666
  • So let's say that $X|x\rangle = \lambda(x)|x\rangle$. Then what happens? – Fly by Night Jun 27 '17 at 17:19
  • @FlybyNight As indicated, you recover the initial behaviour. If you want the details, repeat the same workings with the new notation. – Emilio Pisanty Jun 27 '17 at 17:19
  • So I get $\langle x'|X|x\rangle =\lambda(x)\delta(x-x')$ and hence $(Xf)(x)=\lambda(x)f(x)$. That means that $X : f\mapsto \lambda f$ and not $xf$. What am I missing? Must $\lambda(x) \equiv x$. Otherwise $f \mapsto xf$ is just because we chose $x$ as the symbol for the eigenvalues – Fly by Night Jun 27 '17 at 17:24
  • @FlybyNight I honestly don't know what else you were expecting. If you change the eigenvalues, you change the operator. If you don't change the eigenvalues (so $\lambda(x) = x$) then the operator remains the same. – Emilio Pisanty Jun 27 '17 at 17:26
  • You seem to know something implicitly that I don't. There's something "obvious" to you that I can't see. I was hoping for some explanation. The problem seems to be around the eigenvalues. Can you explain what the $X$-operator is, and why it must satisfy $X : f \mapsto xf$. The proof that I have (Shanker, p68) does not define the $X$-operator and uses the argument I have given above to show that $X : f \mapsto xf$ but, mathematically, this is just because of the labels chosen. There is something deeper - perhaps a definition - missing. – Fly by Night Jun 27 '17 at 17:34
  • The $X$ operator satisfies $X:f\mapsto xf$ (or, more precisely, it takes $f$ to $g$ where $g(x)=xf(x)$) because that's its definition. I don't have Shankar's book available but I suspect you just glossed over the definition when it appeared. – Emilio Pisanty Jun 27 '17 at 18:40
  • Shankar's book assumes a lot of stuff without introduction, e.g. it says "recall from Fourier analysis" and then goes on to use the Fourier transform. The only index reference to the $X$-operator is page 68, and the only reference to the matrix of the $X$-operator is page 68. – Fly by Night Jun 27 '17 at 20:46
  • 1
    @FlybyNight I think Emilio's point is this: we can define a class of operators by the equation $\langle x' | \hat \lambda | x \rangle = \lambda (x) \delta(x-x')$, and we define $\hat{X}$ as the special case where $\lambda(x) = x$, i.e. $\langle x' | \hat{X} | x \rangle = x \delta(x-x')$. – DanielSank Jun 27 '17 at 21:35
  • @FlybyNight See edited answer for a slightly rambly update. If that's not enough to dig you out, I'm probably out of ideas, and you need to pick up other textbooks for alternate viewpoints. – Emilio Pisanty Jun 27 '17 at 23:20
  • 1
    I think one of the confusing here is that in a way, the statement $X|x\rangle=x|x\rangle$ defines both the operator $X$ and the ket $|x\rangle$. Without some sort of Hermitian operator to act on it and give observable values, a Hilbert space doesn't have any physical meaning. – Carmeister Jun 28 '17 at 03:05
  • @EmilioPisanty Thank you very much for your additional comments, and in general for the time you've invested trying to help. One final question: could you suggest any other good texts? – Fly by Night Jun 28 '17 at 15:29
  • @FlybyNight That depends on where you are and what you want with the text, but there is already a good recommendations thread for that purpose. It's been a while since I've looked at the intro section of a QM book but I'd encourage you to have a look at Cohen-Tannoudji, Sakurai, Messiah, Gasiorowicz or Griffiths, and see whether they fit your needs. – Emilio Pisanty Jun 28 '17 at 15:35
3

I think your difficulty lies in figuring out what aspects of the QM formalism are determined by nature and what are human-constructed. In this regard, it doesn't help that there are two equivalent but incompatible ways to make that judgement.

One way is to begin with state vectors. In this view, $|x\rangle$ is defined to be the state corresponding to position $x$, so that it is fixed by nature. Once you have done this, $X$ is constructed. This is the point of view adopted by @Emilio Pisanty in his answer.

The alternative, more advanced point of view is that $X$ is an aspect of nature, and $|x\rangle$ is human-constructed. You seem to intuitively prefer his description. From this perspective, we define $|x\rangle$ to mean "the eigenvector of $X$ with eigenvalue $x$." So, yes, by definition, $\lambda=x$. (This, of course, assumes that $X$ does not have multiple eigenvectors with eigenvalue $x$. But that seems to be the case.)

Urb
  • 2,608
  • 1
    This question has nothing to do with quantum mechanics, as far as I can tell. I think OP is confused because of a small misunderstanding in the mathematics. – DanielSank Jun 27 '17 at 21:52
  • @DanielSank If I asked the question over at the Math site, they would have sent me here. I needed to add a tag, the book I'm reading is called "Principals of Quantum Mechanics", so I put QM. Apologies for any confusion. It was unintentional. – Fly by Night Jun 28 '17 at 15:32
1

This may be a possible explanation.

The equation $X|x\rangle = x|x\rangle$ means that if we measure position of state $|x\rangle$ we will get the numerical value x in our measuring device. Now if you write $X|x\rangle = \lambda(x)|x\rangle$, the equation is valid as long as $\lambda(x)$ uniquely characterize each state. But the state $|x\rangle$ in second equation is not the same as the first one. Because, on measuring this $|x\rangle$ you will get numerical value $\lambda(x)$, not x. To avoid confusion, it will be useful to write the second equation using a different notation. $$X|x\rangle_\lambda = \lambda(x)|x\rangle_\lambda$$ Note that $|x\rangle \neq |x\rangle_\lambda$. So there is no reason that $\langle x|X|f\rangle$ should be equal to $_\lambda\langle x|X|f\rangle$.