Associated Legendre function/Proofs: Difference between revisions
imported>Paul Wormer (→Proof) |
imported>Dan Nessett (→Theorem (orthogonality relation 2): Some more math formula fixes) |
||
(22 intermediate revisions by 2 users not shown) | |||
Line 1: | Line 1: | ||
{{subpages}} | {{subpages}} | ||
It | It is demonstrated that the associated Legendre functions are orthogonal in two different ways and their normalization constant for each is derived. | ||
==Theorem== | ==Theorem (orthogonality relation 1)== | ||
:<math>\int\limits_{-1}^{1}P_{l}^{m} \left( x\right) P_{k}^{m} \left( x\right) | :<math>\int\limits_{-1}^{1}P_{l}^{m} \left( x\right) P_{k}^{m} \left( x\right) | ||
dx =\frac{2}{2l+1} \frac{\left( l+m\right) !}{\left( l-m\right) !} \delta | dx =\frac{2}{2l+1} \frac{\left( l+m\right) !}{\left( l-m\right) !} \delta | ||
Line 15: | Line 15: | ||
The associated Legendre functions are regular solutions to the associated Legendre differential equation given in the main article. The equation is an example of a more general class of equations | The associated Legendre functions are regular solutions to the associated Legendre differential equation given in the main article. The equation is an example of a more general class of equations | ||
known as the [[Sturm-Liouville theory | Sturm-Liouville equation]]s. Using Sturm-Liouville | known as the [[Sturm-Liouville theory | Sturm-Liouville equation]]s. Using Sturm-Liouville | ||
theory, one can show the orthogonality of functions with same superscript ''m'' and different subscripts: | theory, one can show the orthogonality of functions with the same superscript ''m'' and different subscripts: | ||
:<math> | :<math> | ||
K_{kl}^{m} =\int\limits_{-1}^{1}P_{k}^{m} \left( x\right) P_{l}^{m} | K_{kl}^{m} =\int\limits_{-1}^{1}P_{k}^{m} \left( x\right) P_{l}^{m} | ||
\left( x\right) dx = 0 \quad\hbox{if}\quad k \ne l . | \left( x\right) dx = 0 \quad\hbox{if}\quad k \ne l . | ||
</math> | </math> | ||
However, one can find | |||
<math>K_{kl}^{m} </math> | |||
K_{kl}^{m} = | directly from the above definition, whether or not | ||
</math> | <math>k=l</math>. | ||
This involves evaluating the overlap integral directly from the definition of the associated Legendre functions given in the main article. Indeed, inserting the definition of the function twice: | |||
:<math> | :<math> | ||
K_{kl}^{m} =\frac{1}{2^{k+l}\; k! \; l! } | K_{kl}^{m} =\frac{1}{2^{k+l}\; k! \; l! } | ||
Line 67: | Line 67: | ||
So: | So: | ||
:<math> | :<math> | ||
K_{kl}^{m} =\delta_{kl} \; \frac{(-1)^{l+m} }{2^{2l}\, (l!)^{2}} | K_{kl}^{m} =\ (-1)^{l}\ \delta_{kl} \; \frac{(-1)^{l+m} }{2^{2l}\, (l!)^{2}} | ||
\binom{l+m}{2m} | \binom{l+m}{2m} | ||
\int\limits_{-1}^{1}\left( x^{2} -1\right) ^{l} \frac{d^{2m} | \int\limits_{-1}^{1}\left( x^{2} -1\right) ^{l} \frac{d^{2m} | ||
Line 73: | Line 73: | ||
}{dx^{2l} } \left[ \left( 1-x^{2} \right) ^{l} \right] dx, | }{dx^{2l} } \left[ \left( 1-x^{2} \right) ^{l} \right] dx, | ||
</math> | </math> | ||
where δ<sub>''kl''</sub> is the [[Kronecker delta]] that shows the orthogonality of functions with ''l'' ≠ ''k''. | where δ<sub>''kl''</sub> is the [[Kronecker delta]] that shows the orthogonality of functions with ''l'' ≠ ''k''. The factor <math>(-1)^{l}</math> at the front of <math>K_{kl}^{m}</math> comes from switching the sign of x<sup>2</sup>-1 inside (x<sup>2</sup>-1)<sup>l</sup>. | ||
To evaluate the differentiated factors, expand (1−x²)<sup>''k''</sup> | To evaluate the differentiated factors, expand (1−x²)<sup>''k''</sup> | ||
using the binomial theorem: | using the binomial theorem: | ||
Line 88: | Line 88: | ||
Therefore: | Therefore: | ||
:<math> | :<math> | ||
K_{kl}^{m} =\delta _{kl}\;\frac{1}{2^{2l}\; (l!) ^{2} } \frac{(2l)!\,(l+m)!}{(l-m)!} | K_{kl}^{m} =\ (-1)^{l}\delta _{kl}\; \; \frac{1}{2^{2l}\; (l!) ^{2} } \frac{(2l)!\,(l+m)!}{(l-m)!} | ||
\int\limits_{-1}^{1}(x^{2} -1)^{l} dx | \int\limits_{-1}^{1}(x^{2} -1)^{l} dx | ||
\qquad\qquad\qquad\qquad\qquad\qquad (1) | \qquad\qquad\qquad\qquad\qquad\qquad (1) | ||
Line 101: | Line 101: | ||
Thus, | Thus, | ||
:<math>\int\limits_{-1}^{1}\left( x^{2} -1\right) ^{l} dx | :<math>\int\limits_{-1}^{1}\left( x^{2} -1\right) ^{l} dx | ||
=(-1)^{ | =(-1)^{l+1}\int\limits_{\pi}^{0}\left( \sin \theta \right) ^{2l+1} d\theta, | ||
</math> | </math> | ||
where we recall that | where we recall that | ||
Line 109: | Line 108: | ||
The limits were switched from | The limits were switched from | ||
:<math> \pi \rightarrow 0 \; \quad\hbox{and}\quad 0 \rightarrow \pi </math> , | :<math> \pi \rightarrow 0 \; \quad\hbox{and}\quad 0 \rightarrow \pi </math> , | ||
which accounts for one minus sign | which accounts for one minus sign . | ||
Integration of | Integration of | ||
:<math> | :<math> | ||
Line 132: | Line 131: | ||
and changing the variable back to ''x'' | and changing the variable back to ''x'' | ||
yields: | yields: | ||
:<math>\int\limits_{-1}^{1}\left( x^{2} -1\right) ^{l} dx =\frac{ | :<math>\int\limits_{-1}^{1}\left( x^{2} -1\right) ^{l} dx =\ -\ \frac{2l}{2l+1} \int\limits_{-1}^{1}\left( x^{2} -1\right) ^{l-1} dx | ||
</math> | </math> | ||
for ''l'' ≥ 1. | for ''l'' ≥ 1. | ||
Using this recursively: | Using this recursively: | ||
:<math>\int\limits_{-1}^{1}\left( x^{2} -1\right) ^{l} dx =\frac{2\left( | :<math> | ||
l | \int\limits_{-1}^{1}\left( x^{2} -1\right) ^{l} dx =\ (-1)^{l}\ (\frac{2l }{2l+1} \frac{2\left( l-1\right) }{2l-1} \frac{2\left( l-2\right) | ||
}{2l- | }{2l-3} ...\frac{2}{3} )\ \int\limits_{-1}^{1}\ dx </math> | ||
l!} | |||
l!\right) ^{2} }{\left( 2l+1\right) !} | Noting: | ||
:<math>\int\limits_{-1}^{1}\ dx\ =\ 2</math> | |||
and | |||
:<math> \frac{2l }{2l+1} \frac{2\left( l-1\right) }{2l-1} \frac{2\left( l-2\right) | |||
}{2l-3} ...\frac{2}{3} \ =\ \frac{2^ll! }{(2l+1)(2l-1)(2l-3)...3}\ =\ \frac{2^ll! }{\frac{(2l+1)! }{2^ll!}}\ =\frac{2^{2l} \left( | |||
l!\right) ^{2} }{\left( 2l+1\right) !}</math> | |||
:<math> | |||
\int\limits_{-1}^{1}\left( x^{2} -1\right) ^{l} dx = \ (-1)^{l}\ \frac{2^{2l+1} \left( | |||
l!\right) ^{2} }{\left( 2l+1\right) !}</math> | |||
Applying this result to equation (1): | Applying this result to equation (1): | ||
Line 160: | Line 170: | ||
</math> | </math> | ||
that is, the new functions are normalized to unity. | that is, the new functions are normalized to unity. | ||
===Comments=== | ===Comments=== | ||
The orthogonality of the associated Legendre functions can be demonstrated in different ways. The proof presented above assumes only that the reader is familiar with basic calculus and it is therefore accessible to the widest possible audience. However, as mentioned, their orthogonality also follows from the fact that the associated Legendre equation belongs to a family known as the [[Sturm-Liouville theory|Sturm-Liouville equation]]s. | The orthogonality of the associated Legendre functions can be demonstrated in different ways. The proof presented above assumes only that the reader is familiar with basic calculus and it is therefore accessible to the widest possible audience. However, as mentioned, their orthogonality also follows from the fact that the associated Legendre equation belongs to a family known as the [[Sturm-Liouville theory|Sturm-Liouville equation]]s. | ||
Line 211: | Line 217: | ||
where it was used that ''m'' times differentiation of a polynomial lowers its order by ''m'' and that the order of a product of polynomials is the product of the orders. Since we assumed that ''k'' ≤ ''l'', the integral is non-zero only if ''k'' = ''l''. Hence it follows readily that the associated Legendre polynomials of equal superscripts and non-equal subscripts are orthogonal. | where it was used that ''m'' times differentiation of a polynomial lowers its order by ''m'' and that the order of a product of polynomials is the product of the orders. Since we assumed that ''k'' ≤ ''l'', the integral is non-zero only if ''k'' = ''l''. Hence it follows readily that the associated Legendre polynomials of equal superscripts and non-equal subscripts are orthogonal. | ||
However, the hard work (given above) of computing the normalization for the case ''k'' = ''l'' remains to be done. | However, the hard work (given above) of computing the normalization for the case ''k'' = ''l'' remains to be done. | ||
==Theorem (orthogonality relation 2)== | |||
<math>\int_{-1}^{1}\frac{P_{l}^{m}(x) P_{l}^{n} (x)}{(1-x^{2})} dx = \delta_{mn}\frac{(l+m)!}{m(l-m)!} | |||
, m \neq 0 </math>. | |||
where: | |||
<math>P_{l}^{m}(x) = \frac{(-1)^{m}}{(2^{l} l!)} (1-x^{2})^{m/2} \frac{d^{l+m}}{dx^{l+m}} (x^{2}-1)^{l} | |||
, 0 \leqq m \leqq l</math>. | |||
===Proof=== | |||
Let: | |||
<math>T_{l}^{mn} = \int_{-1}^{1}\frac{P_{l}^{m}(x) P_{l}^{n} (x)}{(1-x^{2})} \ dx | |||
= \frac{(-1)^{m+n}}{(2^{l} l!)^2} \int_{-1}^{1} \{(1-x^{2})^{\frac{(m+n)}{2}-1} | |||
\frac{d^{l+m}}{dx^{l+m}} (x^{2}-1)^{l}\} \{\frac{d^{l+n}}{dx^{l+n}} (x^{2}-1)^{l}\} dx</math>. | |||
The functions (x<sup>2</sup>-1)<sup>k</sup> are even functions; so their | |||
j<sup>th</sup> order derivatives are even or odd functions according as j is even or odd. | |||
Therefore, if m or n is even but the other is odd, then one of the two factors in curly braces in the | |||
preceding expression is an even function, and the other is an odd function. This makes the entire | |||
integrand an odd function. When integrated between the limits that are negatives of each other, | |||
it yields 0, as it should, since <math>\delta_{mn} = 0</math>. So we need to consider further | |||
only even integrands, for which m and n are either both even or both odd. In this case all exponents | |||
in the integrand are non-negative integers (except when m = n = 0). | |||
The integrand can be integrated by parts <math> \ell+n</math> times, the first | |||
factor in curly braces and its derivatives being identified as | |||
u and the second factor as v' in the formula: | |||
<math>\int_{-1}^{1} uv' dx = uv \Big|_{-1}^{1} - \int_{-1}^{1} vu' dx</math>. | |||
Since m and n occur symmetrically, we can assume without loss | |||
of generality that <math>n \leqq m</math>. For the first n-1 integrations | |||
by parts, the uv term vanishes at the limits because u includes | |||
at least one factor of (1-x<sup>2</sup>). The same also is true for the | |||
n<sup>th</sup> integration, unless n = m, in which case the uv term is: | |||
<math>S_{l}^{m} = \frac{(-1)^{m+1}}{(2^{l} l!)^{2}} \frac{d^{m-1}}{dx^{m-1}}\{(1-x^{2})^{m-1} | |||
\frac{d^{l+m}}{dx^{l+m}}(x^{2}-1)^{l}\} \{\frac{d^{l}}{dx^{l}}(x^{2}-1)^{l} \} \Big|_{-1}^{1}</math>. | |||
For the remaining <math>\ell</math> integrations the uv term vanishes at the | |||
limits because v includes at least one factor 1-x<sup>2</sup> (and perhaps u does also). The result is: | |||
<math>T_{l}^{mn} = \delta_{mn}S_{l}^{m} + \frac{(-1)^{l+m}}{(2^{l} l!)^{2}} \int_{-1}^{1} | |||
(x^{2}-1)^{l} \frac{d^{l+n}}{dx^{l+n}}\{(1-x^{2})^{\frac{(m+n)}{2}-1} \frac{d^{l+m}}{dx^{l+m}}(x^{2}-1)^{l}\} | |||
dx</math>. | |||
The highest power of x in the binomial expansion of (x<sup>2</sup>-1)<sup>l</sup> | |||
is <math>2\ell</math>; after <math>\ell+m</math> derivatives of it are taken, the highest power | |||
is <math>\ell-m</math>. The highest power of x in the expansion of <math>(1-x^{2})^{\frac{(m+n)}{2}-1}</math> | |||
is m+n-2; so the highest power in the expression in curly braces | |||
is <math>\ell+n-2</math>. After <math>\ell+n-2</math> derivatives of it are taken, the highest | |||
power is 0; that is, the expression is a constant. Taking the | |||
remaining 2 derivatives causes the integrand to vanish. Therefore: | |||
<math>T_{l}^{mn} = \delta_{mn}S_{l}^{m} | |||
= \delta_{mn}\frac{1}{(2^{l} l!)^{2}} \frac{d^{m-1}}{dx^{m-1}}\{(x^{2}-1)^{m-1} | |||
\frac{d^{l+m}}{dx^{l+m}} (x^{2}-1)^{l}\} \{\frac{d^{l}}{dx^{l}}(x^{2}-1)^{l}\} \Big|_{-1}^{1}</math>. | |||
Since this function is odd, we need evaluate it only at its upper | |||
limit x = 1 and double the result. | |||
An expression of the form <math>(x^{2}-1)^{k} = (x-1)^{k}(x+1)^{k}</math> can be differentiated using Leibnitz's rule. Only one term in the sum is of interest here, namely the one in which (x-1)<sup>k</sup> is differentiated exactly k times so that no factors of (x-1) remain. If it is differentiated fewer times, then it vanishes at x = 1. If it is differentiated more times, then it vanishes everywhere. So, ignoring terms with factors of (x-1), we have (from right to left in the preceding expression): | |||
<math>\frac{d^{l}}{dx^{l}}(x^{2}-1)^{l} = [\frac{d^{l}}{dx^{l}}(x-1)^{l}](x+1)^{l} = l! | |||
(x+1)^{l}</math> , which equals <math> 2^{l} \ell!</math> when x = 1: | |||
<math>\frac{d^{l+m}}{dx^{l+m}}(x^{2}-1)^{l} = \frac{(l+m)!}{l!m!} [\frac{d^{l}}{dx^{l}} (x-1)^{l}] | |||
[\frac{d^{m}}{dx^{m}}(x+1)^{l}]</math> | |||
::<math>= \frac{(l+m)!}{l!m!} [l!] [\frac{l!}{(l-m)!} (x+1)^{l-m}]</math> | |||
::<math>= (l+m)! \frac{l!}{m!(l-m)!} (x+1)^{l-m}</math> | |||
<math>\frac{d^{m-1}}{dx^{m-1}}\{(x^{2}-1)^{m-1} \frac{d^{l+m}}{dx^{l+m}}(x^{2}-1)^{l}\}</math> | |||
::<math>= (l+m)! \frac{l!}{m!(l-m)!} \frac{d^{m-1}}{dx^{m-1}}\{(x-1)^{m-1}(x+1)^{l-1}\}</math> | |||
::<math>= (l+m)! \frac{l!}{m!(l-m)!} (m-1)! (x+1)^{l-1}</math> | |||
::<math>= (l+m)! \frac{l!}{m(l-m)!} (x+1)^{l-1}</math> , which equals <math>2^{l-1}(l+m)! \frac{l!}{m(l-m)!} </math> when x = 1. | |||
<math>T_{l}^{mn} = 2\delta_{mn}\frac{1}{(2^{l} l!)^{2}} \{2^{l-1}(l+m)! \frac{l!}{m(l-m)!}\} | |||
\{2^{l} l!\} = \delta_{mn}\frac{(l+m)!}{m(l-m)!}</math> | |||
which has been doubled to include the lower limit x = -1. QED. | |||
When n = m = 0: | |||
<math>T_{l}^{00} = \int_{-1}^{1} \frac{P_{l}^{0}(x) P_{l}^{0} (x)}{(1-x^{2})} dx</math> | |||
::<math>= \frac{1}{(2^{l} l!)^{2}} \int_{-1}^{1} \{(1-x^{2})^{-1} \frac{d^{l}}{dx^{l}}(x^{2}-1)^{l}\} | |||
\{\frac{d^{l}}{dx^{l}}(x^{2}-1)^{l}\} dx </math> | |||
::<math>= \frac{1}{(2^{l} l!)^{2}} \int_{-1}^{1} \frac{1}{2}[(x+1)^{-1} - (x-1)^{-1}] | |||
\{\frac{d^{l}}{dx^{l}}(x^{2}-1)^{l}\}^{2} dx</math>. | |||
When expanded in powers of (x+1) and (x-1) by using Leibnitz's | |||
rule, the expression in curly braces includes exactly one term, | |||
(x-1)<sup>l</sup>, having no factor (x+1) and exactly one term, (x+1)<math>^{l}</math>, having no factor (x-1). Therefore in a Laurent series expansion of the integrand about either x = 1 or x = -1 all terms except | |||
one have non-negative exponents, and their integral is finite. | |||
The one remaining term in the integrand is a constant multiple | |||
of 1/(x+1) or 1/(x-1); so its integral logarithmically diverges | |||
at the limits, making T<math>_{l}^{00}</math> infinite. |
Latest revision as of 11:52, 22 March 2012
It is demonstrated that the associated Legendre functions are orthogonal in two different ways and their normalization constant for each is derived.
Theorem (orthogonality relation 1)
where:
Proof
The associated Legendre functions are regular solutions to the associated Legendre differential equation given in the main article. The equation is an example of a more general class of equations known as the Sturm-Liouville equations. Using Sturm-Liouville theory, one can show the orthogonality of functions with the same superscript m and different subscripts:
However, one can find directly from the above definition, whether or not . This involves evaluating the overlap integral directly from the definition of the associated Legendre functions given in the main article. Indeed, inserting the definition of the function twice:
Since k and l occur symmetrically, one can without loss of generality assume that l ≥ k. Use the well-known integration-by-parts equation
l + m times, where the curly brackets in the integral indicate the factors, the first being u and the second v’. For each of the first m integrations by parts, u in the term contains the factor (1−x2), so the term vanishes. For each of the remaining l integrations, v in that term contains the factor (x2−1) so the term also vanishes. This means:
Expand the second factor using Leibnitz' rule:
The leftmost derivative in the sum is non-zero only when r ≤ 2m (remembering that m ≤ l). The other derivative is non-zero only when k + l + 2m − r ≤ 2k, that is, when r ≥ 2m + l − k. Because l ≥ k these two conditions imply that the only non-zero term in the sum occurs when r = 2m and l = k. So:
where δkl is the Kronecker delta that shows the orthogonality of functions with l ≠ k. The factor at the front of comes from switching the sign of x2-1 inside (x2-1)l. To evaluate the differentiated factors, expand (1−x²)k using the binomial theorem:
The only term that survives differentiation 2k times is the x2k term, which after differentiation gives
Therefore:
Evaluate
by a change of variable:
Thus,
where we recall that
The limits were switched from
- ,
which accounts for one minus sign . Integration of
gives
Since
it follows that
for n > 1.
Applying this result to
and changing the variable back to x yields:
for l ≥ 1. Using this recursively:
Noting:
and
Applying this result to equation (1):
Clearly, if we define new associated Legendre functions by a constant times the old ones,
then the overlap integral becomes,
that is, the new functions are normalized to unity.
Comments
The orthogonality of the associated Legendre functions can be demonstrated in different ways. The proof presented above assumes only that the reader is familiar with basic calculus and it is therefore accessible to the widest possible audience. However, as mentioned, their orthogonality also follows from the fact that the associated Legendre equation belongs to a family known as the Sturm-Liouville equations.
It is possible to demonstrate their orthogonality using principles associated with operator calculus. Let us write
where
Clearly, the case m = 0 is,
The proof given above starts out by implicitly proving the anti-Hermiticity of ∇. Indeed, noting that w(x) is a function with w(1) = w(−1) = 0 for m ≠ 0, it follows from partial integration that
Hence
To demonstrate orthogonality of the associated Legendre polynomials, we use a result from the theory of orthogonal polynomials. Namely, a Legendre polynomial of order l is orthogonal to any polynomial Πp of order p lower than l. In bra-ket notation
Knowing this,
The bra is a polynomial of order k, because
where it was used that m times differentiation of a polynomial lowers its order by m and that the order of a product of polynomials is the product of the orders. Since we assumed that k ≤ l, the integral is non-zero only if k = l. Hence it follows readily that the associated Legendre polynomials of equal superscripts and non-equal subscripts are orthogonal. However, the hard work (given above) of computing the normalization for the case k = l remains to be done.
Theorem (orthogonality relation 2)
.
where:
.
Proof
Let:
.
The functions (x2-1)k are even functions; so their jth order derivatives are even or odd functions according as j is even or odd. Therefore, if m or n is even but the other is odd, then one of the two factors in curly braces in the preceding expression is an even function, and the other is an odd function. This makes the entire integrand an odd function. When integrated between the limits that are negatives of each other, it yields 0, as it should, since . So we need to consider further only even integrands, for which m and n are either both even or both odd. In this case all exponents in the integrand are non-negative integers (except when m = n = 0).
The integrand can be integrated by parts times, the first factor in curly braces and its derivatives being identified as u and the second factor as v' in the formula:
.
Since m and n occur symmetrically, we can assume without loss of generality that . For the first n-1 integrations by parts, the uv term vanishes at the limits because u includes at least one factor of (1-x2). The same also is true for the nth integration, unless n = m, in which case the uv term is:
.
For the remaining integrations the uv term vanishes at the limits because v includes at least one factor 1-x2 (and perhaps u does also). The result is:
.
The highest power of x in the binomial expansion of (x2-1)l is ; after derivatives of it are taken, the highest power is . The highest power of x in the expansion of is m+n-2; so the highest power in the expression in curly braces is . After derivatives of it are taken, the highest power is 0; that is, the expression is a constant. Taking the remaining 2 derivatives causes the integrand to vanish. Therefore:
.
Since this function is odd, we need evaluate it only at its upper limit x = 1 and double the result.
An expression of the form can be differentiated using Leibnitz's rule. Only one term in the sum is of interest here, namely the one in which (x-1)k is differentiated exactly k times so that no factors of (x-1) remain. If it is differentiated fewer times, then it vanishes at x = 1. If it is differentiated more times, then it vanishes everywhere. So, ignoring terms with factors of (x-1), we have (from right to left in the preceding expression):
, which equals when x = 1:
- , which equals when x = 1.
which has been doubled to include the lower limit x = -1. QED.
When n = m = 0:
- .
When expanded in powers of (x+1) and (x-1) by using Leibnitz's rule, the expression in curly braces includes exactly one term, (x-1)l, having no factor (x+1) and exactly one term, (x+1), having no factor (x-1). Therefore in a Laurent series expansion of the integrand about either x = 1 or x = -1 all terms except one have non-negative exponents, and their integral is finite. The one remaining term in the integrand is a constant multiple of 1/(x+1) or 1/(x-1); so its integral logarithmically diverges at the limits, making T infinite.