Complex number/Citable Version: Difference between revisions

From Citizendium
Jump to navigation Jump to search
imported>Michael Hardy
(Somone wrote "Of course, since the square root of any real number is positive...". That is nonsense. I changed it to say the SQUARE (NOT the square ROOT) of any real number is non-negative.)
imported>Michael Hardy
(Lots of formatting, punctuation, and style cleanups. "Displayed" TeX should be indented via a colon and should not be exempt from final punctuation. Too many capital letters.)
Line 1: Line 1:
The '''complex numbers''' <math>\mathbb{C}</math> are numbers of the form ''a''&nbsp;+&nbsp;''bi'',
The '''complex numbers''' <math>\mathbb{C}</math> are numbers of the form ''a+bi'',
obtained by adjoining the [[imaginary unit]] ''i'' to the [[real number]]s (here ''a'' and ''b'' are reals).<ref>This article follows the usual convention in [[mathematics]]  and [[physics]] of using <math>i</math> as the imaginary unit. Complex numbers are frequently used in [[electrical engineering]], but in that discipline it is usual to use <math>j</math> instead, reserving <math>i</math> for [[electrical current]]. This usage is found in some [[programming language]]s too, notably [[Python]].</ref> The number ''i'' can be thought of as a solution of the equation <math>x^2+1=0</math>. In other words, its basic property is <math>i^2=-1</math>. Of course, since the square of any real number is non-negative, ''i'' cannot be a real number. ''A priori'', it is not even clear whether such an object exists and that it can be called a number, i.e. whether we can associate with it some natural operations as addition or multiplication. Assuming, for a moment, that the answer is "yes", we may rewrite the first sentence above in symbols as
obtained by adjoining the [[imaginary unit]] ''i'' to the [[real number]]s (here ''a'' and ''b'' are reals). The number ''i'' can be thought of as a solution of the equation <math>x^2+1=0</math>. In other words, its basic property is <math>i^2=-1</math>. Of course, since the square root of any real number is positive, <math>i\notin \mathbb{R}</math>. ''A priori'', it is not even clear whether such an object exists and that it can be called "a number", i.e. whether we can associate with it some natural operations as addition or multiplication. Assuming, for a moment, that the answer is "yes", we may write


:<math>\mathbb{C} = \{ a + bi | a, b \in \mathbb{R} \}.</math>
:<math>\mathbb{C} = \{ a + bi | a, b \in \mathbb{R} \}</math>


==Historical example ==
Of course, we do not [[formal|formally]] define complex numbers this way but, rather, define them as [[ordered pair]]s of real numbers. The above notation is, however, traditional.


The need for complex numbers may have appeared for the first time during the sixteenth century, when Italian mathematicians like [[Scipione del Ferro]], [[Niccolò Fontana Tartaglia]], [[Gerolamo Cardano]] and [[Rafael Bombelli]] tried to solve [[cubic equation]]s. This is so even for equations with three [[real number|real]] solutions, as the method they used sometimes requires calculations with numbers which squares are negative. Here is such an example (with modern notation). Let us consider the equation
:'''Aside on notation:''' There is a well established tradition in mathematics of adopting notation that is suggestive, even if it is, in some ways, unnatural or awkward. For example, if complex numbers are ordered pairs of real numbers, why not represent them as pairs, i.e., use <math>(a,b)</math> rather than <math>a + bi</math>? Thee are several ways of answering this question. One is that our notation tends to guide our thinking, and writing <math>x = x +0i</math> emphasizes the idea that the real number ''x'' is a complex number, whereas writing <math>(x, 0)</math> for the same number suggests that, as a complex number, ''x'' is something fundamentally different (perhaps it is). A second, and rather different, reason for using the notation <math>a + bi</math> is that it suggests a parallel with another part of mathematics. In elementary number theory, we learn to perform arithmetic modulo a number base. for example, we may write


: <math>x^3=15x+4. \ </math>
::<math>4 + 5 \equiv 2 \pmod 7</math>


[[Cardano's method]] for solving it suggests looking for a solution by writing it as a sum <math>x=u+v</math>, where some other condition on <math>u</math> and <math>v</math> will be decided later. Reporting this in the equation, we get, once the left member is expanded,
:to indicate that when we add 4 and 5 and then divide the result by 7, the remainder is 2. We can do something similar with [[polynomial]]s in a single variable ''x''. We know that <math>(x + 1)(x +2) = x^2 + 3x + 2</math>, but <math>x^2 + 3x + 2 = 1\cdot(x^2 + 1) + (3x + 1)</math>, so when we divide by <math>x^2 + 1</math>, the remainder is <math>3x + 1</math>. And by the same token,


: <math>u^3+3u^2v+3uv^2+v^3=15(u+v)+4, \ </math>
::<math>(1 + i)(2 + i) = 2 + 3i + i^2 = 1 + 3i \ </math>


which can be written as,
:so, when we add or multiply complex numbers, we are just doing modular arithmetic! Of course, there are also times when we wish to focus on the geometric or analytic aspects of complex numbers rather than the algebraic ones, but there is a tendency to want to retain the same notation where possible, and there is no question but that mathematical notation also tends to be dictated by tradition and historical accident.


: <math>u^3+(3uv-15)(u+v)+v^3=4. \ </math>
==Philosophical matters==


Now we choose the second condition on <math>u</math> and <math>v</math>, that is <math>3uv-15=0</math>, or <math>uv=5</math>. This implies that <math>u^3</math> and <math>v^3</math> are numbers which sum and product are given by
==Working with complex numbers==


: <math>\begin{cases}
===Basic operations===
u^3v^3=125, \\
u^3+v^3=4.
\end{cases}</math>


Now, it is a well-known fact that if a [[second degree]] [[polynomial]] <math>x^2-sx+p</math> has two roots, their sum is <math>s</math> and their product is <math>p.</math><ref>To verify this, one just has to write <math>r_1</math> and <math>r_2</math> for the roots, to expand <math>\left(x-r_1\right)\left(x-r_2\right)</math> and to identify the coefficients.</ref>Hence we may find some values for <math>u^3</math> and <math>v^3</math> by solving the [[quadratic equation]]
We define addition and multiplication in the obvious way, using <math>i^2 = -1</math> to rewrite results in the form <math>a + bi</math>:


: <math>x^2-4x+125=0. \ </math>
: <math>(a + bi) + (c + di) = (a + c) + (b + d)i \ </math>


Its [[discriminant]] is <math>\Delta=(-4)^2-4\cdot 125=-484=-22^2</math>, which is ''negative'', so that the quadratic equation has ''no real solution'': the usual formulae giving the solutions require to take the [[square root]] of the discriminant, which is undefined here.
: <math>(a + bi)(c + di) = (ac - bd) + (bc + ad)i \ </math>


Well, let us be bold and write <math>\Delta=\left(22\sqrt{-1}\right)^2</math>. Here, the symbol <math>\sqrt{-1}</math> denotes an hypothetical number which square would be <math>-1.</math><ref> Please note that this notation is purely formal and usual properties of the arithmetic (real) square root do not apply. Consider e.g. the following computation <math>-1=\sqrt{-1}\times\sqrt{-1}=</math>
To handle division, we simply note that <math>(c + di)(c - di) = c^2 +d^2</math>, so


: <math>=\sqrt{(-1)\times(-1)}=\sqrt{1}=1</math>
: <math>\frac{1}{c + di} = \frac{c - di}{c^2 + d^2}</math>


and the contradiction follows. The point is that the second equality
and, in particular,


can not be applied. The meaning of the symbol can be understood and the admissible operations can be specified by giving a precise definition in terms of more elementary mathematical objects (see the formal description).</ref><ref>Observe also that the symbol <math>\sqrt{a}</math> (or <math>\sqrt[n]{a}</math>) with <math>a\in\mathbb{C}</math> is sometimes used to denote the set of ''complex roots'' of ''a'', i.e. the set of the solutions of the equation <math>x^2=a</math> (<math>x^n=a</math> respectively). The set contains 2 (''n'', respectively) "equally important" elements and there is no canonical way to distinguish a "representative". Consequently, no computations are performed using this symbol.</ref> At this stage, such a number has no meaning (square of real numbers are always nonnegative), but we use it in a purely formal way. Using this symbol, we can write the "solutions" to the quadratic equation as
: <math>\frac{a + bi}{c + di} = \frac{(ac + bd) + (bc - ad)i}{c^2 + d^2}.</math>


: <math>u^3=\frac{4+22\sqrt{-1}}{2}=2+11\sqrt{-1}</math>
It turns out that with addition and multiplication defined this way, <math>\mathbb{C}</math> satisfies the [[axiom]]s for a [[field]], and is called the field of complex numbers. If <math>c = a + bi</math> is a complex number, we call <math>a</math> the real part of <math>c</math> and write <math>a = Re (c)</math>. Similarly, <math>b</math> is called the imaginary part of <math>c</math> and we write <math>b = Im (c)</math>. If the imaginary part of a complex number is <math>0</math>, the number is said to be real, and we write <math>a</math> instead of <math>a + 0i</math>. We thus identify <math>\mathbb{R}</math> with a subset (and, in fact, a subfield) of <math>\mathbb{C}</math>.


and
Going a bit further, we can introduce the important operation of complex conjugation. Given an arbitrary complex number <math>z = x + iy</math>, we define its complex conjugate to be <math>\bar{z} = x - iy</math>. Using the identity <math>(a + b)(a - b) = a^2 - b^2</math> we derive the important formula


: <math>v^3=\frac{4-22\sqrt{-1}}{2}=2-11\sqrt{-1}.</math>
:<math>z \bar{z} = x^2 + y^2</math>


It remains to find cube roots of these "numbers". A straightforward calculation shows that <math>u=2+\sqrt{-1}</math> and <math>v=2-\sqrt{-1}</math> do the job. For instance, remembering the rule <math>\left(\sqrt{-1}\right)^2=-1</math>, we have
and we define the modulus of a complex number ''z'' to be


: <math>\left(2+\sqrt{-1}\right)^3=2^3+3\cdot 2^2\sqrt{-1}+3\cdot 2\left(\sqrt{-1}\right)^2+\left(\sqrt{-1}\right)^3;</math>
:<math>|z| = \sqrt{z \bar{z}}</math>


: <math>\left(2+\sqrt{-1}\right)^3=8+12\sqrt{-1}-6-\sqrt{-1}=2+11\sqrt{-1}.</math>
Note that the modulus of a complex number is always a ''real'' number.


But now, going back to the original cubic equation, we get the ''real'' solution <math>x=u+v=2+\sqrt{-1}+2-\sqrt{-1}=4</math>! One can verify it is indeed a solution, as <math>4^3=64=15\cdot 4+4</math> (and once this solution is found, it is easy to find the two other solutions, which are also real).
The modulus (also called absolute value) satisfies three important properties that are completely analogous to the properties of the absolute value of real numbers
 
The fact that the formal calculations managed to give a real solution suggests that the "number" <math>\sqrt{-1}</math> may have some sense. But to really give it a legitimate status, one has to construct a new set of numbers, containing the real numbers, but also other numbers whose squares may be negative real numbers. This will be the set of ''complex numbers''. A rigorous construction of this set was given much later by [[Carl Friedrich Gauss]] in 1831.
 
==Formal definition==
Formally, complex numbers are [[ordered pair]]s of real numbers<ref>We follow a popular approach. From another and perhaps more abstract point of view, complex numbers are defined in terms of polynomials, as the quotient <math>\mathbb{C}=\mathbb{R}[X]/\left(X^2+1\right).</math> Cf. also the section "Working with complex numbers" where an example of a quotient of polynomials is discussed.</ref>, i.e.
:<math>\mathbb{C}= \{ (a,b)~|~~ a,b\in \mathbb{R} \}.</math>
 
To call them 'numbers' we need to introduce some operations on such pairs. So we define
*addition (''a'', ''b'') + (''c'', ''d'') = (''a'' + ''c'', ''b'' + ''d'')
*multiplication (''a'', ''b'')(''c'', ''d'') = (''ac'' - ''bd'', ''bc'' + ''ad'')
While this definition can look arbitrary and artificial at the first sight, it turns out to be very natural one. In particular, the basic properties of the usual operations are preserved and we can employ many formulas from the elementary algebra we are accustomed to. More specifically, the sum (or the product) of two numbers does not depend on the order of terms;<ref>that is, the addition (multiplication) is [[commutativity|commutative]]</ref> the sum (product) of three or more elements does not depend on order of operations ('we can suppress the parentheses');<ref>This is called [[associativity]]</ref> the product of a complex number with a sum of two other numbers expands in the usual way.<ref>In other words, multiplication is [[distributivity|distributive]] over addition</ref> In mathematical language this means that with addition and multiplication defined this way, <math>\mathbb{C}</math> satisfies the [[axiom]]s for a [[field]], and is called the field of complex numbers.
 
Now we are ready to understand the 'real' meaning of <math>\sqrt{-1}</math> and its usage in the above historical example. Observe that the pairs of type (''a'',0) are identical<ref>i.e. [[isomorphism|isomorphic]], which basically means that the mapping <math>\mathbb{C}\ni (a,0)\mapsto a\in\mathbb{R},</math> which preserves the addition and multiplication.</ref> to the set of reals, so we write (''a'', 0)=''a''. Observe also that by definition (0,1)(0,1) = (-1,0)=-1. It follows that <math>\sqrt{-1}</math>, the hypothetical number whose square root gives -1, is well defined as (0,1). It is so characteristic that we usually denote it by ''i'' and call it the imaginary unit. Now, the historical example computation is fully justified: since the basic operations on complex numbers behave like the usual addition and multiplication, we can find the roots of a second degree polynomial with the well-known formulas.
 
==Beyond the notation==
Next step is to observe that any complex number (''a'',''b'') can be expressed as ''a''+''bi'' and, conversely, any sum of this type represents a complex number. The notation using the imaginary unit ''i'' is called the ''algebraic form'' or ''rectangular form'' or yet ''Cartesian form'' of a complex number. It is both traditional and commonly used to perform computations. The importance of such formally trivial rewriting is difficult to overestimate and we discuss it in more details.
 
There is a well established tradition in mathematics of adopting notation that is suggestive, even if it is, in some ways, unnatural or awkward. For example, if complex numbers are ordered pairs of real numbers, why not represent them simply as pairs, i.e., use <math>(a,b)</math> rather than <math>a + bi</math>? There are several ways of answering this question. One is that our notation tends to guide our thinking, and writing <math>x = x +0i</math> emphasizes the idea that the real number ''x'' is a complex number, whereas writing <math>(x, 0)</math> for the same number suggests that, as a complex number, ''x'' is something fundamentally different (perhaps it is). A second, and rather different, reason for using the notation <math>a + bi</math> is that it suggests a parallel with another part of mathematics. In elementary number theory, we learn to perform arithmetic modulo a number base. For example, we may write
 
:<math>4 + 5 \equiv 2 \pmod 7</math>


to indicate that when we add 4 and 5 and then divide the result by 7, the remainder is 2. We can do something similar with [[polynomial]]s in a single variable ''x''. We know that <math>(x + 1)(x +2) = x^2 + 3x + 2</math>, but <math>x^2 + 3x + 2 = 1\cdot(x^2 + 1) + (3x + 1)</math>, so when we divide by <math>x^2 + 1</math>, the remainder is <math>3x + 1</math>. And by the same token,
#<math>|z| \ge 0</math> and <math>|z| = 0</math> if and only if <math>z = 0</math>
#<math>|z_1 z_2| = |z_1| |z_2| \ </math>
#<math>|z_1 + z_2 | \le |z_1| + |z_2|</math>


:<math>(1 + i)(2 + i) = 2 + 3i + i^2 = 1 + 3i \ </math>
so, when we add or multiply complex numbers, we are just doing modular arithmetic!<ref>This example gives also a hint why the complex numbers are sometimes defined in terms of polynomials, as <math>\mathbb{C}=\mathbb{R}[X]/\left(X^2+1\right).</math> </ref>  Of course, there are also times when we wish to focus on the geometric or analytic aspects of complex numbers rather than the algebraic ones, but there is a tendency to want to retain the same notation where possible, and there is no question but that mathematical notation also tends to be dictated by tradition and historical accident.
Another reason, this time of practical nature, is elaborated in the next section.
==Working with complex numbers==
It is not very compelling to compute using the ordered pair definition, especially when it concerns the product of two complex numbers. It turns out that  the imaginary unit comes in handy with its property <math>i^2=-1</math>. Algebraic operations become as natural as for the reals.
===Basic operations===
Addition is straightforward, <math>(a + bi) + (c + di) = (a + c) + (b + d)i \ </math>. More notably, we can rewrite multiplication using <math>i^2 = -1</math> to obtain results in the form <math>a + bi</math>:
:<math>(a + bi)(c + di) = ac + adi + bci + bdi^2 = (ac - bd) + (bc + ad)i. \ </math>
To handle division, we simply note that <math>(c + di)(c - di) = c^2 +d^2</math>, so
:<math>\frac{1}{c + di} = \frac{c - di}{c^2 + d^2}, </math>
from which it follows that
:<math>\frac{a + bi}{c + di} = \frac{(ac + bd) + (bc - ad)i}{c^2 + d^2}.</math>
If <math>z = a + bi</math> is a complex number, we call <math>a</math> the real part of <math>z</math> and write <math>a = Re (z)</math>. Similarly, <math>b</math> is called the imaginary part of <math>z</math> and we write <math>b = Im (z)</math>. If the imaginary part of a complex number is <math>0</math>, the number is said to be real. As mentioned earlier, we write <math>a</math> instead of <math>a + 0i</math> and thus we identify <math>\mathbb{R}</math> with a subset of <math>\mathbb{C}</math>.
Going a bit further, we can introduce the important operation of complex conjugation. Given an arbitrary complex number <math>z = x + iy</math>, we define its complex conjugate to be <math>\bar{z} = x - iy</math>. Using the identity <math>(a + b)(a - b) = a^2 - b^2</math> we derive the important formula
:<math>z \bar{z} = x^2 + y^2</math>
and we define the modulus of a complex number z to be
:<math>|z| = \sqrt{z \bar{z}}</math>
Note that the modulus of a complex number is always a ''nonnegative real'' number.
The modulus (also called absolute value) satisfies three important properties that are completely analogous to the properties of the absolute value of real numbers
*<math>|z| \ge 0</math> and <math>|z| = 0</math> if and only if <math>z = 0</math>
*<math>|z_1 z_2| = |z_1| |z_2| \ </math>
*<math>|z_1 + z_2 | \le |z_1| + |z_2|</math>
The last inequality is known as the [[triangle inequality]].
The last inequality is known as the [[triangle inequality]].


===The complex exponential===
===The complex exponential===
Recall that in real analysis, the ordinary [[exponential]] function may be defined as
Recall that in real analysis, the ordinary [[exponential]] function may be defined as


:<math>\exp x = 1 + x + \frac{x^2}{2!} + \frac{x^3}{3!} + \cdots</math>
: <math>\exp x = 1 + x + \frac{x^2}{2!} + \frac{x^3}{3!} + \cdots.</math>


The same series may be used to define the ''complex'' exponential function
The same series may be used to define the ''complex'' exponential function


:<math>\exp z = 1 + z + \frac{z^2}{2!} + \frac{z^3}{3!} + \cdots</math>
: <math>\exp z = 1 + z + \frac{z^2}{2!} + \frac{z^3}{3!} + \cdots</math>


(where, of course, convergence is defined in terms of the complex modulus, instead of the real absolute value).  
(where, of course, convergence is defined in terms of the complex modulus, instead of the real absolute value).  
Line 120: Line 70:
:'''Notation''': The expressions <math>\exp \ z</math> and <math>e^z \ </math> mean the same thing, and may be used interchangeably.
:'''Notation''': The expressions <math>\exp \ z</math> and <math>e^z \ </math> mean the same thing, and may be used interchangeably.


The complex exponential has the same multiplicative property that  holds for real numbers, namely
The complex expomential has the same multiplicative property that  holds for real numbers,namely


:<math>e^{z_1 z_2} = e^{z_1} e^{z_2}. \ </math>
: <math>e^{z_1 z_2} = e^{z_1} e^{z_2} \ </math>


The complex exponential function has the important property that
The complex exponential function has the important property that


:<math>e^{i\theta} = \cos \theta + i \sin \theta \ </math>
: <math>e^{i\theta} = \cos \theta + i \sin \theta \ </math>


as may be seen  immediately by  substituting <math>z = i\theta</math> and comparing terms with the usual power series expansions of <math>\sin \theta</math> and <math>\cos \theta</math>.
as may be seen  immediately by  substituting <math>z = i\theta</math> and comparing terms with the usual power series expansions of <math>\sin \theta</math> and <math>\cos \theta</math>.
Line 132: Line 82:
The familiar [[trigonometry|trigonometric]] identity
The familiar [[trigonometry|trigonometric]] identity


:<math>\sin^2 \theta + \cos^2 \theta = 1 \ </math>
: <math>\sin^2 \theta + \cos^2 \theta = 1 \ </math>


immediately implies the important formula
immediately implies the important formula


:<math>|e^{i\theta}| = 1</math>, for any <math>\theta \in \mathbb{R}.</math>
: <math>|e^{i\theta}| = 1</math>, for any <math>\theta \in \mathbb{R}</math>


Of course, there is no reason to assume this identity. We only need note that <math>\overline{e^{i\theta}} = e^{-i\theta}</math>, so
Of course, there is no reason to assume this identity. We only need note that <math>\bar{e^{i\theta}} = e^{-i\theta}</math>


:<math>|e^{i\theta}|^2 = e^{i\theta}e^{-i\theta} = e^0 = 1. \ </math>
so,
 
: <math>|e^{i\theta}|^2 = e^{i\theta}e^{-i\theta} = e^0 = 1. \ </math>


==Geometric interpretation==
==Geometric interpretation==
Line 146: Line 98:
Since a complex number <math>z = x + iy</math> corresponds (essentially by definition) to an ordered pair of real numbers <math>(x, y)</math>, it can be interpreted as a point in the plane (i.e., <math>\mathbb{R}^2)</math>. When complex numbers are represented as points in the plane, the resulting diagrams are known as [[Robert Argand|Argand]] diagrams, after [[Robert Argand]]. The geometric representation of complex numbers turns out to be very useful, both as an aid to understanding the properties of complex numbers, but also as a tool in applying complex numbers to [[geometry|geometrical]] and [[physics|physical]] problems.
Since a complex number <math>z = x + iy</math> corresponds (essentially by definition) to an ordered pair of real numbers <math>(x, y)</math>, it can be interpreted as a point in the plane (i.e., <math>\mathbb{R}^2)</math>. When complex numbers are represented as points in the plane, the resulting diagrams are known as [[Robert Argand|Argand]] diagrams, after [[Robert Argand]]. The geometric representation of complex numbers turns out to be very useful, both as an aid to understanding the properties of complex numbers, but also as a tool in applying complex numbers to [[geometry|geometrical]] and [[physics|physical]] problems.


There are no real surprises when we look at addition and subtraction in isolation: addition of complex numbers is not essentially different from addition of [[vector]]s in <math>\mathbb{R}^2</math>. Similarly, if <math>\alpha \in \mathbb{R}</math> is real, multiplication by <math>\alpha</math> is just scalar multiplication. In <math>\mathbb{C}</math> we have
There are no real surprises when we look at addition and subtraction in isolation: addition of complex numbers is not essentially different from addition of [[vector]]s in <math>\mathbb{R}^2</math>. Similarly, if <math>\alpha \in \mathbb{R}</math> is real, multiplication by </math>\alpha</math> is just scalar multiplication. In <math>\mathbb{C}</math> we have


: <math>z_1 + z_2 = (x_1 + iy_1) + (x_2 + iy_2) = (x_1 + x_2) + i(y_1 + y_2) \ </math>


:<math>z_1 + z_2 = (x_1 + iy_1) + (x_2 + iy_2) = (x_1 + x_2) + i(y_1 + y_2) \ </math>
and
and
:<math>\alpha z = \alpha(x + iy) = \alpha x + i\alpha y \ </math>
 
: <math>\alpha z = \alpha(x + iy) = \alpha x + i\alpha y. \ </math>


To put it succintly, <math>\mathbb{C}</math> is a 2-dimensional [[real number|real]] [[vector space]] with respect to the usual operations of addition of complex numbers and multiplication by a real number. There doesn't seem to be much more to say. But there ''is'' more to say, and that is that the multiplication of ''complex'' numbers has geometric significance. This is most easily seen if we take advantage of the complex exponential, and write complex numbers in [[polar coordinates|polar]] form
To put it succintly, <math>\mathbb{C}</math> is a 2-dimensional [[real number|real]] [[vector space]] with respect to the usual operations of addition of complex numbers and multiplication by a real number. There doesn't seem to be much more to say. But there ''is'' more to say, and that is that the multiplication of ''complex'' numbers has geometric significance. This is most easily seen if we take advantage of the complex exponential, and write complex numbers in [[polar coordinates|polar]] form


:<math>z = r e^{i\theta}</math>
: <math>z = r e^{i\theta}.</math>


Here, r is simply the modulus <math>\sqrt{x^2 + y^2}</math> or vector length. The number <math>\theta</math> is just the angle formed with the ''x''-axis, and is called the ''argument''. Now, when complex numbers are written in polar form, multiplication is very interesting
Here, r is simply the modulus <math>\sqrt{x^2 + y^2}</math> or vector length. The number <math>\theta</math> is just the angle formed with the ''x''-axis, and is called the ''argument''. Now, when complex numbers are written in polar form, multiplication is very interesting


:<math>z_1 z_2 = (r_1 e^{i\theta_1}) (r_2 e^{i\theta_2}) = r_1 r_2 e^{i(\theta_1 + \theta_2)}</math>
: <math>z_1 z_2 = (r_1 e^{i\theta_1}) (r_2 e^{i\theta_2}) = r_1 r_2 e^{i(\theta_1 + \theta_2)}.</math>


In other words, multiplication by a complex number ''z'' has the effect of effect of simultaneously scaling by the numbers' modulus and ''rotating'' by its argument. This is really astounding. [[Translation]] corresponds, to complex addition, [[scale|scaling]] to multiplication by a real number, and [[rotation]] to multiplication by a complex number of unit modulus. The one type of [[coordinate transformation]] that is missing from this list is [[reflection]]. On the other hand, there is an arithmetic operation we have not considered, and that is division. Recall that
In other words, multiplication by a complex number ''z'' has the effect of effect of simultaneously scaling by the numbers' modulus and ''rotating'' by its argument. This is really astounding. [[Translation]] corresponds, to complex addition, [[scale|scaling]] to multiplication by a real number, and [[rotation]] to multiplication by a complex number of unit modulus. The one type of [[coordinate transformation]] that is missing from this list is [[reflection]]. On the other hand, there is an arithmetic operation we have not considered, and that is division. Recall that


:<math>\frac{1}{z} = \frac{\bar{z}}{|z|^2}</math>
: <math>\frac{1}{z} = \frac{\bar{z}}{|z|^2}.</math>


In other words, up to a scaling factor, division by ''z'' is just complex conjugation. Returning to the representation of complex numbers in rectangular form, we note that complex conjugation is just th transformation (or map) <math>x + iy \mapsto x - iy</math> or, in vector notation, <math>(x, y) \mapsto (x, -y)</math>. This is nothing other than reflection in the ''x''-axis, and any other reflection may be obtained by combining that transformation with rotations and translations.
In other words, up to a scaling factor, division by ''z'' is just complex conjugation. Returning to the representation of complex numbers in rectangular form, we note that complex conjugation is just th transformation (or map) <math>x + iy \mapsto x - iy</math> or, in vector notation, <math>(x, y) \mapsto (x, -y)</math>. This is nothing other than reflection in the ''x''-axis, and any other reflection may be obtained by combining that transformation with rotations and translations.
Line 169: Line 122:
Historically, this observation was very important and led to the search for higher dimensional algebras that could "arithmetize" [[Euclidean geometry]]. It turns out that there are such generalizations in dimensions 4 and 8, known as the [[quaternions]] and [[octonions]] (also known as [[Cayley numbers]]). At that point, the process stops, but the ideas developed in this process have played an important role in the development of modern [[differential geometry]] and [[physics|mathematical physics]]).
Historically, this observation was very important and led to the search for higher dimensional algebras that could "arithmetize" [[Euclidean geometry]]. It turns out that there are such generalizations in dimensions 4 and 8, known as the [[quaternions]] and [[octonions]] (also known as [[Cayley numbers]]). At that point, the process stops, but the ideas developed in this process have played an important role in the development of modern [[differential geometry]] and [[physics|mathematical physics]]).


==Algebraic closure==
==What about calculus?==


An important property of <math>\mathbb{C}</math> is that it is [[algebraically closed]]. This means that any non-constant real [[polynomial]] must have a root in <math>\mathbb{C}</math>. This result is known as the [[fundamental theorem of algebra]]. There are many proofs of this theorem. Many of the simplest depend crucially on [[complex analysis]]. To illustrate, we consider a proof based on [[Liouville's theorem]]: If <math>p(z)</math> is a polynomial function of a complex variable then both <math>p(z)</math> and <math>1/p(z)</math> will be [[holomorphic]] in any domain where <math>p(z) \not= 0</math>. But, by the triangle inequality, we know that outside a neighborhood of the origin <math>|p(z)| > |p(0)|</math>, so if there is no <math>z_0 </math> such that <math>p(z_0) = 0</math>, we know that <math>1/p(z)</math> is a bounded entire (i.e., holomorphic in all of <math>\mathbb{C}</math>) function. By [[Liouville's theorem]], it must be constant, so <math>p(z)</math> must also be constant.
So far, with one notable exception, we have only made use of ''algebraic'' properties of complex numbers. That exception is, of course, the complex exponential, which is an example of a [[transcendental]] function. As it happens, we could have avoided the use of the exponential function here, but only  at the cost of more complicated algebra. (The more interesting question is ''why'' we would want to avoid using it!) But we now turn to a more general question: Is it possible to extend the methods of calculus to functions of a complex variable, and why might we want to do so? We recall the definition of one of the two fundamental operations of calculus, differentiation. Given a function <math>y = f(x)</math>, we say ''f'' is differentiable at <math>x_0</math> if the limit


There are also proofs that do not depend on [[complex analysis]], but they require more [[algebra|algebraic]] or [[topology|topological]] machinery. The starting point here is that <math>\mathbb{R}</math> is a [[real closed field]] (i.e., an ordered field containing positive square roots and in which odd degree polynomials always do posess a root). The starting point is to note that <math>\mathbb{C} = \mathbb{R}[i]</math> is the splitting field of <math>x^2 + 1</math>, so if we can show that <math>\mathbb{C}</math> has no finite extensions, then we are done. Suppose <math>K/\mathbb{C}</math> is a finite normal extension with Galois group ''G''. A Sylow 2-subgroup ''H'' must correspond to an intermediate field ''L'', such that ''L'' is an extension of <math>\mathbb{R}</math> of ''odd'' degree, but we know no such extensions exist. This contradiction establishes the theorem.
: <math>\lim_{h\to 0} \frac{f(x_0 + h) - f(x_0)}{h}</math>


As an aside, it is interesting to note that avoiding the methods of one branch of mathematics (complex analysis), requires the use of more advanced methods from another branch of mathematics (in this case, field theory).
exists, and we call the limiting value the derivative of ''f'' at <math>x_0</math>, and the function that assigns to each point x the derivative of ''f'' at ''x'' is called the derivative of ''f'', and is written <math>f'(x)</math> or <math>df/dx</math>.


==What about complex analysis?==
==Algebraic closure==


So far, with one notable exception, we have only made use of ''algebraic'' properties of complex numbers. That exception is, of course, the complex exponential, which is an example of a [[transcendental]] function. As it happens, we could have avoided the use of the exponential function here, but only  at the cost of more complicated algebra. (The more interesting question is ''why'' we would want to avoid using it!)  
An important property of <math>\mathbb{C}</math> is that it is [[algebraically closed]]. This means that any non-constant real [[polynomial]] must have a root in <math>\mathbb{C}</math>. This result is known as the [[fundamental theorem of algebra]]. There are many proofs of this theorem. Many of the simplest depend crucially on [[complex analysis]]. To illustrate, we consider a proof based on [[Liouville's theorem]]: If <math>p(z)</math> is a polynomial function of a complex variable then both <math>p(z)</math> and <math>1/p(z)</math> will be [[holomorphic]] in any domain where <math>p(z) \not= 0</math>. But, by the triangle inequality, we know that outside a neighborhood of the origin <math>|p(z)| > |p(0)|</math>, so if there is no <math>z_0 </math> such that <math>p(z_0) = 0</math>, we know that <math>1/p(z)</math> is a bounded entire (i.e., holomorphic in all of <math>\mathbb{C}</math>) function. By [[Liouville's theorem]], it must be constant, so <math>p(z)</math> must also be constant.


===Differentiation===
There are also proofs that do not depend on [[complex analysis]], but they require more [[algebra|algebraic]] or [[topology|topological]] machinery. The starting point here is that <math>\mathbb{R}</math> is a [[real closed field]] (i.e., an ordered field containing positive square roots and in which odd degree polynomials always do posess a root). The starting point is to note that <math>\mathbb{C} = \mathbb{R}[i]</math> is the splitting field of <math>x^2 + 1</math>, so if we can show that <math>\mathbb{C}</math> has no finite extensions. We are done. Suppose <math>K/\mathbb{C}</math> is a finite normal extension with Galois group ''G''. A Sylow 2-subgroup ''H'' must correspond to an intermeiate field ''L'', such that ''L'' is an extension of <math>\mathbb{R}</math> of ''odd'' degree, but we know no such extensions exist. This contradiction establishes the theorem.
 
But we now turn to a more general question: Is it possible to extend the methods of calculus to functions of a complex variable, and why might we want to do so? We recall the definition of one of the two fundamental operations of calculus, differentiation. Given a function <math>y = f(x)</math>, we say ''f'' is differentiable at <math>x_0</math> if the limit
 
:<math>\lim_{h\to 0} \frac{f(x_0 + h) - f(x_0)}{h}</math>
 
exists, and we call the limiting value the derivative of ''f'' at <math>x_0</math>, and the function that assigns to each point x the derivative of ''f'' at ''x'' is called the derivative of ''f'', and is written <math>f'(x)</math> or <math>df/dx</math>. Now, does this definition work for functions of a complex variable? The answser is yes, and to see why, we fix ''x'' and unravel the definition of limit. If the limit exists, say <math>c = f'(x)</math>, then for every (real) number <math>\epsilon > 0</math>, there is a (real) number <math>\delta</math> such that if <math>|h| < \delta</math>
 
:<math>\left | \frac{f(x + h) - f(x)}{h} - c \right | < \epsilon</math>
 
This makes perfect sense for functions of a complex variable, but we need to keep in mind that <math>| \cdot |</math> represents the modulus of a complex number, not the real absolute value.
 
This seemingly innocuous difference actually has far reaching implications. Recall that the complex plane has two real dimensions, so there are many ways that ''h'' can approach 0: successive values of ''h'' may be points on the ''x''-axis, points on the ''y''-axis, some other line through the origin, it may spiral in, or take any of a number of paths, but the definition requires that the limit be the ''same number'' in every case. This is a very strong requirement! Fortunately, it turns out to be sufficient to consider just two of the possible "approach paths": a sequence of values along the ''x''-axis and a sequence of values along the ''y''-axis. If we call the real and imaginary parts (respectively) of <math>w = f(z)</math> ''u'' and ''v'', (i.e., <math>w = f(z) = u + iv</math>), this requirement can be expressed in terms of the [[partial derivative]]s of ''u'' and ''v'' with respect to ''x'' and ''y'':
 
:<math>\frac{\partial u}{\partial x} = \frac{\partial v}{\partial y}</math>
and
:<math>\frac{\partial v}{\partial x} = - \frac{\partial u}{\partial y}</math>
 
These equations are known as the [[Cauchy-Riemann equations]].
 
:'''Note''': These equations are frequently written in the more compact form, <math>u_x = v_y</math> and <math>v_x = - u_y</math>.
 
They may be obtained by noting that if the approach path is on ''x''-axis, <math>\partial f / \partial y = 0</math>, so
 
:<math>\frac{df}{dz} = \frac{1}{2} \left ( \frac{\partial u}{\partial x} + i \frac{\partial v}{\partial x} \right )</math>
 
and that on the ''y''-axis, <math>\partial f / \partial x = 0</math>, so
 
:<math>\frac{df}{dz} = \frac{1}{2} \left (-i \frac {\partial u}{\partial y} + \frac{\partial v}{\partial y} \right )</math>


As an aside, it is interesting to note that avoiding the methods of one branch of mathematics (complex analysis), requires the use of more advanced methods from another branch of mathematics (in this case, field theory).


These equations have far-reaching implications. To get some idea if why this is so, consider that we can take second derivatives to obtain
==Notational variants==


:<math>u_{xx} + u_{yy} = 0</math>
This article follows the usual convention in [[mathematics]]  (and [[physics]]) of using <math>i</math> as the imaginary unit. Complex numbers are frequently used in [[electrical engineering]], but in that discipline it is usual to use <math>j</math> instead, reserving <math>i</math> for [[electrical current]]. This usage is found in some [[programming language]]s, notably [[Python]].
and
:<math>v_{xx} + v_{yy} = 0</math>


In other words, u and v satisfy [[Laplace's equation]] in 2 dimensions. These functions arise in [[physics|mathematical physics]] as [[scalar potential]]s in, for example, [[fluid dynamics]]. [[Laplace's equation]] is also basic to the study of [[partial differential equation]]s. This is but one indication of the reason for the ubiquity of complex functions in [[physics]].
==Further reading==
 
===Integration===
 
By contrast, the definition of [[integral|integration]] in complex analysis involves no surprises. Path integrals and integrals over regions are defined just as they are in the calculus of functions of two real variables. What is different is that the Cauchy-Riemann equations imply that integrals of complex functions have some very special properties. In particular, if a function ''f'' is differentiable (in the sense explained above) in a [[simply connected]] domain (intuitively, a domain having no "holes" in it), then for any closed curve <math>\gamma</math> defined in that domain
:<math>\int_{\gamma}\nolimits f dz = 0</math>
It is essential that the domain of definition be simply connected. For example, let
:<math>D = \{ z \mid \textstyle\frac 1 2 < |z| < \frac 3 2 \}</math>
and let <math>f(z) = 1/z</math>. Then if we define <math>\gamma (t) = e^{it}</math> where ''t'' ranges from 0 to <math>2 \pi</math> (i.e., we take <math>\gamma</math> to be the unit circle), then the integral will ''not'' be 0.
 
It follows that if <math>\gamma_1</math> and <math>\gamma_2</math> are two [[homotopic]] paths joining a pair of points <math>P, Q \in D</math> (intuitively, one can be deformed into the other), then
:<math>\int_{\gamma_1}\nolimits f dz = \int_{\gamma_2}\nolimits f dz</math>
This is commonly expressed by saying that the integrals are path independent, and this is just the condition for the existence of a scalar potential!
 
Finally, we note that integrals in domains containing singularities (such as 1/z in the above example) can be computed using [[Cauchy's integral formula]]
:<math>f(z) = \frac{1}{2\pi i} \int_{\gamma}\nolimits \frac{f(\zeta) d \zeta}{\zeta - z}</math>
This result lies at the heart of many applications of complex analysis to disciplines ranging from [[number theory]] to [[physics]]. Its importance would be difficult to overestimate.
 
 
==Further Reading==


*{{cite book
*{{cite book
Line 271: Line 176:
|isbn = 0-7167-0453-6 }}
|isbn = 0-7167-0453-6 }}


==Notes and references==
 
{{reflist|2}}


[[Category:Mathematics Workgroup]]
[[Category:Mathematics Workgroup]]
[[category:CZ Live]]
[[category:CZ Live]]

Revision as of 13:44, 16 April 2007

The complex numbers are numbers of the form a+bi, obtained by adjoining the imaginary unit i to the real numbers (here a and b are reals). The number i can be thought of as a solution of the equation . In other words, its basic property is . Of course, since the square root of any real number is positive, . A priori, it is not even clear whether such an object exists and that it can be called "a number", i.e. whether we can associate with it some natural operations as addition or multiplication. Assuming, for a moment, that the answer is "yes", we may write

Of course, we do not formally define complex numbers this way but, rather, define them as ordered pairs of real numbers. The above notation is, however, traditional.

Aside on notation: There is a well established tradition in mathematics of adopting notation that is suggestive, even if it is, in some ways, unnatural or awkward. For example, if complex numbers are ordered pairs of real numbers, why not represent them as pairs, i.e., use rather than ? Thee are several ways of answering this question. One is that our notation tends to guide our thinking, and writing emphasizes the idea that the real number x is a complex number, whereas writing for the same number suggests that, as a complex number, x is something fundamentally different (perhaps it is). A second, and rather different, reason for using the notation is that it suggests a parallel with another part of mathematics. In elementary number theory, we learn to perform arithmetic modulo a number base. for example, we may write
to indicate that when we add 4 and 5 and then divide the result by 7, the remainder is 2. We can do something similar with polynomials in a single variable x. We know that , but , so when we divide by , the remainder is . And by the same token,
so, when we add or multiply complex numbers, we are just doing modular arithmetic! Of course, there are also times when we wish to focus on the geometric or analytic aspects of complex numbers rather than the algebraic ones, but there is a tendency to want to retain the same notation where possible, and there is no question but that mathematical notation also tends to be dictated by tradition and historical accident.

Philosophical matters

Working with complex numbers

Basic operations

We define addition and multiplication in the obvious way, using to rewrite results in the form :

To handle division, we simply note that , so

and, in particular,

It turns out that with addition and multiplication defined this way, satisfies the axioms for a field, and is called the field of complex numbers. If is a complex number, we call the real part of and write . Similarly, is called the imaginary part of and we write . If the imaginary part of a complex number is , the number is said to be real, and we write instead of . We thus identify with a subset (and, in fact, a subfield) of .

Going a bit further, we can introduce the important operation of complex conjugation. Given an arbitrary complex number , we define its complex conjugate to be . Using the identity we derive the important formula

and we define the modulus of a complex number z to be

Note that the modulus of a complex number is always a real number.

The modulus (also called absolute value) satisfies three important properties that are completely analogous to the properties of the absolute value of real numbers

  1. and if and only if

The last inequality is known as the triangle inequality.

The complex exponential

Recall that in real analysis, the ordinary exponential function may be defined as

The same series may be used to define the complex exponential function

(where, of course, convergence is defined in terms of the complex modulus, instead of the real absolute value).

Notation: The expressions and mean the same thing, and may be used interchangeably.

The complex expomential has the same multiplicative property that holds for real numbers,namely

The complex exponential function has the important property that

as may be seen immediately by substituting and comparing terms with the usual power series expansions of and .

The familiar trigonometric identity

immediately implies the important formula

, for any

Of course, there is no reason to assume this identity. We only need note that

so,

Geometric interpretation

Since a complex number corresponds (essentially by definition) to an ordered pair of real numbers , it can be interpreted as a point in the plane (i.e., . When complex numbers are represented as points in the plane, the resulting diagrams are known as Argand diagrams, after Robert Argand. The geometric representation of complex numbers turns out to be very useful, both as an aid to understanding the properties of complex numbers, but also as a tool in applying complex numbers to geometrical and physical problems.

There are no real surprises when we look at addition and subtraction in isolation: addition of complex numbers is not essentially different from addition of vectors in . Similarly, if is real, multiplication by </math>\alpha</math> is just scalar multiplication. In we have

and

To put it succintly, is a 2-dimensional real vector space with respect to the usual operations of addition of complex numbers and multiplication by a real number. There doesn't seem to be much more to say. But there is more to say, and that is that the multiplication of complex numbers has geometric significance. This is most easily seen if we take advantage of the complex exponential, and write complex numbers in polar form

Here, r is simply the modulus or vector length. The number is just the angle formed with the x-axis, and is called the argument. Now, when complex numbers are written in polar form, multiplication is very interesting

In other words, multiplication by a complex number z has the effect of effect of simultaneously scaling by the numbers' modulus and rotating by its argument. This is really astounding. Translation corresponds, to complex addition, scaling to multiplication by a real number, and rotation to multiplication by a complex number of unit modulus. The one type of coordinate transformation that is missing from this list is reflection. On the other hand, there is an arithmetic operation we have not considered, and that is division. Recall that

In other words, up to a scaling factor, division by z is just complex conjugation. Returning to the representation of complex numbers in rectangular form, we note that complex conjugation is just th transformation (or map) or, in vector notation, . This is nothing other than reflection in the x-axis, and any other reflection may be obtained by combining that transformation with rotations and translations.

Historically, this observation was very important and led to the search for higher dimensional algebras that could "arithmetize" Euclidean geometry. It turns out that there are such generalizations in dimensions 4 and 8, known as the quaternions and octonions (also known as Cayley numbers). At that point, the process stops, but the ideas developed in this process have played an important role in the development of modern differential geometry and mathematical physics).

What about calculus?

So far, with one notable exception, we have only made use of algebraic properties of complex numbers. That exception is, of course, the complex exponential, which is an example of a transcendental function. As it happens, we could have avoided the use of the exponential function here, but only at the cost of more complicated algebra. (The more interesting question is why we would want to avoid using it!) But we now turn to a more general question: Is it possible to extend the methods of calculus to functions of a complex variable, and why might we want to do so? We recall the definition of one of the two fundamental operations of calculus, differentiation. Given a function , we say f is differentiable at if the limit

exists, and we call the limiting value the derivative of f at , and the function that assigns to each point x the derivative of f at x is called the derivative of f, and is written or .

Algebraic closure

An important property of is that it is algebraically closed. This means that any non-constant real polynomial must have a root in . This result is known as the fundamental theorem of algebra. There are many proofs of this theorem. Many of the simplest depend crucially on complex analysis. To illustrate, we consider a proof based on Liouville's theorem: If is a polynomial function of a complex variable then both and will be holomorphic in any domain where . But, by the triangle inequality, we know that outside a neighborhood of the origin , so if there is no such that , we know that is a bounded entire (i.e., holomorphic in all of ) function. By Liouville's theorem, it must be constant, so must also be constant.

There are also proofs that do not depend on complex analysis, but they require more algebraic or topological machinery. The starting point here is that is a real closed field (i.e., an ordered field containing positive square roots and in which odd degree polynomials always do posess a root). The starting point is to note that is the splitting field of , so if we can show that has no finite extensions. We are done. Suppose is a finite normal extension with Galois group G. A Sylow 2-subgroup H must correspond to an intermeiate field L, such that L is an extension of of odd degree, but we know no such extensions exist. This contradiction establishes the theorem.

As an aside, it is interesting to note that avoiding the methods of one branch of mathematics (complex analysis), requires the use of more advanced methods from another branch of mathematics (in this case, field theory).

Notational variants

This article follows the usual convention in mathematics (and physics) of using as the imaginary unit. Complex numbers are frequently used in electrical engineering, but in that discipline it is usual to use instead, reserving for electrical current. This usage is found in some programming languages, notably Python.

Further reading

  • Ahlfors, Lars V. (1979). Complex Analysis, 3rd edition. McGraw-Hill, Inc.. ISBN 0-07-000657-1. 
  • Apostol, Tom M. (1974). Mathematical Analysis, 2nd edition. Addison-Wesley. ISBN 0-201-00-288-4. 
  • Conway, John H.; Derek A. Smith (2003). On Quaternions and Octonions: Their Geometry, Arithmetic and Symmetry. A K Peters, Ltd.. ISBN 1-56881-134-9. 
  • Jacobson, Nathan (1974). Basic Algebra I. W.H. Freeman and Company. ISBN 0-7167-0453-6.