Complex number: Difference between revisions
imported>Catherine Woodgold (→The complex exponential: Reworded to "Another way to establish this identity is to note that" as I had proposed) |
mNo edit summary |
||
(43 intermediate revisions by 23 users not shown) | |||
Line 1: | Line 1: | ||
{{subpages}} | |||
'''Complex numbers''' are numbers of the form <math>\scriptstyle a+b\cdot i</math>, where <math>a</math> and <math>b</math> are [[real number]]s and <math>i</math> denotes a number satisfying <math>\scriptstyle i^{2}=-1</math>.<ref>This article follows the usual convention in [[mathematics]] and [[physics]] of using <math>i</math> as the imaginary unit. Complex numbers are frequently used in [[electrical engineering]], but in that discipline it is usual to use <math>j</math> instead, reserving <math>i</math> for [[electrical current]]. This usage is found in some [[programming language]]s too, notably [[Python]].</ref> Of course, since the square of any real number is nonnegative, <math>i</math> cannot be a real number. At first glance, it is not even clear whether such an object exists and can be reasonably called a number; for example, can we sensibly associate with <math>i</math> natural operations such as addition and multiplication? As it happens, we can define mathematical operations for these "complex numbers" in a consistent and sensible way and, perhaps more importantly, using complex numbers provides mathematicians, physicists, and engineers with an extremely powerful approach to expressing parts of these sciences in a convenient and natural way. | |||
== Historical Development == | |||
A common complaint among math students is why they must bother with complex numbers when real numbers almost always seem sufficient for applications. Indeed, many familiar real world quantities, such as distance, temperature, and time are best described using real numbers. In contrast, although there are many physical situations which are mostly aptly described with complex numbers, most of these situations require some relatively sophisticated knowledge of physics, and the phenomena which they describe are more abstruse than distance, temperature, or time. | |||
Many modern students first encounter complex numbers when solving [[quadratic equation]]s, which can have complex number solutions. This presentation is historically misleading — the quadratic formula precedes recognition of the utility of complex numbers by many centuries. The ancients could simply dismiss quadratic equations whose solutions involve complex numbers as having no real roots. | |||
The need for complex numbers might have appeared for the first time during the sixteenth century, when Italian mathematicians like [[Scipione del Ferro]], [[Niccolò Fontana Tartaglia]], [[Gerolamo Cardano]] and [[Rafael Bombelli]] tried to solve [[cubic equation|cubic equations]]. Even for equations with three [[real number|real]] solutions, the method they used sometimes required calculations with numbers whose squares are negative. A historical example of this can be found on the [[Complex number/Advanced|"advanced" subpage]] for this article. | |||
In modernity, complex numbers form the basis for the mathematical models of many physical phenomena, including electro-magnetism and quantum mechanics. Even so, it is still often difficult to find examples where an equivalent mathematical model cannot be formulated using pairs of real numbers instead. | |||
==Working with complex numbers== | |||
As a first step in giving some legitimacy to the "number" <math>\scriptstyle \sqrt{-1}</math>, we will explain how to compute with it. How do you add, multiply and divide expressions with this number? It turns out that this is not that difficult; the main rule to keep in mind is that the square of <math>\scriptstyle \sqrt{-1}</math> equals <math>\scriptstyle -1</math>. | |||
In the remainder of the article, we will use the letter <math>i</math> to denote one solution of the equation <math>\scriptstyle i^2 = -1</math>, where we previously used <math>\scriptstyle \sqrt{-1}</math>.<ref>Part of the reason for not using <math>\sqrt{-1}</math> is that the symbol <math>\sqrt{a}</math> (or <math>\sqrt[n]{a}</math>) with <math>a\in\mathbb{C}</math> is sometimes used to denote the set of ''complex roots'' of <math>a</math>, i.e., the set of the solutions of the equation <math>x^2=a</math> (<math>x^n=a</math> respectively). The set contains 2 (<math>n</math>, respectively) "equally important" elements and there is no canonical way to distinguish a "representative". Consequently, no computations are performed using this symbol.</ref> With this convention, all complex numbers can be written as <math>a + bi</math>, where <math>a</math> and <math>b</math> are ''real'' numbers. We call <math>a</math> the '''real part''' of the complex number and <math>b</math> the '''imaginary part'''. The complex number <math>a + 0\cdot i</math> whose imaginary part is zero is considered to be the same thing as the real number <math>a</math>. | |||
===Basic operations=== | |||
Addition of complex numbers is straightforward, <math>\scriptstyle (a + b\cdot i) + (c + d\cdot i) = (a + c) + (b + d) \cdot i.</math> The result is again a complex number. | |||
Multiplication is more interesting. Suppose we want to compute <math>\scriptstyle (a+b \cdot i)\cdot(c+d\cdot i)</math>. Using <math>\scriptstyle i^2 = -1</math>, we can rewrite this product in a form which clearly shows it to be another complex number: | |||
:<math>(a + b\cdot i)\cdot (c + d\cdot i) = a\cdot c + a\cdot d\cdot i + b\cdot c\cdot i + b\cdot d\cdot i^2 = (a\cdot c - b\cdot d) + (b\cdot c + a\cdot d)i. \ </math> | |||
To handle division, we simply note that <math>(c + d\cdot i)\cdot (c - d\cdot i) = c^2 +d^2</math>, so, provided that ''c'' and ''d'' are not simultaneously zero, | |||
:<math>\frac{1}{c + d\cdot i} = \frac{c - d\cdot i}{c^2 + d^2}, </math> | |||
from which it follows that | |||
:<math>\frac{a + b\cdot i}{c + d\cdot i} = \frac{(a\cdot c + b\cdot d) + (b\cdot c - a\cdot d)i}{c^2 + d^2}.</math> | |||
If <math>c = d = 0</math> then division by <math>c+d\cdot i</math> is not defined. | |||
Going a bit further, we can introduce the important operation of '''complex conjugation'''. Given an arbitrary complex number <math>z = x + y\cdot i</math>, we define its ''[[complex conjugate]]'' to be <math>\scriptstyle \bar{z} = x - y \cdot i</math>. Using the identity <math>(a + b)\cdot(a - b) = a^2 - b^2</math> we derive the important formula | |||
Going a bit further, we can introduce the important operation of complex conjugation. Given an arbitrary complex number <math>z = x + | |||
:<math>z \bar{z} = x^2 + y^2</math> | :<math>z \bar{z} = x^2 + y^2</math> | ||
and we define the modulus of a complex number z to be | and we define the modulus of a complex number z to be | ||
Line 87: | Line 58: | ||
The complex exponential has the same multiplicative property that holds for real numbers, namely | The complex exponential has the same multiplicative property that holds for real numbers, namely | ||
:<math>e^{z_1 z_2} = e^{z_1} e^{z_2} \ </math> | :<math>e^{z_1 + z_2} = e^{z_1} e^{z_2} \ </math> | ||
The complex exponential function has the important property that | The complex exponential function has the important property that | ||
Line 104: | Line 75: | ||
Another way to establish this identity is to note that | Another way to establish this identity is to note that | ||
<math>\overline{e^{i\theta}} = e^{-i\theta}</math>, so | <math>\scriptstyle \overline{e^{i\theta}} = e^{-i\theta}</math>, so | ||
:<math>|e^{i\theta}|^2 = e^{i\theta}e^{-i\theta} = e^0 = 1. \ </math> | :<math>|e^{i\theta}|^2 = e^{i\theta}e^{-i\theta} = e^0 = 1. \ </math> | ||
Line 110: | Line 81: | ||
===Geometric interpretation=== | ===Geometric interpretation=== | ||
{{Image|Complex_plane3.png|right|250px|Graphical representation of a complex number and its conjugate}} | |||
Since a complex number <math>z = x + iy</math> is specified by two real numbers, namely <math>x</math> and <math>y</math>, it can be interpreted as the point <math>(x,y)</math> in the plane. When complex numbers are represented as points in the plane, the resulting diagrams are known as [[Robert Argand|Argand]] diagrams, after [[Robert Argand]]. The geometric representation of complex numbers turns out to be very useful, both as an aid to understanding the properties of complex numbers and as a tool in applying complex numbers to [[geometry|geometrical]] and [[physics|physical]] problems. | Since a complex number <math>z = x + iy</math> is specified by two real numbers, namely <math>x</math> and <math>y</math>, it can be interpreted as the point <math>(x,y)</math> in the plane. When complex numbers are represented as points in the plane, the resulting diagrams are known as [[Robert Argand|Argand]] diagrams, after [[Robert Argand]]. The geometric representation of complex numbers turns out to be very useful, both as an aid to understanding the properties of complex numbers and as a tool in applying complex numbers to [[geometry|geometrical]] and [[physics|physical]] problems. | ||
There are no real surprises when we look at addition and subtraction in isolation: addition of complex numbers is not essentially different from addition of [[vector]]s in <math>\mathbb{R}^2</math>. Similarly, if <math>\alpha \in \mathbb{R}</math> is real, multiplication by <math>\alpha</math> is just scalar multiplication. In <math>\mathbb{C}</math> we have | There are no real surprises when we look at addition and subtraction in isolation: addition of complex numbers is not essentially different from addition of [[vector]]s in <math>\scriptstyle \mathbb{R}^2</math>. Similarly, if <math>\scriptstyle \alpha \in \mathbb{R}</math> is real, multiplication by <math>\alpha</math> is just scalar multiplication. In <math>\scriptstyle \mathbb{C}</math> we have | ||
:<math>z_1 + z_2 = (x_1 + iy_1) + (x_2 + iy_2) = (x_1 + x_2) + i(y_1 + y_2) \ </math> | :<math>z_1 + z_2 = (x_1 + iy_1) + (x_2 + iy_2) = (x_1 + x_2) + i(y_1 + y_2) \ </math> | ||
Line 119: | Line 90: | ||
and | and | ||
:<math>\alpha z = \alpha(x + | :<math>\alpha z = \alpha(x + i \cdot y) = \alpha x + i\alpha y. \, </math> | ||
To put it | To put it succinctly, <math>\scriptstyle \mathbb{C}</math> is a 2-dimensional [[real number|real]] [[vector space]] with respect to the usual operations of addition of complex numbers and multiplication by a real number. There doesn't seem to be much more to say. But there ''is'' more to say, and that is that the multiplication of ''complex'' numbers has geometric significance. This is most easily seen if we take advantage of the complex exponential, and write complex numbers in [[polar coordinates|polar]] form | ||
:<math>z = r e^{i\theta}.</math> | :<math>z = r\cdot e^{i\theta}.</math> | ||
Here, r is simply the modulus <math>|z| = \sqrt{x^2 + y^2}</math> or vector length. The number <math>\theta</math> is just the angle formed with the <math>x</math>-axis, and is called the ''argument''. Now, when complex numbers are written in polar form, multiplication is very interesting | Here, r is simply the modulus <math>\scriptstyle |z| = \sqrt{x^2 + y^2}</math> or vector length. The number <math>\theta</math> is just the angle formed with the <math>x</math>-axis, and is called the ''argument'' satisfying the condition <math> \rm tan \theta = y/x </math>. Now, when complex numbers are written in polar form, multiplication is very interesting | ||
:<math>z_1 z_2 = (r_1 e^{i\theta_1}) (r_2 e^{i\theta_2}) = r_1 r_2 e^{i(\theta_1 + \theta_2)}.</math> | :<math>z_1 z_2 = (r_1 e^{i\theta_1}) (r_2 e^{i\theta_2}) = r_1 r_2 e^{i(\theta_1 + \theta_2)}.</math> | ||
{{Image|Graphical_multiplication1.png|left|300px|Multiplication by <math>i</math> amounts to rotation by 90 degrees}} | |||
In other words, multiplication by a complex number <math>z</math> has the effect of simultaneously scaling by the number's modulus and ''rotating'' by its argument. This is really astounding. For example, to multiply a given complex number <math>z</math> by <math>i</math> we need only to rotate <math>z</math> by <math>\pi/2</math> (that is, 90 degrees). [[Translation]] corresponds to complex addition, [[ | In other words, multiplication by a complex number <math>z</math> has the effect of simultaneously scaling by the number's modulus and ''rotating'' by its argument. This is really astounding. For example, to multiply a given complex number <math>z</math> by <math>i</math> we need only to rotate <math>z</math> by <math>\pi/2</math> (that is, 90 degrees). [[Translation (geometry)|Translation]] corresponds to complex addition, [[scaling (geometry)|scaling]] to multiplication by a real number, and [[rotation]] to multiplication by a complex number of unit modulus. The one type of [[coordinate transformation]] that is missing from this list is [[reflection]]. On the other hand, there is an arithmetic operation we have not considered, and that is division. Recall that for non-zero <math>z</math> | ||
:<math>\frac{1}{z} = \frac{\bar{z}}{|z|^2}.</math> | :<math>\frac{1}{z} = \frac{\bar{z}}{|z|^2}.</math> | ||
Division of a complex number <math>z_1</math> by a non-zero complex number <math>z_2</math> can then be interpreted as multiplication of <math>z_1</math> by <math>\frac{1}{z_2}</math>. This in turn corresponds to scaling of the modulus of <math>z_1</math> by the inverse of the modulus of <math>z_2</math> and a ''rotation'' of its argument by the negative of the argument of <math>z_2</math>. That is, | |||
:<math>\frac{z_1}{z_2}=z_1 (\frac{1}{z_2})=\frac{1}{|z_2|^2}z_1 \overline{z_2}=\frac{|z_1|}{|z_2|}e^{i(\theta_1-\theta_2)},</math> | |||
where <math>\theta_1,\theta_2</math> are the arguments of <math>z_1,z_2</math>, respectively. | |||
Returning to the representation of complex numbers in rectangular form, we note that complex conjugation is just the transformation (or map) <math>\scriptstyle x + iy \;\mapsto\; x - iy</math> or, in vector notation, <math>\scriptstyle (x, y)\; \mapsto \;(x, -y)</math>. This is nothing other than reflection in the <math>x</math>-axis, and any other reflection may be obtained by combining that transformation with rotations and translations. | |||
Historically, this observation was very important and led to the search for higher dimensional algebras that could "arithmetize" [[Euclidean geometry]]. It turns out that there are such generalizations in dimensions 4 and 8, known as the [[quaternions]] and [[octonions]] (also known as [[Cayley numbers]]). At that point, the process stops, but the ideas developed in this process have played an important role in the development of modern [[differential geometry]] and [[physics|mathematical physics]]). | Historically, this observation was very important and led to the search for higher dimensional algebras that could "arithmetize" [[Euclidean geometry]]. It turns out that there are such generalizations in dimensions 4 and 8, known as the [[quaternions]] and [[octonions]] (also known as [[Cayley numbers]]). At that point, the process stops, but the ideas developed in this process have played an important role in the development of modern [[differential geometry]] and [[physics|mathematical physics]]). | ||
Line 144: | Line 121: | ||
There are many proofs of the Fundamental Theorem of Algebra. Many of the simplest depend crucially on [[complex analysis]]. But it is by no means necessary to rely on complex analysis here. A proof using [[field theory]] is alluded to at the very end of this article. | There are many proofs of the Fundamental Theorem of Algebra. Many of the simplest depend crucially on [[complex analysis]]. But it is by no means necessary to rely on complex analysis here. A proof using [[field theory]] is alluded to at the very end of this article. | ||
==Formal definition== | ==Formal definition== | ||
We have been treating complex numbers very much like real numbers and found that they can be very useful, but we have not yet proven that they exist or that they can be used without running into contradictions. In fact, it is quite easy to go wrong when using complex numbers. Consider for instance the following computation: | |||
:<math>-1=\sqrt{-1}\times\sqrt{-1}=\sqrt{(-1)\times(-1)}=\sqrt{1}=1.</math> | :<math>-1=\sqrt{-1}\times\sqrt{-1}=\sqrt{(-1)\times(-1)}=\sqrt{1}=1.</math> | ||
This computation seems to show that <math>-1</math> equals <math>1</math>, which is nonsense. The point is that the second equality can not be applied. Positive real numbers satisfy the identity | This computation seems to show that <math>-1</math> equals <math>1</math>, which is nonsense. The point is that the second equality can not be applied. Positive real numbers satisfy the identity | ||
:<math> \sqrt{a}\times\sqrt{b} = \sqrt{a \times b}, </math> | :<math> \sqrt{a}\times\sqrt{b} = \sqrt{a \times b}, </math> | ||
but this identity does not hold for negative real numbers, whose square roots are not real. | but this identity does not hold for negative real numbers, whose square roots are not real, because the square root symbol denotes only the positive solution to <math>x^2 = a</math>. | ||
One possibility to feel more secure when using complex numbers is to define them in terms of constructs which are better understood. This approach was taken by [[William Rowan Hamilton|Hamilton]], who defined complex numbers as [[ordered pair|ordered pairs]] of real numbers, that is, | One possibility to feel more secure when using complex numbers is to define them in terms of constructs which are better understood. This approach was taken by [[William Rowan Hamilton|Hamilton]], who defined complex numbers as [[ordered pair|ordered pairs]] of real numbers, that is, | ||
:<math>\mathbb{C}= \{ (a,b) \colon a,b\in \mathbb{R} \}.</math> | :<math>\mathbb{C}= \{ (a,b) \colon a,b\in \mathbb{R} \}.</math> | ||
Addition and multiplication of such pairs can be defined as follows: | |||
*addition: <math>(a, b) + (c, d) = (a + c, b + d)</math> | *addition: <math>(a, b) + (c, d) = (a + c, b + d)</math> | ||
*multiplication: <math>(a, b)(c, d) = (ac - bd, bc + ad)</math> | *multiplication: <math>(a, b)(c, d) = (ac - bd, bc + ad)</math> | ||
The multiplication may look artificial, but it is inspired by the formula | The multiplication may look artificial, but it is inspired by the formula | ||
:<math>(a + | :<math>(a + b\cdot i)(c + d\cdot i) = (a\cdot c - b\cdot d) + (b\cdot c + a\cdot d)\cdot i. \ </math> | ||
which we derived before. | which we derived before. | ||
These definitions satisfy most of the basic properties of addition and multiplication of real numbers, and we can employ many formulas from the elementary algebra we are accustomed to. More specifically, the sum (or the product) of two numbers does not depend on the order of terms;<ref>that is, the addition (multiplication) is [[commutativity|commutative]]</ref> the sum (product) of three or more elements does not depend on order of operations ('we can suppress the parentheses');<ref>This is called [[associativity]]</ref> the product of a complex number with a sum of two other numbers expands in the usual way.<ref>In other words, multiplication is [[distributivity|distributive]] over addition</ref> In mathematical language this means that with addition and multiplication defined this way, <math>\mathbb{C}</math> satisfies the [[axiom|axioms]] for a [[field]] and is called the field of complex numbers. | These definitions satisfy most of the basic properties of addition and multiplication of real numbers, and we can employ many formulas from the elementary algebra we are accustomed to. More specifically, it can easily be shown that addition and multiplication as defined above are commutative and associative, and that multiplication is distributive over addition; in other words, the sum (or the product) of two numbers does not depend on the order of terms;<ref>that is, the addition (multiplication) is [[commutativity|commutative]]</ref> the sum (product) of three or more elements does not depend on order of operations ('we can suppress the parentheses');<ref>This is called [[associativity]]</ref> the product of a complex number with a sum of two other numbers expands in the usual way.<ref>In other words, multiplication is [[distributivity|distributive]] over addition</ref> In mathematical language this means that with addition and multiplication defined this way, <math>\mathbb{C}</math> satisfies the [[axiom|axioms]] for a [[field]] and is called the field of complex numbers. | ||
Now we are ready to understand the 'real' meaning of <math>i</math>. Observe that the pairs of type (<math>a</math>,0) are identical<ref>i.e., [[isomorphism|isomorphic]], which basically means that the mapping <math> \mathbb{C}\ni (a,0)\mapsto a\in\mathbb{R},</math> preserves the addition and multiplication.</ref> to the set of reals, so we write <math>(a,0)=a</math>. Observe also that by definition <math>(0,1)(0,1) = (-1,0)=-1</math>. In other words, we can define <math>i</math>, the | Now we are ready to understand the 'real' meaning of <math>i</math>. Observe that the pairs of type (<math>a</math>,0) are identical<ref>i.e., [[isomorphism|isomorphic]], which basically means that the mapping <math> \mathbb{C}\ni (a,0)\mapsto a\in\mathbb{R},</math> preserves the addition and multiplication.</ref> to the set of reals, so we write <math>(a,0)=a</math>. Observe also that by definition <math>(0,1)(0,1) = (-1,0)=-1</math>. In other words, we can define <math>i</math>, the symbol we've been using, as the pair (0,1). In this way we have a way of indicating which one we mean of the two solutions of the equation <math>i^2=-1</math>; the other is now denoted (0,-1). | ||
Another way to define the complex numbers comes from [[field theory]]. Because <math>x^2+1</math> is [[irreducible polynomial|irreducible]] in the [[polynomial ring]] <math>\mathbb{R}[x]</math>, the [[ideal (ring theory)|ideal]] generated by <math>x^2+1</math> is a [[maximal ideal]].<ref>An ideal <math>I = \left(f(x)\right)</math> in a polynomial ring over a field is maximal if and only if <math>f(x)</math> is irreducible over the field.</ref> Therefore, the [[quotient ring]] <math> \mathbb{C}=\mathbb{R}[x]/\left(x^2+1\right)</math> is a [[field (mathematics)|field]]. We can choose the polynomials of degree at most 1 as the representatives for the equivalence classes in this quotient ring. So in a sense, we can imagine that the dummy variable <math>x</math> is the imaginary number <math>i</math>, and the elements of the quotient ring behave exactly the way we expect the complex numbers to behave. For example, <math>x^2</math> is in the same equivalence class as <math>-1</math>, and so <math>x^2=-1</math> in this quotient ring. (As a final comment in this analysis, we could next show that <math>\mathbb{C}</math> has no finite [[field extension|extension]] and must therefore be [[algebraic closure|algebraically closed]].) | Another way to define the complex numbers comes from [[field theory]]. Because <math>x^2+1</math> is [[irreducible polynomial|irreducible]] in the [[polynomial ring]] <math>\mathbb{R}[x]</math>, the [[ideal (ring theory)|ideal]] generated by <math>x^2+1</math> is a [[maximal ideal]].<ref>An ideal <math>I = \left(f(x)\right)</math> in a polynomial ring over a field is maximal if and only if <math>f(x)</math> is irreducible over the field.</ref> Therefore, the [[quotient ring]] <math> \mathbb{C}=\mathbb{R}[x]/\left(x^2+1\right)</math> is a [[field (mathematics)|field]]. We can choose the polynomials of degree at most 1 as the representatives for the equivalence classes in this quotient ring. So in a sense, we can imagine that the dummy variable <math>x</math> is the imaginary number <math>i</math>, and the elements of the quotient ring behave exactly the way we expect the complex numbers to behave. For example, <math>x^2</math> is in the same equivalence class as <math>-1</math>, and so <math>x^2=-1</math> in this quotient ring. (As a final comment in this analysis, we could next show that <math>\mathbb{C}</math> has no finite [[field extension|extension]] and must therefore be [[algebraic closure|algebraically closed]].) | ||
==Notes and references== | ==Notes and references== | ||
{{reflist|2}} | {{reflist|2}}[[Category:Suggestion Bot Tag]] | ||
[[Category: | |||
Latest revision as of 11:01, 31 July 2024
Complex numbers are numbers of the form , where and are real numbers and denotes a number satisfying .[1] Of course, since the square of any real number is nonnegative, cannot be a real number. At first glance, it is not even clear whether such an object exists and can be reasonably called a number; for example, can we sensibly associate with natural operations such as addition and multiplication? As it happens, we can define mathematical operations for these "complex numbers" in a consistent and sensible way and, perhaps more importantly, using complex numbers provides mathematicians, physicists, and engineers with an extremely powerful approach to expressing parts of these sciences in a convenient and natural way.
Historical Development
A common complaint among math students is why they must bother with complex numbers when real numbers almost always seem sufficient for applications. Indeed, many familiar real world quantities, such as distance, temperature, and time are best described using real numbers. In contrast, although there are many physical situations which are mostly aptly described with complex numbers, most of these situations require some relatively sophisticated knowledge of physics, and the phenomena which they describe are more abstruse than distance, temperature, or time.
Many modern students first encounter complex numbers when solving quadratic equations, which can have complex number solutions. This presentation is historically misleading — the quadratic formula precedes recognition of the utility of complex numbers by many centuries. The ancients could simply dismiss quadratic equations whose solutions involve complex numbers as having no real roots.
The need for complex numbers might have appeared for the first time during the sixteenth century, when Italian mathematicians like Scipione del Ferro, Niccolò Fontana Tartaglia, Gerolamo Cardano and Rafael Bombelli tried to solve cubic equations. Even for equations with three real solutions, the method they used sometimes required calculations with numbers whose squares are negative. A historical example of this can be found on the "advanced" subpage for this article.
In modernity, complex numbers form the basis for the mathematical models of many physical phenomena, including electro-magnetism and quantum mechanics. Even so, it is still often difficult to find examples where an equivalent mathematical model cannot be formulated using pairs of real numbers instead.
Working with complex numbers
As a first step in giving some legitimacy to the "number" , we will explain how to compute with it. How do you add, multiply and divide expressions with this number? It turns out that this is not that difficult; the main rule to keep in mind is that the square of equals .
In the remainder of the article, we will use the letter to denote one solution of the equation , where we previously used .[2] With this convention, all complex numbers can be written as , where and are real numbers. We call the real part of the complex number and the imaginary part. The complex number whose imaginary part is zero is considered to be the same thing as the real number .
Basic operations
Addition of complex numbers is straightforward, The result is again a complex number.
Multiplication is more interesting. Suppose we want to compute . Using , we can rewrite this product in a form which clearly shows it to be another complex number:
To handle division, we simply note that , so, provided that c and d are not simultaneously zero,
from which it follows that
If then division by is not defined.
Going a bit further, we can introduce the important operation of complex conjugation. Given an arbitrary complex number , we define its complex conjugate to be . Using the identity we derive the important formula
and we define the modulus of a complex number z to be
Note that the modulus of a complex number is always a nonnegative real number. The modulus (also called absolute value) satisfies three important properties that are completely analogous to the properties of the absolute value of real numbers
- ; furthermore, if and only if
The last inequality is known as the triangle inequality.
The complex exponential
Recall that in real analysis, the ordinary exponential function may be defined as
The same series may be used to define the complex exponential function
(where, of course, convergence is defined in terms of the complex modulus, instead of the real absolute value).
The complex exponential has the same multiplicative property that holds for real numbers, namely
The complex exponential function has the important property that
as may be seen immediately by substituting and comparing terms with the usual power series expansions of and .
The familiar trigonometric identity
immediately implies the important formula
- , for any
Another way to establish this identity is to note that , so
Geometric interpretation
Since a complex number is specified by two real numbers, namely and , it can be interpreted as the point in the plane. When complex numbers are represented as points in the plane, the resulting diagrams are known as Argand diagrams, after Robert Argand. The geometric representation of complex numbers turns out to be very useful, both as an aid to understanding the properties of complex numbers and as a tool in applying complex numbers to geometrical and physical problems.
There are no real surprises when we look at addition and subtraction in isolation: addition of complex numbers is not essentially different from addition of vectors in . Similarly, if is real, multiplication by is just scalar multiplication. In we have
and
To put it succinctly, is a 2-dimensional real vector space with respect to the usual operations of addition of complex numbers and multiplication by a real number. There doesn't seem to be much more to say. But there is more to say, and that is that the multiplication of complex numbers has geometric significance. This is most easily seen if we take advantage of the complex exponential, and write complex numbers in polar form
Here, r is simply the modulus or vector length. The number is just the angle formed with the -axis, and is called the argument satisfying the condition . Now, when complex numbers are written in polar form, multiplication is very interesting
In other words, multiplication by a complex number has the effect of simultaneously scaling by the number's modulus and rotating by its argument. This is really astounding. For example, to multiply a given complex number by we need only to rotate by (that is, 90 degrees). Translation corresponds to complex addition, scaling to multiplication by a real number, and rotation to multiplication by a complex number of unit modulus. The one type of coordinate transformation that is missing from this list is reflection. On the other hand, there is an arithmetic operation we have not considered, and that is division. Recall that for non-zero
Division of a complex number by a non-zero complex number can then be interpreted as multiplication of by . This in turn corresponds to scaling of the modulus of by the inverse of the modulus of and a rotation of its argument by the negative of the argument of . That is,
where are the arguments of , respectively.
Returning to the representation of complex numbers in rectangular form, we note that complex conjugation is just the transformation (or map) or, in vector notation, . This is nothing other than reflection in the -axis, and any other reflection may be obtained by combining that transformation with rotations and translations.
Historically, this observation was very important and led to the search for higher dimensional algebras that could "arithmetize" Euclidean geometry. It turns out that there are such generalizations in dimensions 4 and 8, known as the quaternions and octonions (also known as Cayley numbers). At that point, the process stops, but the ideas developed in this process have played an important role in the development of modern differential geometry and mathematical physics).
Algebraic closure
An important property of the set of complex numbers is that it is algebraically closed. This means that any non-constant polynomial with complex coefficients has a complex root. This result is known as the Fundamental Theorem of Algebra.
This is actually quite remarkable. We started out with the real numbers. There are many polynomials with real coefficients that do not have a real root. We took just one of these, the polynomial , and we introduced a new number, , which is defined to be a root of the polynomial. Suddenly, all non-constant polynomials have a root in this new setting where we allow complex numbers.
There are many proofs of the Fundamental Theorem of Algebra. Many of the simplest depend crucially on complex analysis. But it is by no means necessary to rely on complex analysis here. A proof using field theory is alluded to at the very end of this article.
Formal definition
We have been treating complex numbers very much like real numbers and found that they can be very useful, but we have not yet proven that they exist or that they can be used without running into contradictions. In fact, it is quite easy to go wrong when using complex numbers. Consider for instance the following computation:
This computation seems to show that equals , which is nonsense. The point is that the second equality can not be applied. Positive real numbers satisfy the identity
but this identity does not hold for negative real numbers, whose square roots are not real, because the square root symbol denotes only the positive solution to .
One possibility to feel more secure when using complex numbers is to define them in terms of constructs which are better understood. This approach was taken by Hamilton, who defined complex numbers as ordered pairs of real numbers, that is,
Addition and multiplication of such pairs can be defined as follows:
- addition:
- multiplication:
The multiplication may look artificial, but it is inspired by the formula
which we derived before.
These definitions satisfy most of the basic properties of addition and multiplication of real numbers, and we can employ many formulas from the elementary algebra we are accustomed to. More specifically, it can easily be shown that addition and multiplication as defined above are commutative and associative, and that multiplication is distributive over addition; in other words, the sum (or the product) of two numbers does not depend on the order of terms;[3] the sum (product) of three or more elements does not depend on order of operations ('we can suppress the parentheses');[4] the product of a complex number with a sum of two other numbers expands in the usual way.[5] In mathematical language this means that with addition and multiplication defined this way, satisfies the axioms for a field and is called the field of complex numbers.
Now we are ready to understand the 'real' meaning of . Observe that the pairs of type (,0) are identical[6] to the set of reals, so we write . Observe also that by definition . In other words, we can define , the symbol we've been using, as the pair (0,1). In this way we have a way of indicating which one we mean of the two solutions of the equation ; the other is now denoted (0,-1).
Another way to define the complex numbers comes from field theory. Because is irreducible in the polynomial ring , the ideal generated by is a maximal ideal.[7] Therefore, the quotient ring is a field. We can choose the polynomials of degree at most 1 as the representatives for the equivalence classes in this quotient ring. So in a sense, we can imagine that the dummy variable is the imaginary number , and the elements of the quotient ring behave exactly the way we expect the complex numbers to behave. For example, is in the same equivalence class as , and so in this quotient ring. (As a final comment in this analysis, we could next show that has no finite extension and must therefore be algebraically closed.)
Notes and references
- ↑ This article follows the usual convention in mathematics and physics of using as the imaginary unit. Complex numbers are frequently used in electrical engineering, but in that discipline it is usual to use instead, reserving for electrical current. This usage is found in some programming languages too, notably Python.
- ↑ Part of the reason for not using is that the symbol (or ) with is sometimes used to denote the set of complex roots of , i.e., the set of the solutions of the equation ( respectively). The set contains 2 (, respectively) "equally important" elements and there is no canonical way to distinguish a "representative". Consequently, no computations are performed using this symbol.
- ↑ that is, the addition (multiplication) is commutative
- ↑ This is called associativity
- ↑ In other words, multiplication is distributive over addition
- ↑ i.e., isomorphic, which basically means that the mapping preserves the addition and multiplication.
- ↑ An ideal in a polynomial ring over a field is maximal if and only if is irreducible over the field.