Polynomial: Difference between revisions

From Citizendium
Jump to navigation Jump to search
imported>Richard Pinch
(→‎Polynomials of one variable: definition of leading term, anchor for monic polynomial)
imported>Barry R. Smith
(moving to advanced subpage)
Line 16: Line 16:
The degree is an important identifier when working with polynomials.  For instance, many procedures for [[factor]]ing or solving [[polynomial equation]]s require identifying the degree of the polynomial first.  In the last example above, we had to scan through the polynomial from the left all the way through the right to determine that the degree is ''5''.  To facilitate identifying the degree of a polynomial, as well as manipulations of polynomials, they are usually written in '''standard form'''.  The standard form of a polynomial is obtained by [[combining like terms|combining terms]] of the same degree, and then writing the monomials so that the exponents decrease from left to right.  The degree of a polynomial in standard form is the degree of the first monomial appearing.  The term of highest degree is the '''leading term''' and its coefficient is the '''leading coefficient'''.  A polynomial with leading coefficient equal to 1 is '''monic'''.  We can put the last example above in standard form by rearranging the monomials to obtain <math>x^5-3x^4+x^2</math>.  It is of course just as easy to work with polynomials where the monomials are written so that the degrees increase from left to right.
The degree is an important identifier when working with polynomials.  For instance, many procedures for [[factor]]ing or solving [[polynomial equation]]s require identifying the degree of the polynomial first.  In the last example above, we had to scan through the polynomial from the left all the way through the right to determine that the degree is ''5''.  To facilitate identifying the degree of a polynomial, as well as manipulations of polynomials, they are usually written in '''standard form'''.  The standard form of a polynomial is obtained by [[combining like terms|combining terms]] of the same degree, and then writing the monomials so that the exponents decrease from left to right.  The degree of a polynomial in standard form is the degree of the first monomial appearing.  The term of highest degree is the '''leading term''' and its coefficient is the '''leading coefficient'''.  A polynomial with leading coefficient equal to 1 is '''monic'''.  We can put the last example above in standard form by rearranging the monomials to obtain <math>x^5-3x^4+x^2</math>.  It is of course just as easy to work with polynomials where the monomials are written so that the degrees increase from left to right.


=== Definition ===
Let us consider some expressions like <math>X^2-2X+1</math>, <math>\frac{1}{2}X^3+X-\sqrt{2}</math>, or <math>2X^5-3X^2+1</math>. We can write all of them as follows:
: <math>X^2-2X+1=1+(-2)X+1X^2+0X^3+0X^4+\cdots,</math>
: <math>\frac{1}{2}X^3+X-\sqrt{2}=-\sqrt{2}+1X+0X^2+\frac{1}{2}X^3+0X^4+\cdots,</math>
: <math>2X^5-3X^2+1=1+0X+(-3)X^2+0X^3+0X^4+2X^5+0X^6+\cdots.</math>
This suggests that a polynomial can be entirely defined by giving a sequence of numbers, which are called its ''coefficients'', all of them being zero from some rank. For instance the three polynomials above can be written respectively <math>(1,-2,1,0,0,\cdots)</math>, <math>\left(-\sqrt{2},1,0,\frac{1}{2},0,\cdots\right)</math>, and <math>(1,0,-3,0,0,2,0,\cdots)</math>, the dots meaning the sequence continues with an infinity of zeros. This leads to the definition below.
'''Definition.''' A ''polynomial'' <math>P</math>, over the ring <math>R</math> is a sequence <math>P=\left(a_0,a_1,a_2,\cdots,a_n,\cdots\right)</math> of elements of <math>R</math>, called the ''coefficients'' of <math>P</math>, this sequence containing only a finite number of nonzero terms. The rank of the last nonzero term is called the ''degree'' of the polynomial.
Hence, the degrees of the three polynomials given above are respectively 2, 3 and 5. By convention, the degree of <math>(0,0,\cdots)</math> is set to <math>-\infty</math>.
This definition may surprise the reader, because in reality, one thinks of a polynomial as an expression of the form <math>a_0+a_1X+a_2X^2+\cdots+a_nX^n</math> rather than <math>\left(a_0,a_1,a_2,\cdots ,a_n,\cdots\right)</math>. We will progressively show how to return to this usual way of writing a polynomial. First, we identify any element <math>a_0</math> of the ring to the polynomial <math>\left(a_0,0,0,\cdots\right)</math>. For instance, we write only <math>7</math> instead of the cumbersome <math>\left(7,0,0,\cdots\right)</math>, (or in the familiar fashion <math>7+0X+0X^2+\cdots</math>).
Secondly, we merely denote by <math>X</math> the polynomial
<center><math>X=\left(0,1,0,0,\cdots\right)</math>.</center>
This is natural, as in the familiar fashion this sequence corresponds to <math>0+1X+0X^2+0X^3+\cdots</math> It remains to give a sense to <math>X^2</math>, <math>X^3</math>, etc. This will be made in the next two subsections.
=== Calculation rules ===
We now define addition and multiplication of polynomials, beginning with addition, which is easy.
==== Addition ====
With the traditional notation, if we have <math>P=2X^5-3X^2+1</math> and <math>Q=-X^5+4X^4+2X^2-1</math>, we want to have <math>P+Q=(2-1)X^5+4X^4+(-3+2)X^2+1-1=X^5+4X^4-X^2</math>, that is, one wants to add coefficients separately for each degree. This leads to the formal definition below.
'''Definition.''' Given two polynomials <math>P=\left(a_0,a_1,a_2,\dots\right)</math> and <math>Q=\left(b_0,b_1,b_2,\dots\right)</math>, the sum <math>P+Q</math> is defined by <math>P+Q=\left(a_0+b_0,a_1+b_1,a_2+b_2,\dots\right)</math>.
==== Multiplication ====
Multiplication is harder to define. Let us begin with an example using traditional notation. For <math>P=X^2+X-2</math> and <math>Q=2X^2-3X+1</math>, we want to have
<center><math>PQ=X^2\left(2X^2-3X+1\right)+X\left(2X^2-3X+1\right)-2\left(2X^2-3X+1\right)</math>;</center>
<center><math>PQ=2X^4+(-3+2)X^3+(1-3-2\cdot 2)X^2+(1-2\cdot (-3))X-2</math>;</center>
<center><math>PQ=2X^4-X^3-6X^2+7X-2 \ </math>.</center>
One can observe that the coefficient of say, <math>X^2</math>, is obtained by adding <math>1\cdot 1</math>, <math>1\cdot (-3)</math> and <math>-2\cdot 2</math>, that is, by adding all the <math>a_ib_j</math> so that <math>i+j=2</math>, where the <math>a_i</math> denote the coefficients of <math>P</math> and the <math>b_j</math> those of <math>Q</math>. Those mechanics lead to give the definition below.
'''Definition.''' Given two polynomials <math>P=\left(a_0,a_1,a_2,\cdots\right)</math> and <math>Q=\left(b_0,b_1,b_2,\cdots\right)</math>, the product <math>PQ</math> is defined by <math>PQ=\left(c_0,c_1,c_2,\cdots\right)</math>, where for every index <math>k</math>, the coefficient <math>c_k</math> is given by <math>c_k=\sum_{i+j=k}a_ib_j</math>.
The reader which is upset by those cumbersome notations should just retain that this definition allows to multiply polynomials (considered as mere sequences of coefficients) as one is used to do in elementary algebra (using the traditional notation, as in the example). The only striking fact is that in our construction, <math>X</math> does not represent a number, but a pure abstract entity for which we have defined some rules of calculation.
==== The algebra <math>R[X]</math> ====
With the definition above, one can verify that the product of the polynomial <math>X=\left(0,1,0,0,\dots\right)</math> by itself, that is <math>X^2</math>, is the sequence <math>X^2=\left(0,0,1,0,0,\dots\right)</math>. More generally, for each [[natural number]] <math>n</math>, one can verify that the <math>n</math>-th power of <math>X</math> is given by
<math>X^n=\left(0,\dots,0,1,0,0,\dots\right)</math>, where the <math>1</math> is the coefficient of index <math>n</math> and all other coefficients are zeros. In particular, we have the usual convention <math>X^0=\left(1,0,0,\dots\right)</math>, which we identified to the constant <math>1</math>.
Now, any polynomial <math>P=\left(a_0,a_1,a_2,\dots,a_n,0,0,\dots\right)</math> is ''exactly'' equal to <math>a_0+a_1X+a_2X^2+\cdots+a_nX^n</math>, where the addition and the powers (which are mere repetitions of multiplications) are defined as in the preceding subsections. Our whole construction legitimates the traditional notation, and from now on, we will only use the later, with which calculations use natural rules of elementary algebra. It is however important to remember that the "variable" <math>X</math> did not denote some number in our construction, but a particular sequence of coefficients. We have succeeded in defining polynomials in a purely formal manner.
==== Operations and degree: the algebra <math>R_n[X]</math> ====


=== Polynomial function ===
=== Polynomial function ===

Revision as of 13:37, 22 December 2008

This article is developing and not approved.
Main Article
Discussion
Related Articles  [?]
Bibliography  [?]
External Links  [?]
Citable Version  [?]
Advanced [?]
 
This editable Main Article is under development and subject to a disclaimer.

In algebra, a polynomial is, roughly speaking, a formal expression obtained from constant numbers and one or several unspecified numbers called "variables", denoted by letters like , , etc., by making a finite number of additions, subtractions and multiplications. For instance, is a polynomial of one variable, , whereas is a polynomial of two variables, and . Expressions like or are not polynomials ; the first one is a rational function, and the second one is an irrational expression, due to the square root symbol. Such operations might be expressed within the constant numbers, as in the example , but this is only because and are elements of the set (e.g. real numbers) that are being used as coefficients of the polynomials.

It may be convenient to think of a polynomial as a function of its variables, that is, or . Such a function is called a polynomial function. But in reality, both concepts are different, the unspecified variables being purely formal entities when one thinks of an abstract polynomial, whereas they are meant to be replaced by any number when one thinks of a function. The distinction is important in abstract algebra, because what we have called "constant numbers" is more generally replaced by any ring, and for some rings the two concepts cannot be identified. There is not such a problem with polynomials over rings of usual numbers like integers, rational, real or complex numbers. Still it is important to understand that calculations with polynomials can be conceived in an only formal way, without giving any special ontological status to the variables. To make the distinction clear, it is common in algebra to denote the abstract variables with capital letters (, , etc.), while variables of functions are still denoted with lowercase letters. We will use this convention in what follows.

Polynomials of one variable

In this section we deal with the simplest case, that is, polynomials of only one variable, denoted . A polynomial can be written as a finite sum of terms, called monomials. Each monomial is either a constant, or a constant times a positive whole number power of x. For instance, 1, , and are monomials, and their sum, is a polynomial.

A coefficient equal to 1 in front of a positive power of x is typically dropped from the notation, so that represents the same polynomial as . It is sometimes useful to explicitly write a power of x in each monomial, even the constants. To accomplish this, you can write after the constant, so that 2 and are considered the same.

Degree

The power of the variable appearing in a monomial is the degree of the monomial. By the above convention, a constant c is the same as and has degree equal to 0. The degree of a polynomial is the largest of the degrees of the monomials appearing in the polynomial. The only exception is the constant polynomial 0, which typically is not assigned a degree (for reasons made clear below). As an example, 2 has degree 0, has degree 2, and has degree 5.

The degree is an important identifier when working with polynomials. For instance, many procedures for factoring or solving polynomial equations require identifying the degree of the polynomial first. In the last example above, we had to scan through the polynomial from the left all the way through the right to determine that the degree is 5. To facilitate identifying the degree of a polynomial, as well as manipulations of polynomials, they are usually written in standard form. The standard form of a polynomial is obtained by combining terms of the same degree, and then writing the monomials so that the exponents decrease from left to right. The degree of a polynomial in standard form is the degree of the first monomial appearing. The term of highest degree is the leading term and its coefficient is the leading coefficient. A polynomial with leading coefficient equal to 1 is monic. We can put the last example above in standard form by rearranging the monomials to obtain . It is of course just as easy to work with polynomials where the monomials are written so that the degrees increase from left to right.


Polynomial function

Arithmetics

Polynomials of several variables

Applications of polynomials