Main article: Factorization of polynomials
Two problems where the factor theorem is commonly applied are those of factoring a polynomial and finding the roots of a polynomial equation; it is a direct consequence of the theorem that these problems are essentially equivalent.
The factor theorem is also used to remove known zeros from a polynomial while leaving all unknown zeros intact, thus producing a lower degree polynomial whose zeros may be easier to find. Abstractly, the method is as follows:3
Continuing the process until the polynomial f {\displaystyle f} is factored completely, which all its factors is irreducible on R [ x ] {\displaystyle \mathbb {R} [x]} or C [ x ] {\displaystyle \mathbb {C} [x]} .
Find the factors of x 3 + 7 x 2 + 8 x + 2. {\displaystyle x^{3}+7x^{2}+8x+2.}
Solution: Let p ( x ) {\displaystyle p(x)} be the above polynomial
All possible factors of 2 are ± 1 {\displaystyle \pm 1} and ± 2 {\displaystyle \pm 2} . Substituting x = − 1 {\displaystyle x=-1} , we get:
So, ( x − ( − 1 ) ) {\displaystyle (x-(-1))} , i.e, ( x + 1 ) {\displaystyle (x+1)} is a factor of p ( x ) {\displaystyle p(x)} . On dividing p ( x ) {\displaystyle p(x)} by ( x + 1 ) {\displaystyle (x+1)} , we get
Hence, p ( x ) = ( x 2 + 6 x + 2 ) ( x + 1 ) {\displaystyle p(x)=(x^{2}+6x+2)(x+1)}
Out of these, the quadratic factor can be further factored using the quadratic formula, which gives as roots of the quadratic − 3 ± 7 . {\displaystyle -3\pm {\sqrt {7}}.} Thus the three irreducible factors of the original polynomial are x + 1 , {\displaystyle x+1,} x − ( − 3 + 7 ) , {\displaystyle x-(-3+{\sqrt {7}}),} and x − ( − 3 − 7 ) . {\displaystyle x-(-3-{\sqrt {7}}).}
Several proofs of the theorem are presented here.
If x − a {\displaystyle x-a} is a factor of f ( x ) , {\displaystyle f(x),} it is immediate that f ( a ) = 0. {\displaystyle f(a)=0.} So, only the converse will be proved in the following.
This proof begins by verifying the statement for a = 0 {\displaystyle a=0} . That is, it will show that for any polynomial f ( x ) {\displaystyle f(x)} for which f ( 0 ) = 0 {\displaystyle f(0)=0} , there exists a polynomial g ( x ) {\displaystyle g(x)} such that f ( x ) = x ⋅ g ( x ) {\displaystyle f(x)=x\cdot g(x)} . To that end, write f ( x ) {\displaystyle f(x)} explicitly as c 0 + c 1 x 1 + … + c n x n {\displaystyle c_{0}+c_{1}x^{1}+\dotsc +c_{n}x^{n}} . Now observe that 0 = f ( 0 ) = c 0 {\displaystyle 0=f(0)=c_{0}} , so c 0 = 0 {\displaystyle c_{0}=0} . Thus, f ( x ) = x ( c 1 + c 2 x 1 + … + c n x n − 1 ) = x ⋅ g ( x ) {\displaystyle f(x)=x(c_{1}+c_{2}x^{1}+\dotsc +c_{n}x^{n-1})=x\cdot g(x)} . This case is now proven.
What remains is to prove the theorem for general a {\displaystyle a} by reducing to the a = 0 {\displaystyle a=0} case. To that end, observe that f ( x + a ) {\displaystyle f(x+a)} is a polynomial with a root at x = 0 {\displaystyle x=0} . By what has been shown above, it follows that f ( x + a ) = x ⋅ g ( x ) {\displaystyle f(x+a)=x\cdot g(x)} for some polynomial g ( x ) {\displaystyle g(x)} . Finally, f ( x ) = f ( ( x − a ) + a ) = ( x − a ) ⋅ g ( x − a ) {\displaystyle f(x)=f((x-a)+a)=(x-a)\cdot g(x-a)} .
First, observe that whenever x {\displaystyle x} and y {\displaystyle y} belong to any commutative ring (the same one) then the identity x n − y n = ( x − y ) ( y n − 1 + x 1 y n − 2 + … + x n − 2 y 1 + x n − 1 ) {\displaystyle x^{n}-y^{n}=(x-y)(y^{n-1}+x^{1}y^{n-2}+\dotsc +x^{n-2}y^{1}+x^{n-1})} is true. This is shown by multiplying out the brackets.
Let f ( X ) ∈ R [ X ] {\displaystyle f(X)\in R\left[X\right]} where R {\displaystyle R} is any commutative ring. Write f ( X ) = ∑ i c i X i {\displaystyle f(X)=\sum _{i}c_{i}X^{i}} for a sequence of coefficients ( c i ) i {\displaystyle (c_{i})_{i}} . Assume f ( a ) = 0 {\displaystyle f(a)=0} for some a ∈ R {\displaystyle a\in R} . Observe then that f ( X ) = f ( X ) − f ( a ) = ∑ i c i ( X i − a i ) {\displaystyle f(X)=f(X)-f(a)=\sum _{i}c_{i}(X^{i}-a^{i})} . Observe that each summand has X − a {\displaystyle X-a} as a factor by the factorisation of expressions of the form x n − y n {\displaystyle x^{n}-y^{n}} that was discussed above. Thus, conclude that X − a {\displaystyle X-a} is a factor of f ( X ) {\displaystyle f(X)} .
The theorem may be proved using Euclidean division of polynomials: Perform a Euclidean division of f ( x ) {\displaystyle f(x)} by ( x − a ) {\displaystyle (x-a)} to obtain f ( x ) = ( x − a ) Q ( x ) + R ( x ) {\displaystyle f(x)=(x-a)Q(x)+R(x)} where deg ( R ) < deg ( x − a ) {\displaystyle \deg(R)<\deg(x-a)} . Since deg ( R ) < deg ( x − a ) {\displaystyle \deg(R)<\deg(x-a)} , it follows that R {\displaystyle R} is constant. Finally, observe that 0 = f ( a ) = R {\displaystyle 0=f(a)=R} . So f ( x ) = ( x − a ) Q ( x ) {\displaystyle f(x)=(x-a)Q(x)} .
The Euclidean division above is possible in every commutative ring since ( x − a ) {\displaystyle (x-a)} is a monic polynomial, and, therefore, the polynomial long division algorithm does not involve any division of coefficients.
It is also a corollary of the polynomial remainder theorem, but conversely can be used to show it.
When the polynomials are multivariate but the coefficients form an algebraically closed field, the Nullstellensatz is a significant and deep generalisation.
Sullivan, Michael (1996), Algebra and Trigonometry, Prentice Hall, p. 381, ISBN 0-13-370149-2 0-13-370149-2 ↩
Sehgal, V K; Gupta, Sonal (September 2009), Longman ICSE Mathematics Class 10, Dorling Kindersley (India), p. 119, ISBN 978-81-317-2816-1. 978-81-317-2816-1 ↩
Bansal, R. K., Comprehensive Mathematics IX, Laxmi Publications, p. 142, ISBN 81-7008-629-9. 81-7008-629-9 ↩