Bernstein polynomial

<h2 id="definition">Definition</h2>
<h3>Bernstein basis polynomials</h3>
The 
 
 
 
  
 n
 +
 1
  
 
 
 {\displaystyle \ n+1\ }
 
 Bernstein basis polynomials of degree 
 
 
 
  
 n
  
 
 
 {\displaystyle \ n\ }
 
 are defined as

b
          
            ν
            ,
            n
          
        
        (
        x
        )
         
        ≡
         
        
          
            
              (
            
            
              n
              ν
            
            
              )
            
          
        
         
        
          x
          
            ν
          
        
        
          
            (
            
              1
              −
              x
            
            )
          
          
            n
            −
            ν
          
        
         
        ,
         
         
      
    
    {\displaystyle \ b_{\nu ,n}(x)\ \equiv \ {\binom {n}{\nu }}\ x^{\nu }\left(1-x\right)^{n-\nu }\ ,~~}
  
 for 
  
    
      
         
         
        ν
        =
        0
         
        ,
         
        …
         
        ,
        n
         
        ,
      
    
    {\displaystyle ~~\nu =0\ ,\ \ldots \ ,n\ ,}

where 
 
 
 
  
 
 
 
 
 (
 
 
 n
 ν
 
 
 )
 
 
 
 
  
 
 
 {\displaystyle \ {\tbinom {n}{\nu }}\ }
 
 is a <a href="/facts/Binomial_coefficient/Tsx7eY5h">binomial coefficient</a>.
So, for example, 
 
 
 
  
 
 b
 
 2
 ,
 5
 
 
 (
 x
 )
  
 =
  
 
 
 
 
 (
 
 
 5
 2
 
 
 )
 
 
 
 
 
 x
 
 2
 
 
 (
 1
 −
 x
 
 )
 
 3
 
 
  
 =
  
 10
 
 x
 
 2
 
 
 (
 1
 −
 x
 
 )
 
 3
 
 
  
 .
 
 
 {\displaystyle \ b_{2,5}(x)\ =\ {\tbinom {5}{2}}x^{2}(1-x)^{3}\ =\ 10x^{2}(1-x)^{3}~.}

The first few Bernstein basis polynomials for blending 1, 2, 3 or 4 values together are:

b
                  
                    0
                    ,
                    0
                  
                
                (
                x
                )
              
              
                
                =
                1
                 
                ,
              
            
            
              
                
                  b
                  
                    0
                    ,
                    1
                  
                
                (
                x
                )
              
              
                
                =
                1
                −
                x
                 
                ,
              
              
                
                  b
                  
                    1
                    ,
                    1
                  
                
                (
                x
                )
              
              
                
                =
                x
              
            
            
              
                
                  b
                  
                    0
                    ,
                    2
                  
                
                (
                x
                )
              
              
                
                =
                (
                1
                −
                x
                
                  )
                  
                    2
                  
                
                 
                ,
              
              
                
                  b
                  
                    1
                    ,
                    2
                  
                
                (
                x
                )
              
              
                
                =
                2
                x
                (
                1
                −
                x
                )
                 
                ,
              
              
                
                  b
                  
                    2
                    ,
                    2
                  
                
                (
                x
                )
              
              
                
                =
                
                  x
                  
                    2
                  
                
              
            
            
              
                
                  b
                  
                    0
                    ,
                    3
                  
                
                (
                x
                )
              
              
                
                =
                (
                1
                −
                x
                
                  )
                  
                    3
                  
                
                 
                ,
              
              
                
                  b
                  
                    1
                    ,
                    3
                  
                
                (
                x
                )
              
              
                
                =
                3
                x
                (
                1
                −
                x
                
                  )
                  
                    2
                  
                
                 
                ,
              
              
                
                  b
                  
                    2
                    ,
                    3
                  
                
                (
                x
                )
              
              
                
                =
                3
                
                  x
                  
                    2
                  
                
                (
                1
                −
                x
                )
                 
                ,
              
              
                
                  b
                  
                    3
                    ,
                    3
                  
                
                (
                x
                )
              
              
                
                =
                
                  x
                  
                    3
                  
                
                 
                .
              
            
          
        
      
    
    {\displaystyle {\begin{aligned}b_{0,0}(x)&=1\ ,\\b_{0,1}(x)&=1-x\ ,&b_{1,1}(x)&=x\\b_{0,2}(x)&=(1-x)^{2}\ ,&b_{1,2}(x)&=2x(1-x)\ ,&b_{2,2}(x)&=x^{2}\\b_{0,3}(x)&=(1-x)^{3}\ ,&b_{1,3}(x)&=3x(1-x)^{2}\ ,&b_{2,3}(x)&=3x^{2}(1-x)\ ,&b_{3,3}(x)&=x^{3}~.\end{aligned}}}

The Bernstein basis polynomials of degree 
 
 
 
  
 n
  
 
 
 {\displaystyle \ n\ }
 
 form a <a href="/facts/Basis_(linear_algebra)/89IPoN6c">basis</a> for the <a href="/facts/Vector_space/M1pxMLx2">vector space</a> 
 
 
 
  
 
 Π
 
 n
 
 
  
 
 
 {\displaystyle \ \Pi _{n}\ }
 
 of polynomials of degree at most 
 
 
 
  
 n
  
 ,
 
 
 {\displaystyle \ n\ ,}
 
 all with real coefficients.

<h3>Bernstein polynomials</h3>
A linear combination of Bernstein basis polynomials

B
          
            n
          
        
        (
        x
        )
         
        ≡
         
        
          ∑
          
            ν
            =
            0
          
          
            n
          
        
        
          β
          
            ν
          
        
        
          b
          
            ν
            ,
            n
          
        
        (
        x
        )
         
      
    
    {\displaystyle \ B_{n}(x)\ \equiv \ \sum _{\nu =0}^{n}\beta _{\nu }b_{\nu ,n}(x)\ }

is called a Bernstein polynomial or polynomial in Bernstein form of degree 
 
 
 
  
 n
  
 .
 
 
 {\displaystyle \ n~.}
 
<a class="footnote-ref" id="fnref:1" href="#fn:1">1</a> The coefficients 
 
 
 
  
 
 β
 
 ν
 
 
  
 
 
 {\displaystyle \ \beta _{\nu }\ }
 
 are called Bernstein coefficients or Bézier coefficients.
The first few Bernstein basis polynomials from above in <a href="/facts/Monomial/6VAPCYXJ">monomial</a> form are:

b
                  
                    0
                    ,
                    0
                  
                
                (
                x
                )
              
              
                
                =
                1
                 
                ,
              
            
            
              
                
                  b
                  
                    0
                    ,
                    1
                  
                
                (
                x
                )
              
              
                
                =
                1
                −
                1
                x
                 
                ,
              
              
                
                  b
                  
                    1
                    ,
                    1
                  
                
                (
                x
                )
              
              
                
                =
                0
                +
                1
                x
              
            
            
              
                
                  b
                  
                    0
                    ,
                    2
                  
                
                (
                x
                )
              
              
                
                =
                1
                −
                2
                x
                +
                1
                
                  x
                  
                    2
                  
                
                ,
              
              
                
                  b
                  
                    1
                    ,
                    2
                  
                
                (
                x
                )
              
              
                
                =
                0
                +
                2
                x
                −
                2
                
                  x
                  
                    2
                  
                
                 
                ,
              
              
                
                  b
                  
                    2
                    ,
                    2
                  
                
                (
                x
                )
              
              
                
                =
                0
                +
                0
                x
                +
                1
                
                  x
                  
                    2
                  
                
              
            
            
              
                
                  b
                  
                    0
                    ,
                    3
                  
                
                (
                x
                )
              
              
                
                =
                1
                −
                3
                x
                +
                3
                
                  x
                  
                    2
                  
                
                −
                1
                
                  x
                  
                    3
                  
                
                 
                ,
              
              
                
                  b
                  
                    1
                    ,
                    3
                  
                
                (
                x
                )
              
              
                
                =
                0
                +
                3
                x
                −
                6
                
                  x
                  
                    2
                  
                
                +
                3
                
                  x
                  
                    3
                  
                
                 
                ,
              
              
                
                  b
                  
                    2
                    ,
                    3
                  
                
                (
                x
                )
              
              
                
                =
                0
                +
                0
                x
                +
                3
                
                  x
                  
                    2
                  
                
                −
                3
                
                  x
                  
                    3
                  
                
                ,
              
              
                
                  b
                  
                    3
                    ,
                    3
                  
                
                (
                x
                )
              
              
                
                =
                0
                +
                0
                x
                +
                0
                
                  x
                  
                    2
                  
                
                +
                1
                
                  x
                  
                    3
                  
                
                 
                .
              
            
          
        
      
    
    {\displaystyle {\begin{aligned}b_{0,0}(x)&=1\ ,\\b_{0,1}(x)&=1-1x\ ,&b_{1,1}(x)&=0+1x\\b_{0,2}(x)&=1-2x+1x^{2},&b_{1,2}(x)&=0+2x-2x^{2}\ ,&b_{2,2}(x)&=0+0x+1x^{2}\\b_{0,3}(x)&=1-3x+3x^{2}-1x^{3}\ ,&b_{1,3}(x)&=0+3x-6x^{2}+3x^{3}\ ,&b_{2,3}(x)&=0+0x+3x^{2}-3x^{3},&b_{3,3}(x)&=0+0x+0x^{2}+1x^{3}~.\end{aligned}}}

<h2 id="properties">Properties</h2>
The Bernstein basis polynomials have the following properties:

<ul><li>
 
 
 
  
 
 b
 
 ν
 ,
 n
 
 
 
 (
 x
 )
 ≡
 0
  
 ,
 
 
 {\displaystyle \ b_{\nu ,n}\!(x)\equiv 0\ ,}
 
 if 
 
 
 
  
 ν
 <
 0
  
 
 
 {\displaystyle \ \nu <0\ }
 
 or if 
 
 
 
  
 ν
 >
 n
  
 .
 
 
 {\displaystyle \ \nu >n~.}
 
</li>
<li>
 
 
 
  
 
 b
 
 ν
 ,
 n
 
 
 
 (
 x
 )
 ≥
 0
  
 
 
 {\displaystyle \ b_{\nu ,n}\!(x)\geq 0\ }
 
 for 
 
 
 
  
 x
 ∈
 [
 0
 ,
  
 1
 ]
  
 .
 
 
 {\displaystyle \ x\in [0,\ 1]~.}
 
</li>
<li>
 
 
 
  
 
 b
 
 ν
 ,
 n
 
 
 
 
 (
 
 1
 −
 x
 
 )
 
 =
 
 b
 
 n
 −
 ν
 ,
 n
 
 
 
 (
 x
 )
  
 .
 
 
 {\displaystyle \ b_{\nu ,n}\!\left(1-x\right)=b_{n-\nu ,n}\!(x)~.}
 
</li>
<li>
 
 
 
  
 
 b
 
 ν
 ,
 n
 
 
 
 (
 0
 )
 =
 
 δ
 
 ν
 ,
 0
 
 
  
 
 
 {\displaystyle \ b_{\nu ,n}\!(0)=\delta _{\nu ,0}\ }
 
 and 
 
 
 
  
 
 b
 
 ν
 ,
 n
 
 
 
 (
 1
 )
 =
 
 δ
 
 ν
 ,
 n
 
 
  
 
 
 {\displaystyle \ b_{\nu ,n}\!(1)=\delta _{\nu ,n}\ }
 
 where 
 
 
 
  
 
 δ
 
 i
 ,
 j
 
 
  
 
 
 {\displaystyle \ \delta _{i,j}\ }
 
 is the <a href="/facts/Kronecker_delta/gcGAy0bm">Kronecker delta</a> function: 
 
 
 
  
 
 δ
 
 i
 j
 
 
 ≡
 
 
 {
 
 
 
 0
 
 
 
 if 
 
 i
 ≠
 j
  
 ,
 
 
 
 
 1
 
 
 
 if 
 
 i
 =
 j
  
 .
 
 
 
 
 
 
 
 
 {\displaystyle \ \delta _{ij}\equiv {\begin{cases}0&{\text{if }}i\neq j\ ,\\1&{\text{if }}i=j~.\end{cases}}}
 
</li>
<li>
 
 
 
  
 
 b
 
 ν
 ,
 n
 
 
 
 (
 x
 )
  
 
 
 {\displaystyle \ b_{\nu ,n}\!(x)\ }
 
 has a root with multiplicity 
 
 
 
  
 ν
  
 
 
 {\displaystyle \ \nu \ }
 
 at point 
 
 
 
  
 x
 =
 0
  
 
 
 {\displaystyle \ x=0\ }
 
 (note: when 
 
 
 
  
 ν
 =
 0
  
 ,
 
 
 {\displaystyle \ \nu =0\ ,}
 
 there is no root at 0).</li>
<li>
 
 
 
  
 
 b
 
 ν
 ,
 n
 
 
 
 (
 x
 )
  
 
 
 {\displaystyle \ b_{\nu ,n}\!(x)\ }
 
 has a root with multiplicity 
 
 
 
  
 
 (
 
 n
 −
 ν
 
 )
 
  
 
 
 {\displaystyle \ \left(n-\nu \right)\ }
 
 at point 
 
 
 
  
 x
 =
 1
  
 
 
 {\displaystyle \ x=1\ }
 
 (note: if 
 
 
 
  
 ν
 =
 n
  
 ,
 
 
 {\displaystyle \ \nu =n\ ,}
 
 there is no root at 1).</li>
<li>The <a href="/facts/Derivative/7Ito70Dh">derivative</a> can be written as a combination of two polynomials of lower degree: 
 
 
 
  
 
 b
 
 ν
 ,
 n
 
 ′
 
 
 (
 x
 )
 =
 n
 
 
 [
 
 
  
 
 b
 
 ν
 −
 1
 ,
 n
 −
 1
 
 
 
 (
 x
 )
  
 −
  
 
 b
 
 ν
 ,
 n
 −
 1
 
 
 
 (
 x
 )
  
 
 
 ]
 
 
  
 .
 
 
 {\displaystyle \ b_{\nu ,n}'\!(x)=n{\bigl [}\ b_{\nu -1,n-1}\!(x)\ -\ b_{\nu ,n-1}\!(x)\ {\bigr ]}~.}
 
</li>
<li>The k-th derivative at 0: 
 
 
 
  
 
 b
 
 ν
 ,
 n
 
 
 (
 k
 )
 
 
 
 (
 0
 )
  
 =
  
 
 
 
 n
 !
 
 
 (
 n
 −
 k
 )
 !
 
 
 
 
 
 
 (
 
 
 k
 ν
 
 
 )
 
 
 
 (
 −
 1
 
 )
 
 ν
 +
 k
 
 
  
 .
 
 
 {\displaystyle \ b_{\nu ,n}^{(k)}\!(0)\ =\ {\frac {n!}{(n-k)!}}{\binom {k}{\nu }}(-1)^{\nu +k}~.}
 
</li>
<li>The k-th derivative at 1: 
 
 
 
  
 
 b
 
 ν
 ,
 n
 
 
 (
 k
 )
 
 
 (
 1
 )
  
 =
  
 (
 −
 1
 
 )
 
 k
 
 
 
 b
 
 n
 −
 ν
 ,
 n
 
 
 (
 k
 )
 
 
 (
 0
 )
  
 .
 
 
 {\displaystyle \ b_{\nu ,n}^{(k)}(1)\ =\ (-1)^{k}b_{n-\nu ,n}^{(k)}(0)~.}
 
</li>
<li>The transformation of the Bernstein polynomial to monomials is 
 
 
 
  
 
 b
 
 ν
 ,
 n
 
 
 
 (
 x
 )
  
 =
  
 
 
 
 (
 
 
 n
 ν
 
 
 )
 
 
 
 
 ∑
 
 k
 =
 0
 
 
 n
 −
 ν
 
 
 
 
 
 (
 
 
 
 n
 −
 ν
 
 k
 
 
 )
 
 
 
 (
 −
 1
 
 )
 
 n
 −
 ν
 −
 k
 
 
 
 x
 
 ν
 +
 k
 
 
  
 =
  
 
 ∑
 
 ℓ
 =
 ν
 
 
 n
 
 
 
 
 
 (
 
 
 n
 ℓ
 
 
 )
 
 
 
 
 
 
 (
 
 
 ℓ
 ν
 
 
 )
 
 
 
 (
 −
 1
 
 )
 
 ℓ
 −
 ν
 
 
 
 x
 
 ℓ
 
 
  
 ,
 
 
 {\displaystyle \ b_{\nu ,n}\!(x)\ =\ {\binom {n}{\nu }}\sum _{k=0}^{n-\nu }{\binom {n-\nu }{k}}(-1)^{n-\nu -k}x^{\nu +k}\ =\ \sum _{\ell =\nu }^{n}{\binom {n}{\ell }}{\binom {\ell }{\nu }}(-1)^{\ell -\nu }x^{\ell }\ ,}
 
 and by the <a href="/facts/Binomial_transform/Wu345jgq">inverse binomial transformation</a>, the reverse transformation is<a class="footnote-ref" id="fnref:2" href="#fn:2">2</a> 
 
 
 
  
 
 x
 
 k
 
 
  
 =
  
 
 ∑
 
 i
 =
 0
 
 
 n
 −
 k
 
 
 
 
 
 
 (
 
 
 
 n
 −
 k
 
 i
 
 
 )
 
 
 
 
 (
 
 
 n
 i
 
 
 )
 
 
 
 
 
 b
 
 n
 −
 i
 ,
 n
 
 
 
 (
 x
 )
  
 =
  
 
 
 1
 
 
 (
 
 
 n
 k
 
 
 )
 
 
 
 
 
 ∑
 
 j
 =
 k
 
 
 n
 
 
 
 
 
 (
 
 
 j
 k
 
 
 )
 
 
 
 
 b
 
 j
 ,
 n
 
 
 
 (
 x
 )
  
 .
 
 
 {\displaystyle \ x^{k}\ =\ \sum _{i=0}^{n-k}{\frac {\binom {n-k}{i}}{\binom {n}{i}}}b_{n-i,n}\!(x)\ =\ {\frac {1}{\binom {n}{k}}}\sum _{j=k}^{n}{\binom {j}{k}}b_{j,n}\!(x)~.}
 
</li>
<li>The indefinite <a href="/facts/Integral/4PeM0mJc">integral</a> is given by 
 
 
 
  
 ∫
 
 b
 
 ν
 ,
 n
 
 
 
 (
 x
 )
  
 d
 ⁡
 x
 =
 
 
 1
 
 n
 +
 1
 
 
 
 
 ∑
 
 j
 =
 ν
 +
 1
 
 
 n
 +
 1
 
 
 
 b
 
 j
 ,
 n
 +
 1
 
 
 
 (
 x
 )
  
 .
 
 
 {\displaystyle \ \int b_{\nu ,n}\!(x)\ \operatorname {d} x={\frac {1}{n+1}}\sum _{j=\nu +1}^{n+1}b_{j,n+1}\!(x)~.}
 
</li>
<li>The definite integral is constant for a given n: 
 
 
 
  
 
 ∫
 
 0
 
 
 1
 
 
 
 b
 
 ν
 ,
 n
 
 
 
 (
 x
 )
  
 d
 ⁡
 x
 =
 
 
 1
 
 n
 +
 1
 
 
 
  
  
 
 
 {\displaystyle \ \int _{0}^{1}b_{\nu ,n}\!(x)\ \operatorname {d} x={\frac {1}{n+1}}~~}
 
 for all 
 
 
 
  
  
 ν
 =
 0
 ,
 1
 ,
  
 …
  
 ,
 n
  
 .
 
 
 {\displaystyle ~~\nu =0,1,\ \dots \ ,n~.}
 
</li>
<li>If 
 
 
 
  
 n
 ≠
 0
  
 ,
  
 
 
 {\displaystyle \ n\neq 0\ ,~}
 
 then 
 
 
 
  
  
 
 b
 
 ν
 ,
 n
 
 
 
 (
 x
 )
  
 
 
 {\displaystyle ~~b_{\nu ,n}\!(x)\ }
 
 has a unique local maximum on the interval 
 
 
 
  
 [
 0
 ,
 
 1
 ]
  
 
 
 {\displaystyle \ [0,\,1]\ }
 
 at 
 
 
 
  
 x
 =
 
 
 ν
 n
 
 
  
 .
 
 
 {\displaystyle \ x={\frac {\nu }{n}}~.}
 
 This maximum takes the value 
 
 
 
  
 
 ν
 
 ν
 
 
 
 n
 
 −
 n
 
 
 
 
 (
 
 n
 −
 ν
 
 )
 
 
 n
 −
 ν
 
 
 
 
 
 (
 
 
 n
 ν
 
 
 )
 
 
 
  
 .
 
 
 {\displaystyle \ \nu ^{\nu }n^{-n}\left(n-\nu \right)^{n-\nu }{n \choose \nu }~.}
 
</li>
<li>The Bernstein basis polynomials of degree 
 
 
 
  
 n
  
 
 
 {\displaystyle \ n\ }
 
 form a <a href="/facts/Partition_of_unity/4byX17aR">partition of unity</a>: 
 
 
 
  
 
 ∑
 
 ν
 =
 0
 
 
 n
 
 
 
 b
 
 ν
 ,
 n
 
 
 (
 x
 )
  
 =
  
 
 ∑
 
 ν
 =
 0
 
 
 n
 
 
 
 
 
 (
 
 
 n
 ν
 
 
 )
 
 
 
 
 x
 
 ν
 
 
 
 
 (
 
 1
 −
 x
 
 )
 
 
 n
 −
 ν
 
 
  
 =
  
 
 
 (
 
 x
 +
 
 (
 
 1
 −
 x
 
 )
 
 
 )
 
 
 n
 
 
 =
 1
  
 .
 
 
 {\displaystyle \ \sum _{\nu =0}^{n}b_{\nu ,n}(x)\ =\ \sum _{\nu =0}^{n}{n \choose \nu }x^{\nu }\left(1-x\right)^{n-\nu }\ =\ \left(x+\left(1-x\right)\right)^{n}=1~.}
 
</li>
<li>By taking the first 
 
 
 
 x
 
 
 {\displaystyle x}
 
-derivative of 
 
 
 
  
 (
 x
 +
 y
 
 )
 
 n
 
 
  
 ,
 
 
 {\displaystyle \ (x+y)^{n}\ ,}
 
 treating 
 
 
 
  
 y
  
 
 
 {\displaystyle \ y\ }
 
 as constant, then substituting the value 
 
 
 
  
 y
 =
 1
 −
 x
  
 ,
 
 
 {\displaystyle \ y=1-x\ ,}
 
 it can be shown that 
 
 
 
  
 
 ∑
 
 ν
 =
 0
 
 
 n
 
 
 ν
  
 
 b
 
 ν
 ,
 n
 
 
 
 (
 x
 )
 =
 n
  
 x
  
 .
 
 
 {\displaystyle \ \sum _{\nu =0}^{n}\nu \ b_{\nu ,n}\!(x)=n\ x~.}
 
</li>
<li>Similarly the second 
 
 
 
  
 x
  
 
 
 {\displaystyle \ x\ }
 
-derivative of 
 
 
 
  
 (
 x
 +
 y
 
 )
 
 n
 
 
  
 ,
 
 
 {\displaystyle \ (x+y)^{n}\ ,}
 
 with 
 
 
 
  
 y
  
 
 
 {\displaystyle \ y\ }
 
 then again substituted 
 
 
 
  
 y
 =
 1
 −
 x
  
 ,
 
 
 {\displaystyle \ y=1-x\ ,}
 
 shows that 
 
 
 
  
 
 ∑
 
 ν
 =
 1
 
 
 n
 
 
 ν
 
 (
 
 ν
 −
 1
 
 )
 
  
 
 b
 
 ν
 ,
 n
 
 
 
 (
 x
 )
 =
 n
 
 (
 
 n
 −
 1
 
 )
 
  
 
 x
 
 2
 
 
  
 .
 
 
 {\displaystyle \ \sum _{\nu =1}^{n}\nu \left(\nu -1\right)\ b_{\nu ,n}\!(x)=n\left(n-1\right)\ x^{2}~.}
 
</li>
<li>A Bernstein polynomial can always be written as a linear combination of polynomials of higher degree: 
 
 
 
  
 
 b
 
 ν
 ,
 n
 −
 1
 
 
 
 (
 x
 )
  
 =
  
 
 (
 
 
 
 n
 −
 ν
 
 n
 
 
 )
 
  
 
 b
 
 ν
 ,
 n
 
 
 
 (
 x
 )
  
 +
  
 
 (
 
 
 
 ν
 +
 1
 
 n
 
 
 )
 
  
 
 b
 
 ν
 +
 1
 ,
 n
 
 
 
 (
 x
 )
  
 .
 
 
 {\displaystyle \ b_{\nu ,n-1}\!(x)\ =\ \left({\frac {n-\nu }{n}}\right)\ b_{\nu ,n}\!(x)\ +\ \left({\frac {\nu +1}{n}}\right)\ b_{\nu +1,n}\!(x)~.}
 
</li>
<li>The expansion of the <a href="/facts/Chebyshev_polynomials/HoBINqgC">Chebyshev Polynomials of the First Kind</a> into the Bernstein basis is<a class="footnote-ref" id="fnref:3" href="#fn:3">3</a> 
 
 
 
  
 
 T
 
 n
 
 
 
 (
 u
 )
  
 =
  
 (
 2
 n
 −
 1
 )
 !
 !
  
 
 ∑
 
 k
 =
 0
 
 
 n
 
 
 
 
 
  
 (
 −
 1
 
 )
 
 n
 −
 k
 
 
  
 
 
  
 (
 2
 k
 −
 1
 )
 !
 !
  
 (
 2
 n
 −
 2
 k
 −
 1
 )
 !
 !
  
 
 
 
  
 
 b
 
 k
 ,
 n
 
 
 
 (
 u
 )
  
 .
 
 
 {\displaystyle \ T_{n}\!(u)\ =\ (2n-1)!!\ \sum _{k=0}^{n}{\frac {~(-1)^{n-k}\ }{\ (2k-1)!!\ (2n-2k-1)!!\ }}\ b_{k,n}\!(u)~.}
 
</li></ul>
<h2 id="approximating-continuous-functions">Approximating continuous functions</h2>
Let ƒ be a <a href="/facts/Continuous_function/sKbl02pB">continuous function</a> on the interval [0, 1]. Consider the Bernstein polynomial

B
          
            n
          
        
        (
        f
        )
        (
        x
        )
        =
        
          ∑
          
            ν
            =
            0
          
          
            n
          
        
        f
        
          (
          
            
              ν
              n
            
          
          )
        
        
          b
          
            ν
            ,
            n
          
        
        (
        x
        )
        .
      
    
    {\displaystyle B_{n}(f)(x)=\sum _{\nu =0}^{n}f\left({\frac {\nu }{n}}\right)b_{\nu ,n}(x).}

It can be shown that

lim
          
            n
            →
            ∞
          
        
        
          
            B
            
              n
            
          
          (
          f
          )
        
        =
        f
      
    
    {\displaystyle \lim _{n\to \infty }{B_{n}(f)}=f}

<a href="/facts/Uniform_convergence/ecz9eNWn">uniformly</a> on the interval [0, 1].<a class="footnote-ref" id="fnref:4" href="#fn:4">4</a><a class="footnote-ref" id="fnref:5" href="#fn:5">5</a><a class="footnote-ref" id="fnref:6" href="#fn:6">6</a><a class="footnote-ref" id="fnref:7" href="#fn:7">7</a>
Bernstein polynomials thus provide one way to prove the <a href="/facts/Stone%25E2%2580%2593Weierstrass_theorem/QB07Nwba">Weierstrass approximation theorem</a> that every real-valued continuous function on a real interval [a, b] can be uniformly approximated by polynomial functions over 
 
 
 
 
 R
 
 
 
 {\displaystyle \mathbb {R} }
 
.<a class="footnote-ref" id="fnref:8" href="#fn:8">8</a>
A more general statement for a function with continuous kth derivative is

‖
              
                
                  B
                  
                    n
                  
                
                (
                f
                
                  )
                  
                    (
                    k
                    )
                  
                
              
              ‖
            
          
          
            ∞
          
        
        ≤
        
          
            
              (
              n
              
                )
                
                  k
                
              
            
            
              n
              
                k
              
            
          
        
        
          
            ‖
            
              f
              
                (
                k
                )
              
            
            ‖
          
          
            ∞
          
        
        
         
        
          and
        
        
         
        
          
            ‖
            
              
                f
                
                  (
                  k
                  )
                
              
              −
              
                B
                
                  n
                
              
              (
              f
              
                )
                
                  (
                  k
                  )
                
              
            
            ‖
          
          
            ∞
          
        
        →
        0
        ,
      
    
    {\displaystyle {\left\|B_{n}(f)^{(k)}\right\|}_{\infty }\leq {\frac {(n)_{k}}{n^{k}}}\left\|f^{(k)}\right\|_{\infty }\quad \ {\text{and}}\quad \ \left\|f^{(k)}-B_{n}(f)^{(k)}\right\|_{\infty }\to 0,}

where additionally

(
              n
              
                )
                
                  k
                
              
            
            
              n
              
                k
              
            
          
        
        =
        
          (
          
            1
            −
            
              
                0
                n
              
            
          
          )
        
        
          (
          
            1
            −
            
              
                1
                n
              
            
          
          )
        
        ⋯
        
          (
          
            1
            −
            
              
                
                  k
                  −
                  1
                
                n
              
            
          
          )
        
      
    
    {\displaystyle {\frac {(n)_{k}}{n^{k}}}=\left(1-{\frac {0}{n}}\right)\left(1-{\frac {1}{n}}\right)\cdots \left(1-{\frac {k-1}{n}}\right)}

is an <a href="/facts/Eigenvalue/8TjEoT8u">eigenvalue</a> of Bn; the corresponding eigenfunction is a polynomial of degree k.

<h3>Probabilistic proof</h3>
This proof follows Bernstein's original proof of 1912.<a class="footnote-ref" id="fnref:9" href="#fn:9">9</a> See also Feller (1966) or Koralov & Sinai (2007).<a class="footnote-ref" id="fnref:10" href="#fn:10">10</a><a class="footnote-ref" id="fnref:11" href="#fn:11">11</a>

<h4>Motivation</h4>
We will first give intuition for Bernstein's original proof. A continuous function on a compact interval must be uniformly continuous. Thus, the value of any continuous function can be uniformly approximated by its value on some finite net of points in the interval. This consideration renders the approximation theorem intuitive, given that polynomials should be flexible enough to match (or nearly match) a finite number of pairs 
 
 
 
 (
 x
 ,
 f
 (
 x
 )
 )
 
 
 {\displaystyle (x,f(x))}
 
. To do so, we might (1) construct a function close to 
 
 
 
 f
 
 
 {\displaystyle f}
 
 on a lattice, and then (2) smooth out the function outside the lattice to make a polynomial. 
The probabilistic proof below simply provides a constructive method to create a polynomial which is approximately equal to 
 
 
 
 f
 
 
 {\displaystyle f}
 
 on such a point lattice, given that "smoothing out" a function is not always trivial. Taking the expectation of a random variable with a simple distribution is a common way to smooth. Here, we take advantage of the fact that Bernstein polynomials look like Binomial expectations. We split the interval into a lattice of n discrete values. Then, to evaluate any f(x), we evaluate f at one of the n lattice points close to x, randomly chosen by the Binomial distribution. The expectation of this approximation technique is polynomial, as it is the expectation of a function of a binomial RV. The proof below illustrates that this achieves a uniform approximation of f. The crux of the proof is to (1) justify replacing an arbitrary point with a binomially chosen lattice point by concentration properties of a Binomial distribution, and (2) justify the inference from 
 
 
 
 x
 ≈
 X
 
 
 {\displaystyle x\approx X}
 
 to 
 
 
 
 f
 (
 x
 )
 ≈
 f
 (
 X
 )
 
 
 {\displaystyle f(x)\approx f(X)}
 
 by uniform continuity. 

<h4>Bernstein's proof</h4>
Suppose K is a <a href="/facts/Random_variable/TwTBXnLT">random variable</a> distributed as the number of successes in n independent <a href="/facts/Bernoulli_trial/7fm8snCf">Bernoulli trials</a> with probability x of success on each trial; in other words, K has a <a href="/facts/Binomial_distribution/UMoFMjDj">binomial distribution</a> with parameters n and x. Then we have the <a href="/facts/Expected_value/1XV0JKL8">expected value</a> 
 
 
 
 
 
 E
 
 
 ⁡
 
 [
 
 
 K
 n
 
 
 ]
 
 =
 x
  
 
 
 {\displaystyle \operatorname {\mathcal {E}} \left[{\frac {K}{n}}\right]=x\ }
 
 and

p
        (
        K
        )
        =
        
          
            
              (
            
            
              n
              K
            
            
              )
            
          
        
        
          x
          
            K
          
        
        
          
            (
            
              1
              −
              x
            
            )
          
          
            n
            −
            K
          
        
        =
        
          b
          
            K
            ,
            n
          
        
        (
        x
        )
      
    
    {\displaystyle p(K)={n \choose K}x^{K}\left(1-x\right)^{n-K}=b_{K,n}(x)}

By the <a href="/facts/Law_of_large_numbers/X3Bjcy3v">weak law of large numbers</a> of <a href="/facts/Probability_theory/mFBn51rE">probability theory</a>,

lim
          
            n
            →
            ∞
          
        
        
          P
          
            (
            
              
                |
                
                  
                    
                      K
                      n
                    
                  
                  −
                  x
                
                |
              
              >
              δ
            
            )
          
        
        =
        0
      
    
    {\displaystyle \lim _{n\to \infty }{P\left(\left|{\frac {K}{n}}-x\right|>\delta \right)}=0}

for every δ > 0. Moreover, this relation holds uniformly in x, which can be seen from its proof via <a href="/facts/Chebyshev%2527s_inequality/jf5ReDJv">Chebyshev's inequality</a>, taking into account that the variance of 1⁄n K, equal to 1⁄n x(1−x), is bounded from above by 1⁄(4n) irrespective of x.
Because ƒ, being continuous on a closed bounded interval, must be <a href="/facts/Uniform_continuity/hjvw7tII">uniformly continuous</a> on that interval, one infers a statement of the form

lim
          
            n
            →
            ∞
          
        
        
          P
          
            (
            
              
                |
                
                  f
                  
                    (
                    
                      
                        K
                        n
                      
                    
                    )
                  
                  −
                  f
                  
                    (
                    x
                    )
                  
                
                |
              
              >
              ε
            
            )
          
        
        =
        0
      
    
    {\displaystyle \lim _{n\to \infty }{P\left(\left|f\left({\frac {K}{n}}\right)-f\left(x\right)\right|>\varepsilon \right)}=0}

uniformly in x for each 
 
 
 
 ϵ
 >
 0
 
 
 {\displaystyle \epsilon >0}
 
. Taking into account that ƒ is bounded (on the given interval) one finds that

lim
          
            n
            →
            ∞
          
        
        
          
            
              E
            
          
          ⁡
          
            (
            
              |
              
                f
                
                  (
                  
                    
                      K
                      n
                    
                  
                  )
                
                −
                f
                
                  (
                  x
                  )
                
              
              |
            
            )
          
        
        =
        0
      
    
    {\displaystyle \lim _{n\to \infty }{\operatorname {\mathcal {E}} \left(\left|f\left({\frac {K}{n}}\right)-f\left(x\right)\right|\right)}=0}

uniformly in x. To justify this statement, we use a common method in probability theory to convert from closeness in probability to closeness in expectation. One splits the expectation of 
 
 
 
 
 |
 
 f
 
 (
 
 
 K
 n
 
 
 )
 
 −
 f
 
 (
 x
 )
 
 
 |
 
 
 
 {\displaystyle \left|f\left({\frac {K}{n}}\right)-f\left(x\right)\right|}
 
 into two parts split based on whether or not 
 
 
 
 
 |
 
 f
 
 (
 
 
 K
 n
 
 
 )
 
 −
 f
 
 (
 x
 )
 
 
 |
 
 <
 ϵ
 
 
 {\displaystyle \left|f\left({\frac {K}{n}}\right)-f\left(x\right)\right|<\epsilon }
 
. In the interval where the difference does not exceed ε, the expectation clearly cannot exceed ε.
In the other interval, the difference still cannot exceed 2M, where M is an upper bound for |ƒ(x)| (since uniformly continuous functions are bounded). However, by our 'closeness in probability' statement, this interval cannot have probability greater than ε. Thus, this part of the expectation contributes no more than 2M times ε. Then the total expectation is no more than 
 
 
 
 ϵ
 +
 2
 M
 ϵ
 
 
 {\displaystyle \epsilon +2M\epsilon }
 
, which can be made arbitrarily small by choosing small ε.
Finally, one observes that the absolute value of the difference between expectations never exceeds the expectation of the absolute value of the difference, a consequence of Holder's Inequality. Thus, using the above expectation, we see that (uniformly in x)

lim
          
            n
            →
            ∞
          
        
        
          
            |
            
              
                
                  E
                
              
              ⁡
              f
              
                (
                
                  
                    K
                    n
                  
                
                )
              
              −
              
                
                  E
                
              
              ⁡
              f
              
                (
                x
                )
              
            
            |
          
        
        ≤
        
          lim
          
            n
            →
            ∞
          
        
        
          
            
              E
            
          
          ⁡
          
            (
            
              |
              
                f
                
                  (
                  
                    
                      K
                      n
                    
                  
                  )
                
                −
                f
                
                  (
                  x
                  )
                
              
              |
            
            )
          
        
        =
        0
      
    
    {\displaystyle \lim _{n\to \infty }{\left|\operatorname {\mathcal {E}} f\left({\frac {K}{n}}\right)-\operatorname {\mathcal {E}} f\left(x\right)\right|}\leq \lim _{n\to \infty }{\operatorname {\mathcal {E}} \left(\left|f\left({\frac {K}{n}}\right)-f\left(x\right)\right|\right)}=0}

Noting that our randomness was over K while x is constant, the expectation of f(x) is just equal to f(x). But then we have shown that 
 
 
 
 
 
 
 
 E
 
 
 
 x
 
 
 
 ⁡
 f
 
 (
 
 
 K
 n
 
 
 )
 
 
 
 {\displaystyle \operatorname {{\mathcal {E}}_{x}} f\left({\frac {K}{n}}\right)}
 
 converges to f(x). Then we will be done if 
 
 
 
 
 
 
 
 E
 
 
 
 x
 
 
 
 ⁡
 f
 
 (
 
 
 K
 n
 
 
 )
 
 
 
 {\displaystyle \operatorname {{\mathcal {E}}_{x}} f\left({\frac {K}{n}}\right)}
 
 is a polynomial in x (the subscript reminding us that x controls the distribution of K). Indeed it is:

E
              
            
            
              x
            
          
        
        ⁡
        
          [
          
            f
            
              (
              
                
                  K
                  n
                
              
              )
            
          
          ]
        
        =
        
          ∑
          
            K
            =
            0
          
          
            n
          
        
        f
        
          (
          
            
              K
              n
            
          
          )
        
        p
        (
        K
        )
        =
        
          ∑
          
            K
            =
            0
          
          
            n
          
        
        f
        
          (
          
            
              K
              n
            
          
          )
        
        
          b
          
            K
            ,
            n
          
        
        (
        x
        )
        =
        
          B
          
            n
          
        
        (
        f
        )
        (
        x
        )
      
    
    {\displaystyle \operatorname {{\mathcal {E}}_{x}} \left[f\left({\frac {K}{n}}\right)\right]=\sum _{K=0}^{n}f\left({\frac {K}{n}}\right)p(K)=\sum _{K=0}^{n}f\left({\frac {K}{n}}\right)b_{K,n}(x)=B_{n}(f)(x)}

<h4>Uniform convergence rates between functions</h4>
In the above proof, recall that convergence in each limit involving f depends on the uniform continuity of f, which implies a rate of convergence dependent on f 's <a href="/facts/Modulus_of_continuity/31lThXhW">modulus of continuity</a> 
 
 
 
 ω
 .
 
 
 {\displaystyle \omega .}
 
 It also depends on 'M', the absolute bound of the function, although this can be bypassed if one bounds 
 
 
 
 ω
 
 
 {\displaystyle \omega }
 
 and the interval size. Thus, the approximation only holds uniformly across x for a fixed f, but one can readily extend the proof to uniformly approximate a set of functions with a set of Bernstein polynomials in the context of <a href="/facts/Equicontinuity/cKjFQWD6">equicontinuity</a>.

<h3>Elementary proof</h3>
The probabilistic proof can also be rephrased in an elementary way, using the underlying probabilistic ideas but proceeding by direct verification:<a class="footnote-ref" id="fnref:12" href="#fn:12">12</a><a class="footnote-ref" id="fnref:13" href="#fn:13">13</a><a class="footnote-ref" id="fnref:14" href="#fn:14">14</a><a class="footnote-ref" id="fnref:15" href="#fn:15">15</a><a class="footnote-ref" id="fnref:16" href="#fn:16">16</a>
The following identities can be verified:

<ol><li>
 
 
 
 
 ∑
 
 k
 
 
 
 
 
 (
 
 
 n
 k
 
 
 )
 
 
 
 
 x
 
 k
 
 
 (
 1
 −
 x
 
 )
 
 n
 −
 k
 
 
 =
 1
 
 
 {\displaystyle \sum _{k}{n \choose k}x^{k}(1-x)^{n-k}=1}
 
 ("probability")</li>
<li>
 
 
 
 
 ∑
 
 k
 
 
 
 
 k
 n
 
 
 
 
 
 (
 
 
 n
 k
 
 
 )
 
 
 
 
 x
 
 k
 
 
 (
 1
 −
 x
 
 )
 
 n
 −
 k
 
 
 =
 x
 
 
 {\displaystyle \sum _{k}{k \over n}{n \choose k}x^{k}(1-x)^{n-k}=x}
 
 ("mean")</li>
<li>
 
 
 
 
 ∑
 
 k
 
 
 
 
 (
 
 x
 −
 
 
 k
 n
 
 
 
 )
 
 
 2
 
 
 
 
 
 (
 
 
 n
 k
 
 
 )
 
 
 
 
 x
 
 k
 
 
 (
 1
 −
 x
 
 )
 
 n
 −
 k
 
 
 =
 
 
 
 x
 (
 1
 −
 x
 )
 
 n
 
 
 .
 
 
 {\displaystyle \sum _{k}\left(x-{k \over n}\right)^{2}{n \choose k}x^{k}(1-x)^{n-k}={x(1-x) \over n}.}
 
 ("variance")</li></ol>
In fact, by the binomial theorem

 
 
 
 (
 1
 +
 t
 
 )
 
 n
 
 
 =
 
 ∑
 
 k
 
 
 
 
 
 (
 
 
 n
 k
 
 
 )
 
 
 
 
 t
 
 k
 
 
 ,
 
 
 {\displaystyle (1+t)^{n}=\sum _{k}{n \choose k}t^{k},}

and this equation can be applied twice to 
 
 
 
 t
 
 
 d
 
 d
 t
 
 
 
 
 
 {\displaystyle t{\frac {d}{dt}}}
 
. The identities (1), (2), and (3) follow easily using the substitution 
 
 
 
 t
 =
 x
 
 /
 
 (
 1
 −
 x
 )
 
 
 {\displaystyle t=x/(1-x)}
 
.
Within these three identities, use the above basis polynomial notation

b
          
            k
            ,
            n
          
        
        (
        x
        )
        =
        
          
            
              (
            
            
              n
              k
            
            
              )
            
          
        
        
          x
          
            k
          
        
        (
        1
        −
        x
        
          )
          
            n
            −
            k
          
        
        ,
      
    
    {\displaystyle b_{k,n}(x)={n \choose k}x^{k}(1-x)^{n-k},}

and let

f
          
            n
          
        
        (
        x
        )
        =
        
          ∑
          
            k
          
        
        f
        (
        k
        
          /
        
        n
        )
        
        
          b
          
            k
            ,
            n
          
        
        (
        x
        )
        .
      
    
    {\displaystyle f_{n}(x)=\sum _{k}f(k/n)\,b_{k,n}(x).}

Thus, by identity (1)

f
          
            n
          
        
        (
        x
        )
        −
        f
        (
        x
        )
        =
        
          ∑
          
            k
          
        
        [
        f
        (
        k
        
          /
        
        n
        )
        −
        f
        (
        x
        )
        ]
        
        
          b
          
            k
            ,
            n
          
        
        (
        x
        )
        ,
      
    
    {\displaystyle f_{n}(x)-f(x)=\sum _{k}[f(k/n)-f(x)]\,b_{k,n}(x),}

so that

|
        
        
          f
          
            n
          
        
        (
        x
        )
        −
        f
        (
        x
        )
        
          |
        
        ≤
        
          ∑
          
            k
          
        
        
          |
        
        f
        (
        k
        
          /
        
        n
        )
        −
        f
        (
        x
        )
        
          |
        
        
        
          b
          
            k
            ,
            n
          
        
        (
        x
        )
        .
      
    
    {\displaystyle |f_{n}(x)-f(x)|\leq \sum _{k}|f(k/n)-f(x)|\,b_{k,n}(x).}

Since f is uniformly continuous, given 
 
 
 
 ε
 >
 0
 
 
 {\displaystyle \varepsilon >0}
 
, there is a 
 
 
 
 δ
 >
 0
 
 
 {\displaystyle \delta >0}
 
 such that 
 
 
 
 
 |
 
 f
 (
 a
 )
 −
 f
 (
 b
 )
 
 |
 
 <
 ε
 
 
 {\displaystyle |f(a)-f(b)|<\varepsilon }
 
 whenever

|
 
 a
 −
 b
 
 |
 
 <
 δ
 
 
 {\displaystyle |a-b|<\delta }
 
. Moreover, by continuity, 
 
 
 
 M
 =
 sup
 
 |
 
 f
 
 |
 
 <
 ∞
 
 
 {\displaystyle M=\sup |f|<\infty }
 
. But then

|
 
 
 f
 
 n
 
 
 (
 x
 )
 −
 f
 (
 x
 )
 
 |
 
 ≤
 
 ∑
 
 
 |
 
 x
 −
 
 
 k
 n
 
 
 
 |
 
 <
 δ
 
 
 
 |
 
 f
 (
 k
 
 /
 
 n
 )
 −
 f
 (
 x
 )
 
 |
 
 
 
 b
 
 k
 ,
 n
 
 
 (
 x
 )
 +
 
 ∑
 
 
 |
 
 x
 −
 
 
 k
 n
 
 
 
 |
 
 ≥
 δ
 
 
 
 |
 
 f
 (
 k
 
 /
 
 n
 )
 −
 f
 (
 x
 )
 
 |
 
 
 
 b
 
 k
 ,
 n
 
 
 (
 x
 )
 .
 
 
 {\displaystyle |f_{n}(x)-f(x)|\leq \sum _{|x-{k \over n}|<\delta }|f(k/n)-f(x)|\,b_{k,n}(x)+\sum _{|x-{k \over n}|\geq \delta }|f(k/n)-f(x)|\,b_{k,n}(x).}

The first sum is less than ε. On the other hand, by identity (3) above, and since 
 
 
 
 
 |
 
 x
 −
 k
 
 /
 
 n
 
 |
 
 ≥
 δ
 
 
 {\displaystyle |x-k/n|\geq \delta }
 
, the second sum is bounded by 
 
 
 
 2
 M
 
 
 {\displaystyle 2M}
 
 times

∑
 
 
 |
 
 x
 −
 k
 
 /
 
 n
 
 |
 
 ≥
 δ
 
 
 
 b
 
 k
 ,
 n
 
 
 (
 x
 )
 ≤
 
 ∑
 
 k
 
 
 
 δ
 
 −
 2
 
 
 
 
 (
 
 x
 −
 
 
 k
 n
 
 
 
 )
 
 
 2
 
 
 
 b
 
 k
 ,
 n
 
 
 (
 x
 )
 =
 
 δ
 
 −
 2
 
 
 
 
 
 x
 (
 1
 −
 x
 )
 
 n
 
 
 <
 
 
 1
 4
 
 
 
 δ
 
 −
 2
 
 
 
 n
 
 −
 1
 
 
 .
 
 
 {\displaystyle \sum _{|x-k/n|\geq \delta }b_{k,n}(x)\leq \sum _{k}\delta ^{-2}\left(x-{k \over n}\right)^{2}b_{k,n}(x)=\delta ^{-2}{x(1-x) \over n}<{1 \over 4}\delta ^{-2}n^{-1}.}

(<a href="/facts/Chebyshev%2527s_inequality/jf5ReDJv">Chebyshev's inequality</a>)
It follows that the polynomials fn tend to f uniformly.

<h2 id="generalizations-to-higher-dimension">Generalizations to higher dimension</h2>
Bernstein polynomials can be generalized to k dimensions – the resulting polynomials have the form Bi1(x1) Bi2(x2) ... Bik(xk).<a class="footnote-ref" id="fnref:17" href="#fn:17">17</a> In the simplest case only products of the unit interval [0,1] are considered; but, using <a href="/facts/Affine_transformation/BD7yDr10">affine transformations</a> of the line, Bernstein polynomials can also be defined for products [a1, b1] × [a2, b2] × ... × [ak, bk]. For a continuous function f on the k-fold product of the unit interval, the proof that f(x1, x2, ... , xk) can be uniformly approximated by

∑
          
            
              i
              
                1
              
            
          
        
        
          ∑
          
            
              i
              
                2
              
            
          
        
        ⋯
        
          ∑
          
            
              i
              
                k
              
            
          
        
        
          
            
              (
            
            
              
                n
                
                  1
                
              
              
                i
                
                  1
                
              
            
            
              )
            
          
        
        
          
            
              (
            
            
              
                n
                
                  2
                
              
              
                i
                
                  2
                
              
            
            
              )
            
          
        
        ⋯
        
          
            
              (
            
            
              
                n
                
                  k
                
              
              
                i
                
                  k
                
              
            
            
              )
            
          
        
        f
        
          (
          
            
              
                
                  i
                  
                    1
                  
                
                
                  n
                  
                    1
                  
                
              
            
            ,
            
              
                
                  i
                  
                    2
                  
                
                
                  n
                  
                    2
                  
                
              
            
            ,
            …
            ,
            
              
                
                  i
                  
                    k
                  
                
                
                  n
                  
                    k
                  
                
              
            
          
          )
        
        
          x
          
            1
          
          
            
              i
              
                1
              
            
          
        
        (
        1
        −
        
          x
          
            1
          
        
        
          )
          
            
              n
              
                1
              
            
            −
            
              i
              
                1
              
            
          
        
        
          x
          
            2
          
          
            
              i
              
                2
              
            
          
        
        (
        1
        −
        
          x
          
            2
          
        
        
          )
          
            
              n
              
                2
              
            
            −
            
              i
              
                2
              
            
          
        
        ⋯
        
          x
          
            k
          
          
            
              i
              
                k
              
            
          
        
        (
        1
        −
        
          x
          
            k
          
        
        
          )
          
            
              n
              
                k
              
            
            −
            
              i
              
                k
              
            
          
        
      
    
    {\displaystyle \sum _{i_{1}}\sum _{i_{2}}\cdots \sum _{i_{k}}{n_{1} \choose i_{1}}{n_{2} \choose i_{2}}\cdots {n_{k} \choose i_{k}}f\left({i_{1} \over n_{1}},{i_{2} \over n_{2}},\dots ,{i_{k} \over n_{k}}\right)x_{1}^{i_{1}}(1-x_{1})^{n_{1}-i_{1}}x_{2}^{i_{2}}(1-x_{2})^{n_{2}-i_{2}}\cdots x_{k}^{i_{k}}(1-x_{k})^{n_{k}-i_{k}}}

is a straightforward extension of Bernstein's proof in one dimension.
<a class="footnote-ref" id="fnref:18" href="#fn:18">18</a>

<h2 id="see-also">See also</h2>
<ul><li><a href="/facts/Polynomial_interpolation/DmnuxaKT">Polynomial interpolation</a></li>
<li><a href="/facts/Newton_polynomial/2vRsjpA2">Newton form</a></li>
<li><a href="/facts/Lagrange_polynomial/MnpMmhtC">Lagrange form</a></li>
<li><a href="/facts/Binomial_QMF/BxQ2A4EW">Binomial QMF</a> (also known as <a href="/facts/Daubechies_wavelet/lhF1zmqB">Daubechies wavelet</a>)</li></ul>
<h2 id="notes">Notes</h2>

<ul><li><a href="/facts/S._Bernstein/m7aCQ2Y6">Bernstein, S.</a> (1912), <a href="https://www.mn.uio.no/math/english/people/aca/michaelf/translations/bernstein_english.pdf">"Démonstration du théorème de Weierstrass fondée sur le calcul des probabilités (Proof of the theorem of Weierstrass based on the calculus of probabilities)"</a> (PDF), Comm. Kharkov Math. Soc., 13: 1–2, English translation</li>
<li><a href="/facts/George_G._Lorentz/QHttNoXd">Lorentz, G. G.</a> (1953), Bernstein Polynomials, <a href="/facts/University_of_Toronto_Press/fSM2fkTh">University of Toronto Press</a></li>
<li><a href="/facts/Naum_Akhiezer/tFszwj15">Akhiezer, N. I.</a> (1956), <a href="https://archive.org/details/theoryofapproxim00akhi/page/30/mode/2up?q=bernstein">Theory of approximation</a> (in Russian), translated by Charles J. Hyman, Frederick Ungar, pp. 30–31, Russian edition first published in 1940</li>
<li><a href="/facts/J._C._Burkill/sv7N0Ury">Burkill, J. C.</a> (1959), <a href="http://www.math.tifr.res.in/~publ/ln/tifr16.pdf">Lectures On Approximation By Polynomials</a> (PDF), Bombay: <a href="/facts/Tata_Institute_of_Fundamental_Research/Xetanlpg">Tata Institute of Fundamental Research</a>, pp. 7–8</li>
<li>Goldberg, Richard R. (1964), <a href="https://archive.org/details/in.ernet.dli.2015.134296/page/n243/mode/2up?q=bernstein">Methods of real analysis</a>, John Wiley & Sons, pp. 263–265</li>
<li>Caglar, Hakan; Akansu, Ali N. (July 1993). "A generalized parametric PR-QMF design technique based on Bernstein polynomial approximation". IEEE Transactions on Signal Processing. 41 (7): 2314–2321. <a href="/facts/Bibcode_(identifier)/9HtdQSGB">Bibcode</a>:<a href="https://ui.adsabs.harvard.edu/abs/1993ITSP...41.2314C">1993ITSP...41.2314C</a>. <a href="/facts/Doi_(identifier)/muM9Etpq">doi</a>:<a href="https://doi.org/10.1109%2F78.224242">10.1109/78.224242</a>. <a href="/facts/Zbl_(identifier)/P6rFxKKx">Zbl</a> <a href="https://zbmath.org/?format=complete&q=an:0825.93863">0825.93863</a>.</li>
<li>Korovkin, P.P. (2001) [1994], <a href="https://www.encyclopediaofmath.org/index.php?title=Bernstein_polynomials">"Bernstein polynomials"</a>, <a href="/facts/Encyclopedia_of_Mathematics/WC6mGtPm">Encyclopedia of Mathematics</a>, <a href="/facts/European_Mathematical_Society/B3h7b672">EMS Press</a></li>
<li><a href="/facts/Isidor_Natanson/PsKYIsnK">Natanson, I.P.</a> (1964). Constructive function theory. Volume I: Uniform approximation. Translated by Alexis N. Obolensky. New York: Frederick Ungar. <a href="/facts/MR_(identifier)/uP137L11">MR</a> <a href="https://mathscinet.ams.org/mathscinet-getitem?mr=0196340">0196340</a>. <a href="/facts/Zbl_(identifier)/P6rFxKKx">Zbl</a> <a href="https://zbmath.org/?format=complete&q=an:0133.31101">0133.31101</a>.</li>
<li><a href="/facts/William_Feller/CeLs6QpB">Feller, William</a> (1966), An introduction to probability theory and its applications, Vol, II, John Wiley & Sons, pp. 149–150, 218–222</li>
<li><a href="/facts/Richard_Beals_(mathematician)/9xCWt6jL">Beals, Richard</a> (2004), Analysis. An introduction, <a href="/facts/Cambridge_University_Press/fgEBSSRq">Cambridge University Press</a>, pp. 95–98, <a href="/facts/ISBN_(identifier)/15AdSPa9">ISBN</a> 0521600472</li></ul>
<h2 id="external-links">External links</h2>
<ul><li><a href="/facts/Mark_Kac/AcByIKRc">Kac, Mark</a> (1938). <a href="https://doi.org/10.4064%2Fsm-7-1-49-51">"Une remarque sur les polynomes de M. S. Bernstein"</a>. <a href="/facts/Studia_Mathematica/hmauxFK6">Studia Mathematica</a>. 7: 49–51. <a href="/facts/Doi_(identifier)/muM9Etpq">doi</a>:<a href="https://doi.org/10.4064%2Fsm-7-1-49-51">10.4064/sm-7-1-49-51</a>.</li>
<li>Kelisky, Richard Paul; Rivlin, Theodore Joseph (1967). <a href="https://doi.org/10.2140%2Fpjm.1967.21.511">"Iteratives of Bernstein Polynomials"</a>. <a href="/facts/Pacific_Journal_of_Mathematics/fBmfJLky">Pacific Journal of Mathematics</a>. 21 (3): 511. <a href="/facts/Doi_(identifier)/muM9Etpq">doi</a>:<a href="https://doi.org/10.2140%2Fpjm.1967.21.511">10.2140/pjm.1967.21.511</a>.</li>
<li>Stark, E. L. (1981). "Bernstein Polynome, 1912-1955". In Butzer, P.L. (ed.). ISNM60. pp. 443–461. <a href="/facts/Doi_(identifier)/muM9Etpq">doi</a>:<a href="https://doi.org/10.1007%2F978-3-0348-9369-5_40">10.1007/978-3-0348-9369-5_40</a>. <a href="/facts/ISBN_(identifier)/15AdSPa9">ISBN</a> 978-3-0348-9369-5.</li>
<li><a href="/facts/Sonia_Petrone/CnsK9acl">Petrone, Sonia</a> (1999). "Random Bernstein polynomials". Scand. J. Stat. 26 (3): 373–393. <a href="/facts/Doi_(identifier)/muM9Etpq">doi</a>:<a href="https://doi.org/10.1111%2F1467-9469.00155">10.1111/1467-9469.00155</a>. <a href="/facts/S2CID_(identifier)/ldJsHa2Y">S2CID</a> <a href="https://api.semanticscholar.org/CorpusID:122387975">122387975</a>.</li>
<li>Oruc, Halil; Phillips, Geoerge M. (1999). <a href="https://doi.org/10.1017%2FS0013091500020332">"A generalization of the Bernstein Polynomials"</a>. <a href="/facts/Edinburgh_Mathematical_Society/aVAsEhRL">Proceedings of the Edinburgh Mathematical Society</a>. 42 (2): 403–413. <a href="/facts/Doi_(identifier)/muM9Etpq">doi</a>:<a href="https://doi.org/10.1017%2FS0013091500020332">10.1017/S0013091500020332</a>.</li>
<li>Joy, Kenneth I. (2000). <a href="https://web.archive.org/web/20120220143625/http://www.idav.ucdavis.edu/education/CAGDNotes/Bernstein-Polynomials.pdf">"Bernstein Polynomials"</a> (PDF). Archived from <a href="http://www.idav.ucdavis.edu/education/CAGDNotes/Bernstein-Polynomials.pdf">the original</a> (PDF) on 2012-02-20. Retrieved 2009-02-28. from <a href="/facts/University_of_California%2c_Davis/LTCyhaP5">University of California, Davis</a>. Note the error in the summation limits in the first formula on page 9.</li>
<li>Idrees Bhatti, M.; Bracken, P. (2007). <a href="https://doi.org/10.1016%2Fj.cam.2006.05.002">"Solutions of differential equations in a Bernstein Polynomial basis"</a>. J. Comput. Appl. Math. 205 (1): 272–280. <a href="/facts/Bibcode_(identifier)/9HtdQSGB">Bibcode</a>:<a href="https://ui.adsabs.harvard.edu/abs/2007JCoAM.205..272I">2007JCoAM.205..272I</a>. <a href="/facts/Doi_(identifier)/muM9Etpq">doi</a>:<a href="https://doi.org/10.1016%2Fj.cam.2006.05.002">10.1016/j.cam.2006.05.002</a>.</li>
<li><a href="/facts/Bill_Casselman_(mathematician)/IvuklxDd">Casselman, Bill</a> (2008). <a href="https://www.ams.org/featurecolumn/archive/bezier.html">"From Bézier to Bernstein"</a>. Feature Column from <a href="/facts/American_Mathematical_Society/fr5gotCt">American Mathematical Society</a></li>
<li>Acikgoz, Mehmet; Araci, Serkan (2010). "On the generating function for Bernstein Polynomials". AIP Conf. Proc. AIP Conference Proceedings. 1281 (1): 1141. <a href="/facts/Bibcode_(identifier)/9HtdQSGB">Bibcode</a>:<a href="https://ui.adsabs.harvard.edu/abs/2010AIPC.1281.1141A">2010AIPC.1281.1141A</a>. <a href="/facts/Doi_(identifier)/muM9Etpq">doi</a>:<a href="https://doi.org/10.1063%2F1.3497855">10.1063/1.3497855</a>.</li>
<li>Doha, E. H.; Bhrawy, A. H.; Saker, M. A. (2011). <a href="https://doi.org/10.1016%2Fj.aml.2010.11.013">"Integrals of Bernstein polynomials: An application for the solution of high even-order differential equations"</a>. Appl. Math. Lett. 24 (4): 559–565. <a href="/facts/Doi_(identifier)/muM9Etpq">doi</a>:<a href="https://doi.org/10.1016%2Fj.aml.2010.11.013">10.1016/j.aml.2010.11.013</a>.</li>
<li>Farouki, Rida T. (2012). "The Bernstein polynomial basis: a centennial retrospective". Comp. Aid. Geom. Des. 29 (6): 379–419. <a href="/facts/Doi_(identifier)/muM9Etpq">doi</a>:<a href="https://doi.org/10.1016%2Fj.cagd.2012.03.001">10.1016/j.cagd.2012.03.001</a>.</li>
<li>Chen, Xiaoyan; Tan, Jieqing; Liu, Zhi; Xie, Jin (2017). <a href="https://doi.org/10.1016%2Fj.jmaa.2016.12.075">"Approximations of functions by a new family of generalized Bernstein operators"</a>. J. Math. Ann. Applic. 450: 244–261. <a href="/facts/Doi_(identifier)/muM9Etpq">doi</a>:<a href="https://doi.org/10.1016%2Fj.jmaa.2016.12.075">10.1016/j.jmaa.2016.12.075</a>.</li>
<li><a href="/facts/Eric_W._Weisstein/FbMuVDep">Weisstein, Eric W.</a> <a href="https://mathworld.wolfram.com/BernsteinPolynomial.html">"Bernstein Polynomial"</a>. <a href="/facts/MathWorld/yc1jodbq">MathWorld</a>.</li>
<li>This article incorporates material from <a href="https://planetmath.org/BernsteinPolynomial">properties of Bernstein polynomial</a> on <a href="/facts/PlanetMath/wphPCtLf">PlanetMath</a>, which is licensed under the Creative Commons Attribution/Share-Alike License.</li></ul>

<h2 id="references">References</h2>

<ol>
<li id="fn:1">Lorentz 1953 - Lorentz, G. G. (1953), Bernstein Polynomials, University of Toronto Press <a href="#fnref:1" class="footnote-back-ref">↩</a></li>
<li id="fn:2">Mathar, R.J. (2018). "Orthogonal basis function over the unit circle with the minimax property". Appendix B. arXiv:1802.09518 [math.NA]. <a href="/wiki/ArXiv_(identifier)" target="_blank">/wiki/ArXiv_(identifier)</a> <a href="#fnref:2" class="footnote-back-ref">↩</a></li>
<li id="fn:3">Rababah, Abedallah (2003). "Transformation of Chebyshev-Bernstein polynomial basis". Computational Methods in Applied Mathematics. 3 (4): 608–622. doi:10.2478/cmam-2003-0038. S2CID 120938358. <a href="https://doi.org/10.2478%2Fcmam-2003-0038" target="_blank">https://doi.org/10.2478%2Fcmam-2003-0038</a> <a href="#fnref:3" class="footnote-back-ref">↩</a></li>
<li id="fn:4">Natanson (1964) p. 6 <a href="#fnref:4" class="footnote-back-ref">↩</a></li>
<li id="fn:5">Lorentz 1953 - Lorentz, G. G. (1953), Bernstein Polynomials, University of Toronto Press <a href="#fnref:5" class="footnote-back-ref">↩</a></li>
<li id="fn:6">Feller 1966 - Feller, William (1966), An introduction to probability theory and its applications, Vol, II, John Wiley & Sons, pp. 149–150, 218–222 <a href="#fnref:6" class="footnote-back-ref">↩</a></li>
<li id="fn:7">Beals 2004 - Beals, Richard (2004), Analysis. An introduction, Cambridge University Press, pp. 95–98, ISBN 0521600472 <a href="#fnref:7" class="footnote-back-ref">↩</a></li>
<li id="fn:8">Natanson (1964) p. 3 <a href="#fnref:8" class="footnote-back-ref">↩</a></li>
<li id="fn:9">Bernstein 1912 - Bernstein, S. (1912), "Démonstration du théorème de Weierstrass fondée sur le calcul des probabilités (Proof of the theorem of Weierstrass based on the calculus of probabilities)" (PDF), Comm. Kharkov Math. Soc., 13: 1–2 <a href="https://www.mn.uio.no/math/english/people/aca/michaelf/translations/bernstein_english.pdf" target="_blank">https://www.mn.uio.no/math/english/people/aca/michaelf/translations/bernstein_english.pdf</a> <a href="#fnref:9" class="footnote-back-ref">↩</a></li>
<li id="fn:10">Koralov, L.; Sinai, Y. (2007). ""Probabilistic proof of the Weierstrass theorem"". Theory of probability and random processes (2nd ed.). Springer. p. 29. <a href="#fnref:10" class="footnote-back-ref">↩</a></li>
<li id="fn:11">Feller 1966 - Feller, William (1966), An introduction to probability theory and its applications, Vol, II, John Wiley & Sons, pp. 149–150, 218–222 <a href="#fnref:11" class="footnote-back-ref">↩</a></li>
<li id="fn:12">Lorentz 1953, pp. 5–6 - Lorentz, G. G. (1953), Bernstein Polynomials, University of Toronto Press <a href="#fnref:12" class="footnote-back-ref">↩</a></li>
<li id="fn:13">Beals 2004 - Beals, Richard (2004), Analysis. An introduction, Cambridge University Press, pp. 95–98, ISBN 0521600472 <a href="#fnref:13" class="footnote-back-ref">↩</a></li>
<li id="fn:14">Goldberg 1964 - Goldberg, Richard R. (1964), Methods of real analysis, John Wiley & Sons, pp. 263–265 <a href="https://archive.org/details/in.ernet.dli.2015.134296/page/n243/mode/2up?q=bernstein" target="_blank">https://archive.org/details/in.ernet.dli.2015.134296/page/n243/mode/2up?q=bernstein</a> <a href="#fnref:14" class="footnote-back-ref">↩</a></li>
<li id="fn:15">Akhiezer 1956 - Akhiezer, N. I. (1956), Theory of approximation (in Russian), translated by Charles J. Hyman, Frederick Ungar, pp. 30–31 <a href="https://archive.org/details/theoryofapproxim00akhi/page/30/mode/2up?q=bernstein" target="_blank">https://archive.org/details/theoryofapproxim00akhi/page/30/mode/2up?q=bernstein</a> <a href="#fnref:15" class="footnote-back-ref">↩</a></li>
<li id="fn:16">Burkill 1959 - Burkill, J. C. (1959), Lectures On Approximation By Polynomials (PDF), Bombay: Tata Institute of Fundamental Research, pp. 7–8 <a href="http://www.math.tifr.res.in/~publ/ln/tifr16.pdf" target="_blank">http://www.math.tifr.res.in/~publ/ln/tifr16.pdf</a> <a href="#fnref:16" class="footnote-back-ref">↩</a></li>
<li id="fn:17">Lorentz 1953 - Lorentz, G. G. (1953), Bernstein Polynomials, University of Toronto Press <a href="#fnref:17" class="footnote-back-ref">↩</a></li>
<li id="fn:18">Hildebrandt, T. H.; Schoenberg, I. J. (1933), "On linear functional operations and the moment problem for a finite interval in one or several dimensions", Annals of Mathematics, 34 (2): 327, doi:10.2307/1968205, JSTOR 1968205 <a href="/wiki/Theophil_Henry_Hildebrandt" target="_blank">/wiki/Theophil_Henry_Hildebrandt</a> <a href="#fnref:18" class="footnote-back-ref">↩</a></li>
</ol>

Bernstein polynomial open-in-new

Bernstein polynomial