Uniformly most powerful test

<h2 id="setting">Setting</h2>
<p>Let 
  
    
      
        X
      
    
    {\displaystyle X}
  
 denote a random vector (corresponding to the measurements), taken from a <a href="/facts/Parametrized_family/YoEWw0I8">parametrized family</a> of <a href="/facts/Probability_density_function/zvfybna4">probability density functions</a> or <a href="/facts/Probability_mass_function/LhurokRt">probability mass functions</a>  
  
    
      
        
          f
          
            θ
          
        
        (
        x
        )
      
    
    {\displaystyle f_{\theta }(x)}
  
, which depends on the unknown deterministic parameter 
  
    
      
        θ
        ∈
        Θ
      
    
    {\displaystyle \theta \in \Theta }
  
. The parameter space 
  
    
      
        Θ
      
    
    {\displaystyle \Theta }
  
 is partitioned into two disjoint sets 
  
    
      
        
          Θ
          
            0
          
        
      
    
    {\displaystyle \Theta _{0}}
  
 and 
  
    
      
        
          Θ
          
            1
          
        
      
    
    {\displaystyle \Theta _{1}}
  
. Let 
  
    
      
        
          H
          
            0
          
        
      
    
    {\displaystyle H_{0}}
  
 denote the hypothesis that 
  
    
      
        θ
        ∈
        
          Θ
          
            0
          
        
      
    
    {\displaystyle \theta \in \Theta _{0}}
  
, and let 
  
    
      
        
          H
          
            1
          
        
      
    
    {\displaystyle H_{1}}
  
 denote the hypothesis that 
  
    
      
        θ
        ∈
        
          Θ
          
            1
          
        
      
    
    {\displaystyle \theta \in \Theta _{1}}
  
.
The binary test of hypotheses is performed using a test function 
  
    
      
        φ
        (
        x
        )
      
    
    {\displaystyle \varphi (x)}
  
 with a reject region 
  
    
      
        R
      
    
    {\displaystyle R}
  
 (a subset of measurement space).
</p>

φ
        (
        x
        )
        =
        
          
            {
            
              
                
                  1
                
                
                  
                    if 
                  
                  x
                  ∈
                  R
                
              
              
                
                  0
                
                
                  
                    if 
                  
                  x
                  ∈
                  
                    R
                    
                      c
                    
                  
                
              
            
            
          
        
      
    
    {\displaystyle \varphi (x)={\begin{cases}1&{\text{if }}x\in R\\0&{\text{if }}x\in R^{c}\end{cases}}}

<p>meaning that 
  
    
      
        
          H
          
            1
          
        
      
    
    {\displaystyle H_{1}}
  
 is in force if the measurement 
  
    
      
        X
        ∈
        R
      
    
    {\displaystyle X\in R}
  
 and that 
  
    
      
        
          H
          
            0
          
        
      
    
    {\displaystyle H_{0}}
  
 is in force if the measurement 
  
    
      
        X
        ∈
        
          R
          
            c
          
        
      
    
    {\displaystyle X\in R^{c}}
  
.
Note that 
  
    
      
        R
        ∪
        
          R
          
            c
          
        
      
    
    {\displaystyle R\cup R^{c}}
  
 is a disjoint covering of the measurement space.
</p>
<h2 id="formal-definition">Formal definition</h2>
<p>A test function 
  
    
      
        φ
        (
        x
        )
      
    
    {\displaystyle \varphi (x)}
  
 is UMP of size 
  
    
      
        α
      
    
    {\displaystyle \alpha }
  
 if for any other test function 
  
    
      
        
          φ
          ′
        
        (
        x
        )
      
    
    {\displaystyle \varphi '(x)}
  
 satisfying
</p>

sup
          
            θ
            ∈
            
              Θ
              
                0
              
            
          
        
        
        E
        ⁡
        [
        
          φ
          ′
        
        (
        X
        )
        
          |
        
        θ
        ]
        =
        
          α
          ′
        
        ≤
        α
        =
        
          sup
          
            θ
            ∈
            
              Θ
              
                0
              
            
          
        
        
        E
        ⁡
        [
        φ
        (
        X
        )
        
          |
        
        θ
        ]
        
      
    
    {\displaystyle \sup _{\theta \in \Theta _{0}}\;\operatorname {E} [\varphi '(X)|\theta ]=\alpha '\leq \alpha =\sup _{\theta \in \Theta _{0}}\;\operatorname {E} [\varphi (X)|\theta ]\,}

∀
        θ
        ∈
        
          Θ
          
            1
          
        
        ,
        
        E
        ⁡
        [
        
          φ
          ′
        
        (
        X
        )
        
          |
        
        θ
        ]
        =
        1
        −
        
          β
          ′
        
        (
        θ
        )
        ≤
        1
        −
        β
        (
        θ
        )
        =
        E
        ⁡
        [
        φ
        (
        X
        )
        
          |
        
        θ
        ]
        .
      
    
    {\displaystyle \forall \theta \in \Theta _{1},\quad \operatorname {E} [\varphi '(X)|\theta ]=1-\beta '(\theta )\leq 1-\beta (\theta )=\operatorname {E} [\varphi (X)|\theta ].}

<h2 id="the-karlinrubin-theorem">The Karlin–Rubin theorem</h2>
<p>The Karlin–Rubin theorem can be regarded as an extension of the Neyman–Pearson lemma for composite hypotheses.<a class="footnote-ref" id="fnref:1" href="#fn:1"><sup>1</sup></a> Consider a scalar measurement having a probability density function parameterized by a scalar parameter <i>θ</i>, and define the likelihood ratio 
  
    
      
        l
        (
        x
        )
        =
        
          f
          
            
              θ
              
                1
              
            
          
        
        (
        x
        )
        
          /
        
        
          f
          
            
              θ
              
                0
              
            
          
        
        (
        x
        )
      
    
    {\displaystyle l(x)=f_{\theta _{1}}(x)/f_{\theta _{0}}(x)}
  
.
If 
  
    
      
        l
        (
        x
        )
      
    
    {\displaystyle l(x)}
  
 is monotone non-decreasing, in 
  
    
      
        x
      
    
    {\displaystyle x}
  
, for any pair 
  
    
      
        
          θ
          
            1
          
        
        ≥
        
          θ
          
            0
          
        
      
    
    {\displaystyle \theta _{1}\geq \theta _{0}}
  
 (meaning that the greater 
  
    
      
        x
      
    
    {\displaystyle x}
  
 is, the more likely 
  
    
      
        
          H
          
            1
          
        
      
    
    {\displaystyle H_{1}}
  
 is), then the threshold test:
</p>

φ
        (
        x
        )
        =
        
          
            {
            
              
                
                  1
                
                
                  
                    if 
                  
                  x
                  >
                  
                    x
                    
                      0
                    
                  
                
              
              
                
                  0
                
                
                  
                    if 
                  
                  x
                  <
                  
                    x
                    
                      0
                    
                  
                
              
            
            
          
        
      
    
    {\displaystyle \varphi (x)={\begin{cases}1&{\text{if }}x>x_{0}\\0&{\text{if }}x<x_{0}\end{cases}}}

where 
  
    
      
        
          x
          
            0
          
        
      
    
    {\displaystyle x_{0}}
  
 is chosen such that 
  
    
      
        
          E
          
            
              θ
              
                0
              
            
          
        
        ⁡
        φ
        (
        X
        )
        =
        α
      
    
    {\displaystyle \operatorname {E} _{\theta _{0}}\varphi (X)=\alpha }

<p>is the UMP test of size <i>α</i> for testing 
  
    
      
        
          H
          
            0
          
        
        :
        θ
        ≤
        
          θ
          
            0
          
        
        
           vs. 
        
        
          H
          
            1
          
        
        :
        θ
        >
        
          θ
          
            0
          
        
        .
      
    
    {\displaystyle H_{0}:\theta \leq \theta _{0}{\text{ vs. }}H_{1}:\theta >\theta _{0}.}

</p><p>Note that exactly the same test is also UMP for testing 
  
    
      
        
          H
          
            0
          
        
        :
        θ
        =
        
          θ
          
            0
          
        
        
           vs. 
        
        
          H
          
            1
          
        
        :
        θ
        >
        
          θ
          
            0
          
        
        .
      
    
    {\displaystyle H_{0}:\theta =\theta _{0}{\text{ vs. }}H_{1}:\theta >\theta _{0}.}

</p>
<h2 id="important-case-exponential-family">Important case: exponential family</h2>
<p>Although the Karlin-Rubin theorem may seem weak because of its restriction to scalar parameter and scalar measurement, it turns out that there exist a host of problems for which the theorem holds. In particular, the one-dimensional <a href="/facts/Exponential_family/1LkkqEIf">exponential family</a> of <a href="/facts/Probability_density_function/zvfybna4">probability density functions</a> or <a href="/facts/Probability_mass_function/LhurokRt">probability mass functions</a> with
</p>

f
          
            θ
          
        
        (
        x
        )
        =
        g
        (
        θ
        )
        h
        (
        x
        )
        exp
        ⁡
        (
        η
        (
        θ
        )
        T
        (
        x
        )
        )
      
    
    {\displaystyle f_{\theta }(x)=g(\theta )h(x)\exp(\eta (\theta )T(x))}

<p>has a monotone non-decreasing likelihood ratio in the <a href="/facts/Sufficiency_(statistics)/eClRXHpd">sufficient statistic</a> 
  
    
      
        T
        (
        x
        )
      
    
    {\displaystyle T(x)}
  
, provided that 
  
    
      
        η
        (
        θ
        )
      
    
    {\displaystyle \eta (\theta )}
  
 is non-decreasing.
</p>
<h2 id="example">Example</h2>
<p>Let 
  
    
      
        X
        =
        (
        
          X
          
            0
          
        
        ,
        …
        ,
        
          X
          
            M
            −
            1
          
        
        )
      
    
    {\displaystyle X=(X_{0},\ldots ,X_{M-1})}
  
 denote i.i.d. normally distributed 
  
    
      
        N
      
    
    {\displaystyle N}
  
-dimensional random vectors with mean 
  
    
      
        θ
        m
      
    
    {\displaystyle \theta m}
  
 and covariance matrix 
  
    
      
        R
      
    
    {\displaystyle R}
  
. We then have
</p>

f
                  
                    θ
                  
                
                (
                X
                )
                =

(
                2
                π
                
                  )
                  
                    −
                    M
                    N
                    
                      /
                    
                    2
                  
                
                
                  |
                
                R
                
                  
                    |
                  
                  
                    −
                    M
                    
                      /
                    
                    2
                  
                
                exp
                ⁡
                
                  {
                  
                    −
                    
                      
                        1
                        2
                      
                    
                    
                      ∑
                      
                        n
                        =
                        0
                      
                      
                        M
                        −
                        1
                      
                    
                    (
                    
                      X
                      
                        n
                      
                    
                    −
                    θ
                    m
                    
                      )
                      
                        T
                      
                    
                    
                      R
                      
                        −
                        1
                      
                    
                    (
                    
                      X
                      
                        n
                      
                    
                    −
                    θ
                    m
                    )
                  
                  }
                
              
            
            
              
                =

(
                2
                π
                
                  )
                  
                    −
                    M
                    N
                    
                      /
                    
                    2
                  
                
                
                  |
                
                R
                
                  
                    |
                  
                  
                    −
                    M
                    
                      /
                    
                    2
                  
                
                exp
                ⁡
                
                  {
                  
                    −
                    
                      
                        1
                        2
                      
                    
                    
                      ∑
                      
                        n
                        =
                        0
                      
                      
                        M
                        −
                        1
                      
                    
                    
                      (
                      
                        
                          θ
                          
                            2
                          
                        
                        
                          m
                          
                            T
                          
                        
                        
                          R
                          
                            −
                            1
                          
                        
                        m
                      
                      )
                    
                  
                  }
                
              
            
            
              
              
                exp
                ⁡
                
                  {
                  
                    −
                    
                      
                        1
                        2
                      
                    
                    
                      ∑
                      
                        n
                        =
                        0
                      
                      
                        M
                        −
                        1
                      
                    
                    
                      X
                      
                        n
                      
                      
                        T
                      
                    
                    
                      R
                      
                        −
                        1
                      
                    
                    
                      X
                      
                        n
                      
                    
                  
                  }
                
                exp
                ⁡
                
                  {
                  
                    θ
                    
                      m
                      
                        T
                      
                    
                    
                      R
                      
                        −
                        1
                      
                    
                    
                      ∑
                      
                        n
                        =
                        0
                      
                      
                        M
                        −
                        1
                      
                    
                    
                      X
                      
                        n
                      
                    
                  
                  }
                
              
            
          
        
      
    
    {\displaystyle {\begin{aligned}f_{\theta }(X)={}&(2\pi )^{-MN/2}|R|^{-M/2}\exp \left\{-{\frac {1}{2}}\sum _{n=0}^{M-1}(X_{n}-\theta m)^{T}R^{-1}(X_{n}-\theta m)\right\}\\[4pt]={}&(2\pi )^{-MN/2}|R|^{-M/2}\exp \left\{-{\frac {1}{2}}\sum _{n=0}^{M-1}\left(\theta ^{2}m^{T}R^{-1}m\right)\right\}\\[4pt]&\exp \left\{-{\frac {1}{2}}\sum _{n=0}^{M-1}X_{n}^{T}R^{-1}X_{n}\right\}\exp \left\{\theta m^{T}R^{-1}\sum _{n=0}^{M-1}X_{n}\right\}\end{aligned}}}

<p>which is exactly in the form of the exponential family shown in the previous section, with the sufficient statistic being
</p>

T
        (
        X
        )
        =
        
          m
          
            T
          
        
        
          R
          
            −
            1
          
        
        
          ∑
          
            n
            =
            0
          
          
            M
            −
            1
          
        
        
          X
          
            n
          
        
        .
      
    
    {\displaystyle T(X)=m^{T}R^{-1}\sum _{n=0}^{M-1}X_{n}.}

<p>Thus, we conclude that the test
</p>

φ
        (
        T
        )
        =
        
          
            {
            
              
                
                  1
                
                
                  T
                  >
                  
                    t
                    
                      0
                    
                  
                
              
              
                
                  0
                
                
                  T
                  <
                  
                    t
                    
                      0
                    
                  
                
              
            
            
          
        
        
        
          E
          
            
              θ
              
                0
              
            
          
        
        ⁡
        φ
        (
        T
        )
        =
        α
      
    
    {\displaystyle \varphi (T)={\begin{cases}1&T>t_{0}\\0&T<t_{0}\end{cases}}\qquad \operatorname {E} _{\theta _{0}}\varphi (T)=\alpha }

<p>is the UMP test of size 
  
    
      
        α
      
    
    {\displaystyle \alpha }
  
 for testing 
  
    
      
        
          H
          
            0
          
        
        :
        θ
        ⩽
        
          θ
          
            0
          
        
      
    
    {\displaystyle H_{0}:\theta \leqslant \theta _{0}}
  
 vs. 
  
    
      
        
          H
          
            1
          
        
        :
        θ
        >
        
          θ
          
            0
          
        
      
    
    {\displaystyle H_{1}:\theta >\theta _{0}}

</p>
<h2 id="further-discussion">Further discussion</h2>
<p>In general, UMP tests do not exist for vector parameters or for two-sided tests (a test in which one hypothesis lies on both sides of the alternative). The reason is that in these situations, the most powerful test of a given size for one possible value of the parameter (e.g. for 
  
    
      
        
          θ
          
            1
          
        
      
    
    {\displaystyle \theta _{1}}
  
 where 
  
    
      
        
          θ
          
            1
          
        
        >
        
          θ
          
            0
          
        
      
    
    {\displaystyle \theta _{1}>\theta _{0}}
  
) is different from the most powerful test of the same size for a different value of the parameter (e.g. for 
  
    
      
        
          θ
          
            2
          
        
      
    
    {\displaystyle \theta _{2}}
  
 where 
  
    
      
        
          θ
          
            2
          
        
        <
        
          θ
          
            0
          
        
      
    
    {\displaystyle \theta _{2}<\theta _{0}}
  
). As a result, no test is uniformly most powerful in these situations.
</p>

<h2 id="further-reading">Further reading</h2>
<ul><li><a href="/facts/Thomas_S._Ferguson/gpL3b7N5">Ferguson, T. S.</a> (1967). "Sec. 5.2: <i>Uniformly most powerful tests</i>". <i>Mathematical Statistics: A decision theoretic approach</i>. New York: Academic Press.</li>
<li>Mood, A. M.; Graybill, F. A.; Boes, D. C. (1974). "Sec. IX.3.2: <i>Uniformly most powerful tests</i>". <i>Introduction to the theory of statistics</i> (3rd ed.). New York: McGraw-Hill.</li>
<li>L. L. Scharf, <i>Statistical Signal Processing</i>, Addison-Wesley, 1991, section 4.7.</li></ul>

<h2 id="references">References</h2>

<ol>
<li id="fn:1"><p>Casella, G.; Berger, R.L. (2008), Statistical Inference, Brooks/Cole. ISBN 0-495-39187-5 (Theorem 8.3.17) <a href="/wiki/ISBN_(identifier)" target="_blank">/wiki/ISBN_(identifier)</a> <a href="#fnref:1" class="footnote-back-ref">↩</a></p></li>
</ol>

Uniformly most powerful test open-in-new

Uniformly most powerful test