Submodular set function

<h2 id="definition">Definition</h2>
<p>If 
  
    
      
        Ω
      
    
    {\displaystyle \Omega }
  
 is a finite <a href="/facts/Set_(mathematics)/BfucfHMq">set</a>, a submodular function is a set function 
  
    
      
        f
        :
        
          2
          
            Ω
          
        
        →
        
          R
        
      
    
    {\displaystyle f:2^{\Omega }\rightarrow \mathbb {R} }
  
, where 
  
    
      
        
          2
          
            Ω
          
        
      
    
    {\displaystyle 2^{\Omega }}
  
 denotes the <a href="/facts/Power_set/FVuf8PDD">power set</a> of 
  
    
      
        Ω
      
    
    {\displaystyle \Omega }
  
, which satisfies one of the following equivalent conditions.<a class="footnote-ref" id="fnref:5" href="#fn:5"><sup>5</sup></a>
</p>
<ol><li>For every 
  
    
      
        X
        ,
        Y
        ⊆
        Ω
      
    
    {\displaystyle X,Y\subseteq \Omega }
  
 with 
  
    
      
        X
        ⊆
        Y
      
    
    {\displaystyle X\subseteq Y}
  
 and every 
  
    
      
        x
        ∈
        Ω
        ∖
        Y
      
    
    {\displaystyle x\in \Omega \setminus Y}
  
 we have that 
  
    
      
        f
        (
        X
        ∪
        {
        x
        }
        )
        −
        f
        (
        X
        )
        ≥
        f
        (
        Y
        ∪
        {
        x
        }
        )
        −
        f
        (
        Y
        )
      
    
    {\displaystyle f(X\cup \{x\})-f(X)\geq f(Y\cup \{x\})-f(Y)}
  
.</li>
<li>For every 
  
    
      
        S
        ,
        T
        ⊆
        Ω
      
    
    {\displaystyle S,T\subseteq \Omega }
  
 we have that 
  
    
      
        f
        (
        S
        )
        +
        f
        (
        T
        )
        ≥
        f
        (
        S
        ∪
        T
        )
        +
        f
        (
        S
        ∩
        T
        )
      
    
    {\displaystyle f(S)+f(T)\geq f(S\cup T)+f(S\cap T)}
  
.</li>
<li>For every 
  
    
      
        X
        ⊆
        Ω
      
    
    {\displaystyle X\subseteq \Omega }
  
 and 
  
    
      
        
          x
          
            1
          
        
        ,
        
          x
          
            2
          
        
        ∈
        Ω
        ∖
        X
      
    
    {\displaystyle x_{1},x_{2}\in \Omega \backslash X}
  
 such that 
  
    
      
        
          x
          
            1
          
        
        ≠
        
          x
          
            2
          
        
      
    
    {\displaystyle x_{1}\neq x_{2}}
  
 we have that 
  
    
      
        f
        (
        X
        ∪
        {
        
          x
          
            1
          
        
        }
        )
        +
        f
        (
        X
        ∪
        {
        
          x
          
            2
          
        
        }
        )
        ≥
        f
        (
        X
        ∪
        {
        
          x
          
            1
          
        
        ,
        
          x
          
            2
          
        
        }
        )
        +
        f
        (
        X
        )
      
    
    {\displaystyle f(X\cup \{x_{1}\})+f(X\cup \{x_{2}\})\geq f(X\cup \{x_{1},x_{2}\})+f(X)}
  
.</li></ol>
<p>A nonnegative submodular function is also a <a href="/facts/Subadditive_set_function/xHkWC2UG">subadditive</a> function, but a subadditive function need not be submodular.
If 
  
    
      
        Ω
      
    
    {\displaystyle \Omega }
  
 is not assumed finite, then the above conditions are not equivalent.  In particular a function

f
      
    
    {\displaystyle f}
  
 defined by 
  
    
      
        f
        (
        S
        )
        =
        1
      
    
    {\displaystyle f(S)=1}
  
 if 
  
    
      
        S
      
    
    {\displaystyle S}
  
 is finite and 
  
    
      
        f
        (
        S
        )
        =
        0
      
    
    {\displaystyle f(S)=0}
  
 if 
  
    
      
        S
      
    
    {\displaystyle S}
  
 is infinite 
satisfies the first condition above, but the second condition fails when 
  
    
      
        S
      
    
    {\displaystyle S}
  
 and 
  
    
      
        T
      
    
    {\displaystyle T}
  
 are infinite sets with finite intersection.
</p>
<h2 id="types-and-examples-of-submodular-functions">Types and examples of submodular functions</h2>
<h3>Monotone</h3>
<p>A set function 
  
    
      
        f
      
    
    {\displaystyle f}
  
 is <i>monotone</i> if for every 
  
    
      
        T
        ⊆
        S
      
    
    {\displaystyle T\subseteq S}
  
 we have that 
  
    
      
        f
        (
        T
        )
        ≤
        f
        (
        S
        )
      
    
    {\displaystyle f(T)\leq f(S)}
  
. Examples of monotone submodular functions include:
</p>
Linear (Modular) functions
Any function of the form 
  
    
      
        f
        (
        S
        )
        =
        
          ∑
          
            i
            ∈
            S
          
        
        
          w
          
            i
          
        
      
    
    {\displaystyle f(S)=\sum _{i\in S}w_{i}}
  
 is called a linear function. Additionally if 
  
    
      
        ∀
        i
        ,
        
          w
          
            i
          
        
        ≥
        0
      
    
    {\displaystyle \forall i,w_{i}\geq 0}
  
 then f is monotone.
<a href="/facts/Budget-additive_valuation/djdVEGm9">Budget-additive functions</a>
Any function of the form 
  
    
      
        f
        (
        S
        )
        =
        min
        
          {
          
            B
            ,
             
            
              ∑
              
                i
                ∈
                S
              
            
            
              w
              
                i
              
            
          
          }
        
      
    
    {\displaystyle f(S)=\min \left\{B,~\sum _{i\in S}w_{i}\right\}}
  
 for each 
  
    
      
        
          w
          
            i
          
        
        ≥
        0
      
    
    {\displaystyle w_{i}\geq 0}
  
 and 
  
    
      
        B
        ≥
        0
      
    
    {\displaystyle B\geq 0}
  
 is called budget additive.<a class="footnote-ref" id="fnref:6" href="#fn:6"><sup>6</sup></a>
Coverage functions
Let 
  
    
      
        Ω
        =
        {
        
          E
          
            1
          
        
        ,
        
          E
          
            2
          
        
        ,
        …
        ,
        
          E
          
            n
          
        
        }
      
    
    {\displaystyle \Omega =\{E_{1},E_{2},\ldots ,E_{n}\}}
  
 be a collection of subsets of some <a href="/facts/Matroid/5bLcGlRF">ground set</a> 
  
    
      
        
          Ω
          ′
        
      
    
    {\displaystyle \Omega '}
  
. The function 
  
    
      
        f
        (
        S
        )
        =
        
          |
          
            
              ⋃
              
                
                  E
                  
                    i
                  
                
                ∈
                S
              
            
            
              E
              
                i
              
            
          
          |
        
      
    
    {\displaystyle f(S)=\left|\bigcup _{E_{i}\in S}E_{i}\right|}
  
 for 
  
    
      
        S
        ⊆
        Ω
      
    
    {\displaystyle S\subseteq \Omega }
  
 is called a coverage function. This can be generalized by adding non-negative weights to the elements.
<a href="/facts/Entropy_(information_theory)/NLg4NLvt">Entropy</a>
Let 
  
    
      
        Ω
        =
        {
        
          X
          
            1
          
        
        ,
        
          X
          
            2
          
        
        ,
        …
        ,
        
          X
          
            n
          
        
        }
      
    
    {\displaystyle \Omega =\{X_{1},X_{2},\ldots ,X_{n}\}}
  
 be a set of <a href="/facts/Random_variables/TwTBXnLT">random variables</a>. Then for any 
  
    
      
        S
        ⊆
        Ω
      
    
    {\displaystyle S\subseteq \Omega }
  
 we have that 
  
    
      
        H
        (
        S
        )
      
    
    {\displaystyle H(S)}
  
 is a submodular function, where 
  
    
      
        H
        (
        S
        )
      
    
    {\displaystyle H(S)}
  
 is the entropy of the set of random variables 
  
    
      
        S
      
    
    {\displaystyle S}
  
, a fact known as <a href="/facts/Entropic_vector/H9JC4qBY">Shannon's inequality</a>.<a class="footnote-ref" id="fnref:7" href="#fn:7"><sup>7</sup></a> Further inequalities for the entropy function are known to hold, see <a href="/facts/Entropic_vector/H9JC4qBY">entropic vector</a>.
<a href="/facts/Matroid/5bLcGlRF">Matroid</a> <a href="/facts/Matroid_rank/PsbEdnUJ">rank functions</a>
Let 
  
    
      
        Ω
        =
        {
        
          e
          
            1
          
        
        ,
        
          e
          
            2
          
        
        ,
        …
        ,
        
          e
          
            n
          
        
        }
      
    
    {\displaystyle \Omega =\{e_{1},e_{2},\dots ,e_{n}\}}
  
 be the ground set on which a matroid is defined. Then the rank function of the matroid is a submodular function.<a class="footnote-ref" id="fnref:8" href="#fn:8"><sup>8</sup></a>
<h3>Non-monotone</h3>
<p>A submodular function that is not monotone is called <i>non-monotone</i>. In particular, a function is called non-monotone if it has the property that adding more elements to a set can decrease the value of the function. More formally, the function 
  
    
      
        f
      
    
    {\displaystyle f}
  
 is non-monotone if there are sets 
  
    
      
        S
        ,
        T
      
    
    {\displaystyle S,T}
  
 in its domain s.t. 
  
    
      
        S
        ⊂
        T
      
    
    {\displaystyle S\subset T}
  
 and 
  
    
      
        f
        (
        S
        )
        >
        f
        (
        T
        )
      
    
    {\displaystyle f(S)>f(T)}
  
. 
</p>
<h4>Symmetric</h4>
<p>A non-monotone submodular function 
  
    
      
        f
      
    
    {\displaystyle f}
  
 is called <i>symmetric</i> if for every 
  
    
      
        S
        ⊆
        Ω
      
    
    {\displaystyle S\subseteq \Omega }
  
 we have that 
  
    
      
        f
        (
        S
        )
        =
        f
        (
        Ω
        −
        S
        )
      
    
    {\displaystyle f(S)=f(\Omega -S)}
  
.
Examples of symmetric non-monotone submodular functions include:
</p>
Graph cuts
Let 
  
    
      
        Ω
        =
        {
        
          v
          
            1
          
        
        ,
        
          v
          
            2
          
        
        ,
        …
        ,
        
          v
          
            n
          
        
        }
      
    
    {\displaystyle \Omega =\{v_{1},v_{2},\dots ,v_{n}\}}
  
 be the vertices of a <a href="/facts/Graph_(discrete_mathematics)/kw3eIBUe">graph</a>. For any set of vertices 
  
    
      
        S
        ⊆
        Ω
      
    
    {\displaystyle S\subseteq \Omega }
  
 let 
  
    
      
        f
        (
        S
        )
      
    
    {\displaystyle f(S)}
  
 denote the number of edges 
  
    
      
        e
        =
        (
        u
        ,
        v
        )
      
    
    {\displaystyle e=(u,v)}
  
 such that 
  
    
      
        u
        ∈
        S
      
    
    {\displaystyle u\in S}
  
 and 
  
    
      
        v
        ∈
        Ω
        −
        S
      
    
    {\displaystyle v\in \Omega -S}
  
. This can be generalized by adding non-negative weights to the edges.
<a href="/facts/Mutual_information/HIUvsjvV">Mutual information</a>
Let 
  
    
      
        Ω
        =
        {
        
          X
          
            1
          
        
        ,
        
          X
          
            2
          
        
        ,
        …
        ,
        
          X
          
            n
          
        
        }
      
    
    {\displaystyle \Omega =\{X_{1},X_{2},\ldots ,X_{n}\}}
  
 be a set of <a href="/facts/Random_variable/TwTBXnLT">random variables</a>. Then for any 
  
    
      
        S
        ⊆
        Ω
      
    
    {\displaystyle S\subseteq \Omega }
  
 we have that 
  
    
      
        f
        (
        S
        )
        =
        I
        (
        S
        ;
        Ω
        −
        S
        )
      
    
    {\displaystyle f(S)=I(S;\Omega -S)}
  
 is a submodular function, where 
  
    
      
        I
        (
        S
        ;
        Ω
        −
        S
        )
      
    
    {\displaystyle I(S;\Omega -S)}
  
 is the mutual information.
<h4>Asymmetric</h4>
<p>A non-monotone submodular function which is not symmetric is called asymmetric.
</p>
Directed cuts
Let 
  
    
      
        Ω
        =
        {
        
          v
          
            1
          
        
        ,
        
          v
          
            2
          
        
        ,
        …
        ,
        
          v
          
            n
          
        
        }
      
    
    {\displaystyle \Omega =\{v_{1},v_{2},\dots ,v_{n}\}}
  
 be the vertices of a <a href="/facts/Directed_graph/YBdfQ0um">directed graph</a>. For any set of vertices 
  
    
      
        S
        ⊆
        Ω
      
    
    {\displaystyle S\subseteq \Omega }
  
 let 
  
    
      
        f
        (
        S
        )
      
    
    {\displaystyle f(S)}
  
 denote the number of edges 
  
    
      
        e
        =
        (
        u
        ,
        v
        )
      
    
    {\displaystyle e=(u,v)}
  
 such that 
  
    
      
        u
        ∈
        S
      
    
    {\displaystyle u\in S}
  
 and 
  
    
      
        v
        ∈
        Ω
        −
        S
      
    
    {\displaystyle v\in \Omega -S}
  
. This can be generalized by adding non-negative weights to the directed edges.
<h2 id="continuous-extensions-of-submodular-set-functions">Continuous extensions of submodular set functions</h2>
<p>Often, given a submodular set function that describes the values of various sets, we need to compute the values of <i>fractional</i> sets. For example: we know that the value of receiving house A and house B is V, and we want to know the value of receiving 40% of house A and 60% of house B. To this end, we need a <i>continuous extension</i> of the submodular set function.
</p><p>Formally, a set function 
  
    
      
        f
        :
        
          2
          
            Ω
          
        
        →
        
          R
        
      
    
    {\displaystyle f:2^{\Omega }\rightarrow \mathbb {R} }
  
 with 
  
    
      
        
          |
        
        Ω
        
          |
        
        =
        n
      
    
    {\displaystyle |\Omega |=n}
  
 can be represented as a function on 
  
    
      
        {
        0
        ,
        1
        
          }
          
            n
          
        
      
    
    {\displaystyle \{0,1\}^{n}}
  
, by associating each 
  
    
      
        S
        ⊆
        Ω
      
    
    {\displaystyle S\subseteq \Omega }
  
 with a binary vector 
  
    
      
        
          x
          
            S
          
        
        ∈
        {
        0
        ,
        1
        
          }
          
            n
          
        
      
    
    {\displaystyle x^{S}\in \{0,1\}^{n}}
  
 such that 
  
    
      
        
          x
          
            i
          
          
            S
          
        
        =
        1
      
    
    {\displaystyle x_{i}^{S}=1}
  
 when 
  
    
      
        i
        ∈
        S
      
    
    {\displaystyle i\in S}
  
, and 
  
    
      
        
          x
          
            i
          
          
            S
          
        
        =
        0
      
    
    {\displaystyle x_{i}^{S}=0}
  
 otherwise. A <i>continuous <a href="/facts/Restriction_(mathematics)/JV59DhKy">extension</a></i> of 
  
    
      
        f
      
    
    {\displaystyle f}
  
 is a continuous function 
  
    
      
        F
        :
        [
        0
        ,
        1
        
          ]
          
            n
          
        
        →
        
          R
        
      
    
    {\displaystyle F:[0,1]^{n}\rightarrow \mathbb {R} }
  
, that matches the value of 
  
    
      
        f
      
    
    {\displaystyle f}
  
 on 
  
    
      
        x
        ∈
        {
        0
        ,
        1
        
          }
          
            n
          
        
      
    
    {\displaystyle x\in \{0,1\}^{n}}
  
, i.e. 
  
    
      
        F
        (
        
          x
          
            S
          
        
        )
        =
        f
        (
        S
        )
      
    
    {\displaystyle F(x^{S})=f(S)}
  
.
</p><p>Several kinds of continuous extensions of submodular functions are commonly used, which are described below.
</p>
<h3>Lovász extension</h3>
<p>This extension is named after mathematician <a href="/facts/L%C3%A1szl%C3%B3_Lov%C3%A1sz/X0VB38Ee">László Lovász</a>.<a class="footnote-ref" id="fnref:9" href="#fn:9"><sup>9</sup></a> Consider any vector 
  
    
      
        
          x
        
        =
        {
        
          x
          
            1
          
        
        ,
        
          x
          
            2
          
        
        ,
        …
        ,
        
          x
          
            n
          
        
        }
      
    
    {\displaystyle \mathbf {x} =\{x_{1},x_{2},\dots ,x_{n}\}}
  
 such that each 
  
    
      
        0
        ≤
        
          x
          
            i
          
        
        ≤
        1
      
    
    {\displaystyle 0\leq x_{i}\leq 1}
  
. Then the Lovász extension is defined as
</p><p>
  
    
      
        
          f
          
            L
          
        
        (
        
          x
        
        )
        =
        
          E
        
        (
        f
        (
        {
        i
        
          |
        
        
          x
          
            i
          
        
        ≥
        λ
        }
        )
        )
      
    
    {\displaystyle f^{L}(\mathbf {x} )=\mathbb {E} (f(\{i|x_{i}\geq \lambda \}))}

</p><p>where the expectation is over 
  
    
      
        λ
      
    
    {\displaystyle \lambda }
  
 chosen from the <a href="/facts/Uniform_distribution_(continuous)/XbnlVljT">uniform distribution</a> on the interval 
  
    
      
        [
        0
        ,
        1
        ]
      
    
    {\displaystyle [0,1]}
  
. The Lovász extension is a convex function if and only if 
  
    
      
        f
      
    
    {\displaystyle f}
  
 is a submodular function.
</p>
<h3>Multilinear extension</h3>
<p>Consider any vector 
  
    
      
        
          x
        
        =
        {
        
          x
          
            1
          
        
        ,
        
          x
          
            2
          
        
        ,
        …
        ,
        
          x
          
            n
          
        
        }
      
    
    {\displaystyle \mathbf {x} =\{x_{1},x_{2},\ldots ,x_{n}\}}
  
 such that each 
  
    
      
        0
        ≤
        
          x
          
            i
          
        
        ≤
        1
      
    
    {\displaystyle 0\leq x_{i}\leq 1}
  
. Then the multilinear extension is defined as <a class="footnote-ref" id="fnref:10" href="#fn:10"><sup>10</sup></a><a class="footnote-ref" id="fnref:11" href="#fn:11"><sup>11</sup></a>
  
    
      
        F
        (
        
          x
        
        )
        =
        
          ∑
          
            S
            ⊆
            Ω
          
        
        f
        (
        S
        )
        
          ∏
          
            i
            ∈
            S
          
        
        
          x
          
            i
          
        
        
          ∏
          
            i
            ∉
            S
          
        
        (
        1
        −
        
          x
          
            i
          
        
        )
      
    
    {\displaystyle F(\mathbf {x} )=\sum _{S\subseteq \Omega }f(S)\prod _{i\in S}x_{i}\prod _{i\notin S}(1-x_{i})}
  
.
</p><p>Intuitively, <i>xi</i> represents the probability that item <i>i</i> is chosen for the set. For every set <i>S</i>, the two inner products represent the probability that the chosen set is exactly <i>S</i>. Therefore, the sum represents the expected value of <i>f</i> for the set formed by choosing each item <i>i</i> at random with probability xi, independently of the other items.
</p>
<h3>Convex closure</h3>
<p>Consider any vector 
  
    
      
        
          x
        
        =
        {
        
          x
          
            1
          
        
        ,
        
          x
          
            2
          
        
        ,
        …
        ,
        
          x
          
            n
          
        
        }
      
    
    {\displaystyle \mathbf {x} =\{x_{1},x_{2},\dots ,x_{n}\}}
  
 such that each 
  
    
      
        0
        ≤
        
          x
          
            i
          
        
        ≤
        1
      
    
    {\displaystyle 0\leq x_{i}\leq 1}
  
. Then the convex closure is defined as 
  
    
      
        
          f
          
            −
          
        
        (
        
          x
        
        )
        =
        min
        
          (
          
            
              ∑
              
                S
              
            
            
              α
              
                S
              
            
            f
            (
            S
            )
            :
            
              ∑
              
                S
              
            
            
              α
              
                S
              
            
            
              1
              
                S
              
            
            =
            
              x
            
            ,
            
              ∑
              
                S
              
            
            
              α
              
                S
              
            
            =
            1
            ,
            
              α
              
                S
              
            
            ≥
            0
          
          )
        
      
    
    {\displaystyle f^{-}(\mathbf {x} )=\min \left(\sum _{S}\alpha _{S}f(S):\sum _{S}\alpha _{S}1_{S}=\mathbf {x} ,\sum _{S}\alpha _{S}=1,\alpha _{S}\geq 0\right)}
  
.
</p><p>The convex closure of any set function is convex over 
  
    
      
        [
        0
        ,
        1
        
          ]
          
            n
          
        
      
    
    {\displaystyle [0,1]^{n}}
  
.
</p>
<h3>Concave closure</h3>
<p>Consider any vector 
  
    
      
        
          x
        
        =
        {
        
          x
          
            1
          
        
        ,
        
          x
          
            2
          
        
        ,
        …
        ,
        
          x
          
            n
          
        
        }
      
    
    {\displaystyle \mathbf {x} =\{x_{1},x_{2},\dots ,x_{n}\}}
  
 such that each 
  
    
      
        0
        ≤
        
          x
          
            i
          
        
        ≤
        1
      
    
    {\displaystyle 0\leq x_{i}\leq 1}
  
. Then the concave closure is defined as 
  
    
      
        
          f
          
            +
          
        
        (
        
          x
        
        )
        =
        max
        
          (
          
            
              ∑
              
                S
              
            
            
              α
              
                S
              
            
            f
            (
            S
            )
            :
            
              ∑
              
                S
              
            
            
              α
              
                S
              
            
            
              1
              
                S
              
            
            =
            
              x
            
            ,
            
              ∑
              
                S
              
            
            
              α
              
                S
              
            
            =
            1
            ,
            
              α
              
                S
              
            
            ≥
            0
          
          )
        
      
    
    {\displaystyle f^{+}(\mathbf {x} )=\max \left(\sum _{S}\alpha _{S}f(S):\sum _{S}\alpha _{S}1_{S}=\mathbf {x} ,\sum _{S}\alpha _{S}=1,\alpha _{S}\geq 0\right)}
  
.
</p>
<h3>Relations between continuous extensions</h3>
<p>For the extensions discussed above, it can be shown that 
  
    
      
        
          f
          
            +
          
        
        (
        
          x
        
        )
        ≥
        F
        (
        
          x
        
        )
        ≥
        
          f
          
            −
          
        
        (
        
          x
        
        )
        =
        
          f
          
            L
          
        
        (
        
          x
        
        )
      
    
    {\displaystyle f^{+}(\mathbf {x} )\geq F(\mathbf {x} )\geq f^{-}(\mathbf {x} )=f^{L}(\mathbf {x} )}
  
 when 
  
    
      
        f
      
    
    {\displaystyle f}
  
 is submodular.<a class="footnote-ref" id="fnref:12" href="#fn:12"><sup>12</sup></a>
</p>
<h2 id="properties">Properties</h2>
<ol><li>The class of submodular functions is <a href="/facts/Closure_(mathematics)/Ct6r2fJz">closed</a> under non-negative <a href="/facts/Linear_combination/CVjL6kuN">linear combinations</a>. Consider any submodular function 
  
    
      
        
          f
          
            1
          
        
        ,
        
          f
          
            2
          
        
        ,
        …
        ,
        
          f
          
            k
          
        
      
    
    {\displaystyle f_{1},f_{2},\ldots ,f_{k}}
  
 and non-negative numbers 
  
    
      
        
          α
          
            1
          
        
        ,
        
          α
          
            2
          
        
        ,
        …
        ,
        
          α
          
            k
          
        
      
    
    {\displaystyle \alpha _{1},\alpha _{2},\ldots ,\alpha _{k}}
  
. Then the function 
  
    
      
        g
      
    
    {\displaystyle g}
  
 defined by 
  
    
      
        g
        (
        S
        )
        =
        
          ∑
          
            i
            =
            1
          
          
            k
          
        
        
          α
          
            i
          
        
        
          f
          
            i
          
        
        (
        S
        )
      
    
    {\displaystyle g(S)=\sum _{i=1}^{k}\alpha _{i}f_{i}(S)}
  
 is submodular.</li>
<li>For any submodular function 
  
    
      
        f
      
    
    {\displaystyle f}
  
, the function defined by 
  
    
      
        g
        (
        S
        )
        =
        f
        (
        Ω
        ∖
        S
        )
      
    
    {\displaystyle g(S)=f(\Omega \setminus S)}
  
 is submodular.</li>
<li>The function 
  
    
      
        g
        (
        S
        )
        =
        min
        (
        f
        (
        S
        )
        ,
        c
        )
      
    
    {\displaystyle g(S)=\min(f(S),c)}
  
, where 
  
    
      
        c
      
    
    {\displaystyle c}
  
 is a real number, is submodular whenever 
  
    
      
        f
      
    
    {\displaystyle f}
  
 is monotone submodular. More generally, 
  
    
      
        g
        (
        S
        )
        =
        h
        (
        f
        (
        S
        )
        )
      
    
    {\displaystyle g(S)=h(f(S))}
  
 is submodular, for any non decreasing concave function 
  
    
      
        h
      
    
    {\displaystyle h}
  
.</li>
<li>Consider a random process where a set 
  
    
      
        T
      
    
    {\displaystyle T}
  
 is chosen with each element in 
  
    
      
        Ω
      
    
    {\displaystyle \Omega }
  
 being included in 
  
    
      
        T
      
    
    {\displaystyle T}
  
 independently with probability 
  
    
      
        p
      
    
    {\displaystyle p}
  
. Then the following inequality is true 
  
    
      
        
          E
        
        [
        f
        (
        T
        )
        ]
        ≥
        p
        f
        (
        Ω
        )
        +
        (
        1
        −
        p
        )
        f
        (
        ∅
        )
      
    
    {\displaystyle \mathbb {E} [f(T)]\geq pf(\Omega )+(1-p)f(\varnothing )}
  
 where 
  
    
      
        ∅
      
    
    {\displaystyle \varnothing }
  
 is the empty set. More generally consider the following random process where a set 
  
    
      
        S
      
    
    {\displaystyle S}
  
 is constructed as follows. For each of 
  
    
      
        1
        ≤
        i
        ≤
        l
        ,
        
          A
          
            i
          
        
        ⊆
        Ω
      
    
    {\displaystyle 1\leq i\leq l,A_{i}\subseteq \Omega }
  
 construct 
  
    
      
        
          S
          
            i
          
        
      
    
    {\displaystyle S_{i}}
  
 by including each element in 
  
    
      
        
          A
          
            i
          
        
      
    
    {\displaystyle A_{i}}
  
 independently into 
  
    
      
        
          S
          
            i
          
        
      
    
    {\displaystyle S_{i}}
  
 with probability 
  
    
      
        
          p
          
            i
          
        
      
    
    {\displaystyle p_{i}}
  
. Furthermore let 
  
    
      
        S
        =
        
          ∪
          
            i
            =
            1
          
          
            l
          
        
        
          S
          
            i
          
        
      
    
    {\displaystyle S=\cup _{i=1}^{l}S_{i}}
  
. Then the following inequality is true 
  
    
      
        
          E
        
        [
        f
        (
        S
        )
        ]
        ≥
        
          ∑
          
            R
            ⊆
            [
            l
            ]
          
        
        
          Π
          
            i
            ∈
            R
          
        
        
          p
          
            i
          
        
        
          Π
          
            i
            ∉
            R
          
        
        (
        1
        −
        
          p
          
            i
          
        
        )
        f
        (
        
          ∪
          
            i
            ∈
            R
          
        
        
          A
          
            i
          
        
        )
      
    
    {\displaystyle \mathbb {E} [f(S)]\geq \sum _{R\subseteq [l]}\Pi _{i\in R}p_{i}\Pi _{i\notin R}(1-p_{i})f(\cup _{i\in R}A_{i})}
  
.</li></ol>
<h2 id="optimization-problems">Optimization problems</h2>
<p>Submodular functions have properties which are very similar to <a href="/facts/Convex_function/IbzG5SLF">convex</a> and <a href="/facts/Concave_function/HxVEBkce">concave functions</a>. For this reason, an <a href="/facts/Optimization_problem/QzwJfFEE">optimization problem</a> which concerns optimizing a convex or concave function can also be described as the problem of maximizing or minimizing a submodular function subject to some constraints.
</p>
<h3>Submodular set function minimization</h3>
<p>The hardness of minimizing a submodular set function depends on constraints imposed on the problem.
</p>
<ol><li>The unconstrained problem of minimizing a submodular function is computable in <a href="/facts/Polynomial_time/77T62gmf">polynomial time</a>,<a class="footnote-ref" id="fnref:13" href="#fn:13"><sup>13</sup></a><a class="footnote-ref" id="fnref:14" href="#fn:14"><sup>14</sup></a> and even in <a href="/facts/Strongly_polynomial/77T62gmf">strongly-polynomial</a> time.<a class="footnote-ref" id="fnref:15" href="#fn:15"><sup>15</sup></a><a class="footnote-ref" id="fnref:16" href="#fn:16"><sup>16</sup></a> Computing the <a href="/facts/Minimum_cut/xWTNp1hN">minimum cut</a> in a graph is a special case of this minimization problem.</li>
<li>The problem of minimizing a submodular function with a cardinality lower bound is <a href="/facts/NP-hard/mQeNPv4R">NP-hard</a>, with polynomial factor lower bounds on the approximation factor.<a class="footnote-ref" id="fnref:17" href="#fn:17"><sup>17</sup></a><a class="footnote-ref" id="fnref:18" href="#fn:18"><sup>18</sup></a></li></ol>
<h3>Submodular set function maximization</h3>
<p>Unlike the case of minimization, maximizing a generic submodular function is <a href="/facts/NP-hard/mQeNPv4R">NP-hard</a> even in the unconstrained setting. Thus, most of the works in this field are concerned with polynomial-time approximation algorithms, including <a href="/facts/Greedy_algorithm/alfsekco">greedy algorithms</a> or <a href="/facts/Local_search_(optimization)/G1D2aHBz">local search algorithms</a>.
</p>
<ol><li>The problem of maximizing a non-negative submodular function admits a 1/2 approximation algorithm.<a class="footnote-ref" id="fnref:19" href="#fn:19"><sup>19</sup></a><a class="footnote-ref" id="fnref:20" href="#fn:20"><sup>20</sup></a> Computing the <a href="/facts/Maximum_cut/9rulcdeW">maximum cut</a> of a graph is a special case of this problem.</li>
<li>The problem of maximizing a monotone submodular function subject to a cardinality constraint admits a 
  
    
      
        1
        −
        1
        
          /
        
        e
      
    
    {\displaystyle 1-1/e}
  
 approximation algorithm.<a class="footnote-ref" id="fnref:21" href="#fn:21"><sup>21</sup></a><a class="footnote-ref" id="fnref:22" href="#fn:22"><sup>22</sup></a> The <a href="/facts/Maximum_coverage_problem/oeTR6bxe">maximum coverage problem</a> is a special case of this problem.</li>
<li>The problem of maximizing a monotone submodular function subject to a <a href="/facts/Matroid/5bLcGlRF">matroid</a> constraint (which subsumes the case above) also admits a 
  
    
      
        1
        −
        1
        
          /
        
        e
      
    
    {\displaystyle 1-1/e}
  
 approximation algorithm.<a class="footnote-ref" id="fnref:23" href="#fn:23"><sup>23</sup></a><a class="footnote-ref" id="fnref:24" href="#fn:24"><sup>24</sup></a><a class="footnote-ref" id="fnref:25" href="#fn:25"><sup>25</sup></a></li></ol>
<p>Many of these algorithms can be unified within a semi-differential based framework of algorithms.<a class="footnote-ref" id="fnref:26" href="#fn:26"><sup>26</sup></a>
</p>
<h3>Related optimization problems</h3>
<p>Apart from submodular minimization and maximization, there are several other natural optimization problems related to submodular functions.
</p>
<ol><li>Minimizing the difference between two submodular functions<a class="footnote-ref" id="fnref:27" href="#fn:27"><sup>27</sup></a> is not only NP hard, but also inapproximable.<a class="footnote-ref" id="fnref:28" href="#fn:28"><sup>28</sup></a></li>
<li>Minimization/maximization of a submodular function subject to a submodular level set constraint (also known as submodular optimization subject to submodular cover or submodular knapsack constraint) admits bounded approximation guarantees.<a class="footnote-ref" id="fnref:29" href="#fn:29"><sup>29</sup></a></li>
<li>Partitioning data based on a submodular function to maximize the average welfare is known as the submodular welfare problem, which also admits bounded approximation guarantees (see <a href="/facts/Welfare_maximization/yaS2GTT2">welfare maximization</a>).</li></ol>
<h2 id="applications">Applications</h2>
<p>Submodular functions naturally occur in several real world applications, in <a href="/facts/Economics/HYctA2Dn">economics</a>, <a href="/facts/Game_theory/zkEIj1ya">game theory</a>, <a href="/facts/Machine_learning/e0w0XJTu">machine learning</a> and <a href="/facts/Computer_vision/Tl2Yyk66">computer vision</a>.<a class="footnote-ref" id="fnref:30" href="#fn:30"><sup>30</sup></a><a class="footnote-ref" id="fnref:31" href="#fn:31"><sup>31</sup></a> Owing to the diminishing returns property, submodular functions naturally model costs of items, since there is often a larger discount, with an increase in the items one buys. Submodular functions model notions of complexity, similarity and cooperation when they appear in minimization problems. In maximization problems, on the other hand, they model notions of diversity, information and coverage.
</p>
<h2 id="see-also">See also</h2>
<ul><li><a href="/facts/Supermodular_function/EkrRrU0f">Supermodular function</a></li>
<li><a href="/facts/Matroid/5bLcGlRF">Matroid</a>, <a href="/facts/Polymatroid/qtraTA2a">Polymatroid</a></li>
<li><a href="/facts/Utility_functions_on_indivisible_goods/fYTKPz7d">Utility functions on indivisible goods</a></li></ul>
<h2 id="citations">Citations</h2>

<ul><li><a href="/facts/Alexander_Schrijver/eJHJNRGz">Schrijver, Alexander</a> (2003), <i>Combinatorial Optimization</i>, <a href="/facts/Springer_Publishing/JfRev59b">Springer</a>, <a href="/facts/ISBN_(identifier)/15AdSPa9">ISBN</a> 3-540-44389-4</li>
<li><a href="/facts/Jon_Lee_(mathematician)/hnfK1FQN">Lee, Jon</a> (2004), <i>A First Course in Combinatorial Optimization</i>, <a href="/facts/Cambridge_University_Press/fgEBSSRq">Cambridge University Press</a>, <a href="/facts/ISBN_(identifier)/15AdSPa9">ISBN</a> 0-521-01012-8</li>
<li>Fujishige, Satoru (2005), <i>Submodular Functions and Optimization</i>, <a href="/facts/Elsevier/jkPSAEmR">Elsevier</a>, <a href="/facts/ISBN_(identifier)/15AdSPa9">ISBN</a> 0-444-52086-4</li>
<li>Narayanan, H. (1997), <i>Submodular Functions and Electrical Networks</i>, Elsevier, <a href="/facts/ISBN_(identifier)/15AdSPa9">ISBN</a> 0-444-82523-1</li>
<li>Oxley, James G. (1992), <i>Matroid theory</i>, Oxford Science Publications, Oxford: <a href="/facts/Oxford_University_Press/CYyTzzWG">Oxford University Press</a>, <a href="/facts/ISBN_(identifier)/15AdSPa9">ISBN</a> 0-19-853563-5, <a href="/facts/Zbl_(identifier)/P6rFxKKx">Zbl</a> <a href="https://zbmath.org/?format=complete&q=an:0784.05002">0784.05002</a></li></ul>
<h2 id="external-links">External links</h2>
<ul><li><a href="http://www.cs.berkeley.edu/~stefje/references.html">http://www.cs.berkeley.edu/~stefje/references.html</a> has a longer bibliography</li>
<li><a href="http://submodularity.org/">http://submodularity.org/</a> includes further material on the subject</li></ul>

<h2 id="references">References</h2>

<ol>
<li id="fn:1"><p>H. Lin and J. Bilmes, A Class of Submodular Functions for Document Summarization, ACL-2011. <a href="#fnref:1" class="footnote-back-ref">↩</a></p></li>
<li id="fn:2"><p>S. Tschiatschek, R. Iyer, H. Wei and J. Bilmes, Learning Mixtures of Submodular Functions for Image Collection Summarization, NIPS-2014. <a href="#fnref:2" class="footnote-back-ref">↩</a></p></li>
<li id="fn:3"><p>A. Krause and C. Guestrin, Near-optimal nonmyopic value of information in graphical models, UAI-2005. <a href="#fnref:3" class="footnote-back-ref">↩</a></p></li>
<li id="fn:4"><p>A. Krause and C. Guestrin, Beyond Convexity: Submodularity in Machine Learning, Tutorial at ICML-2008 <a href="#fnref:4" class="footnote-back-ref">↩</a></p></li>
<li id="fn:5"><p>(Schrijver 2003, §44, p. 766) - Schrijver, Alexander (2003), Combinatorial Optimization, Springer, ISBN 3-540-44389-4 <a href="#fnref:5" class="footnote-back-ref">↩</a></p></li>
<li id="fn:6"><p>Buchbinder, Niv; Feldman, Moran (2018). "Submodular Functions Maximization Problems". In Gonzalez, Teofilo F. (ed.). Handbook of Approximation Algorithms and Metaheuristics, Second Edition: Methodologies and Traditional Applications. Chapman and Hall/CRC. doi:10.1201/9781351236423. ISBN 9781351236423. <a href="9781351236423" target="_blank">9781351236423</a> <a href="#fnref:6" class="footnote-back-ref">↩</a></p></li>
<li id="fn:7"><p>"Information Processing and Learning" (PDF). cmu. <a href="https://www.cs.cmu.edu/~aarti/Class/10704_Spring15/lecs/lec3.pdf" target="_blank">https://www.cs.cmu.edu/~aarti/Class/10704_Spring15/lecs/lec3.pdf</a> <a href="#fnref:7" class="footnote-back-ref">↩</a></p></li>
<li id="fn:8"><p>Fujishige (2005) p.22 <a href="#fnref:8" class="footnote-back-ref">↩</a></p></li>
<li id="fn:9"><p>Lovász, L. (1983). "Submodular functions and convexity". Mathematical Programming the State of the Art. pp. 235–257. doi:10.1007/978-3-642-68874-4_10. ISBN 978-3-642-68876-8. S2CID 117358746. <a href="978-3-642-68876-8" target="_blank">978-3-642-68876-8</a> <a href="#fnref:9" class="footnote-back-ref">↩</a></p></li>
<li id="fn:10"><p>Vondrak, Jan (2008-05-17). "Optimal approximation for the submodular welfare problem in the value oracle model". Proceedings of the fortieth annual ACM symposium on Theory of computing. STOC '08. New York, NY, USA: Association for Computing Machinery. pp. 67–74. doi:10.1145/1374376.1374389. ISBN 978-1-60558-047-0. S2CID 170510. <a href="978-1-60558-047-0" target="_blank">978-1-60558-047-0</a> <a href="#fnref:10" class="footnote-back-ref">↩</a></p></li>
<li id="fn:11"><p>Calinescu, Gruia; Chekuri, Chandra; Pál, Martin; Vondrák, Jan (January 2011). "Maximizing a Monotone Submodular Function Subject to a Matroid Constraint". SIAM Journal on Computing. 40 (6): 1740–1766. doi:10.1137/080733991. ISSN 0097-5397. <a href="http://epubs.siam.org/doi/10.1137/080733991" target="_blank">http://epubs.siam.org/doi/10.1137/080733991</a> <a href="#fnref:11" class="footnote-back-ref">↩</a></p></li>
<li id="fn:12"><p>Vondrák, Jan. "Polyhedral techniques in combinatorial optimization: Lecture 17" (PDF). <a href="https://theory.stanford.edu/~jvondrak/CS369P/lec17.pdf" target="_blank">https://theory.stanford.edu/~jvondrak/CS369P/lec17.pdf</a> <a href="#fnref:12" class="footnote-back-ref">↩</a></p></li>
<li id="fn:13"><p>Grötschel, M.; Lovasz, L.; Schrijver, A. (1981). "The ellipsoid method and its consequences in combinatorial optimization". Combinatorica. 1 (2): 169–197. doi:10.1007/BF02579273. hdl:10068/182482. S2CID 43787103. <a href="/wiki/Martin_Gr%C3%B6tschel" target="_blank">/wiki/Martin_Gr%C3%B6tschel</a> <a href="#fnref:13" class="footnote-back-ref">↩</a></p></li>
<li id="fn:14"><p>Cunningham, W. H. (1985). "On submodular function minimization". Combinatorica. 5 (3): 185–192. doi:10.1007/BF02579361. S2CID 33192360. <a href="/wiki/Doi_(identifier)" target="_blank">/wiki/Doi_(identifier)</a> <a href="#fnref:14" class="footnote-back-ref">↩</a></p></li>
<li id="fn:15"><p>Iwata, S.; Fleischer, L.; Fujishige, S. (2001). "A combinatorial strongly polynomial algorithm for minimizing submodular functions". J. ACM. 48 (4): 761–777. doi:10.1145/502090.502096. S2CID 888513. <a href="/wiki/Doi_(identifier)" target="_blank">/wiki/Doi_(identifier)</a> <a href="#fnref:15" class="footnote-back-ref">↩</a></p></li>
<li id="fn:16"><p>Schrijver, A. (2000). "A combinatorial algorithm minimizing submodular functions in strongly polynomial time". J. Combin. Theory Ser. B. 80 (2): 346–355. doi:10.1006/jctb.2000.1989. <a href="/wiki/Alexander_Schrijver" target="_blank">/wiki/Alexander_Schrijver</a> <a href="#fnref:16" class="footnote-back-ref">↩</a></p></li>
<li id="fn:17"><p>Z. Svitkina and L. Fleischer, Submodular approximation: Sampling-based algorithms and lower bounds, SIAM Journal on Computing (2011). <a href="#fnref:17" class="footnote-back-ref">↩</a></p></li>
<li id="fn:18"><p>R. Iyer, S. Jegelka and J. Bilmes, Fast Semidifferential based submodular function optimization, Proc. ICML (2013). <a href="/wiki/Stefanie_Jegelka" target="_blank">/wiki/Stefanie_Jegelka</a> <a href="#fnref:18" class="footnote-back-ref">↩</a></p></li>
<li id="fn:19"><p>U. Feige, V. Mirrokni and J. Vondrák, Maximizing non-monotone submodular functions, Proc. of 48th FOCS (2007), pp. 461–471. <a href="/wiki/Uriel_Feige" target="_blank">/wiki/Uriel_Feige</a> <a href="#fnref:19" class="footnote-back-ref">↩</a></p></li>
<li id="fn:20"><p>N. Buchbinder, M. Feldman, J. Naor and R. Schwartz, A tight linear time (1/2)-approximation for unconstrained submodular maximization, Proc. of 53rd FOCS (2012), pp. 649-658. <a href="#fnref:20" class="footnote-back-ref">↩</a></p></li>
<li id="fn:21"><p>Nemhauser, George; Wolsey, L. A.; Fisher, M. L. (1978). "An analysis of approximations for maximizing submodular set functions I". Mathematical Programming. 14 (14): 265–294. doi:10.1007/BF01588971. S2CID 206800425. <a href="/wiki/George_Nemhauser" target="_blank">/wiki/George_Nemhauser</a> <a href="#fnref:21" class="footnote-back-ref">↩</a></p></li>
<li id="fn:22"><p>Williamson, David P. "Bridging Continuous and Discrete Optimization: Lecture 23" (PDF). <a href="https://people.orie.cornell.edu/dpw/orie6334/lecture23.pdf" target="_blank">https://people.orie.cornell.edu/dpw/orie6334/lecture23.pdf</a> <a href="#fnref:22" class="footnote-back-ref">↩</a></p></li>
<li id="fn:23"><p>G. Calinescu, C. Chekuri, M. Pál and J. Vondrák, Maximizing a submodular set function subject to a matroid constraint, SIAM J. Comp. 40:6 (2011), 1740-1766. <a href="#fnref:23" class="footnote-back-ref">↩</a></p></li>
<li id="fn:24"><p>M. Feldman, J. Naor and R. Schwartz, A unified continuous greedy algorithm for submodular maximization, Proc. of 52nd FOCS (2011). <a href="#fnref:24" class="footnote-back-ref">↩</a></p></li>
<li id="fn:25"><p>Y. Filmus, J. Ward, A tight combinatorial algorithm for submodular maximization subject to a matroid constraint, Proc. of 53rd FOCS (2012), pp. 659-668. <a href="#fnref:25" class="footnote-back-ref">↩</a></p></li>
<li id="fn:26"><p>R. Iyer, S. Jegelka and J. Bilmes, Fast Semidifferential based submodular function optimization, Proc. ICML (2013). <a href="/wiki/Stefanie_Jegelka" target="_blank">/wiki/Stefanie_Jegelka</a> <a href="#fnref:26" class="footnote-back-ref">↩</a></p></li>
<li id="fn:27"><p>M. Narasimhan and J. Bilmes, A submodular-supermodular procedure with applications to discriminative structure learning, In Proc. UAI (2005). <a href="#fnref:27" class="footnote-back-ref">↩</a></p></li>
<li id="fn:28"><p>R. Iyer and J. Bilmes, Algorithms for Approximate Minimization of the Difference between Submodular Functions, In Proc. UAI (2012). <a href="#fnref:28" class="footnote-back-ref">↩</a></p></li>
<li id="fn:29"><p>R. Iyer and J. Bilmes, Submodular Optimization Subject to Submodular Cover and Submodular Knapsack Constraints, In Advances of NIPS (2013). <a href="#fnref:29" class="footnote-back-ref">↩</a></p></li>
<li id="fn:30"><p>A. Krause and C. Guestrin, Beyond Convexity: Submodularity in Machine Learning, Tutorial at ICML-2008 <a href="#fnref:30" class="footnote-back-ref">↩</a></p></li>
<li id="fn:31"><p>J. Bilmes, Submodularity in Machine Learning Applications, Tutorial at AAAI-2015. <a href="#fnref:31" class="footnote-back-ref">↩</a></p></li>
</ol>

Submodular set function open-in-new

Submodular set function