Change of variables

<h2 id="simple-example">Simple example</h2>
<p>Consider the system of equations
</p>

x
        y
        +
        x
        +
        y
        =
        71
      
    
    {\displaystyle xy+x+y=71}

x
          
            2
          
        
        y
        +
        x
        
          y
          
            2
          
        
        =
        880
      
    
    {\displaystyle x^{2}y+xy^{2}=880}

<p>where 
  
    
      
        x
      
    
    {\displaystyle x}
  
 and 
  
    
      
        y
      
    
    {\displaystyle y}
  
 are positive integers with 
  
    
      
        x
        >
        y
      
    
    {\displaystyle x>y}
  
. (Source: 1991 <a href="/facts/American_Invitational_Mathematics_Examination/sSulQLjP">AIME</a>)
</p><p>Solving this normally is not very difficult, but it may get a little tedious. However, we can rewrite the second equation as 
  
    
      
        x
        y
        (
        x
        +
        y
        )
        =
        880
      
    
    {\displaystyle xy(x+y)=880}
  
. Making the substitutions 
  
    
      
        s
        =
        x
        +
        y
      
    
    {\displaystyle s=x+y}
  
 and 
  
    
      
        t
        =
        x
        y
      
    
    {\displaystyle t=xy}
  
 reduces the system to 
  
    
      
        s
        +
        t
        =
        71
        ,
        s
        t
        =
        880
      
    
    {\displaystyle s+t=71,st=880}
  
. Solving this gives 
  
    
      
        (
        s
        ,
        t
        )
        =
        (
        16
        ,
        55
        )
      
    
    {\displaystyle (s,t)=(16,55)}
  
 and 
  
    
      
        (
        s
        ,
        t
        )
        =
        (
        55
        ,
        16
        )
      
    
    {\displaystyle (s,t)=(55,16)}
  
. Back-substituting the first ordered pair gives us 
  
    
      
        x
        +
        y
        =
        16
        ,
        x
        y
        =
        55
        ,
        x
        >
        y
      
    
    {\displaystyle x+y=16,xy=55,x>y}
  
, which gives the solution 
  
    
      
        (
        x
        ,
        y
        )
        =
        (
        11
        ,
        5
        )
        .
      
    
    {\displaystyle (x,y)=(11,5).}
  
 Back-substituting the second ordered pair gives us 
  
    
      
        x
        +
        y
        =
        55
        ,
        x
        y
        =
        16
        ,
        x
        >
        y
      
    
    {\displaystyle x+y=55,xy=16,x>y}
  
, which gives no solutions. Hence the solution that solves the system is 
  
    
      
        (
        x
        ,
        y
        )
        =
        (
        11
        ,
        5
        )
      
    
    {\displaystyle (x,y)=(11,5)}
  
.
</p>
<h2 id="formal-introduction">Formal introduction</h2>
<p>Let 
  
    
      
        A
      
    
    {\displaystyle A}
  
, 
  
    
      
        B
      
    
    {\displaystyle B}
  
 be <a href="/facts/Smooth_manifold/aSnbXKcV">smooth manifolds</a> and let 
  
    
      
        Φ
        :
        A
        →
        B
      
    
    {\displaystyle \Phi :A\rightarrow B}
  
 be a 
  
    
      
        
          C
          
            r
          
        
      
    
    {\displaystyle C^{r}}
  
-<a href="/facts/Diffeomorphism/855N5YYj">diffeomorphism</a> between them, that is: 
  
    
      
        Φ
      
    
    {\displaystyle \Phi }
  
 is a 
  
    
      
        r
      
    
    {\displaystyle r}
  
 times continuously differentiable, <a href="/facts/Bijective/j4gTuTmW">bijective</a> map from 
  
    
      
        A
      
    
    {\displaystyle A}
  
 to 
  
    
      
        B
      
    
    {\displaystyle B}
  
 with 
  
    
      
        r
      
    
    {\displaystyle r}
  
 times continuously differentiable inverse from 
  
    
      
        B
      
    
    {\displaystyle B}
  
 to 
  
    
      
        A
      
    
    {\displaystyle A}
  
. Here 
  
    
      
        r
      
    
    {\displaystyle r}
  
 may be any natural number (or zero), 
  
    
      
        ∞
      
    
    {\displaystyle \infty }
  
 (<a href="/facts/Smooth_function/mffL4ch2">smooth</a>) or 
  
    
      
        ω
      
    
    {\displaystyle \omega }
  
 (<a href="/facts/Analytic_function/JhL3z8FP">analytic</a>).
</p><p>The map 
  
    
      
        Φ
      
    
    {\displaystyle \Phi }
  
 is called a <i>regular coordinate transformation</i> or <i>regular variable substitution</i>, where <i>regular</i> refers to the 
  
    
      
        
          C
          
            r
          
        
      
    
    {\displaystyle C^{r}}
  
-ness of 
  
    
      
        Φ
      
    
    {\displaystyle \Phi }
  
. Usually one will write 
  
    
      
        x
        =
        Φ
        (
        y
        )
      
    
    {\displaystyle x=\Phi (y)}
  
 to indicate the replacement of the variable 
  
    
      
        x
      
    
    {\displaystyle x}
  
 by the variable 
  
    
      
        y
      
    
    {\displaystyle y}
  
 by substituting the value of 
  
    
      
        Φ
      
    
    {\displaystyle \Phi }
  
 in 
  
    
      
        y
      
    
    {\displaystyle y}
  
 for every occurrence of 
  
    
      
        x
      
    
    {\displaystyle x}
  
.
</p>
<h2 id="other-examples">Other examples</h2>
<h3>Coordinate transformation</h3>
<p>Some systems can be more easily solved when switching to <a href="/facts/Polar_coordinates/HBko4YoN">polar coordinates</a>. Consider for example the equation
</p>

U
        (
        x
        ,
        y
        )
        :=
        (
        
          x
          
            2
          
        
        +
        
          y
          
            2
          
        
        )
        
          
            1
            −
            
              
                
                  x
                  
                    2
                  
                
                
                  
                    x
                    
                      2
                    
                  
                  +
                  
                    y
                    
                      2
                    
                  
                
              
            
          
        
        =
        0.
      
    
    {\displaystyle U(x,y):=(x^{2}+y^{2}){\sqrt {1-{\frac {x^{2}}{x^{2}+y^{2}}}}}=0.}

<p>This may be a potential energy function for some physical problem. If one does not immediately see a solution, one might try the substitution
</p>

(
          x
          ,
          y
          )
          =
          Φ
          (
          r
          ,
          θ
          )
        
      
    
    {\displaystyle \displaystyle (x,y)=\Phi (r,\theta )}
  
 given by 
  
    
      
        
          Φ
          (
          r
          ,
          θ
          )
          =
          (
          r
          cos
          ⁡
          (
          θ
          )
          ,
          r
          sin
          ⁡
          (
          θ
          )
          )
          .
        
      
    
    {\displaystyle \displaystyle \Phi (r,\theta )=(r\cos(\theta ),r\sin(\theta )).}

<p>Note that if 
  
    
      
        θ
      
    
    {\displaystyle \theta }
  
 runs outside a 
  
    
      
        2
        π
      
    
    {\displaystyle 2\pi }
  
-length interval, for example, 
  
    
      
        [
        0
        ,
        2
        π
        ]
      
    
    {\displaystyle [0,2\pi ]}
  
, the map 
  
    
      
        Φ
      
    
    {\displaystyle \Phi }
  
 is no longer bijective. Therefore, 
  
    
      
        Φ
      
    
    {\displaystyle \Phi }
  
 should be limited to, for example 
  
    
      
        (
        0
        ,
        ∞
        ]
        ×
        [
        0
        ,
        2
        π
        )
      
    
    {\displaystyle (0,\infty ]\times [0,2\pi )}
  
. Notice how 
  
    
      
        r
        =
        0
      
    
    {\displaystyle r=0}
  
 is excluded, for 
  
    
      
        Φ
      
    
    {\displaystyle \Phi }
  
 is not bijective in the origin (
  
    
      
        θ
      
    
    {\displaystyle \theta }
  
 can take any value, the point will be mapped to (0, 0)). Then, replacing all occurrences of the original variables by the new <a href="/facts/Expression_(mathematics)/MPrMlYbE">expressions</a> prescribed by 
  
    
      
        Φ
      
    
    {\displaystyle \Phi }
  
 and using the identity 
  
    
      
        
          sin
          
            2
          
        
        ⁡
        x
        +
        
          cos
          
            2
          
        
        ⁡
        x
        =
        1
      
    
    {\displaystyle \sin ^{2}x+\cos ^{2}x=1}
  
, we get
</p>

V
        (
        r
        ,
        θ
        )
        =
        
          r
          
            2
          
        
        
          
            1
            −
            
              
                
                  
                    r
                    
                      2
                    
                  
                  
                    cos
                    
                      2
                    
                  
                  ⁡
                  θ
                
                
                  r
                  
                    2
                  
                
              
            
          
        
        =
        
          r
          
            2
          
        
        
          
            1
            −
            
              cos
              
                2
              
            
            ⁡
            θ
          
        
        =
        
          r
          
            2
          
        
        
          |
          
            sin
            ⁡
            θ
          
          |
        
        .
      
    
    {\displaystyle V(r,\theta )=r^{2}{\sqrt {1-{\frac {r^{2}\cos ^{2}\theta }{r^{2}}}}}=r^{2}{\sqrt {1-\cos ^{2}\theta }}=r^{2}\left|\sin \theta \right|.}

<p>Now the solutions can be readily found: 
  
    
      
        sin
        ⁡
        (
        θ
        )
        =
        0
      
    
    {\displaystyle \sin(\theta )=0}
  
, so 
  
    
      
        θ
        =
        0
      
    
    {\displaystyle \theta =0}
  
 or 
  
    
      
        θ
        =
        π
      
    
    {\displaystyle \theta =\pi }
  
. Applying the inverse of 
  
    
      
        Φ
      
    
    {\displaystyle \Phi }
  
 shows that this is equivalent to 
  
    
      
        y
        =
        0
      
    
    {\displaystyle y=0}
  
 while 
  
    
      
        x
        ≠
        0
      
    
    {\displaystyle x\not =0}
  
. Indeed, we see that for 
  
    
      
        y
        =
        0
      
    
    {\displaystyle y=0}
  
 the function vanishes, except for the origin.
</p><p>Note that, had we allowed 
  
    
      
        r
        =
        0
      
    
    {\displaystyle r=0}
  
, the origin would also have been a solution, though it is not a solution to the original problem. Here the bijectivity of 
  
    
      
        Φ
      
    
    {\displaystyle \Phi }
  
 is crucial. The function is always positive (for 
  
    
      
        x
        ,
        y
        ∈
        
          R
        
      
    
    {\displaystyle x,y\in \mathbb {R} }
  
), hence the absolute values.
</p>
<h3>Differentiation</h3>
<p class="note">Main article: <a href="/facts/Chain_rule/aMLYxP0x">Chain rule</a></p>
<p>The <a href="/facts/Chain_rule/aMLYxP0x">chain rule</a> is used to simplify complicated differentiation. For example, consider the problem of calculating the derivative
</p>

d
            
              d
              x
            
          
        
        sin
        ⁡
        (
        
          x
          
            2
          
        
        )
        .
      
    
    {\displaystyle {\frac {d}{dx}}\sin(x^{2}).}

<p>Let 
  
    
      
        y
        =
        sin
        ⁡
        u
      
    
    {\displaystyle y=\sin u}
  
 with 
  
    
      
        u
        =
        
          x
          
            2
          
        
        .
      
    
    {\displaystyle u=x^{2}.}
  
 Then:
</p>

d
                    
                      d
                      x
                    
                  
                
                sin
                ⁡
                (
                
                  x
                  
                    2
                  
                
                )
              
              
                
                =
                
                  
                    
                      d
                      y
                    
                    
                      d
                      x
                    
                  
                
              
            
            
              
              
                
                =
                
                  
                    
                      d
                      y
                    
                    
                      d
                      u
                    
                  
                
                
                  
                    
                      d
                      u
                    
                    
                      d
                      x
                    
                  
                
              
              
              
                
                  This part is the chain rule.
                
              
            
            
              
              
                
                =
                
                  (
                  
                    
                      
                        d
                        
                          d
                          u
                        
                      
                    
                    sin
                    ⁡
                    u
                  
                  )
                
                
                  (
                  
                    
                      
                        d
                        
                          d
                          x
                        
                      
                    
                    
                      x
                      
                        2
                      
                    
                  
                  )
                
              
            
            
              
              
                
                =
                (
                cos
                ⁡
                u
                )
                (
                2
                x
                )
              
            
            
              
              
                
                =
                
                  (
                  
                    cos
                    ⁡
                    (
                    
                      x
                      
                        2
                      
                    
                    )
                  
                  )
                
                (
                2
                x
                )
              
            
            
              
              
                
                =
                2
                x
                cos
                ⁡
                (
                
                  x
                  
                    2
                  
                
                )
              
            
          
        
      
    
    {\displaystyle {\begin{aligned}{\frac {d}{dx}}\sin(x^{2})&={\frac {dy}{dx}}\\[6pt]&={\frac {dy}{du}}{\frac {du}{dx}}&&{\text{This part is the chain rule.}}\\[6pt]&=\left({\frac {d}{du}}\sin u\right)\left({\frac {d}{dx}}x^{2}\right)\\[6pt]&=(\cos u)(2x)\\&=\left(\cos(x^{2})\right)(2x)\\&=2x\cos(x^{2})\end{aligned}}}

<h3>Integration</h3>
<p class="note">Main article: <a href="/facts/Integration_by_substitution/NvXfTTaf">Integration by substitution</a></p>
<p>Difficult integrals may often be evaluated by changing variables; this is enabled by the <a href="/facts/Substitution_rule/NvXfTTaf">substitution rule</a> and is analogous to the use of the chain rule above. Difficult integrals may also be solved by simplifying the integral using a change of variables given by the corresponding <a href="/facts/Jacobian_matrix_and_determinant/0tNv6o8j">Jacobian matrix and determinant</a>.<a class="footnote-ref" id="fnref:1" href="#fn:1"><sup>1</sup></a> Using the Jacobian determinant and the corresponding change of variable that it gives is the basis of coordinate systems such as polar, cylindrical, and spherical coordinate systems.
</p>
<h4>Change of variables formula in terms of Lebesgue measure</h4><p>
The following theorem allows us to relate integrals with respect to Lebesgue measure to an equivalent integral with respect to the pullback measure under a parameterization G.<a class="footnote-ref" id="fnref:2" href="#fn:2"><sup>2</sup></a> The proof is due to approximations of the Jordan content. </p><blockquote><p>Suppose that 
  
    
      
        Ω
      
    
    {\displaystyle \Omega }
  
 is an open subset of 
  
    
      
        
          
            R
          
          
            n
          
        
      
    
    {\displaystyle \mathbb {R} ^{n}}
  
 and 
  
    
      
        G
        :
        Ω
        →
        
          
            R
          
          
            n
          
        
      
    
    {\displaystyle G:\Omega \to \mathbb {R} ^{n}}
  
 is a 
  
    
      
        
          C
          
            1
          
        
      
    
    {\displaystyle C^{1}}
  
 diffeomorphism.
</p><ul><li>If 
  
    
      
        f
      
    
    {\displaystyle f}
  
 is a Lebesgue measurable function on 
  
    
      
        G
        (
        Ω
        )
      
    
    {\displaystyle G(\Omega )}
  
, then 
  
    
      
        f
        ∘
        G
      
    
    {\displaystyle f\circ G}
  
 is Lebesgue measurable on 
  
    
      
        Ω
      
    
    {\displaystyle \Omega }
  
. If 
  
    
      
        f
        ≥
        0
      
    
    {\displaystyle f\geq 0}
  
 or 
  
    
      
        f
        ∈
        
          L
          
            1
          
        
        (
        G
        (
        Ω
        )
        ,
        m
        )
        ,
      
    
    {\displaystyle f\in L^{1}(G(\Omega ),m),}
  
 then 
  
    
      
        
          ∫
          
            G
            (
            Ω
            )
          
        
        f
        (
        x
        )
        d
        x
        =
        
          ∫
          
            Ω
          
        
        f
        ∘
        G
        (
        x
        )
        
          |
        
        
          det
        
        
          D
          
            x
          
        
        G
        
          |
        
        d
        x
      
    
    {\displaystyle \int _{G(\Omega )}f(x)dx=\int _{\Omega }f\circ G(x)|{\text{det}}D_{x}G|dx}
  
.</li>
<li>If 
  
    
      
        E
        ⊂
        Ω
      
    
    {\displaystyle E\subset \Omega }
  
 and 
  
    
      
        E
      
    
    {\displaystyle E}
  
 is Lebesgue measurable, then 
  
    
      
        G
        (
        E
        )
      
    
    {\displaystyle G(E)}
  
 is Lebesgue measurable, then 
  
    
      
        m
        (
        G
        (
        E
        )
        )
        =
        
          ∫
          
            E
          
        
        
          |
        
        
          det
        
        
          D
          
            x
          
        
        G
        
          |
        
        d
        x
      
    
    {\displaystyle m(G(E))=\int _{E}|{\text{det}}D_{x}G|dx}
  
.</li></ul>
</blockquote><p>As a corollary of this theorem, we may compute the Radon–Nikodym derivatives of both the pullback and pushforward measures of 
  
    
      
        m
      
    
    {\displaystyle m}
  
 under 
  
    
      
        T
      
    
    {\displaystyle T}
  
.
</p><h5>Pullback measure and transformation formula</h5>
<p>The pullback measure in terms of a transformation 
  
    
      
        T
      
    
    {\displaystyle T}
  
 is defined as 
  
    
      
        
          T
          
            ∗
          
        
        μ
        :=
        μ
        (
        T
        (
        A
        )
        )
      
    
    {\displaystyle T^{*}\mu :=\mu (T(A))}
  
. The change of variables formula for pullback measures is
</p><p>
  
    
      
        
          ∫
          
            T
            (
            Ω
            )
          
        
        g
        d
        μ
        =
        
          ∫
          
            Ω
          
        
        g
        ∘
        T
        d
        
          T
          
            ∗
          
        
        μ
      
    
    {\displaystyle \int _{T(\Omega )}gd\mu =\int _{\Omega }g\circ TdT^{*}\mu }
  
.
</p><p>Pushforward measure and transformation formula
</p><p>The pushforward measure in terms of a transformation 
  
    
      
        T
      
    
    {\displaystyle T}
  
, is defined as 
  
    
      
        
          T
          
            ∗
          
        
        μ
        :=
        μ
        (
        
          T
          
            −
            1
          
        
        (
        A
        )
        )
      
    
    {\displaystyle T_{*}\mu :=\mu (T^{-1}(A))}
  
. The change of variables formula for pushforward measures is
</p><p>
  
    
      
        
          ∫
          
            Ω
          
        
        g
        ∘
        T
        d
        μ
        =
        
          ∫
          
            T
            (
            Ω
            )
          
        
        g
        d
        
          T
          
            ∗
          
        
        μ
      
    
    {\displaystyle \int _{\Omega }g\circ Td\mu =\int _{T(\Omega )}gdT_{*}\mu }
  
.
</p><p>As a corollary of the change of variables formula for Lebesgue measure, we have that
</p>
<ul><li>Radon-Nikodym derivative of the pullback with respect to Lebesgue measure: 
  
    
      
        
          
            
              d
              
                T
                
                  ∗
                
              
              m
            
            
              d
              m
            
          
        
        (
        x
        )
        =
        
          |
        
        
          det
        
        
          D
          
            x
          
        
        T
        
          |
        
      
    
    {\displaystyle {\frac {dT^{*}m}{dm}}(x)=|{\text{det}}D_{x}T|}
  
</li>
<li>Radon-Nikodym derivative of the pushforward with respect to Lebesgue measure: 
  
    
      
        
          
            
              d
              
                T
                
                  ∗
                
              
              m
            
            
              d
              m
            
          
        
        (
        x
        )
        =
        
          |
        
        
          det
        
        
          D
          
            x
          
        
        
          T
          
            −
            1
          
        
        
          |
        
      
    
    {\displaystyle {\frac {dT_{*}m}{dm}}(x)=|{\text{det}}D_{x}T^{-1}|}
  
</li></ul>
<p>From which we may obtain
</p>
<ul><li>The change of variables formula for pullback measure: 
  
    
      
        
          ∫
          
            T
            (
            Ω
            )
          
        
        g
        d
        m
        =
        
          ∫
          
            Ω
          
        
        g
        ∘
        T
        d
        
          T
          
            ∗
          
        
        m
        =
        
          ∫
          
            Ω
          
        
        g
        ∘
        T
        
          |
        
        
          det
        
        
          D
          
            x
          
        
        T
        
          |
        
        d
        m
        (
        x
        )
      
    
    {\displaystyle \int _{T(\Omega )}gdm=\int _{\Omega }g\circ TdT^{*}m=\int _{\Omega }g\circ T|{\text{det}}D_{x}T|dm(x)}
  
</li>
<li>The change of variables formula for pushforward measure:
  
    
      
        
          ∫
          
            Ω
          
        
        g
        d
        m
        =
        
          ∫
          
            T
            (
            Ω
            )
          
        
        g
        ∘
        
          T
          
            −
            1
          
        
        d
        
          T
          
            ∗
          
        
        m
        =
        
          ∫
          
            T
            (
            Ω
            )
          
        
        g
        ∘
        
          T
          
            −
            1
          
        
        
          |
        
        
          det
        
        
          D
          
            x
          
        
        
          T
          
            −
            1
          
        
        
          |
        
        d
        m
        (
        x
        )
      
    
    {\displaystyle \int _{\Omega }gdm=\int _{T(\Omega )}g\circ T^{-1}dT_{*}m=\int _{T(\Omega )}g\circ T^{-1}|{\text{det}}D_{x}T^{-1}|dm(x)}
  
</li></ul>
<h3>Differential equations</h3>
<p>Variable changes for differentiation and integration are taught in elementary <a href="/facts/Calculus/MEmUTYoz">calculus</a> and the steps are rarely carried out in full.
</p><p>The very broad use of variable changes is apparent when considering differential equations, where the independent variables may be changed using the <a href="/facts/Chain_rule/aMLYxP0x">chain rule</a> or the dependent variables are changed resulting in some differentiation to be carried out. Exotic changes, such as the mingling of dependent and independent variables in <a href="/facts/Point_transformation/J8vfkj37">point</a> and <a href="/facts/Contact_transformation/r3IswjXw">contact transformations</a>, can be very complicated but allow much freedom.
</p><p>Very often, a general form for a change is substituted into a problem and parameters picked along the way to best simplify the problem.
</p>
<h3>Scaling and shifting</h3>
<p>Probably the simplest change is the scaling and shifting of variables, that is replacing them with new variables that are "stretched" and "moved" by constant amounts. This is very common in practical applications to get physical parameters out of problems. For an <i>n</i>th order derivative, the change simply results in
</p>

d
                
                  n
                
              
              y
            
            
              d
              
                x
                
                  n
                
              
            
          
        
        =
        
          
            
              y
              
                scale
              
            
            
              x
              
                scale
              
              
                n
              
            
          
        
        
          
            
              
                d
                
                  n
                
              
              
                
                  
                    y
                    ^
                  
                
              
            
            
              d
              
                
                  
                    
                      x
                      ^
                    
                  
                
                
                  n
                
              
            
          
        
      
    
    {\displaystyle {\frac {d^{n}y}{dx^{n}}}={\frac {y_{\text{scale}}}{x_{\text{scale}}^{n}}}{\frac {d^{n}{\hat {y}}}{d{\hat {x}}^{n}}}}

<p>where
</p>

x
        =
        
          
            
              x
              ^
            
          
        
        
          x
          
            scale
          
        
        +
        
          x
          
            shift
          
        
      
    
    {\displaystyle x={\hat {x}}x_{\text{scale}}+x_{\text{shift}}}

y
        =
        
          
            
              y
              ^
            
          
        
        
          y
          
            scale
          
        
        +
        
          y
          
            shift
          
        
        .
      
    
    {\displaystyle y={\hat {y}}y_{\text{scale}}+y_{\text{shift}}.}

<p>This may be shown readily through the <a href="/facts/Chain_rule/aMLYxP0x">chain rule</a> and linearity of differentiation. This change is very common in practical applications to get physical parameters out of problems, for example, the <a href="/facts/Boundary_value_problem/QEfmUqmP">boundary value problem</a>
</p>

μ
        
          
            
              
                d
                
                  2
                
              
              u
            
            
              d
              
                y
                
                  2
                
              
            
          
        
        =
        
          
            
              d
              p
            
            
              d
              x
            
          
        
        
        ;
        
        u
        (
        0
        )
        =
        u
        (
        L
        )
        =
        0
      
    
    {\displaystyle \mu {\frac {d^{2}u}{dy^{2}}}={\frac {dp}{dx}}\quad ;\quad u(0)=u(L)=0}

<p>describes parallel fluid flow between flat solid walls separated by a distance δ; μ is the <a href="/facts/Viscosity/QLVfrdCz">viscosity</a> and 
  
    
      
        d
        p
        
          /
        
        d
        x
      
    
    {\displaystyle dp/dx}
  
 the <a href="/facts/Pressure_gradient/v7BmlAAS">pressure gradient</a>, both constants. By scaling the variables the problem becomes
</p>

d
                
                  2
                
              
              
                
                  
                    u
                    ^
                  
                
              
            
            
              d
              
                
                  
                    
                      y
                      ^
                    
                  
                
                
                  2
                
              
            
          
        
        =
        1
        
        ;
        
        
          
            
              u
              ^
            
          
        
        (
        0
        )
        =
        
          
            
              u
              ^
            
          
        
        (
        1
        )
        =
        0
      
    
    {\displaystyle {\frac {d^{2}{\hat {u}}}{d{\hat {y}}^{2}}}=1\quad ;\quad {\hat {u}}(0)={\hat {u}}(1)=0}

<p>where
</p>

y
        =
        
          
            
              y
              ^
            
          
        
        L
        
        
          and
        
        
        u
        =
        
          
            
              u
              ^
            
          
        
        
          
            
              L
              
                2
              
            
            μ
          
        
        
          
            
              d
              p
            
            
              d
              x
            
          
        
        .
      
    
    {\displaystyle y={\hat {y}}L\qquad {\text{and}}\qquad u={\hat {u}}{\frac {L^{2}}{\mu }}{\frac {dp}{dx}}.}

<p>Scaling is useful for many reasons. It simplifies analysis both by reducing the number of parameters and by simply making the problem neater. Proper scaling may <i>normalize</i> variables, that is make them have a sensible unitless range such as 0 to 1. Finally, if a problem mandates numeric solution, the fewer the parameters the fewer the number of computations.
</p>
<h3>Momentum vs. velocity</h3>
<p>Consider a system of equations
</p>

m
                
                  
                    
                      v
                      ˙
                    
                  
                
              
              
                
                =
                −
                
                  
                    
                      ∂
                      H
                    
                    
                      ∂
                      x
                    
                  
                
              
            
            
              
                m
                
                  
                    
                      x
                      ˙
                    
                  
                
              
              
                
                =
                
                  
                    
                      ∂
                      H
                    
                    
                      ∂
                      v
                    
                  
                
              
            
          
        
      
    
    {\displaystyle {\begin{aligned}m{\dot {v}}&=-{\frac {\partial H}{\partial x}}\\[5pt]m{\dot {x}}&={\frac {\partial H}{\partial v}}\end{aligned}}}

<p>for a given function 
  
    
      
        H
        (
        x
        ,
        v
        )
      
    
    {\displaystyle H(x,v)}
  
.
The mass can be eliminated by the (trivial) substitution 
  
    
      
        Φ
        (
        p
        )
        =
        1
        
          /
        
        m
        ⋅
        p
      
    
    {\displaystyle \Phi (p)=1/m\cdot p}
  
.
Clearly this is a bijective map from 
  
    
      
        
          R
        
      
    
    {\displaystyle \mathbb {R} }
  
 to 
  
    
      
        
          R
        
      
    
    {\displaystyle \mathbb {R} }
  
. Under the substitution 
  
    
      
        v
        =
        Φ
        (
        p
        )
      
    
    {\displaystyle v=\Phi (p)}
  
 the system becomes
</p>

p
                      ˙
                    
                  
                
              
              
                
                =
                −
                
                  
                    
                      ∂
                      H
                    
                    
                      ∂
                      x
                    
                  
                
              
            
            
              
                
                  
                    
                      x
                      ˙
                    
                  
                
              
              
                
                =
                
                  
                    
                      ∂
                      H
                    
                    
                      ∂
                      p
                    
                  
                
              
            
          
        
      
    
    {\displaystyle {\begin{aligned}{\dot {p}}&=-{\frac {\partial H}{\partial x}}\\[5pt]{\dot {x}}&={\frac {\partial H}{\partial p}}\end{aligned}}}

<h3>Lagrangian mechanics</h3>
<p class="note">Main article: <a href="/facts/Lagrangian_mechanics/sTxyqWz8">Lagrangian mechanics</a></p>
<p>Given a force field 
  
    
      
        φ
        (
        t
        ,
        x
        ,
        v
        )
      
    
    {\displaystyle \varphi (t,x,v)}
  
, <a href="/facts/Isaac_Newton/4L7gTncN">Newton</a>'s <a href="/facts/Equations_of_motion/25msr1vE">equations of motion</a> are
</p>

m
        
          
            
              x
              ¨
            
          
        
        =
        φ
        (
        t
        ,
        x
        ,
        v
        )
        .
      
    
    {\displaystyle m{\ddot {x}}=\varphi (t,x,v).}

<p>Lagrange examined how these equations of motion change under an arbitrary substitution of variables 
  
    
      
        x
        =
        Ψ
        (
        t
        ,
        y
        )
      
    
    {\displaystyle x=\Psi (t,y)}
  
, 
  
    
      
        v
        =
        
          
            
              ∂
              Ψ
              (
              t
              ,
              y
              )
            
            
              ∂
              t
            
          
        
        +
        
          
            
              ∂
              Ψ
              (
              t
              ,
              y
              )
            
            
              ∂
              y
            
          
        
        ⋅
        w
        .
      
    
    {\displaystyle v={\frac {\partial \Psi (t,y)}{\partial t}}+{\frac {\partial \Psi (t,y)}{\partial y}}\cdot w.}

</p><p>He found that the equations
</p>

∂
              
                L
              
            
            
              ∂
              y
            
          
        
        =
        
          
            
              d
            
            
              
                d
              
              t
            
          
        
        
          
            
              ∂
              
                L
              
            
            
              ∂
              
                w
              
            
          
        
      
    
    {\displaystyle {\frac {\partial {L}}{\partial y}}={\frac {\mathrm {d} }{\mathrm {d} t}}{\frac {\partial {L}}{\partial {w}}}}

<p>are equivalent to Newton's equations for the function 
  
    
      
        L
        =
        T
        −
        V
      
    
    {\displaystyle L=T-V}
  
,
where <i>T</i> is the kinetic, and <i>V</i> the potential energy.
</p><p>In fact, when the substitution is chosen well (exploiting for example symmetries and constraints of the system) these equations are much easier to solve than Newton's equations in Cartesian coordinates.
</p>
<h2 id="see-also">See also</h2>
<ul><li><a href="/facts/Change_of_variables_(PDE)/7NTwRBre">Change of variables (PDE)</a></li>
<li><a href="/facts/Probability_density_function/zvfybna4">Change of variables for probability densities</a></li>
<li><a href="/facts/Substitution_property_of_equality/AQCuumLg">Substitution property of equality</a></li>
<li><a href="/facts/Universal_instantiation/wr965NmP">Universal instantiation</a></li></ul>

<h2 id="references">References</h2>

<ol>
<li id="fn:1"><p>Kaplan, Wilfred (1973). "Change of Variables in Integrals". Advanced Calculus (Second ed.). Reading: Addison-Wesley. pp. 269–275. <a href="/wiki/Wilfred_Kaplan" target="_blank">/wiki/Wilfred_Kaplan</a> <a href="#fnref:1" class="footnote-back-ref">↩</a></p></li>
<li id="fn:2"><p>Folland, G. B. (1999). Real analysis : modern techniques and their applications (2nd ed.). New York: Wiley. pp. 74–75. ISBN 0-471-31716-0. OCLC 39849337. <a href="0-471-31716-0" target="_blank">0-471-31716-0</a> <a href="#fnref:2" class="footnote-back-ref">↩</a></p></li>
</ol>

Change of variables open-in-new

Change of variables