The idea of bounding probability has a very long tradition throughout the history of probability theory. Indeed, in 1854 George Boole used the notion of interval bounds on probability in his The Laws of Thought. Also dating from the latter half of the 19th century, the inequality attributed to Chebyshev described bounds on a distribution when only the mean and variance of the variable are known, and the related inequality attributed to Markov found bounds on a positive variable when only the mean is known. Kyburg reviewed the history of interval probabilities and traced the development of the critical ideas through the 20th century, including the important notion of incomparable probabilities favored by Keynes.
The methods of probability bounds analysis that could be routinely used in
risk assessments were developed in the 1980s. Hailperin described a computational scheme for bounding logical calculations extending the ideas of Boole. Yager described the elementary procedures by which bounds on convolutions can be computed under an assumption of independence. At about the same time, Makarov, and independently, Rüschendorf solved the problem, originally posed by Kolmogorov, of how to find the upper and lower bounds for the probability distribution of a sum of random variables whose marginal distributions, but not their joint distribution, are known. Frank et al. generalized the result of Makarov and expressed it in terms of copulas. Since that time, formulas and algorithms for sums have been generalized and extended to differences, products, quotients and other binary and unary functions under various dependence assumptions.
Arithmetic expressions involving operations such as additions, subtractions, multiplications, divisions, minima, maxima, powers, exponentials, logarithms, square roots, absolute values, etc., are commonly used in risk analyses and uncertainty modeling. Convolution is the operation of finding the probability distribution of a sum of independent random variables specified by probability distributions. We can extend the term to finding distributions of other mathematical functions (products, differences, quotients, and more complex functions) and other assumptions about the intervariable dependencies. There are convenient algorithms for computing these generalized convolutions under a variety of assumptions about the dependencies among the inputs.
Let
D
{\displaystyle \mathbb {D} }
denote the space of distribution functions on the real numbers
R
,
{\displaystyle \mathbb {R} ,}
i.e.,
D
=
{
D
|
D
:
R
→
[
0
,
1
]
,
D
(
x
)
≤
D
(
y
)
for all
x
<
y
}
.
{\displaystyle \mathbb {D} =\{D|D:\mathbb {R} \to [0,1],D(x)\leq D(y){\text{ for all }}x<y\}.}
where
F
¯
{\displaystyle {\overline {F}}}
and
F
_
∈
D
{\displaystyle {\underline {F}}\in \mathbb {D} }
,
m
{\displaystyle m}
and
v
{\displaystyle v}
are real intervals, and
F
⊂
D
{\displaystyle \mathbf {F} \subset \mathbb {D} }
. This quintuple denotes the set of distribution functions
F
∈
F
⊂
D
{\displaystyle F\in \mathbf {F} \subset \mathbb {D} }
such that:
∀
x
∈
R
:
F
¯
(
x
)
≤
F
(
x
)
≤
F
_
(
x
)
∫
R
x
d
F
(
x
)
∈
m
expectation condition
∫
R
x
2
d
F
(
x
)
−
(
∫
R
x
d
F
(
x
)
)
2
∈
v
variance condition
{\displaystyle {\begin{aligned}\forall x\in \mathbb {R} :\qquad &{\overline {F}}(x)\leq F(x)\leq {\underline {F}}(x)\\[6pt]&\int _{\mathbb {R} }xdF(x)\in m&&{\text{expectation condition}}\\&\int _{\mathbb {R} }x^{2}dF(x)-\left(\int _{\mathbb {R} }xdF(x)\right)^{2}\in v&&{\text{variance condition}}\end{aligned}}}
The notation
X
∼
F
{\displaystyle X\sim F}
denotes the fact that
X
∈
R
{\displaystyle X\in \mathbb {R} }
is a random variable governed by the distribution function F, that is,
{
F
:
R
→
[
0
,
1
]
x
↦
Pr
(
X
≤
x
)
{\displaystyle {\begin{cases}F:\mathbb {R} \to [0,1]\\x\mapsto \Pr(X\leq x)\end{cases}}}
Let us generalize the tilde notation for use with p-boxes. We will write X ~ B to mean that X is a random variable whose distribution function is unknown except that it is inside B. Thus, X ~ F ∈ B can be contracted to X ~ B without mentioning the distribution function explicitly.
Generalized convolutions for other operations such as subtraction, multiplication, division, etc., can be derived using transformations. For instance, p-box subtraction A − B can be defined as A + (−B), where the negative of a p-box B = [B1, B2] is [B2(−x), B1(−x)].
so long as A and B can be assumed to be independent events. If they are not independent, we can still bound the conjunction using the classical Fréchet inequality. In this case, we can infer at least that the probability of the joint event A & B is surely within the interval
P(A & B) = env(max(0, if A and B are independent events. If they are not independent, the Fréchet inequality bounds the disjunction
P(A v B) = env(max(It is also possible to compute interval bounds on the conjunction or disjunction under other assumptions about the dependence between A and B. For instance, one might assume they are positively dependent, in which case the resulting interval is not as tight as the answer assuming independence but tighter than the answer given by the Fréchet inequality. Comparable calculations are used for other logical functions such as negation, exclusive disjunction, etc. When the Boolean expression to be evaluated becomes complex, it may be necessary to evaluate it using the methods of mathematical programming to get best-possible bounds on the expression. A similar problem one presents in the case of probabilistic logic (see for example Gerla 1994). If the probabilities of the events are characterized by probability distributions or p-boxes rather than intervals, then analogous calculations can be done to obtain distributional or p-box results characterizing the probability of the top event.
Like arithmetic and logical operations, these magnitude comparisons generally depend on the stochastic dependence between A and B, and the subtraction in the encoding should reflect that dependence. If their dependence is unknown, the difference can be computed without making any assumption using the Fréchet operation.
Some analysts use sampling-based approaches to computing probability bounds, including Monte Carlo simulation, Latin hypercube methods or importance sampling. These approaches cannot assure mathematical rigor in the result because such simulation methods are approximations, although their performance can generally be improved simply by increasing the number of replications in the simulation. Thus, unlike the analytical theorems or methods based on mathematical programming, sampling-based calculations usually cannot produce verified computations. However, sampling-based methods can be very useful in addressing a variety of problems which are computationally difficult to solve analytically or even to rigorously bound. One important example is the use of Cauchy-deviate sampling to avoid the curse of dimensionality in propagating interval uncertainty through high-dimensional problems.
Boole, George (1854). An Investigation of the Laws of Thought on which are Founded the Mathematical Theories of Logic and Probabilities. London: Walton and Maberly. https://www.gutenberg.org/ebooks/15114
Hailperin, Theodore (1986). Boole's Logic and Probability. Amsterdam: North-Holland. ISBN 978-0-444-11037-4. 978-0-444-11037-4
Kyburg, H.E., Jr. (1999). Interval valued probabilities. SIPTA Documentation on Imprecise Probability. https://sipta.org/documentation/interval_prob/kyburgnew.pdf
Walley, Peter (1991). Statistical Reasoning with Imprecise Probabilities. London: Chapman and Hall. ISBN 978-0-412-28660-5. 978-0-412-28660-5
Hailperin, Theodore (1986). Boole's Logic and Probability. Amsterdam: North-Holland. ISBN 978-0-444-11037-4. 978-0-444-11037-4
Yager, R.R. (1986). Arithmetic and other operations on Dempster–Shafer structures. International Journal of Man-machine Studies 25: 357–366.
Makarov, G.D. (1981). Estimates for the distribution function of a sum of two random variables when the marginal distributions are fixed. Theory of Probability and Its Applications 26: 803–806.
Rüschendorf, L. (1982). Random variables with maximum sums. Advances in Applied Probability 14: 623–632.
Frank, M.J., R.B. Nelsen and B. Schweizer (1987). Best-possible bounds for the distribution of a sum—a problem of Kolmogorov. Probability Theory and Related Fields 74: 199–211.
Williamson, R.C., and T. Downs (1990). Probabilistic arithmetic I: Numerical methods for calculating convolutions and dependency bounds. International Journal of Approximate Reasoning 4: 89–158.
Ferson, S., V. Kreinovich, L. Ginzburg, D.S. Myers, and K. Sentz. (2003). Constructing Probability Boxes and Dempster–Shafer Structures Archived 22 July 2011 at the Wayback Machine. SAND2002-4015. Sandia National Laboratories, Albuquerque, NM. http://www.ramas.com/unabridged.zip
Berleant, D. (1993). Automatically verified reasoning with both intervals and probability density functions. Interval Computations 1993 (2) : 48–70.
Berleant, D., G. Anderson, and C. Goodman-Strauss (2008). Arithmetic on bounded families of distributions: a DEnv algorithm tutorial. Pages 183–210 in Knowledge Processing with Interval and Soft Computing, edited by C. Hu, R.B. Kearfott, A. de Korvin and V. Kreinovich, Springer (ISBN 978-1-84800-325-5). /wiki/ISBN_(identifier)
Berleant, D., and C. Goodman-Strauss (1998). Bounding the results of arithmetic operations on random variables of unknown dependency using intervals. Reliable Computing 4: 147–165.
Ferson, S., R. Nelsen, J. Hajagos, D. Berleant, J. Zhang, W.T. Tucker, L. Ginzburg and W.L. Oberkampf (2004). Dependence in Probabilistic Modeling, Dempster–Shafer Theory, and Probability Bounds Analysis. Sandia National Laboratories, SAND2004-3072, Albuquerque, NM. http://www.ramas.com/depend.pdf
Yager, R.R. (1986). Arithmetic and other operations on Dempster–Shafer structures. International Journal of Man-machine Studies 25: 357–366.
Williamson, R.C., and T. Downs (1990). Probabilistic arithmetic I: Numerical methods for calculating convolutions and dependency bounds. International Journal of Approximate Reasoning 4: 89–158.
Ferson, S., V. Kreinovich, L. Ginzburg, D.S. Myers, and K. Sentz. (2003). Constructing Probability Boxes and Dempster–Shafer Structures Archived 22 July 2011 at the Wayback Machine. SAND2002-4015. Sandia National Laboratories, Albuquerque, NM. http://www.ramas.com/unabridged.zip
Ferson, S., R. Nelsen, J. Hajagos, D. Berleant, J. Zhang, W.T. Tucker, L. Ginzburg and W.L. Oberkampf (2004). Dependence in Probabilistic Modeling, Dempster–Shafer Theory, and Probability Bounds Analysis. Sandia National Laboratories, SAND2004-3072, Albuquerque, NM. http://www.ramas.com/depend.pdf
Makarov, G.D. (1981). Estimates for the distribution function of a sum of two random variables when the marginal distributions are fixed. Theory of Probability and Its Applications 26: 803–806.
Frank, M.J., R.B. Nelsen and B. Schweizer (1987). Best-possible bounds for the distribution of a sum—a problem of Kolmogorov. Probability Theory and Related Fields 74: 199–211.
Williamson, R.C., and T. Downs (1990). Probabilistic arithmetic I: Numerical methods for calculating convolutions and dependency bounds. International Journal of Approximate Reasoning 4: 89–158.
Berleant, D., and C. Goodman-Strauss (1998). Bounding the results of arithmetic operations on random variables of unknown dependency using intervals. Reliable Computing 4: 147–165.
Ferson, S., R. Nelsen, J. Hajagos, D. Berleant, J. Zhang, W.T. Tucker, L. Ginzburg and W.L. Oberkampf (2004). Dependence in Probabilistic Modeling, Dempster–Shafer Theory, and Probability Bounds Analysis. Sandia National Laboratories, SAND2004-3072, Albuquerque, NM. http://www.ramas.com/depend.pdf
Boole, George (1854). An Investigation of the Laws of Thought on which are Founded the Mathematical Theories of Logic and Probabilities. London: Walton and Maberly. https://www.gutenberg.org/ebooks/15114
Kyburg, H.E., Jr. (1999). Interval valued probabilities. SIPTA Documentation on Imprecise Probability. https://sipta.org/documentation/interval_prob/kyburgnew.pdf
Hailperin, Theodore (1986). Boole's Logic and Probability. Amsterdam: North-Holland. ISBN 978-0-444-11037-4. 978-0-444-11037-4
Alvarez, D. A., 2006. On the calculation of the bounds of probability of events using infinite random sets. International Journal of Approximate Reasoning 43: 241–267.
Baraldi, P., Popescu, I. C., Zio, E., 2008. Predicting the time to failure of a randomly degrading component by a hybrid Monte Carlo and possibilistic method. IEEE Proc. International Conference on Prognostics and Health Management.
Batarseh, O. G., Wang, Y., 2008. Reliable simulation with input uncertainties using an interval-based approach. IEEE Proc. Winter Simulation Conference.
Roy, Christopher J., and Michael S. Balch (2012). A holistic approach to uncertainty quantification with application to supersonic nozzle thrust. International Journal for Uncertainty Quantification 2 (4): 363–81 doi:10.1615/Int.J.UncertaintyQuantification.2012003562. /wiki/Doi_(identifier)
Zhang, H., Mullen, R. L., Muhanna, R. L. (2010). Interval Monte Carlo methods for structural reliability. Structural Safety 32: 183–190.
Zhang, H., Dai, H., Beer, M., Wang, W. (2012). Structural reliability analysis on the basis of small samples: an interval quasi-Monte Carlo method. Mechanical Systems and Signal Processing 37 (1–2): 137–51 doi:10.1016/j.ymssp.2012.03.001. /wiki/Doi_(identifier)
Trejo, R., Kreinovich, V. (2001). Error estimations for indirect measurements: randomized vs. deterministic algorithms for ‘black-box’ programs. Handbook on Randomized Computing, S. Rajasekaran, P. Pardalos, J. Reif, and J. Rolim (eds.), Kluwer, 673–729. http://www.cs.utep.edu/vladik/2000/tr00-17.pdf
Aughenbaugh, J. M., and C.J.J. Paredis (2007). Probability bounds analysis as a general approach to sensitivity analysis in decision making under uncertainty Archived 2012-03-21 at the Wayback Machine. SAE 2007 Transactions Journal of Passenger Cars: Mechanical Systems, (Section 6) 116: 1325–1339, SAE International, Warrendale, Pennsylvania. http://www.srl.gatech.edu/Members/jaughenbaugh/papers_presentations/2007-01-1480.pdf
Flander, L., W. Dixon, M. McBride, and M. Burgman. (2012). Facilitated expert judgment of environmental risks: acquiring and analysing imprecise data. International Journal of Risk Assessment and Management 16: 199–212.
Dixon, W.J. (2007). The use of Probability Bounds Analysis for Characterising and Propagating Uncertainty in Species Sensitivity Distributions. Technical Report Series No. 163, Arthur Rylah Institute for Environmental Research, Department of Sustainability and Environment. Heidelberg, Victoria, Australia. http://www.depi.vic.gov.au/__data/assets/pdf_file/0013/333112/ARI-Technical-Report-163-The-use-of-probability-bounds-analysis-for-characterising-and-propagating-uncertainty-in-species-sensitivity-distributions.pdf
Oberguggenberger, M., J. King and B. Schmelzer (2007). Imprecise probability methods for sensitivity analysis in engineering. Proceedings of the 5th International Symposium on Imprecise Probability: Theories and Applications, Prague, Czech Republic. http://www.sipta.org/isipta07/proceedings/papers/s032.pdf
Enszer, J.A., Y. Lin, S. Ferson, G.F. Corliss and M.A. Stadtherr (2011). Probability bounds analysis for nonlinear dynamic process models. AIChE Journal 57: 404–422.
Enszer, Joshua Alan, (2010). Verified Probability Bound Analysis for Dynamic Nonlinear Systems. Dissertation, University of Notre Dame.
Nong, A., and K. Krishnan (2007). Estimation of interindividual pharmacokinetic variability factor for inhaled volatile organic chemicals using a probability-bounds approach. Regulatory Toxicology and Pharmacology 48: 93–101. /wiki/Regulatory_Toxicology_and_Pharmacology
Guyonnet, D., F. Blanchard, C. Harpet, Y. Ménard, B. Côme and C. Baudrit (2005). Projet IREA—Traitement des incertitudes en évaluation des risques d'exposition, Annexe B, Cas «Eaux souterraines». Rapport BRGM/RP-54099-FR, Bureau de Recherches Géologiques et Minières, France. Archived 2012-03-11 at the Wayback Machine http://www.brgm.fr/publication/pubDetailRapportSP.jsp?id=RSP-BRGM/RP-54099-FR
Fetz, Thomas; Tonon, Fulvio (2008). "Probability bounds for series systems with variables constrained by sets of probability measures". International Journal of Reliability and Safety. 2 (4): 309. doi:10.1504/IJRS.2008.022079. /wiki/Doi_(identifier)
Augustsson, A., M. Filipsson, T. Öberg, B. Bergbäck (2011). Climate change—an uncertainty factor in risk analysis of contaminated land. Science of the Total Environment 409: 4693–4700.
Baudrit, C., D. Guyonnet, H. Baroudi, S. Denys and P. Begassat (2005). Assessment of child exposure to lead on an ironworks brownfield: uncertainty analysis. 9th International FZK/TNO Conference on Contaminated Soil – ConSoil2005, Bordeaux, France, pages 1071–1080. http://www.univ-orleans.fr/mapmo/membres/baudrit/consoil05.pdf
Dixon, W.J. (2007). Uncertainty Propagation in Population Level Salinity Risk Models. Technical Report Technical Report Series No. 164, Arthur Rylah Institute for Environmental Research. Heidelberg, Victoria, Australia http://www.depi.vic.gov.au/__data/assets/pdf_file/0017/333116/ARI-Technical-Report-164-Uncertainty-propagation-in-population-level-salinity-risk-models.pdf
Karanki, D.R., H.S. Kushwaha, A.K. Verma, and S. Ajit. (2009). Uncertainty analysis based on probability bounds (p-box) approach in probabilistic safety assessment. Risk Analysis 29: 662–75.
Sander, P., B. Bergbäck and T. Öberg (2006). Uncertain numbers and uncertainty in the selection of input distributions—Consequences for a probabilistic risk assessment of contaminated land. Risk Analysis 26: 1363–1375. /wiki/Probabilistic_risk_assessment
Minnery, J.G., J.G. Jacangelo, L.I. Boden, D.J. Vorhees and W. Heiger-Bernays (2009). Sensitivity analysis of the pressure-based direct integrity test for membranes used in drinking water treatment. Environmental Science and Technology 43(24): 9419–9424.
Regan, H.M., B.E. Sample and S. Ferson (2002). Comparison of deterministic and probabilistic calculation of ecological soil screening levels. Environmental Toxicology and Chemistry 21: 882–890.
U.S. Environmental Protection Agency (Region I), GE/Housatonic River Site in New England http://www.epa.gov/ne/ge/
Moore, Dwayne R.J.; Breton, Roger L.; Delong, Tod R.; Ferson, Scott; Lortie, John P.; MacDonald, Drew B.; McGrath, Richard; Pawlisz, Andrzej; Svirsky, Susan C.; Teed, R. Scott; Thompson, Ryan P.; Whitfield Aslund, Melissa (2016). "Ecological risk assessment for mink and short-tailed shrew exposed to PCBS, dioxins, and furans in the Housatonic River area". Integrated Environmental Assessment and Management. 12 (1): 174–184. Bibcode:2016IEAM...12..174M. doi:10.1002/ieam.1661. PMID 25976918. /wiki/Bibcode_(identifier)
U.S. Environmental Protection Agency (Region 6 Superfund Program), Calcasieu Estuary Remedial Investigation Archived January 20, 2011, at the Wayback Machine http://www.epa.gov/region6/6sf/louisiana/calcasieu/la_calcasieu_calcri.html
Roy, C.J., and M.S. Balch (2012). A holistic approach to uncertainty quantification with application to supersonic nozzle thrust. International Journal for Uncertainty Quantification 2: 363-381.
doi:10.1615/Int.J.UncertaintyQuantification.2012003562. http://www.dl.begellhouse.com/download/article/340d23ed5c6d1633/IJUQ-3562.pdf
Oberkampf, W.L., and C. J. Roy. (2010). Verification and Validation in Scientific Computing. Cambridge University Press.
Regan, H.M., B.K. Hope, and S. Ferson (2002). Analysis and portrayal of uncertainty in a food web exposure model. Human and Ecological Risk Assessment 8: 1757–1777.
Ferson, S., and W.T. Tucker (2004). Reliability of risk analyses for contaminated groundwater. Groundwater Quality Modeling and Management under Uncertainty, edited by S. Mishra, American Society of Civil Engineers Reston, VA.
Crespo, Luis G.; Kenny, Sean P.; Giesy, Daniel P. (2013). "Reliability analysis of polynomial systems subject to p-box uncertainties". Mechanical Systems and Signal Processing. 37 (1–2): 121–136. Bibcode:2013MSSP...37..121C. doi:10.1016/j.ymssp.2012.08.012. /wiki/Bibcode_(identifier)
Ferson, Scott; Burgman, Mark A. (1995). "Correlations, dependency bounds and extinction risks". Biological Conservation. 73 (2): 101–105. Bibcode:1995BCons..73..101F. doi:10.1016/0006-3207(95)90031-4. https://dx.doi.org/10.1016/0006-3207%2895%2990031-4
Ferson, S., D.R.J. Moore, P.J. Van den Brink, T.L. Estes, K. Gallagher, R. O'Connor and F. Verdonck. (2010). Bounding uncertainty analyses. Pages 89–122 in Application of Uncertainty Analysis to Ecological Risks of Pesticides, edited by W. J. Warren-Hicks and A. Hart. CRC Press, Boca Raton, Florida.
Augustsson, A., M. Filipsson, T. Öberg, B. Bergbäck (2011). Climate change—an uncertainty factor in risk analysis of contaminated land. Science of the Total Environment 409: 4693–4700.
Kriegler, E., and H. Held (2005). Utilizing belief functions for the estimation of future climate change. International Journal of Approximate Reasoning 39: 185–209. http://authors.elsevier.com/sd/article/S0888613X04001112
Kriegler, E. (2005). Imprecise probability analysis for integrated assessment of climate change, Ph.D. dissertation, Universität Potsdam, Germany. http://opus.kobv.de/ubp/volltexte/2005/561/
Batarseh, O.G.Y., (2010). An Interval Based Approach to Model Input Uncertainty in Discrete-event Simulation. Ph.D. dissertation, University of Central Florida.
Goldwasser, L., L. Ginzburg and S. Ferson (2000). Variability and measurement error in extinction risk analysis: the northern spotted owl on the Olympic Peninsula. Pages 169–187 in Quantitative Methods for Conservation Biology, edited by S. Ferson and M. Burgman, Springer-Verlag, New York.
Hayes, K.R. (2011). Uncertainty and uncertainty analysis methods: Issues in quantitative and qualitative risk modeling with application to import risk assessment ACERA project (0705). Report Number: EP102467, CSIRO, Hobart, Australia. /wiki/CSIRO
Zhang, H., R.L. Mullen, and R.L. Muhanna (2010). Finite element structural analysis using imprecise probabilities based on p-box representation. Proceedings of the 4th International Workshop on Reliable Engineering Computing (REC 2010). http://www.eng.nus.edu.sg/civil/REC2010/documents/papers/013.pdf
Zhang, H., R. Mullen, R. Muhanna (2012). Safety Structural Analysis with Probability-Boxes.
International Journal of Reliability and Safety 6: 110–129. http://www.inderscience.com/search/index.php?action=record&rec_id=44292&prevQuery=&ps=10&m=or
Patelli, E; de Angelis, M (2015). "Line sampling approach for extreme case analysis in presence of aleatory and epistemic uncertainties". Safety and Reliability of Complex Engineered Systems. pp. 2585–2593. doi:10.1201/b19094-339 (inactive 11 November 2024). ISBN 978-1-138-02879-1.{{cite book}}: CS1 maint: DOI inactive as of November 2024 (link) 978-1-138-02879-1
Mehl, Christopher H. (2013). "P-boxes for cost uncertainty analysis". Mechanical Systems and Signal Processing. 37 (1–2): 253–263. Bibcode:2013MSSP...37..253M. doi:10.1016/j.ymssp.2012.03.014. /wiki/Bibcode_(identifier)
Sentz, K., and S. Ferson (2011). Probabilistic bounding analysis in the quantification of margins and uncertainties. Reliability Engineering and System Safety 96: 1126–1136.
Rozell, Daniel J., and Sheldon J. Reaven (2012). Water pollution risk associated with natural gas extraction from the Marcellus Shale. Risk Analysis 32: 1382–1393.