A typical scoring method is composed of 3 components:1
Items 1 & 2 can be achieved by using some form of regression, that will provide both the risk estimation and the formula to calculate the score. Item 3 requires setting an arbitrary set of thresholds and will usually involve expert opinion.
Risk score are designed to represent an underlying probability of an adverse event denoted { Y = 1 } {\displaystyle \lbrace Y=1\rbrace } given a vector of P {\displaystyle P} explanatory variables X {\displaystyle \mathbf {X} } containing measurements of the relevant risk factors. In order to establish the connection between the risk factors and the probability, a set of weights β {\displaystyle \beta } is estimated using a generalized linear model:
Where g − 1 : R → [ 0 , 1 ] {\displaystyle g^{-1}:\mathbb {R} \rightarrow [0,1]} is a real-valued, monotonically increasing function that maps the values of the linear predictor X β {\displaystyle \mathbf {X} \beta } to the interval [ 0 , 1 ] {\displaystyle [0,1]} . GLM methods typically uses the logit or probit as the link function.
While it's possible to estimate P ( Y = 1 | X ) {\displaystyle \mathbf {P} (\mathbf {Y} =1|\mathbf {X} )} using other statistical or machine learning methods, the requirements of simplicity and easy interpretation (and monotonicity per risk factor) make most of these methods difficult to use for scoring in this context:
When using GLM, the set of estimated weights β {\displaystyle \beta } can be used to assign different values (or "points") to different values of the risk factors in X {\displaystyle \mathbf {X} } (continuous or nominal as indicators). The score can then be expressed as a weighted sum:
Let A = { a 1 , . . . , a m } {\displaystyle \mathbf {A} =\lbrace \mathbf {a} _{1},...,\mathbf {a} _{m}\rbrace } denote a set of m ≥ 2 {\displaystyle m\geq 2} "escalating" actions available for the decision maker (e.g. for credit risk decisions: a 1 {\displaystyle \mathbf {a} _{1}} = "approve automatically", a 2 {\displaystyle \mathbf {a} _{2}} = "require more documentation and check manually", a 3 {\displaystyle \mathbf {a} _{3}} = "decline automatically"). In order to define a decision rule, we want to define a map between different values of the score and the possible decisions in A {\displaystyle \mathbf {A} } . Let τ = { τ 1 , . . . τ m − 1 } {\displaystyle \tau =\lbrace \tau _{1},...\tau _{m-1}\rbrace } be a partition of R {\displaystyle \mathbb {R} } into m {\displaystyle m} consecutive, non-overlapping intervals, such that τ 1 < τ 2 < … < τ m − 1 {\displaystyle \tau _{1}<\tau _{2}<\ldots <\tau _{m-1}} .
The map is defined as follows:
(see more examples on the category page Category:Medical scoring system)
The primary use of scores in the financial sector is for Credit scorecards, or credit scores:
Toren, Yizhar (2011). "Ordinal Risk-Group Classification". arXiv:1012.5487 [stat.ML]. /wiki/ArXiv_(identifier) ↩
Le Gall, JR; Lemeshow, S; Saulnier, F (1993). "A new Simplified Acute Physiology Score (SAPS II) based on a European/North American multicenter study". JAMA. 270 (24): 2957–63. doi:10.1001/jama.1993.03510240069035. PMID 8254858. /wiki/Doi_(identifier) ↩
"Simplified Acute Physiology Score (SAPS II) Calculator - ClinCalc.com". clincalc.com. Retrieved August 20, 2018. http://clincalc.com/IcuMortality/SAPSII.aspx ↩
Johnston SC; Rothwell PM; Nguyen-Huynh MN; Giles MF; Elkins JS; Bernstein AL; Sidney S. "Validation and refinement of scores to predict very early stroke risk after transient ischaemic attack" Lancet (2007): 369(9558):283-292 ↩
"ABCD² Score for TIA". www.mdcalc.com. Retrieved December 16, 2018. https://www.mdcalc.com/abcd2-score-tia ↩
"ISM7 (NI) Scorecard, Allstate Property & Casualty Company" (PDF). Retrieved December 16, 2018. http://infoportal.ncdoi.net/getfile.jsp?sfp=/PC/PC095000/PC095470A815823.PDF ↩
"How We Analyzed the COMPAS Recidivism Algorithm". Retrieved December 16, 2018. https://www.propublica.org/article/how-we-analyzed-the-compas-recidivism-algorithm ↩