Symmetric mean absolute percentage error

On this page

Statistical accuracy measure

The symmetric mean absolute percentage error (SMAPE) is an accuracy measure based on percentage errors that compares forecast values to actual data. It is calculated by taking the absolute difference between each actual value A_t and forecast value F_t, dividing by the average of their absolute values, and expressing this as a percentage. This ratio is summed over all data points t and then averaged by the number of points n, giving a symmetric measure that treats over- and under-forecasting equally. SMAPE is widely used in forecasting to assess the relative accuracy of predictive models, balancing errors relative to the scale of the values involved for reliable performance evaluation.

We don't have any images related to Symmetric mean absolute percentage error yet.

You can add one yourself here.

We don't have any YouTube videos related to Symmetric mean absolute percentage error yet.

You can add one yourself here.

We don't have any PDF documents related to Symmetric mean absolute percentage error yet.

You can add one yourself here.

We don't have any Books related to Symmetric mean absolute percentage error yet.

You can add one yourself here.

We don't have any archived web articles related to Symmetric mean absolute percentage error yet.

You can submit a link to a page to archive here.

History

The earliest reference to a similar formula appears to be Armstrong (1985, p. 348), where it is called "adjusted MAPE" and is defined without the absolute values in the denominator. It was later discussed, modified, and re-proposed by Flores (1986).

Armstrong's original definition is as follows:

SMAPE = 1 n ∑ t = 1 n | F t − A t | ( A t + F t ) / 2 {\displaystyle {\text{SMAPE}}={\frac {1}{n}}\sum _{t=1}^{n}{\frac {\left|F_{t}-A_{t}\right|}{(A_{t}+F_{t})/2}}}

The problem is that it can be negative (if A t + F t < 0 {\displaystyle A_{t}+F_{t}<0} ) or even undefined (if A t + F t = 0 {\displaystyle A_{t}+F_{t}=0} ). Therefore, the currently accepted version of SMAPE assumes the absolute values in the denominator.

Discussion

In contrast to the mean absolute percentage error, SMAPE has both a lower and an upper bound. Indeed, the formula above provides a result between 0% and 200%. However, a percentage error between 0% and 100% is much easier to interpret. That is the reason why the formula below is often used in practice (i.e., no factor 0.5 in the denominator):

SMAPE = 100 n ∑ t = 1 n | F t − A t | | A t | + | F t | {\displaystyle {\text{SMAPE}}={\frac {100}{n}}\sum _{t=1}^{n}{\frac {|F_{t}-A_{t}|}{|A_{t}|+|F_{t}|}}}

In the above formula, if A t = F t = 0 {\displaystyle A_{t}=F_{t}=0} , then the t'th term in the summation is 0 since the percent error between the two is 0 and the value of | 0 − 0 | | 0 | + | 0 | {\displaystyle {\frac {|0-0|}{|0|+|0|}}} is undefined.

One supposed problem with SMAPE is that it is not symmetric with respect to the sign of the error term since over- and under-forecasts are not treated equally. The following example illustrates this by applying the second SMAPE formula:

Over-forecasting: At = 100 and Ft = 110 give SMAPE = 4.76%
Under-forecasting: At = 100 and Ft = 90 give SMAPE = 5.26%.

However, one should only expect this type of symmetry for measures which are entirely difference-based and not relative (such as mean squared error and mean absolute deviation).

There is a third version of SMAPE, which allows measuring the direction of the bias in the data by generating a positive and a negative error on line item level. Furthermore, it is better protected against outliers and the bias effect mentioned in the previous paragraph than the two other formulas. The formula is:

SMAPE = ∑ t = 1 n | F t − A t | ∑ t = 1 n ( A t + F t ) {\displaystyle {\text{SMAPE}}={\frac {\sum _{t=1}^{n}\left|F_{t}-A_{t}\right|}{\sum _{t=1}^{n}(A_{t}+F_{t})}}}

A limitation of SMAPE is that if the actual value or forecast value is 0, the value of error will boom up to the upper-limit of error. (200% for the first and 100% for the second and third formula).

Alternatives

Provided the data are strictly positive, a better measure of relative accuracy can be obtained based on the log of the accuracy ratio: log(Ft / At) This measure is easier to analyze statistically and has valuable symmetry and unbiasedness properties. When used in constructing forecasting models, the resulting prediction corresponds to the geometric mean (Tofallis, 2015).

External links

Rob J. Hyndman: Errors on Percentage Errors

History

Discussion

Alternatives

See also

External links