Stealing pages from the server...

Skewness and Kurtosis


Introduction

Statistics is a discipline of applied mathematics that deals with the gathering, describing, analysing, and inferring conclusions from numerical data. Differential and integral calculus, linear algebra, and probability theory are all used substantially in statistics’ mathematical theories. Descriptive statistics, which explains the qualities of sample and population data, and inferential statistics, which uses those qualities to test hypotheses and make conclusions, are the two major disciplines of statistics. Skewness and Kurtosis are two common statistics tools for the study of manipulation of data.

Skewness

Skewness is a distortion or asymmetry in a set of data that deviates from the symmetrical bell curve, or normal distribution. The curve is said to be skewed if it is displaced to the left or right. Skewness can be expressed as a measure of how far a given distribution deviates from a normal distribution. The skewness of a random variable is the third standardized moment , defined as:

Skewness is a descriptive statistic that can be used in conjunction with the histogram and the normal quantile plot to characterise the data or distribution. Many models assume normal distribution; i.e., data are symmetric about the mean. The normal distribution has a skewness of zero. But in reality, data points may not be perfectly symmetric. So, an understanding of the skewness of the dataset indicates whether deviations from the mean are going to be positive or negative.

  • If skewness is less than -1 or greater than 1, the distribution is highly skewed.
  • If skewness is between -1 and -0.5 or between 0.5 and 1, the distribution is moderately skewed.
  • If skewness is between -0.5 and 0.5, the distribution is approximately symmetric.

The skewness for a normal distribution is zero, and any symmetric data should have a skewness near zero. Negative values for the skewness indicate data that are skewed left and positive values for the skewness indicate data that are skewed right. By skewed left, we mean that the left tail is long relative to the right tail. Similarly, skewed right means that the right tail is long relative to the left tail. If the data are multi-modal, then this may affect the sign of the skewness.

Take asset prices for example, beginning with the internet bubble of the late 1990s, deviations from “normal” returns have been more common in the recent two decades. In fact, asset returns are becoming increasingly skewed to the right. The terrorist acts of September 11, 2001, the housing bubble burst and following financial crisis, and the years of quantitative easing (QE) all contributed to this volatility.

Kurtosis

Kurtosis, like skewness, is a statistical term used to quantify distribution. Unlike skewness, which distinguishes extreme values in one tail from those in the other, kurtosis assesses extreme values in both tails.

This heaviness or lightness in the tails usually means that your data looks flatter (or less flat) compared to the normal distribution. The standard normal distribution has a kurtosis of 3, so if your values are close to that then your graph’s tails are nearly normal.

Also an example for asset prices. For investors, a high kurtosis of the return distribution means that they will see more extreme returns (either positive or negative) than the normal + or - three standard deviations from the mean that the normal distribution of returns predicts. Kurtosis risk is the name given to this phenomenon.

Conclusion

In this blog, we discuss the concept of skewness and kurtosis and their application in understanding the risk profiles of financial securities. In addition, we also glance over some common misconceptions regarding the calculation and interpretation of skewness and kurtosis.

References

  1. https://community.gooddata.com/metrics-and-maql-kb-articles-43/normality-testing-skewness-and-kurtosis-241
  2. https://www.itl.nist.gov/div898/handbook/eda/section3/eda35b.htm
  3. https://codeburst.io/2-important-statistics-terms-you-need-to-know-in-data-science-skewness-and-kurtosis-388fef94eeaa

Author: Yang Wang
Reprint policy: All articles in this blog are used except for special statements CC BY 4.0 reprint polocy. If reproduced, please indicate source Yang Wang !
 Previous
Introduction to Ordinary Least Squares Introduction to Ordinary Least Squares
In empirical finance and many other domains, linear regression and the closely related linear prediction theory are commonly used statistical methods. Because of the wide range of applications, basic linear regression courses normally concentrate on the mathematically simplest scenario, which can be used in a variety of other applications.
2021-05-26
Next 
Tensorflow Speech Recognition Challenge Tensorflow Speech Recognition Challenge
The goal of this article is to create an algorithm that can understand simple spoken commands using the Speech Commands Dataset. We can improve product effectiveness and accessibility by improving the recognition accuracy of open-source voice interface tools.
2021-05-26
  TOC