Measure of Dispersion

Range, Quantile, Quartile and Percentile:

  • Range and Quartile are measure of dispersion or spread of Numerical data in a set.
  • Useful in findings outliers in data set.
  • Range:
    • Range of a data set is calculated by taking the difference of smallest and largest numbers.
    • First sort the set and then take the difference of first and last elements.
    • Example:
      • 1. Data : 20, 30, 12, 8, 50, 70, 100, 2, 40, 90
        • Sort the data : 
          • 2, 8, 12, 20, 30, 40, 50, 70, 90, 100
        • Take difference to calculate Range = 100-2 = 98
      • 2. Data : 20, 30, -12, 8, 50, 70, 100, -2, 40, 90
        • Sort the data : 
          • -12, -2,8, 20, 30, 40, 50, 70, 90, 100
        • Take difference to calculate Range = 100-(-12) = 112
  • Quantile:
      • Quantiles are data points, at which split the data distribution into equal size consecutive sets. 

 



                                    Here, Q1, Q2 and Q3 are three quantiles.

  • Quantiles are point taken at regular intervals of a data distribution, dividing it into essential equal size consecutive sets. [Important point to note down, I have mentioned essential, means not exactly equal sized.]

2-Quantiles:

    • Data set divided into two half parts are called 2-Quantile. 
    • Here, Number of quantile = 1 (1 data point) 


4-Quantiles (Quartile):

    • Data set divided into four essential equal parts are called 4-Quantile. 
    • Here, Number of quantiles = 3 (3 data point)

            



    • Each parts represent one quarter of data distribution. All parts are commonly known as Quartile.

Quartile:

    • Divide the data distribution into four equal parts, means, each part represents a quarter of data distribution. So, it is called Quartile.

Quartile = 4-Quantiles = 3 Quantiles point


100-Quantiles (Percentile):

    • Data set divided into hundred essential equal parts are called 100-Quantile. It is also called Percentile. 

Percentile = 100-Quartiles = 99 Quantile points.

So, in Quartile :

    • Q1 =  25th Percentile
    • Q2 =  50th Percentile = Median
    • Q3 = 75th Percentile
                  


Comments

Popular posts from this blog

Understanding of Attribute types

Density Based clustering evaluation with Modified Silhouette Score

Cluster Evaluation: Silhouette Score