Pandas: Find percentile stats of a given column

Given a Pandas DataFrame, we have to find percentile stats of a given column. By Pranit Sharma Last updated : September 24, 2023

Pandas is a special tool that allows us to perform complex manipulations of data effectively and efficiently. Inside pandas, we mostly deal with a dataset in the form of DataFrame. DataFrames are 2-dimensional data structures in pandas. DataFrames consist of rows, columns, and data.

Pandas allows us to perform almost every kind of mathematical operations including statistical operations like mean, median, and mode.

To find the percentile stats of a given column, we will use methods like mean(), median(), and mode().

By using pandas.DataFrame.mean() Method

This method returns the mean of the values over the requested axis.

DataFrame.mean(
    axis=NoDefault.no_default, 
    skipna=True, 
    level=None, 
    numeric_only=None, 
    **kwargs
    )

By using pandas.DataFrame.median() Method

This method returns the median of the values over the requested axis.

DataFrame.median(
    axis=NoDefault.no_default, 
    skipna=True, 
    level=None, 
    numeric_only=None, 
    **kwargs
    )

By using pandas.DataFrame.mode() Method

This method returns the mode(s) of each element along the selected axis.

DataFrame.mode(
    axis=0, 
    numeric_only=False, 
    dropna=True
    )
Note

To work with pandas, we need to import pandas package first, below is the syntax:

import pandas as pd

Let us understand with the help of an example,

Python program to find percentile stats of a given column

# Importing pandas package
import pandas as pd

# Creating a dictionary
d = {
    'A':[90,72,56,76,82,34],
    'B':[50,56,72,80,53,78]
}

# Creating a DataFrame
df = pd.DataFrame(d,index=['a','b','c','d','e','f'])

# Display original DataFrame
print("Original DataFrame:\n",df,"\n")

# Get multiple statistical result

print("MEAN:\n",df['A'].mean(),"\n")

print("MEAN:\n",df['A'].median(),"\n")

# Calculating percentile using quantile

print("10th Percentile:\n",df['A'].quantile(0.1),"\n")

print("50th Percentile:\n",df['A'].quantile(0.5),"\n")

print("90th Percentile:\n",df['A'].quantile(0.9),"\n")

Output

The output of the above program is:

Example: Find percentile stats of a given column

Python Pandas Programs »


Comments and Discussions!

Load comments ↻






Copyright © 2024 www.includehelp.com. All rights reserved.