How to get plot correlation matrix using Pandas?

Learn how to get plot correlation matrix using Pandas?
By Pranit Sharma Last updated : September 20, 2023

Pandas is a special tool that allows us to perform complex manipulations of data effectively and efficiently. Inside pandas, we mostly deal with a dataset in the form of DataFrame. DataFrames are 2-dimensional data structures in pandas. DataFrames consist of rows, columns, and the data.

Problem statement

Given a Pandas DataFrame, we have to get plot correlation matrix.

Getting plot correlation matrix using Pandas

There is always some kind of similarity/difference between all the values of all the columns in pandas DataFrame. This similarity or difference is known as the correlation of values in a DataFrame. To find the correlation in pandas, we use pandas.DataFrame.corr() method in pandas. This method is used to find the pair-wise correlation (similarities/differences) of the column values. An important point is if there is any null value present in any column, pandas.DataFrame.corr() automatically excludes it and also the non-numeric data is ignored.

Syntax:

DataFrame.corr(method='pearson', min_periods=1)

Let us understand with the help of an example.

Python program to get plot correlation matrix using Pandas

# Importing pandas package
import pandas as pd

# Importing seaborn package
import seaborn as sn

# Import matplotlib package
import matplotlib.pyplot as plt

# Create a DataFrame
df = pd.DataFrame({
    'A':[39,40,32,45,89,102293],
    'B':[40,39,22,54,22,0],
    'C':[42,44,20,49,30,110]}
    )

# Display original DataFrame
print("Original DataFrame:\n",df,"\n")

# Finding correlation
result = df.corr(method ='pearson')

# Display result
print("Correlation in DataFrame is:\n",result,"\n")

Output

The output of the above program is:

Example 1: Get plot correlation matrix

Visualising the correlation matrix

For visualising the correlation matrix, use the following code:

# Visualising Correlation
sn.heatmap(result, annot=True)

print(plt.show())

Output

The output of the above program is:

Example 2: Get plot correlation matrix

Python Pandas Programs »

Comments and Discussions!

Load comments ↻





Copyright © 2024 www.includehelp.com. All rights reserved.