All Automated EDA Libraries

Visualize dataset of any size with just one line of code

Shubham Bagade
3 min readApr 30, 2021

Exploratory Data Analysis is a process where we tend to analyze the dataset and summarize the main characteristics of the dataset often using visual methods. EDA is really important because if you are not familiar with the dataset you are working on, then you won’t be able to infer something from that data. However, EDA generally takes a lot of time.

In this article, we will work on Automating EDA using 5 libraries . It is a python library that generates beautiful, high-density visualizations to start your EDA. Let us explore it.

1. D-Tale for interactive data exploration

Installing

Like any other python library, we can install D-Tale by using the pip install command given below.

pip install dtale

Analyzing Dataset

import seaborn as sns
df=sns.load_dataset('titanic')
import dtale
dtale.show(df)

2. Exploratory Data Analysis with Pandas Profiling

Installing

Like any other python library, we can install Pandas Profiling by using the pip install command given below.

pip install pandas-profiling

Analyzing Dataset

import seaborn as sns 
df1 = sns.load_dataset('tips')
from pandas_profiling import ProfileReport
profile = ProfileReport(df1,explorative=True)
profile.to_file('output.html')

3. Sweetviz: Automated EDA in Python

Installing

Like any other python library, we can install Sweetviz by using the pip install command given below.

pip install sweetviz

Analyzing Dataset

import sweetviz as sv
report = sv.analyze(df)
report.show_html('sweet.html')

4. Autoviz: Automatically Visualize any Dataset

Installing

Like any other python library, we can install Autoviz by using the pip install command given below.

pip install autoviz

Analyzing Dataset

from autoviz.AutoViz_Class import AutoViz_Class
AV = AutoViz_Class()
df = AV.AutoViz('car_design.csv')

5. Dataprep.EdA: Accelerate your EDA

Installing

Like any other python library, we can install Dataprep by using the pip install command given below.

pip install dataprep

Analyzing Dataset

from dataprep.datasets import load_dataset
from dataprep.eda import create_report
df = load_dataset("titanic")
create_report(df)

In this article, we saw that we can visualize datasets with just one line of code and we can find the patterns in the dataset accordingly.

You can view the code and data I have used here in my GITHUB

--

--

Shubham Bagade

Bachelor of Engineering in Computer Science Engineering | Artificial intelligence Enthusiast