لَآ إِلَـٰهَ إِلَّا هُوَ
LA ILAHA ILLA HU
Allah, Your Lord There Is No Deity Except Him.

Exploratory Data Analysis EDA In Python

Exploratory data analysis is an approach of analyzing data sets to summarize their main characteristics, often using statistical graphics and other data visualization methods.

What is Exploratory Data Analysis used for?

Exploratory Data Analysis refers to the critical process of performing initial investigations on data so as to discover patterns,to spot anomalies,to test hypothesis and to check assumptions with the help of summary statistics and graphical representations.

The goals of Exploratory Data Analysis?

Exploratory data analysis (EDA) involves using graphics and visualizations to explore and analyze a data set. The goal is to explore, investigate and learn, as opposed to confirming statistical hypotheses.

Importance of EDA

EDA helps in identify errors in data sets. Gives a better understanding of the data set. It helps detect outliers or anomalous events. Helps understand data set variables and the relationship among them.

What are the components of EDA?

There are four basic parts of every EDA tool. The pattern editor, component editor, schematic layout, and PCB layout program. The names of the modules might vary from program to program, and they might be used differently, but they all exist in one form or another.

Skills needed for exploratory data analysis.

This includes practical expertise, such as knowing how to scrape and store data. It also requires more nuanced problem-solving abilities, such as how to analyze data and draw conclusions from it. As a statistical approach, exploratory data analysis (or EDA) is vital for learning more about a new dataset.

Difference between data visualization and exploratory data analysis.

Exploratory data analysis is a way to better understand your data which helps in further Data preprocessing. And data visualization is key, making the exploratory data analysis process streamline and easily analyzing data using wonderful plots and charts.

Methods used in Exploratory Data Analysis.

The main methods of analysis used are histogram, stem and leaf plot, and box plots. Fairly simple and easy to create some of the most valuable types includes two charts you can generate when doing EDA are Histograms and Scatter plots.

A histogram allows us to see the distribution of a particular variable.

A scatter plot allows us to see a relationship between two or more variables.

A Simple Example of EDA By Pairplot image:

Exploratory Data Analysis

Some of the most common plots used for Exploratory Data Analysis are as under.

1. Histograms.

2. Scatter plots.

3. Pair plots.

4. Box plots.

5. Violin plots.

6. Distribution Plots.

What are the Steps inoled while doing EDA in numerical data?

Steps for EDA

1, Look at the whole data overview. train.head()

2. Look at missing data.

3. Split the dataset into numerical and categorical(String).

4. Look at data distribution for numerical columns.

5. Look at Data Characteristics of text variables.

Difference between EDA and IDA.

EDA is different from initial data analysis (IDA), which focuses more narrowly on checking assumptions required for model fitting and hypothesis testing, and handling missing values and making transformations of variables as needed. EDA encompasses IDA.

Example of EDA

EDA on Abalone Dataset has been performed. This Dataset has 9 features with 1 Dependent and 8 Independent Features. You can Download the pdf of the EDA just by clicking on the download button of the embedded pdf in your PC as well as Mobile

Note: For Downloading on Mobile Phones, just click on the 'Open' Button. The pdf file will be automatically downloaded. You just have to wait for a few seconds to see the download option available on your mobile screen.