Dr3bdo

Dr3bdo

Home Top Ad

This project was the final project for the udacity's course : Intro to Data Analysis The main task of the project is to...


This project was the final project for the udacity's course : Intro to Data Analysis The main task of the project is to analyze a dataset and then communicate the findings about it. and I should use the Python libraries NumPy, Pandas, and Matplotlib to make the analysis easier.

After finishing Udacity’s statistics courses ( Intro to Descriptive Statistics , Intro to Inferential Statistics ) the final project w...


After finishing Udacity’s statistics courses (Intro to Descriptive Statistics, Intro to Inferential Statistics) the final project was to pick a data set explore it and ask a question/hypothesis and try to answer it with statistics.
One of the suggested datasets was the Italian Football Data so I picked a similar dataset for the premiere league since I’m more familiar with it
I got the data for the season 2017/2018 from football-data.co.uk. after exploring the dataset I found out it had many columns (65) describing the events and results of each match (380 matchs), after reading this note to figure out what each column represents a simple question came to my mind : Does home team have more chance to win ?. there is already alot of studies about home matches and its effect on the home team performance and the factors causing this effect.


10 rows of the data set