This project involves an exploratory data analysis (EDA) of game sales to answer several key questions about the industry trends and preferences.
- Global Bestsellers: Which games have the highest global sales?
- Yearly Sales: Which year recorded the highest global sales?
- Genre Popularity: What are the most popular game genres globally and in each region (North America, Europe, Japan, Rest of the World)?
- Changing Preferences: How have genre preferences evolved over the years?
The project is structured in a Jupyter Notebook format. Below is an outline of the notebook:
-
Introduction
- An overview of the objectives and questions to be answered.
-
Data Loading and Initial Exploration
- Importing necessary libraries (
pandas
,matplotlib.pyplot
). - Loading the dataset and displaying the first few rows for an initial understanding.
import pandas as pd import matplotlib.pyplot as plt df = pd.read_csv('data/XboxOne_GameSales.csv', encoding='latin1') df.head()
- Importing necessary libraries (
-
Global Bestsellers
- Analysis to identify games with the highest global sales.
-
Yearly Sales
- Analysis to determine which year recorded the highest global sales.
-
Genre Popularity
- Analysis of the most popular game genres globally and in each region.
-
Changing Preferences
- Analysis of how genre preferences have evolved over the years.
- Python 3.x
- Jupyter Notebook
- Pandas library
- Matplotlib library
-
Clone the repository:
git clone https://github.com/LeoIvin/Game-Sales-Exploratory-Data-Analysis.git
-
Navigate to the project directory:
cd Game-Sales-Exploratory-Data-Analysis
-
Install the required libraries:
pip install pandas matplotlib
-
Run the Jupyter Notebook:
jupyter notebook
Open the xbox-analysis.ipynb
notebook in Jupyter and run the cells to perform the analysis. Each section of the notebook includes comments and explanations to guide you through the analysis process.
If you would like to contribute to this project, please fork the repository and submit a pull request with your changes.