The video game industry is dynamic and highly competitive, with countless titles released each year across various platforms and genres. To succeed in this space, it's essential for publishers and developers to have a deep understanding of market dynamics, including sales trends, popular genres, and effective platforms for distribution.
Project Objectives
- Analyze global video game sales data (from Kaggle) to identify key trends and patterns.
- Determine the most influential factors affecting game sales, such as genre, platform, and region.
- Explore market share among game publishers to understand the competitive landscape.
- Segment games into different sales categories for targeted analysis.
- Create predictive models to estimate global game sales based on available data.
Project Challenge
Navigating the vast and intricate landscape of video game sales data presented a significant challenge. Managing a dataset with missing values and diverse attributes required meticulous data preprocessing. Additionally, interpreting complex correlations and high cardinality features while building predictive models demanded a combination of statistical expertise and machine learning proficiency.
- Dealing with a large and diverse dataset with missing values.
- Identifying meaningful correlations between variables to derive valuable insights.
- Handling high cardinality features like genres and publishers.
- Creating accurate predictive models to estimate sales.
- Extracting actionable insights from complex sales data.
Initial insights
- The data set contains mostly nominal and ordinal values, which can be challenging for certain machine learning algorithms.
- Unsupervised learning was chosen for this project.
- The data set used in this project is relatively small, and it only includes video games that have sold over 100,000 copies. This means that the results of the analysis may not be generalizable to all video games.