Investigate TMDb Dataset
In this project, I investigated and visualized a dataset from TMDb to analyze the evolution of movies, with a focus on profitability and average budgets over time. The goal was to provide insights into the most and least profitable genres each year and examine the correlation between average budget and runtime.
Data
This dataset contains information about 10,000 movies, including user ratings and revenue. The data was cleaned from the original dataset on Kaggle. Certain columns, like ‘cast’ and ‘genres’, contain multiple values separated by pipe (|) characters, but no further cleaning was necessary. The final two columns show budget and revenue adjusted for inflation to 2010 dollars.
Original uncleaned data can be found at: TMDb Movie Metadata.
GitHub