Prof Mo Saraee M.Saraee@salford.ac.uk
Professor
A data mining approach to analysis and prediction of movie ratings
Saraee, MH; White, S; Eccleston, J
Authors
S White
J Eccleston
Abstract
This paper details our analysis of the Internet Movie Database (IMDb), a free, user-maintained, online resource of production details for over 390,000 movies, television series and video games, which contains information such as title, genre, box-office taking, cast credits and user's ratings. We gather a series of interesting facts and relationships using a variety of data mining techniques.
In particular, we concentrate on attributes relevant to the user ratings of movies, such as discovering if big-budget films are more popular than their low budget counterparts, if any relationship between movies produced during the "golden age" (i.e.
Citizen Kane, It’s A Wonderful Life, etc.) can be proved, and whether any particular actors or actresses are likely to help a movie to succeed.
The paper also reports on the techniques used, giving their implementation and usefulness. We have found that the IMDb is difficult to perform data mining upon, due to the format of the source data. We also found some interesting facts, such as the budget of a film is no indication of how well-rated it will be, there is a downward trend in the quality of films over time, and the director and actors/actresses involved in a film are the most important factors to its success or lack thereof. The data used in this paper is not freely distributable, but remains copyright to the Internet Movie Database inc.
Citation
Saraee, M., White, S., & Eccleston, J. (2004, September). A data mining approach to analysis and prediction of movie ratings. Presented at The Fifth International Conference on Data Mining, Text Mining and their Business Applications,, Malaga, Spain
Presentation Conference Type | Other |
---|---|
Conference Name | The Fifth International Conference on Data Mining, Text Mining and their Business Applications, |
Conference Location | Malaga, Spain |
Start Date | Sep 15, 2004 |
End Date | Sep 17, 2004 |
Publication Date | Jan 1, 2004 |
Deposit Date | Nov 3, 2011 |
Publicly Available Date | Apr 5, 2016 |
Publisher URL | http://library.witpress.com/pages/PaperInfo.asp?PaperID=14248 |
Additional Information | Event Type : Conference |
Files
Wessex_movie.pdf
(366 Kb)
PDF
You might also like
Optimizing the Parameters of Relay Selection Model in D2D Network
(2024)
Conference Proceeding
Multiclass Classification and Defect Detection of Steel tube using modified YOLO
(2023)
Conference Proceeding
Downloadable Citations
About USIR
Administrator e-mail: library-research@salford.ac.uk
This application uses the following open-source libraries:
SheetJS Community Edition
Apache License Version 2.0 (http://www.apache.org/licenses/)
PDF.js
Apache License Version 2.0 (http://www.apache.org/licenses/)
Font Awesome
SIL OFL 1.1 (http://scripts.sil.org/OFL)
MIT License (http://opensource.org/licenses/mit-license.html)
CC BY 3.0 ( http://creativecommons.org/licenses/by/3.0/)
Powered by Worktribe © 2024
Advanced Search