Choosing the optimal approach to building a recommender system based on movie data
DOI: 10.31673/2412-4338.2022.028590
Abstract
Recommender systems have become increasingly popular recently because they can address the problem of information overload by suggesting items of interest to the users. The volume of information with each year increases and more opportunities for Internet business, so it becomes easier to get anything through the internet, such as goods, books, movies, news. Work in the field of creating recommendation systems can be both commercial and research in nature, and need to address a number of issues. The paper studies the problem of choosing the optimal approach to building a recommendation system by repulsive from the available data. It is not possible to create a recommendation system without data. The data is usually available in an explicit or implicit way. Data that is called explicit can be collected by finding reviews and sharing of user views about different products. However, the implicit data is related to the search magazine and the data history available on the system. Choosing an optimal approach to building a recommendation system will allow the machine learning algorithm to be used. Simulated recommendation system architecture that helps to understand how to build a user-defined recommendation process. Recommender systems are classified in the manner of sampling the necessary material for the user system. Two basic approaches are primarily applied: collaborative filtering and content-oriented filtering. There is also a hybrid filtering that combines both collaborative and content-oriented filtering. The work carried out analysis and comparison of recommendation systems by types. Comparing types of recommendation system based on basic characteristics: implementation complexity, implementation accuracy, job speed, dependency on system users. Analysis of the core content-oriented filtering machine learning algorithms have been conducted.
Key words: recommendation system, collaborative filtering, content-oriented filtering, hybrid filtering, algorithm.
References
- Yadav N., Kumar R., Singh A., Pal S. Diversity in Recommendation System: Cluster Based Approach. Hybrid Intelligent Systems. 2020. 113–122с.
- John S. Breese, David Heckerman, and Carl Myers Kadie. Empirical analysis of predictive algorithms for collaborative filtering. CoRR, abs/1301.7363, 2013. 69с.
- Liu Na, Ming-Xia Li, Qiu Hai-yang, and Hao-Long Su. A hybrid user-based collaborative filtering algorithm with topic model. Appl. Intell., 51(11):7946–7959, 2021.
- Soumen Chakrabarti. Mining the web - discovering knowledge from hypertext data. Morgan Kaufmann, 2003.
- Christopher D. Manning, Prabhakar Raghavan, and Hinrich Schütze. Introduction to information retrieval. Cambridge University Press, 2008.
- Robin D. Burke. Hybrid recommender systems: Survey and experiments. User Model. User Adapt. Interact., 12(4):331–370, 2002.
- Cosine similarity [Electronic resource] – Online: https://www.wikiwand.com/en/Cosine_similarity
- Tf–idf [Electronic resource] – Online: https://en.wikipedia.org/wiki/Tf–idf
- Roshan Bharti and Deepak Gupta. Recommending Top N Movies Using Content-Based Filtering and Collaborative Filtering with Hadoop and Hive Framework,2019.102-112с.