User 1893 | 5/6/2015, 10:28:12 PM
Im given a file where each row corresponds to a review. Each row has User,Item,Rating,Categories. (IMPORTANT - The number of items is very low, but almost each User has reviewed only 1-2 items)
Im also given a test file, where each row is a User-Item pair. I have to predict whether a User will review/buy the specified Item. (1 if he will, 0 otherwise)
For this I made a recommender (automatically chose a Ranking Factorization model), and gave the item_data the categories for each item. Then i try to predict the rating for each User-Item pair, and if the rating >= 3 or something, I say he will but it.
It seems that the rating is very high for totally non-related items as well. How do I fix this?