Best Method?

User 199 | 4/4/2014, 4:36:38 PM

I ran most of the algorithms on the netflix_small dataset (results attached). Looking at the validation RMSE, RBM seems to be performing best. Any comments on this? On different datasets, surely one can get different results. I was expecting better results from SVD however.


User 6 | 4/4/2014, 7:21:18 PM

Hi, There are many algorithms and no silver bullet - the performance of the algorithm also depends on the provided dataset. RBM is known to traditionally work well with Netflix data see Netflix blog: When Netflix talks about SVD they do not mean Singular Value Decomposition, but SVD++ which is also implemented in GraphChi.

You should note that the performance of the algorithm depends on many tunable parameters, and you should optimize the number of iterations, feature vector width (D), regularization and special parameters for each algorithm for example step sizes for SGD. I believe the results you sent are not optimized yet but present a baseline.