User 4603 | 4/11/2016, 9:31:21 AM
I want some help on applying linear regression model.
I have a data file with 12k training examples and 2k test examples.
Each training example have 2 features. Out of which, 1 feature have float values and other have categorical data with around 1400 unique categories. I am getting a very bad rmse(16000) and correlation coefficients(84%).
Kindly help me in deciding what to do next to improve my model.