User 3229 | 2/19/2016, 9:34:52 PM
Is it possible to use a list of diagnosis codes as a feature to create a regression model? I have a column in my data set that contains data like so: [121095,118654,119466,118814,119528,119467] 
I am getting this error: Dataset mismatch between training and prediction. Numeric feature 'ProceduresList' must contain lists of consistent size. (Found lists/arrays of sizes 1 and 0).
Do I need to create a column for each possible Diagnosis code and then flag True of False if the patient has that diagnosis? That would be an exhaustive list of features if I wanted to track all possibilities. Or is there a way to put them all in one column? I feel the "array" option for a model feature is not going to help save me here?