User 2459 | 10/21/2015, 2:08:20 PM
Hey, 1. So I have .txt files from pub med and i would like to convert this data into potentially a table to use with graph lab create. 2. Each .txt file is an article and essentially I would like to extract information from these article/journals. so for example maybe a treatment or a drug that is discussed in the article. I was thinking of using a mixture of nltk, deep learning and classification. Any advice or thoughts about this?
here is an extract from one .txt file:
"Selenium can bioaccumulate in aquatic organisms resulting in adverse effects when it exceeds threshold levels. In fish, these effects can include reduced production of viable eggs, post-hatch mortality, deformities in growing stages, and various pathological effects in the kidneys, liver, heart, and ovaries (Hamilton , ; Lemly ). In severe cases, these effects may lead to population declines (Lemly )."