IndexError when reading Avro file

User 2110 | 7/27/2015, 7:01:02 PM

I'm attempting to read in an Avro file, but I'm receiving this error: `

<ipython-input-29-c7e6b5f53359> in <module>() 1 requestFile = "req/part-00000.avro" ----> 2 requests = gl.SArray.from_avro(requestFile)

/anaconda/envs/dato-env/lib/python2.7/site-packages/graphlab/datastructures/sarray.pyc in fromavro(cls, filename) 483 mt.getmetrictracker().track('sarray.fromavro') 484 proxy = UnitySArrayProxy(glconnect.getclient()) --> 485 proxy.loadfromavro(filename) 486 return cls(_proxy = proxy) 487

graphlab/cython/cysarray.pyx in graphlab.cython.cysarray.UnitySArrayProxy.loadfromavro()

graphlab/cython/cysarray.pyx in graphlab.cython.cysarray.UnitySArrayProxy.loadfromavro()

IndexError: vector I've been able to read this specific avro file in using other packages (i.e. spark-avro), and the code is very simple, which makes me wonder if there is some missing edge case not accounted for in the avro reader: import graphlab as gl requestFile = "req/part-00000.avro" requests = gl.SArray.from_avro(requestFile) ` Any thoughts?

Comments

User 19 | 7/28/2015, 6:28:26 PM

Hi bplaster,

Is it possible for you to share a small snippet of data that reproduces the issue? This will help us find and fix the issue.

Thanks, Chris


User 2110 | 7/29/2015, 2:58:52 PM

Chris,

Here is a small sample of the data.

Thank you, Brandon