One hundred and fifty pages of data – OCR help

Sample of my data courtesy of Rita Wagner at the Prince George Tree Improvement Station

My research depends on data that many, many people have collected over many, many years. A lot of it is still on data collection sheets used in the field and has been sitting ignored in filing cabinets. It is absolutely fantastic that people have been willing to dig up and share this stuff with me. Hopefully by the end of my project I’ll have a great big data package to publish on dryad! Then whenever anyone needs this kind of data, no one has to waste time digging through decades of old files.

Now that I’ve got the data, I need to analyze it. And to do that I need to get these handwritten data into something I can feed R. Before I talk some poor undergrad into helping me out, I thought I’d look into some kind of automated solution. My knowledge of OCR at this point is “sometimes some program can read text in an image.” Any advice? You can see a sample of what I’m working with in the image above.

4 thoughts on “One hundred and fifty pages of data – OCR help”

Automating digitizing your data | On the other hand says:

2013.2.4 at 4:28 pm

[…] month I asked for advice on automating a huge bunch of data transcription. At that point, my knowledge of OCR was […]

On the other hand

Internet home of C. Susannah Tysor

One hundred and fifty pages of data – OCR help

4 thoughts on “One hundred and fifty pages of data – OCR help”

Leave a Reply Cancel reply