Zoobooks Data

My group’s project fortunately enough is on track. We all are responsible for a set of years of the Zoobooks to transcribe the data from. The years that I am responsible for are 1955 and 1960-1962. Downloading the pdfs of the Zoobooks was very much the easiest part. Using the terminal and the script to change the pdfs to txt files was an easy process as well. The challenge for me came when trying to convert the txt files to csv files.

Screen Shot 2015-10-27 at 10.35.15 AM

 

 

 

 

 

 

Where I was going wrong came from saving more than one python doc in pdfminer and not restarting the script in the terminal page for each file that I was converting. The task ultimately got done and the data for all my years is nicely showcased for our story map.

Screen Shot 2015-10-27 at 11.12.21 AM

xgen

2 Comments

  1. Interesting! Now that your group project blog has delivered more details regarding the project, I am more intrigued into seeing he final result. I am also impressed by the pace of your and your group’s progress. Moreover, the pictures here also help me to understand how cool and thrilling this project will be and I appreciate the fact that you have clearly labeled the difficulties you have already encountered at this phase. Regardless, I wish you good luck and I look forward to your final project! 🙂

  2. I agree that using python scripts can sometimes be confusing to use. The hardest part is finding out the exact syntax to use for the script. Other than that little problem, it looks like your group is making great progress. I look forward to seeing the final product after analyzing the Zoobooks. It will be interesting to see what information and conclusions can be made from looking at the Zoobooks.

Leave a Reply

Your email address will not be published. Required fields are marked *