A BUSY WEEK
This week was one of the busiest weeks in my internship so far. I had to finalize my script really fast so that I could present it to my supervisor. When I finally presented my work, we had to consult a librarian to verify its accuracy. I am happy that the librarian approved my work as the only best way to gather the data from the Library of Congress. However, when we discussed the next step with my supervisors, we happened to find out an even better step to gather the most accurate data. This time it does not involve collecting data from a third-party website or scraping any website. Instead, my supervisors have a list of authors along with all their information, including their era. The era we are looking for is the Romantic era which goes from 1780 to 1838. So, this time, what we will do is work with the authors, and collect those who match the era we are interested in. When we get all the authors, we match them with our project Gutenberg metadata and only collect the work of art of those authors. This way we do not need the date of publication as we already know the era during which these authors worked.
While this week was stressful in terms of the amount of work I had to complete, it was also a very resourceful week as I got to learn so many things. I worked with python, the language that I know best, I combined Python with a query language called CQL, and I got more insight into how librarians work with old works of art through that meeting with the librarian, and now I am working with SQL query. I know that the knowledge that I acquired will be valuable for me in the future. About SQL in particular, I was nervous about it before the start of the internship because I tried to get into a database class last semester but I could not. So, I had to learn the language on my own so that I can use it. I am still in the beginning phase when it comes to SQL queries, but the more I work with the language, the more I learn, and I am happy because that is the way I have always learned new things.
Comments
Post a Comment