IMPORTANCE OF VARIOUS SKILLSETS

     As part of the second part of my internship, we are working with python to process the data, and we are mainly doing topic modeling. This is going to be the very first time that I will be using a tool I learned in college. We will be using Gensim to do topic modeling, and I learned about Gensim during my Text Analysis class right before my internship. I feel happy that some of the skills I learned will be useful in an actual project. However, we will not be using Gensim exclusively, this is only one of the few more libraries.

    Here again, we will be using more than one tool for the same task in order to get the best result. This is similar to what we did during the data collection. It made me realize how important it is to always compare different options and to know which one to choose. I used to be the type of person who would master a single tool and only do my job with it. For example, I only mastered python for data science and Pandas for data processing. Now, I am thinking of enlarging my horizon and adding more skills under my belt. I am planning to learn R as a second option for data science/analysis, and PySpark for big data. We are already using PySpark in our current project so I should be able to learn it by the end of my internship, and I can learn R by taking a data analytics class in Berea.

Comments

Popular posts from this blog

FINAL TASK

FINAL POINT