.
In this post, we are going to be looking at the task of performing topic modelling on twitter data to figure out what people are tweeting about. We would first load our data, perform simple EDA on the data to explore popular hashtags and users, lastly we would apply a machine learning algorithm, LDA ( Latent Dirichlet Allocation), to explore the topics in the tweets.
The code for this blog post can be found here.
To follow this post, you should be comfortable with using basic python
, thepandas
and numpy
python package. You will need to have the following packages installed: numpy
, pandas
, seaborn
, matplotlib
, sklearn
, nltk
.
https://medium.com/@osas.usen/topic-extraction-from-tweets-using-lda-a997e4eb0985
.
0 Comments