Twitter is full of posts tagged #BigData and #DataScience. Which are the ones that people pay attention to most? In a project for Synergic Partners (now integrated within LUCA, Telefonica Data Unit), this team used network science and text-mining techniques to identify Twitter influencers in data science. They built a projected network by combining “retweet” and “mention” layers into a single layer and discovered communities using the K-Clique, Modularity, Random Walk and Mixed Membership Blockmodel community detection algorithms. They identified community influencers using centrality metrics and characterized users and communities using LDA. With a limited dataset of less than 200,000 tweets, they found that the modularity and random walk techniques produced the most coherent communities based on user demographics and influencers. An interactive visualization showed each community’s network and user demographics.
Students: Casey Huang, Claire Liu, Jordan Rosenblum and Steven Royce.