Social network discovery

Source: allthingsgraphed.com

Project explanation:

The author explores a combination of sentiment analysis and network discovery methods to extract knowledge relevant for the company. The dataset consists of 15 million mixed-language Twitter messages purchased by the company. Messages are embedded via a word2vec model and a CNN sentiment classifier is trained on Google-translated messages from SemEval competition. It performs with 85% accuracy on our test set. A network of 3.5 million users is constructed and analysed using Louvain community detection to be subdivided into 81 communities. For each community representative users are identified by centrality measures, notably PageRank. The 3 biggest communities are formed around the topics of: team sports, teenage life and political news. Furthermore, the network is embedded via node embeddings and 0.9 million paths taken by messages are defined. The same CNN classifier is then trained on the paths. The resulting context-based sentiment classifiers performs at 81% accuracy without reading the content of a message. We therefore show that knowledge of context of a message brings a predictive power almost as high as knowing the message's content. The results conclude that poorly interpretable embeddings can be used to build highly interpretable features for each user.

Status:

Work in progress to be completed in May 2018.

Released under GNU GPL v2.1 License.

Resources:

Community detection in graphs: a user guide - A wonderful summary of academic approaches to centrality measures and community detection.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
README.md		README.md
graph_creator.py		graph_creator.py
graph_maker_hash.py		graph_maker_hash.py
node_embedding.ipynb		node_embedding.ipynb
node_embedding_update.ipynb		node_embedding_update.ipynb
node_split.py		node_split.py
nodes_embedding.py		nodes_embedding.py
nodes_paths_with_sentiment.py		nodes_paths_with_sentiment.py
nodes_paths_with_sentiment_balanced.py		nodes_paths_with_sentiment_balanced.py
nodes_paths_with_sentiment_balanced_all.py		nodes_paths_with_sentiment_balanced_all.py
nodes_paths_with_sentiment_balanced_update.py		nodes_paths_with_sentiment_balanced_update.py
opm.py		opm.py
tweet_reader.py		tweet_reader.py
tweets2list.py		tweets2list.py
utils.py		utils.py
w2v.py		w2v.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Social network discovery

Project explanation:

Status:

Resources:

About

Releases

Packages

Languages

License

positivedefinite/social_network_discovery

Folders and files

Latest commit

History

Repository files navigation

Social network discovery

Project explanation:

Status:

Resources:

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages