Skip to content

A test suite to assess the ability of tools to properly handle different kinds of emojis, including skin tone and composite emojis, with regard to tokenization and various natural language processing tasks.

Notifications You must be signed in to change notification settings

abushoeb/Emoji-Test-Suite

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 

Repository files navigation

Emoji Test Suite 👨🏽‍🔬

Assessing Emoji Use in Modern Text Processing Tools

About

We have developed a test suite to assess the ability of tools to properly handle different kinds of emojis, including skin tone and composite emojis, with regard to tokenization and various natural language processing tasks.

Results available for the following tasks:

  • Tokenization
  • Part of speech tagging
  • Dependency Parsing
  • Sentiment analysis

List of tools used in the experiments:

  • Gensim
  • NLTK
  • NLTK Tweet Tokenizer
  • PyNLPl
  • SpaCy
  • SpaCyMoji
  • Stanford CoreNLP
  • Stanza
  • Textblob

Preprint

Contact

About

A test suite to assess the ability of tools to properly handle different kinds of emojis, including skin tone and composite emojis, with regard to tokenization and various natural language processing tasks.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages