Skip to content

sangv/swsd

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

48 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Simple Word Sense Disambiguation Library (swsd)

A simple word sense disambiguation library written in scala with a java api

Dependencies:

Wordnet 3.0 (using extJWNL)

StanforNLP

Neo4j graph database and scala-gremlin

Approach:

On input text, perform part-of-speech tagging

On tagged text, derive intersection points for the different possible synsets of the word

Disambiguate words that can be associated with one or more intersection points

For the remaining words, disambiguate to most popular context for that part of speech

Future Work:

Add support for word compoundification

Provide a simple play webapp to deploy this as a REST service

Research into other alternatives to using StanfordNLP

Research into using other graph databases or in-memory graphs like TinkerGraph etc.

About

A simple word sense disambiguation library

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published