Skip to content
This repository has been archived by the owner on Aug 31, 2022. It is now read-only.
/ spacy-benchmarks Public archive

💫 Runtime performance comparison of spaCy against other NLP libraries

Notifications You must be signed in to change notification settings

explosion/spacy-benchmarks

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

37 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Runtime performance comparison of spaCy against other NLP libraries

⚠️ This repository is old and deprecated. For up-to-date benchmark scripts, see the projects repo.

Set up the corpus DB

The speed test expects to read documents from a simple SQLite table. More corpus injestors need to be written. So far there's one to create the table from the Gigaword corpus.

fab corpus.giga:path_to_gigaword/

Set up the tools

fab init

This should download and install spaCy and other NLP libraries.

Run a benchmark

fab speed:parse,spacy,n=1000
fab speed:tag,spacy
fab speed:tag,spacy,nltk,n=10000
fab speed:tokenize,spacy,clearnlp

About

💫 Runtime performance comparison of spaCy against other NLP libraries

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 3

  •  
  •  
  •  

Languages