Skip to content

Returns latest research results by crawling arxiv papers and summarizing abstracts. Helps you stay afloat with so many new papers everyday.

Notifications You must be signed in to change notification settings

chiphuyen/sotawhat

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 

Repository files navigation

sotawhat

This script runs using Python 3.

First, install the required packages. This script only requires nltk and PyEnchant.

$ pip3 install -r requirements.txt

If you run the error that the package punkt doesn't exist, download it by going into your Python environment and running:

$ python3

>>> import nltk
>>> nltk.download('punkt')

In MacOS, you can get the SSL error

[nltk_data] Error loading punkt: <urlopen error [SSL:
[nltk_data]     CERTIFICATE_VERIFY_FAILED] certificate verify failed:
[nltk_data]     unable to get local issuer certificate (_ssl.c:1045)>

this will be fixed by reinstalling certificates

$ /Applications/Python\ 3.x/Install\ Certificates.command

To query for a certain keyword, run:

$ python3 sotawhat.py "[keyword]" [number of results]

For example:

$ python3 sotawhat.py "perplexity" 10

If you don't specify the number of results, by default, the script returns 5 results. Each result contains the title of the paper with author and published date, a summary of the abstract, and link to the paper.

We've found that this script works well with keywords that are:

  • a model (e.g. transformer, wavenet, ...)
  • a dataset (e.g. wikitext, imagenet, ...)
  • a task (e.g. 'language model', 'machine translation', 'fuzzing', ...)
  • a metric (e.g. BLEU, perplexity, ...)
  • random stuff

About

Returns latest research results by crawling arxiv papers and summarizing abstracts. Helps you stay afloat with so many new papers everyday.

Topics

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages