wiki-use-annoy

Short wikipedia articles lookup using Google's USE (Universal Sentence Encoder) and Annoy (Approximate Nearest Neighbors Oh Yeah)

Note: Not the entire wikipedia articles lookup ;). Checkout the disclaimer below

Installation

Clone the repo

git clone https://github.com/jaganlal/wiki-use-annoy.git
cd wiki-use-annoy/

Install the required libs

pip install -r requirements.txt

Usage

Model Download

Download the universal-sentence-encoder-large model using download-use.py script

python download-use.py

Build Annoy Index

Build annoy index for the short-wiki.csv file

python build-short-wiki-annoy-index.py

Find Similarities

Find the similarties by providing the id

python find-similar-wiki-articles.py

Key in the id, say for example music-wikipedia. You'll see the following results (in the form of id for similicity)

pop-wikipedia
guitar-wikipedia
brain-wikipedia
world-wikipedia
science-wikipedia
malayalam-wikipedia
sourashtra-wikipedia
apple-wikipedia
usa-wikipedia

Disclaimer

I started to create (short-wiki.csv) a short intro on some of the articles (source: wikipedia) about places, people, culture etc. So this application will lookup from that articles. Checkout short-wiki.csv for more information on this. You can imagine this as a cleaned up data lookup. If you want to contribute (either code or data part), please feel free to fork it and create a PR.

Note

As you have noticed, there are no error handlings

References

https://jaganlal.github.io/ui-sentence-similarity/

https://github.com/jaganlal/wiki-use-annoy-tf2/blob/master/README.md

https://towardsdatascience.com/use-cases-of-googles-universal-sentence-encoder-in-production-dd5aaab4fc15

https://medium.com/@vineet.mundhra/finding-similar-sentences-using-wikipedia-and-tensorflow-hub-dee2f52ed587

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
model-indexes		model-indexes
.gitignore		.gitignore
README.md		README.md
_config.yml		_config.yml
build-short-wiki-annoy-index.py		build-short-wiki-annoy-index.py
download-use.py		download-use.py
find-similar-wiki-articles-with-filter.py		find-similar-wiki-articles-with-filter.py
find-similar-wiki-articles.py		find-similar-wiki-articles.py
find-similarities.py		find-similarities.py
requirements.txt		requirements.txt
short-wiki.csv		short-wiki.csv
use-simple-sentence.py		use-simple-sentence.py
wiki.annoy.index		wiki.annoy.index

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

wiki-use-annoy

Note: Not the entire wikipedia articles lookup ;). Checkout the disclaimer below

Installation

Clone the repo

Install the required libs

Usage

Model Download

Build Annoy Index

Find Similarities

Disclaimer

Note

References

About

Releases

Packages

Contributors 2

Languages

jaganlal/wiki-use-annoy

Folders and files

Latest commit

History

Repository files navigation

wiki-use-annoy

Note: Not the entire wikipedia articles lookup ;). Checkout the disclaimer below

Installation

Clone the repo

Install the required libs

Usage

Model Download

Build Annoy Index

Find Similarities

Disclaimer

Note

References

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages