Skip to content

extracts, cleans, and formats data tables from wikipedia into JSON

Notifications You must be signed in to change notification settings

chrisscottromano/wiki-table-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

wiki-table-scraper

extracts, cleans, and formats data tables from wikipedia into JSON

web page url is defined in the code, not by CL arguments

with no edits, this script will generate a (mostly) cleaned JSON table of the data from this wiki page as "table.json"

to scrape a different page, you must supply:

url = " " (with a url)

If multiple tables are on page (and you want to scrape them all)

for i in range (1, x): (x = number of tables on page + 1)

If single table, comment out the for loops

I plan to turn this into an application that lets you do all this through a GUI instead of editing code

Maybe a website, maybe a chrome extension — we shall see

About

extracts, cleans, and formats data tables from wikipedia into JSON

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages