Skip to content

RAM-16/WebScraping

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Instructables.com WebScraping

This was built using BeautifulSoup, Selenium, PhantomJS


Instructions:

  • Download PhantomJS
  • Add it in the same directory where main.py is located
  • Run

Features:

  • Works on both Windows and MacOS
  • Only accepts URLs from instructables.com

Attributes extracted from the URL:

  • Scraped URL

  • Header Title

  • Views Count

  • Favorites Count

  • Comment Count

  • Supplies (If available)

  • Steps Titles

  • Youtube Video URL


Tested on URLs:

  1. https://www.instructables.com/Building-a-Self-Driving-Boat-ArduPilot-Rover/
  2. https://www.instructables.com/Hydraulic-Craft-Stick-Box/
  3. https://www.instructables.com/How-to-Make-a-Self-Watering-Plant-Stand/

Result ScreenShots:

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%