Skip to content

gehaustein/bundestag_crawler

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

2 Commits
 
 
 
 
 
 

Repository files navigation

mdb_crawler

Prerequisites

  1. The mdb_crawler uses selenium and BeautifulSoup to download all social media links from the bundestag.de website. You can use pip to install the two packages pip install selenium bs4.
  2. Selenium needs a webdriver to navigate the website. Chromedriver is used in the notebook and you would therefore need to download chromedriver on your system. Please download chromedriver from https://chromedriver.chromium.org/, unzip the file and copy the path to chromedriver into the notebook.

Chromedriver configuration

Chromedriver might not run unless the quarantine on the file is lifted. If you run which chromedriver an error might occur.
Navigate to the directory of chromedriver on your machine in terminal and run xattr -d com.apple.quarantine chromedriver to lift the quarantine.
Verify the success by running which chromedriver again without error.

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published