Skip to content

This repository serves as a proof of concept for scraping Tweets from a twitter user. This repository is educational purposes only. Please follow Twitters ToS.

Notifications You must be signed in to change notification settings

KeiranHines/TwitterScraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

7 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Proof of concept web scraper

Requirements

Python 3 required, tested with Python 3.8.5

Third-party Python modules listed in requirements.txt and can be installed using pip3 install -r requirements.txt

Firefox is required to be installed and the Firefox GeckoDriver installed and accessible in the PATH environment variable.

  • GeckoDriver can be downloaded here

Directory Structure

src: All source
|
|-->apps: Apps based on these scrapers.
|
|-->framework: The framework for this an future web-scrapers.
        |
        |-->exceptions: Exception handling for the framework. 
|
|-->handlers: Handlers that process AbstactDataClass data
|
|-->twitter: A Twitter specific user scraper

Usages

  • apps/tweet_logger.py
    • Description: gets the last I tweets for a given twitter handle and logs to stdout, updating every M minutes
    • Usage: tweet_logger.py -u -i -m
    • Defaults:
      • If -i is not specified default to 5.
      • If -m is not specified default to 10.

About

This repository serves as a proof of concept for scraping Tweets from a twitter user. This repository is educational purposes only. Please follow Twitters ToS.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages