Scrape any kerboodle textbook and convert into a PDF
Report Bug
·
Request Feature
kerboodle-scraper allows you to scrape any kerboodle textbook from kerboodle.com and combine each of its pages into one PDF file.
- Clone the repository
- Install project requirements
$ pip install Pillow
$ pip install wget
When running the program, you will be asked to obtain a kerboodle textbook ID. This ID can be obtained from the following steps.
- Login to your kerboodle account
- Open the textbook you would like to scrape
- Open Inspect Element (F12 or right click, then Inspect Element)
- Filter out HTML containing the string 'img' (CTRL-F)
- An image element should be located, and the
src
attribute contains the image file location. - Copy the image
src
- Format the image
src
where the book ID is the following
//assets-runtime-production-oxed-oup.avallain.net/ebooks/{book_id}/images/1.jpg
- Copy the
book_id
- Run the program, and paste
book_id
python main.py
Copyright © 2022 Sanjay Sunil ([email protected])
Distributed under the {license} License. See LICENSE
for more information.