Scraping Books from Internet

This project contains a collection of scripts designed to scrape books, primarily from the Internet Archive. The results are stored in CSV files for easy access and processing.

Features

Book Scraping: Extract metadata and details of books from the Internet Archive.
CSV Output: The scraped data is saved in well-structured CSV files.

Requirements

Ensure you have Python installed. Then, install the necessary dependencies:

pip install -r requirements.txt

Usage

Clone the repository:

git clone https://github.com/your_username/scraping_books_from_internet.git

Navigate to the project directory:
```
cd scraping_books_from_internet
```
Run the scraping script:
```
python api7.py
```
Or:
```
python scraper.py
```
The resulting CSV files will be located in the output/ directory.

Current Status

The project is functional but contains some known bugs. If you encounter issues, feel free to report them or contribute a fix.

Contributing

Contributions are welcome! If you'd like to enhance the project or fix bugs:

Fork the repository
Make your changes
Submit a pull request

Let’s improve this project together!

Notes

The scraping scripts are optimized for the Internet Archive, but they might be adaptable to other sources with some modifications or other data like videos , images ...
Ensure compliance with the terms of service of the websites you scrape.

Happy scraping! 🕵️‍♂️📚

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Readme.md		Readme.md
api.py		api.py
api2.py		api2.py
api3.py		api3.py
api4.py		api4.py
api5.py		api5.py
api6.py		api6.py
api7.py		api7.py
api_basic.py		api_basic.py
books_info.csv		books_info.csv
books_info_with_images.csv		books_info_with_images.csv
books_info_with_images4.csv		books_info_with_images4.csv
main.py		main.py
scraper.py		scraper.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Scraping Books from Internet

Features

Requirements

Usage

Current Status

Contributing

Notes

About

Releases

Packages

Languages

Locussta/scraping_books_from_internet

Folders and files

Latest commit

History

Repository files navigation

Scraping Books from Internet

Features

Requirements

Usage

Current Status

Contributing

Notes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages