add python image scrape script

This commit is contained in:
Mara Karagianni
2024-10-31 19:25:04 +01:00
parent 7ef8f2ffd5
commit ad3a364347
2 changed files with 78 additions and 0 deletions

12
python/scrape/README.md Normal file
View File

@@ -0,0 +1,12 @@
## A script that scrapes images from a given URL
we need to use the requests and BeautifulSoup libraries to retrieve and parse the HTML content. `os` and `shutil` are helpful for managing files and saving the images. Scraping should be done ethically, following the website's robots.txt rules and terms of service.
```
pip install requests beautifulsoup4 tldextract
```
Run the script with:
```
python cyberfeminist_images.py
```