mirror of
https://git.erg.school/P039/art_num_2024.git
synced 2026-02-04 04:59:18 +01:00
add python image scrape script
This commit is contained in:
12
python/scrape/README.md
Normal file
12
python/scrape/README.md
Normal file
@@ -0,0 +1,12 @@
|
||||
## A script that scrapes images from a given URL
|
||||
we need to use the requests and BeautifulSoup libraries to retrieve and parse the HTML content. `os` and `shutil` are helpful for managing files and saving the images. Scraping should be done ethically, following the website's robots.txt rules and terms of service.
|
||||
|
||||
```
|
||||
pip install requests beautifulsoup4 tldextract
|
||||
|
||||
```
|
||||
|
||||
Run the script with:
|
||||
```
|
||||
python cyberfeminist_images.py
|
||||
```
|
||||
Reference in New Issue
Block a user