mirror of
https://git.erg.school/P039/art_num_2024.git
synced 2026-02-04 04:59:18 +01:00
13 lines
433 B
Markdown
13 lines
433 B
Markdown
## A script that scrapes images from a given URL
|
|
we need to use the requests and BeautifulSoup libraries to retrieve and parse the HTML content. `os` and `shutil` are helpful for managing files and saving the images. Scraping should be done ethically, following the website's robots.txt rules and terms of service.
|
|
|
|
```
|
|
pip install requests beautifulsoup4 tldextract
|
|
|
|
```
|
|
|
|
Run the script with:
|
|
```
|
|
python cyberfeminist_images.py
|
|
```
|