mirror of
https://git.erg.school/P039/art_num_2024.git
synced 2026-02-04 13:09:20 +01:00
translate scraping README in french
This commit is contained in:
@@ -1,12 +1,16 @@
|
|||||||
## A script that scrapes images from a given URL
|
## Un script qui extrait des images depuis une URL donnée
|
||||||
we need to use the requests and BeautifulSoup libraries to retrieve and parse the HTML content. `os` and `shutil` are helpful for managing files and saving the images. Scraping should be done ethically, following the website's robots.txt rules and terms of service.
|
|
||||||
|
|
||||||
|
Nous devons installer:
|
||||||
```
|
```
|
||||||
pip install requests beautifulsoup4 tldextract
|
pip install requests beautifulsoup4 tldextract
|
||||||
|
|
||||||
```
|
```
|
||||||
|
|
||||||
Run the script with:
|
Exécutez le script avec :
|
||||||
```
|
```
|
||||||
python cyberfeminist_images.py
|
python get_images.py https://www.freepik.com/images
|
||||||
```
|
```
|
||||||
|
Remplacez l’URL par le lien que vous souhaitez extraire.
|
||||||
|
**Remarque:** Le scraping doit être effectué de manière éthique, en respectant les règles du fichier robots.txt et les conditions d'utilisation du site.
|
||||||
|
|
||||||
|
|
||||||
|
|||||||
Reference in New Issue
Block a user