s500042/webscraper

Go to file

paprykdev 053a9b20de Some checks failed Docker Image CI / build (push) Has been cancelled Details docs: update README.md Signed-off-by: paprykdev <58005447+paprykdev@users.noreply.github.com>		2025-01-28 17:17:10 +01:00
.github/workflows	fix: update docker workflow to build image using start.py instead of docker compose	2024-11-16 00:50:38 +01:00
app	feat: add fully functional torres scraper	2025-01-28 16:26:29 +01:00
scripts	feat: scraper for monet arts	2024-12-18 01:41:12 +01:00
.gitignore	feat: add fully functional torres scraper	2025-01-28 16:26:29 +01:00
LICENSE	docs: add MIT LICENSE	2024-11-12 05:17:24 +01:00
README.md	docs: update README.md	2025-01-28 17:17:10 +01:00

README.md

Web scraper 🔍

Description

This project is a web scraper designed to extract data from websites.

How to use

Clone the repository
cd webscraper
cd app
pip3 install -r requirements.txt
python3 scripts/monet.py for the monet scraper
python3 scripts/torres.py for the torres scraper

How to use with Docker

Clone the repository
cd webscraper/app
docker compose up -d
docker exec -it docker exec -it scraper xvfb-run --auto-servernum --server-num=1 --server-args='-screen 0, 1920x1080x24' python3 scripts/monet.py for the monet scraper
docker exec -it docker exec -it scraper xvfb-run --auto-servernum --server-num=1 --server-args='-screen 0, 1920x1080x24' python3 scripts/torres.py for the torres scraper
docker compose down

Video

Watch the video