webscraper/README.md

# Web scraper 🔍

## Description

This project is a web scraper designed to extract data from websites.

## Features

☑️ Extracts data from web pages

## Usage

### With Docker

1. Clone the repository:

```bash
git clone https://git.wmi.amu.edu.pl/s500042/webscraper
```

2. Navigate to the project directory:

```bash
cd webscraper
```

3. Build the Docker image and run it using `start.py` script:

```bash
python scripts/start.py
```

On Mac, you'll have to use

```bash
python3 scripts/start.py
```

4. Check `/app/dist/data.json` file to see the extracted data.

### Without Docker

1. Clone the repository:

```bash
git clone https://git.wmi.amu.edu.pl/s500042/webscraper
```

2. Install the required dependencies:

```bash
pip install -r app/requirements.txt
```

If you're on Arch Linux, you'll need to create a virtual environment.
Here's is a [Step by step guide](#) that will help you create it.

3. Run `run_with_no_docker.py` script:

```bash
python scripts/run_with_no_docker.py
```

On Mac you'll, need to use:

```bash
python3 scripts/run_with_no_docker.py
```

4. Check `/app/dist/data.json` file to see the extracted data.

## License

This project is licensed under the MIT License. See the [LICENSE](LICENSE) file for details.
docs: update README.md 2024-11-12 05:17:45 +01:00			`# Web scraper 🔍`

			`## Description`

docs: update README to simplify usage instructions and remove outdated content 2024-11-15 17:13:29 +01:00			`This project is a web scraper designed to extract data from websites.`
docs: update README.md 2024-11-12 05:17:45 +01:00
			`## Features`

docs: update README to simplify usage instructions and remove outdated content 2024-11-15 17:13:29 +01:00			`☑️ Extracts data from web pages`
docs: update README.md 2024-11-12 05:17:45 +01:00
docs: update README to simplify usage instructions and remove outdated content 2024-11-15 17:13:29 +01:00			`## Usage`
docs: update README.md 2024-11-12 05:17:45 +01:00
docs: update README to simplify usage instructions and remove outdated content 2024-11-15 17:13:29 +01:00			`### With Docker`
docs: update README.md 2024-11-12 05:17:45 +01:00
			`1. Clone the repository:`

			```bash
docs: update README to correct script paths and improve instructions 2024-11-15 22:40:07 +01:00			`git clone https://git.wmi.amu.edu.pl/s500042/webscraper`
docs: update README.md 2024-11-12 05:17:45 +01:00			```

			`2. Navigate to the project directory:`

			```bash
			`cd webscraper`
			```

docs: update README to simplify usage instructions and remove outdated content 2024-11-15 17:13:29 +01:00			3. Build the Docker image and run it using `start.py` script:
docs: update README.md 2024-11-12 05:17:45 +01:00
			```bash
docs: update README to correct script paths and improve instructions 2024-11-15 22:40:07 +01:00			`python scripts/start.py`
docs: update README.md 2024-11-12 05:17:45 +01:00			```

docs: update README to simplify usage instructions and remove outdated content 2024-11-15 17:13:29 +01:00			`On Mac, you'll have to use`
docs: update README.md 2024-11-12 05:17:45 +01:00
			```bash
docs: update README to correct script paths and improve instructions 2024-11-15 22:40:07 +01:00			`python3 scripts/start.py`
docs: update README.md 2024-11-12 05:17:45 +01:00			```

docs: update README to correct script paths and improve instructions 2024-11-15 22:40:07 +01:00			4. Check `/app/dist/data.json` file to see the extracted data.

docs: update README.md 2024-11-12 05:17:45 +01:00			`### Without Docker`

			`1. Clone the repository:`

			```bash
docs: update README to correct script paths and improve instructions 2024-11-15 22:40:07 +01:00			`git clone https://git.wmi.amu.edu.pl/s500042/webscraper`
docs: update README.md 2024-11-12 05:17:45 +01:00			```

docs: update README to simplify usage instructions and remove outdated content 2024-11-15 17:13:29 +01:00			`2. Install the required dependencies:`
docs: update README.md 2024-11-12 05:17:45 +01:00
			```bash
docs: update README to correct script paths and improve instructions 2024-11-15 22:40:07 +01:00			`pip install -r app/requirements.txt`
docs: update README.md 2024-11-12 05:17:45 +01:00			```

			`If you're on Arch Linux, you'll need to create a virtual environment.`
			`Here's is a [Step by step guide](#) that will help you create it.`

docs: update README to correct script paths and improve instructions 2024-11-15 22:40:07 +01:00			3. Run `run_with_no_docker.py` script:
docs: update README to simplify usage instructions and remove outdated content 2024-11-15 17:13:29 +01:00
			```bash
docs: update README to correct script paths and improve instructions 2024-11-15 22:40:07 +01:00			`python scripts/run_with_no_docker.py`
docs: update README to simplify usage instructions and remove outdated content 2024-11-15 17:13:29 +01:00			```
docs: update README.md 2024-11-12 05:17:45 +01:00
docs: update README to simplify usage instructions and remove outdated content 2024-11-15 17:13:29 +01:00			`On Mac you'll, need to use:`
docs: update README.md 2024-11-12 05:17:45 +01:00
			```bash
docs: update README to correct script paths and improve instructions 2024-11-15 22:40:07 +01:00			`python3 scripts/run_with_no_docker.py`
docs: update README.md 2024-11-12 05:17:45 +01:00			```

docs: update README to correct script paths and improve instructions 2024-11-15 22:40:07 +01:00			4. Check `/app/dist/data.json` file to see the extracted data.

docs: update README.md 2024-11-12 05:17:45 +01:00			`## License`

			`This project is licensed under the MIT License. See the [LICENSE](LICENSE) file for details.`