2022-03-21 00:12:12 +01:00
|
|
|
# ium_444018
|
2022-03-16 11:51:25 +01:00
|
|
|
|
2022-03-21 00:12:12 +01:00
|
|
|
Zadania realizowane w ramach Inżynieri Uczenia Maszynowego.
|
|
|
|
|
|
|
|
## Zbiór
|
|
|
|
***IMDB Movies Dataset***
|
|
|
|
|
|
|
|
https://www.kaggle.com/datasets/harshitshankhdhar/imdb-dataset-of-top-1000-movies-and-tv-shows?select=imdb_top_1000.csv
|
|
|
|
|
|
|
|
## Wymagania
|
|
|
|
- `python3`
|
|
|
|
- `pip`
|
|
|
|
- API token z `kaggle.com`
|
|
|
|
|
|
|
|
## Uruchamianie
|
|
|
|
|
|
|
|
- Instalujemy potrzebne pakiety:
|
|
|
|
|
|
|
|
```sh
|
|
|
|
$ pip install -r requirements.txt
|
|
|
|
```
|
|
|
|
- Pobieramy zbiór danych z Kaggle. Skorzystamy ze skryptu w repo, który pobierze i podzieli dane na podzbiory:
|
|
|
|
|
|
|
|
``` $ ./download_dataset.sh ```
|