|
7d27ba0fe1
|
Merge pull request 'scrapper' (#1) from scrapper into master
Reviewed-on: #1
|
2025-01-02 18:11:53 +01:00 |
|
|
44d6306c76
|
-Now movies with error(s) are being skipped -added error_saver -added movies_data.csv
|
2025-01-02 11:22:42 +01:00 |
|
|
f284af608b
|
Added comments
|
2025-01-01 17:41:07 +01:00 |
|
|
72739bd8e2
|
Fixed bug in error handling.
|
2025-01-01 17:39:59 +01:00 |
|
|
34bab33075
|
Changed error handling in getting movie data
|
2024-12-30 10:25:32 +01:00 |
|
|
689298559c
|
-removed debug line in get_movie_data
|
2024-12-23 22:12:43 +01:00 |
|
|
96bdc0f4d6
|
-fixed typo in config -added genres sorting -added checking if the movie already exists in scrapped data
|
2024-12-23 22:11:54 +01:00 |
|
|
b2b202ba7c
|
-merged main_genres and sub_genres into genres in whole data handling
|
2024-12-23 22:00:34 +01:00 |
|
|
464387577c
|
-removed duplications in movies list for genre when movie has multiple genres with same parent genre -changed saving to_csv in get_movie_data for test purposes -removed extending movie main_genres
|
2024-12-23 21:57:59 +01:00 |
|
|
6f1141c36a
|
-Implemented whole scrapper -Changed logic where button with movies is not found in get_movies_links
|
2024-12-23 20:46:51 +01:00 |
|
|
8f42247244
|
- Implemented get_movie_data.py - Saved example of movie data - Changed columns for movie data - Added new packages (pandas, tabulate)
|
2024-12-23 17:50:03 +01:00 |
|
|
5d6b2c4427
|
Added way to return movies links and wrote simple docs in get_movies_links
|
2024-12-23 16:09:55 +01:00 |
|
|
a0ce4e175f
|
-Added scrapper for interests -Added scrapper for movies links for interests -generated whole data for interests (genre,subgenres names and links), example of movies links for action subgenre -added config.py for whole scrapping purpouses -modified .gitignore to ignore __pycache__ folders
|
2024-12-23 00:19:02 +01:00 |
|
|
0bed2c8765
|
Modified readme
|
2024-12-22 23:01:15 +01:00 |
|
|
db0980fef8
|
Added simple readme.md
|
2024-12-22 22:10:00 +01:00 |
|
|
9a6aad98fb
|
Initial commit. Made project structure, created requirements.txt
|
2024-12-22 22:07:47 +01:00 |
|