1
0
Commit Graph

66 Commits

Author SHA1 Message Date
7d27ba0fe1 Merge pull request 'scrapper' () from scrapper into master
Reviewed-on: 
2025-01-02 18:11:53 +01:00
44d6306c76 -Now movies with error(s) are being skipped -added error_saver -added movies_data.csv 2025-01-02 11:22:42 +01:00
f284af608b Added comments 2025-01-01 17:41:07 +01:00
72739bd8e2 Fixed bug in error handling. 2025-01-01 17:39:59 +01:00
34bab33075 Changed error handling in getting movie data 2024-12-30 10:25:32 +01:00
689298559c -removed debug line in get_movie_data 2024-12-23 22:12:43 +01:00
96bdc0f4d6 -fixed typo in config -added genres sorting -added checking if the movie already exists in scrapped data 2024-12-23 22:11:54 +01:00
b2b202ba7c -merged main_genres and sub_genres into genres in whole data handling 2024-12-23 22:00:34 +01:00
464387577c -removed duplications in movies list for genre when movie has multiple genres with same parent genre -changed saving to_csv in get_movie_data for test purposes -removed extending movie main_genres 2024-12-23 21:57:59 +01:00
6f1141c36a -Implemented whole scrapper -Changed logic where button with movies is not found in get_movies_links 2024-12-23 20:46:51 +01:00
8f42247244 - Implemented get_movie_data.py - Saved example of movie data - Changed columns for movie data - Added new packages (pandas, tabulate) 2024-12-23 17:50:03 +01:00
5d6b2c4427 Added way to return movies links and wrote simple docs in get_movies_links 2024-12-23 16:09:55 +01:00
a0ce4e175f -Added scrapper for interests -Added scrapper for movies links for interests -generated whole data for interests (genre,subgenres names and links), example of movies links for action subgenre -added config.py for whole scrapping purpouses -modified .gitignore to ignore __pycache__ folders 2024-12-23 00:19:02 +01:00
0bed2c8765 Modified readme 2024-12-22 23:01:15 +01:00
db0980fef8 Added simple readme.md 2024-12-22 22:10:00 +01:00
9a6aad98fb Initial commit. Made project structure, created requirements.txt 2024-12-22 22:07:47 +01:00