gonito-data/README.md

9 lines
487 B
Markdown
Raw Normal View History

2021-05-02 14:55:33 +02:00
# Amazon Products (Japanese)
This challenge requires extracting product category from product description.
The data is taken from Japanese amazon and consists of over 8000 product offers.
It was scraped using a simple Python bot. Most of the product descriptions contain
the category as a substring somewhere in the text (or alternatively some synonym of the category).
There is also no predefined set of all possible categories. Hence this task is NOT about
sequence classification.