Mechanism for preparing corpora for Concordia, built around the Fast-aligner software.
Go to file
2019-06-26 09:08:00 +02:00
bad-words redesign 2019-06-13 12:34:19 +02:00
dictionaries dictionaries, paths 2019-06-13 12:44:16 +02:00
censor_sources.py redesign 2019-06-13 12:34:19 +02:00
collect_dict.py dictionaries, paths 2019-06-13 12:44:16 +02:00
get_alignments.py redesign 2019-06-13 12:34:19 +02:00
Makefile lemmatizer 2019-06-26 09:08:00 +02:00
prepare_corpus.py redesign 2019-06-13 12:34:19 +02:00
sentence_lemmatizer.py lemmatizer 2019-06-26 09:08:00 +02:00