2021-11-04 11:40:00 +01:00
|
|
|
install pytorch
|
|
|
|
install transformers from source: pip install git+https://github.com/huggingface/transformers
|
|
|
|
instal datasets: pip install datasets
|
2021-11-04 12:06:29 +01:00
|
|
|
install tokenizers: pip install tokenizers
|
2021-11-04 11:40:28 +01:00
|
|
|
|
|
|
|
to run: go do a solution directory and: bash run.sh
|
2021-11-04 12:24:59 +01:00
|
|
|
|
|
|
|
|
|
|
|
adjust:
|
|
|
|
per_device_train_batch_size
|
|
|
|
per_device_eval_batch_size
|
|
|
|
gradient_accumulation_steps
|
|
|
|
|
|
|
|
if you want to run scripts on other dataset remeber to adjust max_seq_length
|