paper-cutter/{{cookiecutter.paper_repo_name}}/helpers/get-sentences.sh
Filip Gralinski 67c788fe69 Init from an internal repo.
Commit d5b6f8e831fc5c933af5ceb1267f51ef6af6c438
2020-11-24 08:33:07 +01:00

4 lines
149 B
Bash
Executable File

#!/bin/bash
bash helpers/pdf-to-plain-text.sh "$1" | perl helpers/strip-references.pl | perl -pne 'chomp $_; $_.=" "' | python3 -m syntok.segmenter