Tools for working with the S800 corpus
Tools for working with the S800 corpus (http://species.jensenlab.org/).
.ann
standoffwget http://species.jensenlab.org/files/S800-1.0.tar.gz
mkdir original-data
tar xzf S800-1.0.tar.gz -C original-data
./convert_s800.sh original-data standoff
./split_s800.sh
mkdir conll
git clone https://github.com/spyysalo/standoff2conll.git
for i in train devel test; do
python3 standoff2conll/standoff2conll.py split-standoff/$i > conll/$i.tsv
done