Skip to content

alexbrandsen/jsonl2bio

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

jsonl2bio

Usage: python jsonl2bio.py [input file location] [output file name without extension] [user id to BIO tag file]

Script that converts JSONL output from Doccano (https://github.com/chakki-works/doccano) to the BIO format

This script assumes content has been tokenised before uploading to Doccano (i.e., it uses spaces to split sentences into words)

Will produce multiple files if annotations from multiple users are present

Needs a file containing translations of Docanno user IDs to required BIO tag, separated by spaces, 1 tag per line. Example can be found in the tagtable.txt file.

About

Script that converts JSONL output from Doccano to the BIO format

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages