New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Output format: training-output
to build supervised models
#801
base: master
Are you sure you want to change the base?
Conversation
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
The crossScalaVersion does need to be changed for publication.
export/src/main/scala/org/clulab/reach/export/TrainingDataExporter.scala
Show resolved
Hide resolved
export/src/main/scala/org/clulab/reach/export/TrainingDataExporter.scala
Outdated
Show resolved
Hide resolved
export/src/main/scala/org/clulab/reach/export/TrainingDataExporter.scala
Outdated
Show resolved
Hide resolved
@enoriega, this is being built for both Scala 2.11 and 2.12. The earlier version does not like trailing/dangling commas like the ones in TrainingDataExporter, so it doesn't compile. One can use |
That TrainingDataExporter still needs a comma removed at line 76 in order to work on Scala 2.11. |
Summary
Added a new output format suitable to train classifiers using a python pipeline. It "flattens" activations and regulations and creates a json array with the tokens, spans, label and polarity for each event.
Example