💫 Merge conllu converters into one function #3373
Labels
enhancement
Feature requests and improvements
feat / cli
Feature: Command-line interface
help wanted (easy)
Contributions welcome! (also suited for spaCy beginners)
help wanted
Contributions welcome!
I was just updating the
spacy convert
command and docs and noticed that the converters are currently a bit messy. We might be able to eliminate some code duplication and make them a bit nicer overall. It'd be nice if we could merge theconll
converters (or at leastconll
/conllu
andconllubio
) into a single script.(That said, we do have to accept that they'll always be kinda hacky, simply because the formats we're dealing with aren't always 100% consistent. There are several variations of the
.conll
format alone that we need to handle).Resources
spacy.cli.converters
: https:/explosion/spaCy/tree/develop/spacy/cli/convertersconllubio
converterFYI: Future plans
Doc
objects instead of JSON objects. This means that the newDoc.to_json
object will be the single source of truth for the JSON format and we don't end up with arbitrary data transformation logic all over the place.The text was updated successfully, but these errors were encountered: