Intent Parser Multi-word Entities #2944

jfantell · 2018-11-18T20:30:45Z

I am trying to train a model using the following script:

https:/explosion/spacy/blob/master/examples/training/train_intent_parser.py

I am providing the model this sample data:

("show me the best Marriot hotel in New York", {
'heads': [0, 0, 5, 5, 5, 0, 7, 5, 5],
'deps': ['ROOT', '-', '-', 'QUALITY', 'PLACE', 'PLACE', '-', 'LOCATION', 'LOCATION']
})

Currently, this produces the following output:

[('show', 'ROOT', 'show'), ('best', 'QUALITY', 'hotel'), ('Marriot', 'PLACE', 'hotel'), ('hotel', 'PLACE', 'show'), ('New', 'LOCATION', 'hotel'), ('York', 'LOCATION', 'hotel')]

Instead, I want it to produce this output:

[('show', 'ROOT', 'show'), ('best', 'QUALITY', 'hotel'), ('Marriot hotel', 'PLACE', 'hotel'), ('New York', 'LOCATION', 'hotel')]

I did not find any documentation on how multi-word entities such as "New York" and "Marriot hotel" can be extracted using the intent parser. Could someone please advise me as to how this could be done? Thank you for your time in advance!

The text was updated successfully, but these errors were encountered:

honnibal · 2018-11-26T12:40:49Z

You could try merging the named entities into one token each before training your intent parser. Alternatively, you might be better off leaving them as multiple tokens, and then dealing with the subtree afterwards. You can find docs about the merge method here: https://spacy.io/api/doc#merge

lock · 2018-12-26T13:04:47Z

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.

honnibal closed this as completed Nov 26, 2018

honnibal added the usage General spaCy usage label Nov 26, 2018

lock bot locked as resolved and limited conversation to collaborators Dec 26, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Intent Parser Multi-word Entities #2944

Intent Parser Multi-word Entities #2944

jfantell commented Nov 18, 2018

honnibal commented Nov 26, 2018

lock bot commented Dec 26, 2018

Intent Parser Multi-word Entities #2944

Intent Parser Multi-word Entities #2944

Comments

jfantell commented Nov 18, 2018

honnibal commented Nov 26, 2018

lock bot commented Dec 26, 2018