Retrain transformer-based models in allennlp_models.pretrained #4457

epwalsh · 2020-07-09T19:23:54Z

Updates from the new transformers/tokenizers release has broken some of these.

Fine-grained NER transformer
RoBERTa SNLI @epwalsh - https://beaker.org/ex/ex_3xaw1xzw6689/tasks/tk_zxz18n8idbn0
RoBERTa MNLI @epwalsh - https://beaker.org/ex/ex_jbf2biqowfwa/tasks/tk_28u7smfn77o9
BERT SRL
RoBERTa SST @AkshitaB

dirkgr · 2020-07-14T08:53:57Z

I started doing BERT SRL too because I already had the environment for that spun up.

dirkgr · 2020-07-17T16:15:12Z

@AkshitaB, when you touch the RoBERTa SST model, try it without the cls_is_last_token setting. It's more correct to not have it.

matt-gardner · 2020-07-27T16:57:28Z

I see BERT SRL checked here, but there's this issue that says that performance actually doesn't match, with a solution that seems to fix it: #4392 (comment). Have you checked performance against the original reported performance? Seems like a simple config file fix to get performance back up, if this is an issue.

I was just looking at the SRL and BERT SRL models, though, and I think we probably can just combine them at this point. I don't think we gain much by using the srl_bert.py code at this point, and it's hard-coded for an earlier version of transformers that makes it so you can't easily update it for roberta, which would surely be better. But, that should be a separate issue.

dirkgr · 2020-08-03T16:24:38Z

The srl_bert.py code is simpler though. It's a much more transformer-native implementation. If we can get that one to perform as well as srl.py, I might argue for keeping srl_bert.py and removing the other one.

matt-gardner · 2020-08-03T17:26:13Z

"transformer-native" means "I have to use transformers, and I can't even try using anything else." It kind of goes against the whole point of the abstractions that we have.

dirkgr · 2020-08-03T17:52:36Z

While that's true, it's hard to argue in print for a solution that's more complicated and different from the standard if it doesn't also improve results.

In principle the srl_bert.py approach should work with any token embedder, but it might not do as well as a custom architecture.

dirkgr · 2020-08-24T17:37:08Z

We're tracking the remaining issue in #4521.

epwalsh added the Models Issues related to the allennlp-models repo label Jul 9, 2020

epwalsh added this to the 1.1 milestone Jul 9, 2020

dirkgr mentioned this issue Jul 14, 2020

Updated the fine-grained NER transformer model allenai/allennlp-models#92

Merged

dirkgr assigned dirkgr, epwalsh and AkshitaB Jul 14, 2020

dirkgr mentioned this issue Jul 14, 2020

Updates the SRL model allenai/allennlp-models#93

Merged

Riccorl mentioned this issue Jul 22, 2020

Can't reproduce SRL result with allennlp==1.0.0 #4392

Closed

This was referenced Jul 28, 2020

Textual Entailment - ROBERTa model trained on SNLI - Different results using the demo and the library #4517

Closed

Update RoBERTa SNLI/MNLI models allenai/allennlp-models#102

Merged

matt-gardner mentioned this issue Aug 3, 2020

Simplify BERT SRL / SRL models #4521

Closed

dirkgr closed this as completed Aug 24, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Retrain transformer-based models in allennlp_models.pretrained #4457

Retrain transformer-based models in allennlp_models.pretrained #4457

epwalsh commented Jul 9, 2020 •

edited

Loading

dirkgr commented Jul 14, 2020

dirkgr commented Jul 17, 2020

matt-gardner commented Jul 27, 2020

dirkgr commented Aug 3, 2020

matt-gardner commented Aug 3, 2020

dirkgr commented Aug 3, 2020

dirkgr commented Aug 24, 2020

Retrain transformer-based models in allennlp_models.pretrained #4457

Retrain transformer-based models in allennlp_models.pretrained #4457

Comments

epwalsh commented Jul 9, 2020 • edited Loading

dirkgr commented Jul 14, 2020

dirkgr commented Jul 17, 2020

matt-gardner commented Jul 27, 2020

dirkgr commented Aug 3, 2020

matt-gardner commented Aug 3, 2020

dirkgr commented Aug 3, 2020

dirkgr commented Aug 24, 2020

epwalsh commented Jul 9, 2020 •

edited

Loading