Update some models for AMP training #104

epwalsh · 2020-07-31T18:29:20Z

Fixes models that use an RNN cell by wrapping the forward call to the cell within an autocast(False) context.

epwalsh · 2020-08-01T02:09:43Z

Current failure may be related to pytorch/pytorch#36428, but note that we're actually using LSMTCell directly here and still seeing a failure, despite this comment which says AMP should work fine when using just RNN cells.

matt-gardner

LGTM! Sorry, I missed the review request earlier somehow. Luckily, you've been on vacation, so no harm done :).

allennlp_models/generation/models/copynet_seq2seq.py

matt-gardner · 2020-08-07T16:38:18Z

tests/generation/models/copynet_test.py

+ overrides="{'trainer.use_amp':true,'trainer.cuda_device':0}",
+ )
+
+ # NOTE: as of writing this test, AMP does not work with RNNs. Hence we had


Again here, holdover from earlier.

Co-authored-by: Matt Gardner <[email protected]>

tests/generation/models/copynet_test.py

allennlp_models/generation/models/simple_seq2seq.py

allennlp_models/generation/modules/decoder_nets/lstm_cell.py

add amp test to copynet

77f8c3c

epwalsh added 6 commits August 1, 2020 10:10

wrap RNN cell with autocast(False)

948acf1

update CHANGELOG

708fbfe

also convert decoder state to right dtype

e7f0277

add test that will fail when AMP works with RNNs

7caa398

update comment

b8e52f1

update SimpleSeq2Seq

5d36a42

epwalsh mentioned this pull request Aug 3, 2020

add 'use_amp' option to transformer embedders allenai/allennlp#4526

Closed

epwalsh requested a review from matt-gardner August 3, 2020 16:43

epwalsh added 3 commits August 3, 2020 09:46

update CHANGELOG

4396896

Merge branch 'master' into update-some-models-for-amp

a172c31

Merge branch 'master' into update-some-models-for-amp

54fa6b0

matt-gardner approved these changes Aug 7, 2020

View reviewed changes

Update allennlp_models/generation/models/copynet_seq2seq.py

3a6f79b

Co-authored-by: Matt Gardner <[email protected]>

epwalsh commented Aug 10, 2020

View reviewed changes

tests/generation/models/copynet_test.py Outdated Show resolved Hide resolved

Update tests/generation/models/copynet_test.py

70ea391

epwalsh commented Aug 10, 2020

View reviewed changes

allennlp_models/generation/models/simple_seq2seq.py Outdated Show resolved Hide resolved

epwalsh commented Aug 10, 2020

View reviewed changes

allennlp_models/generation/modules/decoder_nets/lstm_cell.py Outdated Show resolved Hide resolved

Apply suggestions from code review

9a48dac

epwalsh merged commit e5f5c62 into master Aug 10, 2020

epwalsh deleted the update-some-models-for-amp branch August 10, 2020 16:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Update some models for AMP training #104

Update some models for AMP training #104

epwalsh commented Jul 31, 2020 •

edited

Loading

epwalsh commented Aug 1, 2020

matt-gardner left a comment

matt-gardner Aug 7, 2020

Update some models for AMP training #104

Update some models for AMP training #104

Conversation

epwalsh commented Jul 31, 2020 • edited Loading

epwalsh commented Aug 1, 2020

matt-gardner left a comment

Choose a reason for hiding this comment

matt-gardner Aug 7, 2020

Choose a reason for hiding this comment

epwalsh commented Jul 31, 2020 •

edited

Loading