Bert #105

chenkelmann · 2020-06-17T16:13:48Z

Channel for questions

Description

An implementation for Bert pretraining, an example for it, a simple base class for listeners to save boilerplate, a new initializer and learning rate tracker to get the same results as the TF reference implementation. A sketch of a utility class that should fix #84 by executing training in parallel.

Checklist

Essentials

Please feel free to remove inapplicable items for your PR.

Changes are complete (i.e. I finished coding on this PR)
All changes have test coverage: no
Code is well-documented:
- comments could do with updating, some had to be deleted again due to the neverending checkstyle errors. Time budget for commenting was spent on quieting the stylecheckrs.
To the my best knowledge, examples and jupyter notebooks are either not affected by this change, or have been fixed to be compatible with this change

Changes

Bert Pretraining Blocks
Utility class for easier memory management
new learning rate tracker
new Initializer
Base class for training listeners to avoid boilerplate
Example for Bert pretraining

Comments

The example does not log anything - are the default listeners broken?
With this code, the parallel gpu problem and the batch norm performance problems can be further explored.

Added all classes necessary for a vanilla bert pretraining. Added a training listener base class to save on boilerplate when implementing custom listeners. Added Bert classes & simple example Removed comments to shut up checkstyle. Fixed PMD errors. Fixed more PMD errors. Made code layout less readable and less pretty with ./gradlew formatJava.

Change-Id: I5e46341fc48853f7ae0dfa4deaa1923fa5bb5c6a

…#105) Fixes deepjavalibrary#103

chenkelmann mentioned this pull request Jun 17, 2020

Multiple GPUs are used sequentially, not parallel #84

Closed

chenkelmann and others added 2 commits January 14, 2021 13:01

Update to latest master

f36eca7

Change-Id: I5e46341fc48853f7ae0dfa4deaa1923fa5bb5c6a

zachgk force-pushed the bert branch from 4dbb7da to f36eca7 Compare January 14, 2021 22:57

zachgk approved these changes Jan 15, 2021

View reviewed changes

frankfliu approved these changes Jan 15, 2021

View reviewed changes

zachgk merged commit dac7c07 into deepjavalibrary:master Jan 15, 2021

Lokiiiiii pushed a commit to Lokiiiiii/djl that referenced this pull request Oct 10, 2023

[serving] Fix selfsign certificate issue for JDK 15+ (deepjavalibrary…

6fd45a8

…#105) Fixes deepjavalibrary#103

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bert #105

Bert #105

chenkelmann commented Jun 17, 2020

Bert #105

Bert #105

Conversation

chenkelmann commented Jun 17, 2020

Channel for questions

Description

Checklist

Essentials

Changes

Comments