Add `ResNetBackbone` and `ResNetImageClassifier` #1765

james77777778 · 2024-08-09T07:31:52Z

This PR largely follows #1737 but introduces some improvements:

Supports both "channels_last" and "channels_first"
Add support for mixed precision
Improve the docstrings

The preact parameter in ResNetBackbone determines whether to instantiate ResNet or ResNetV2 (similar to keras.applications.ResNet*)
https:/keras-team/keras/blob/b45be337c6156d90f220beba9f68eeb2e52e2b0d/keras/src/applications/resnet.py#L50

I have a small question regarding model implementation:
When creating a Dense layer in ImageClassifier, the dtype_policy is not configurable. I used backbone.dtype_policy as a workaround but it might be fragile.

The implementation has been verified by using the model weights from KerasCV.
Please refer to this colab: https://colab.research.google.com/drive/1HGktF-4TkTdTjlMVv7qr3D_ysspJXDmo?usp=sharing

@mattdangerw @divyashreepathihalli @SamanehSaadat

divyashreepathihalli

Thanks for the PR! Looks great! I have 2 minor NIT comments. Otherwise LGTM, subject to Matt's approval.

keras_nlp/src/models/resnet/resnet_backbone.py

keras_nlp/src/models/resnet/resnet_backbone_test.py

mattdangerw · 2024-08-09T17:55:13Z

keras_nlp/src/models/resnet/resnet_backbone_test.py

+ ("v2_block_channels_last", True, "block", "channels_last"),
+ )
+ def test_backbone_basics(self, preact, block_type, data_format):
+ if (


Actually maybe we should add this data format test to our test utils somehow. self.run_backbone_test(..., test_channels=True). Or even self.run_vision_backbone_test that wraps self.run_backbone_test with more options.

No strong opinion on the factoring, but some way to easily add this without much code for all coming vision models.

Added run_vision_backbone_test with the run_data_format_check option.
This should simplify the code for future vision models.

mattdangerw · 2024-08-09T17:56:55Z

keras_nlp/src/models/resnet/resnet_backbone.py

+ stackwise_num_filters,
+ stackwise_num_blocks,
+ stackwise_num_strides,
+ block_type,


Should we have a default for the next three args? In particular include_rescaling which I think has a default in pali gemma.

Updated.
The new signature:

def __init__( self, stackwise_num_filters, stackwise_num_blocks, stackwise_num_strides, block_type, use_pre_activation=False, include_rescaling=True, input_image_shape=(None, None, 3), pooling="avg", data_format=None, dtype=None, **kwargs, ):

I think we can change input_image_shape from a fixed (224, 224, 3) to (None, None, 3). This should be more user-friendly.

mattdangerw · 2024-08-09T18:47:57Z

keras_nlp/src/models/resnet/resnet_backbone.py

+
+
+@keras_nlp_export("keras_nlp.models.ResNetBackbone")
+class ResNetBackbone(Backbone):


Do we think this makes more sense as one architecture or two? We can do both here. Either ResNetBackbone and ResNet2Backbone or the one like we have here.

Most of the args makes sense here, but block_type: "block" vs block_type: "basic_block" is a little hard to understand. But also night to have less duplication.

Basically it's on us to define what constitutes a new architecture.

how about providing different architectures as main backbone subclasses with built in configs?

Talked offline. Let's just stick with the one class for now, nice and simple.

If we can think of something still short that's a little more descriptive than block_type: "block" and block_type: "basic_block" let's go for it, but I think this is fine as is.

Updated to block_type: "bottleneck_block" and block_type: "basic_block".
This should be consistent with the original paper: https://arxiv.org/abs/1512.03385

mattdangerw

Left some minor comments, but all good on my end once addressed. @divyashreepathihalli I'll leave the final merge to you!

james77777778 · 2024-08-10T09:45:05Z

@divyashreepathihalli @mattdangerw
Thanks for reviewing. All comments should be resolved.

But my question still remains:

When creating a Dense layer in ImageClassifier, the dtype_policy is not configurable.

Any ideas about that?

james77777778 · 2024-08-12T01:25:01Z

Another question:
Should we support pyramid feature outputs in vision backbones?
This is crucial for any downstream algorithms (object detection, segmentation, ...)

https:/keras-team/keras-cv/blob/5cfe7c3d8f2d758b91bbfcedf7f0a29931236c68/keras_cv/src/models/backbones/backbone.py#L153-L176

divyashreepathihalli · 2024-08-12T16:15:03Z

Another question: Should we support pyramid feature outputs in vision backbones? This is crucial for any downstream algorithms (object detection, segmentation, ...)

https:/keras-team/keras-cv/blob/5cfe7c3d8f2d758b91bbfcedf7f0a29931236c68/keras_cv/src/models/backbones/backbone.py#L153-L176

This sounds like a good idea! Lets add it in new PR

divyashreepathihalli

LGTM! Thanks!

* Add ResNetV1 and ResNetV2 * Address comments

* Add VGG16 backbone (#1737) * Agg Vgg16 backbone * update names * update tests * update test * add image classifier * incorporate review comments * Update test case * update backbone test * add image classifier * classifier cleanup * code reformat * add vgg16 image classifier * make vgg generic * update doc string * update docstring * add classifier test * update tests * update docstring * address review comments * code reformat * update the configs * address review comments * fix task saved model test * update init * code reformatted * Add `ResNetBackbone` and `ResNetImageClassifier` (#1765) * Add ResNetV1 and ResNetV2 * Address comments * Add CSP DarkNet backbone and classifier (#1774) * Add CSP DarkNet * Add CSP DarkNet * snake_case function names * change use_depthwise to block_type * Add `FeaturePyramidBackbone` and port weights from `timm` for `ResNetBackbone` (#1769) * Add FeaturePyramidBackbone and update ResNetBackbone * Simplify the implementation * Fix CI * Make ResNetBackbone compatible with timm and add FeaturePyramidBackbone * Add conversion implementation * Update docstrings * Address comments * Add DenseNet (#1775) * Add DenseNet * fix testcase * address comments * nit * fix lint errors * move description * Add ViTDetBackbone (#1776) * add vit det vit_det_backbone * update docstring * code reformat * fix tests * address review comments * bump year on all files * address review comments * rename backbone * fix tests * change back to ViT * address review comments * update image shape * Add Mix transformer (#1780) * Add MixTransformer * fix testcase * test changes and comments * lint fix * update config list * modify testcase for 2 layers * update input_image_shape -> image_shape (#1785) * update input_image_shape -> image_shape * update docstring example * code reformat * update tests * Create __init__.py (#1788) add missing __init__ file to vit_det * Hack package build script to rename to keras-hub (#1793) This is a temporary way to test out the keras-hub branch. - Does a global rename of all symbols during package build. - Registers the "old" name on symbol export for saving compat. - Adds a github action to publish every commit to keras-hub as a new package. - Removes our descriptions on PyPI temporarily, until we want to message this more broadly. * Add CLIP and T5XXL for StableDiffusionV3 (#1790) * Add `CLIPTokenizer`, `T5XXLTokenizer`, `CLIPTextEncoder` and `T5XXLTextEncoder`. * Make CLIPTextEncoder as Backbone * Add `T5XXLPreprocessor` and remove `T5XXLTokenizer` Add `CLIPPreprocessor` * Use `tf = None` at the top * Replace manual implementation of `CLIPAttention` with `MultiHeadAttention` * Add Bounding Box Utils (#1791) * Bounding box utils * - Correct test cases * - Remove hard tensorflow dtype * - fix api gen * - Fix import for test cases - Use setup for converters test case * - fix api_gen issue * - FIx api gen * - Fix api gen error * - Correct test cases as per new api changes * mobilenet_v3 added in keras-nlp (#1782) * mobilenet_v3 added in keras-nlp * minor bug fixed in mobilenet_v3_backbone * formatting corrected * refactoring backbone * correct_pad_downsample method added * refactoring backbone * parameters updated * Testcaseupdated, expected output shape corrected * code formatted with black * testcase updated * refactoring and description added * comments updated * added mobilenet v1 and v2 * merge conflict resolved * version arg removed, and config options added * input_shape changed to image_shape in arg * config updated * input shape corrected * comments resolved * activation function format changed * minor bug fixed * minor bug fixed * added vision_backbone_test * channel_first bug resolved * channel_first cases working * comments resolved * formatting fixed * refactoring --------- Co-authored-by: ushareng <[email protected]> * Pkgoogle/efficient net migration (#1778) * migrating efficientnet models to keras-hub * merging changes from other sources * autoformatting pass * initial consolidation of efficientnet_backbone * most updates and removing separate implementation * cleanup, autoformatting, keras generalization * removed layer examples outside of effiicient net * many, mainly documentation changes, small test fixes * Add the ResNet_vd backbone (#1766) * Add ResNet_vd to ResNet backbone * Addressed requested parameter changes * Fixed tests and updated comments * Added new parameters to docstring * Add `VAEImageDecoder` for StableDiffusionV3 (#1796) * Add `VAEImageDecoder` for StableDiffusionV3 * Use `keras.Model` for `VAEImageDecoder` and follows the coding style in `VAEAttention` * Replace `Backbone` with `keras.Model` in `CLIPTextEncoder` and `T5XXLTextEncoder` (#1802) * Add pyramid output for densenet, cspDarknet (#1801) * add pyramid outputs * fix testcase * format fix * make common testcase for pyramid outputs * change default shape * simplify testcase * test case change and add channel axis * Add `MMDiT` for StableDiffusionV3 (#1806) * Add `MMDiT` * Update * Update * Update implementation * Add remaining bbox utils (#1804) * - Add formats, iou, utils for bounding box * - Add `AnchorGenerator`, `BoxMatcher` and `NonMaxSupression` layers * - Remove scope_name not required. * use default keras name scope * - Correct format error * - Remove layers as of now and keep them at model level till keras core supports them * - Correct api_gen * Fix timm conversion for rersnet (#1814) * Add `StableDiffusion3` * Fix `_normalize_inputs` * Separate CLIP encoders from SD3 backbone. * Simplify `text_to_image` function. * Address comments * Minor update and add docstrings. * Add VGG16 backbone (#1737) * Agg Vgg16 backbone * update names * update tests * update test * add image classifier * incorporate review comments * Update test case * update backbone test * add image classifier * classifier cleanup * code reformat * add vgg16 image classifier * make vgg generic * update doc string * update docstring * add classifier test * update tests * update docstring * address review comments * code reformat * update the configs * address review comments * fix task saved model test * update init * code reformatted * Add `ResNetBackbone` and `ResNetImageClassifier` (#1765) * Add ResNetV1 and ResNetV2 * Address comments * Add CSP DarkNet backbone and classifier (#1774) * Add CSP DarkNet * Add CSP DarkNet * snake_case function names * change use_depthwise to block_type * Add `FeaturePyramidBackbone` and port weights from `timm` for `ResNetBackbone` (#1769) * Add FeaturePyramidBackbone and update ResNetBackbone * Simplify the implementation * Fix CI * Make ResNetBackbone compatible with timm and add FeaturePyramidBackbone * Add conversion implementation * Update docstrings * Address comments * Add DenseNet (#1775) * Add DenseNet * fix testcase * address comments * nit * fix lint errors * move description * Add ViTDetBackbone (#1776) * add vit det vit_det_backbone * update docstring * code reformat * fix tests * address review comments * bump year on all files * address review comments * rename backbone * fix tests * change back to ViT * address review comments * update image shape * Add Mix transformer (#1780) * Add MixTransformer * fix testcase * test changes and comments * lint fix * update config list * modify testcase for 2 layers * update input_image_shape -> image_shape (#1785) * update input_image_shape -> image_shape * update docstring example * code reformat * update tests * Create __init__.py (#1788) add missing __init__ file to vit_det * Hack package build script to rename to keras-hub (#1793) This is a temporary way to test out the keras-hub branch. - Does a global rename of all symbols during package build. - Registers the "old" name on symbol export for saving compat. - Adds a github action to publish every commit to keras-hub as a new package. - Removes our descriptions on PyPI temporarily, until we want to message this more broadly. * Add CLIP and T5XXL for StableDiffusionV3 (#1790) * Add `CLIPTokenizer`, `T5XXLTokenizer`, `CLIPTextEncoder` and `T5XXLTextEncoder`. * Make CLIPTextEncoder as Backbone * Add `T5XXLPreprocessor` and remove `T5XXLTokenizer` Add `CLIPPreprocessor` * Use `tf = None` at the top * Replace manual implementation of `CLIPAttention` with `MultiHeadAttention` * Add Bounding Box Utils (#1791) * Bounding box utils * - Correct test cases * - Remove hard tensorflow dtype * - fix api gen * - Fix import for test cases - Use setup for converters test case * - fix api_gen issue * - FIx api gen * - Fix api gen error * - Correct test cases as per new api changes * mobilenet_v3 added in keras-nlp (#1782) * mobilenet_v3 added in keras-nlp * minor bug fixed in mobilenet_v3_backbone * formatting corrected * refactoring backbone * correct_pad_downsample method added * refactoring backbone * parameters updated * Testcaseupdated, expected output shape corrected * code formatted with black * testcase updated * refactoring and description added * comments updated * added mobilenet v1 and v2 * merge conflict resolved * version arg removed, and config options added * input_shape changed to image_shape in arg * config updated * input shape corrected * comments resolved * activation function format changed * minor bug fixed * minor bug fixed * added vision_backbone_test * channel_first bug resolved * channel_first cases working * comments resolved * formatting fixed * refactoring --------- Co-authored-by: ushareng <[email protected]> * Pkgoogle/efficient net migration (#1778) * migrating efficientnet models to keras-hub * merging changes from other sources * autoformatting pass * initial consolidation of efficientnet_backbone * most updates and removing separate implementation * cleanup, autoformatting, keras generalization * removed layer examples outside of effiicient net * many, mainly documentation changes, small test fixes * Add the ResNet_vd backbone (#1766) * Add ResNet_vd to ResNet backbone * Addressed requested parameter changes * Fixed tests and updated comments * Added new parameters to docstring * Add `VAEImageDecoder` for StableDiffusionV3 (#1796) * Add `VAEImageDecoder` for StableDiffusionV3 * Use `keras.Model` for `VAEImageDecoder` and follows the coding style in `VAEAttention` * Replace `Backbone` with `keras.Model` in `CLIPTextEncoder` and `T5XXLTextEncoder` (#1802) * Add pyramid output for densenet, cspDarknet (#1801) * add pyramid outputs * fix testcase * format fix * make common testcase for pyramid outputs * change default shape * simplify testcase * test case change and add channel axis * Add `MMDiT` for StableDiffusionV3 (#1806) * Add `MMDiT` * Update * Update * Update implementation * Add remaining bbox utils (#1804) * - Add formats, iou, utils for bounding box * - Add `AnchorGenerator`, `BoxMatcher` and `NonMaxSupression` layers * - Remove scope_name not required. * use default keras name scope * - Correct format error * - Remove layers as of now and keep them at model level till keras core supports them * - Correct api_gen * Fix timm conversion for rersnet (#1814) * Fix * Update * Rename to diffuser and decoder * Define functional model * Merge from upstream/master * Delete old SD3 * Fix copyright * Rename to keras_hub * Address comments * Update * Fix CI * Fix bugs occurred in keras3.1 --------- Co-authored-by: Divyashree Sreepathihalli <[email protected]> Co-authored-by: Sachin Prasad <[email protected]> Co-authored-by: Matt Watson <[email protected]> Co-authored-by: Siva Sravana Kumar Neeli <[email protected]> Co-authored-by: Usha Rengaraju <[email protected]> Co-authored-by: ushareng <[email protected]> Co-authored-by: pkgoogle <[email protected]> Co-authored-by: gowthamkpr <[email protected]>

Add ResNetV1 and ResNetV2

39beae5

james77777778 requested review from mattdangerw, SamanehSaadat and divyashreepathihalli August 9, 2024 07:32

divyashreepathihalli mentioned this pull request Aug 9, 2024

add EffecientNetV1 and EffecientNetV2 #1746

Closed

divyashreepathihalli approved these changes Aug 9, 2024

View reviewed changes

keras_nlp/src/models/resnet/resnet_backbone.py Outdated Show resolved Hide resolved

keras_nlp/src/models/resnet/resnet_backbone.py Outdated Show resolved Hide resolved

keras_nlp/src/models/resnet/resnet_backbone_test.py Outdated Show resolved Hide resolved

divyashreepathihalli added the kokoro:force-run Runs Tests on GPU label Aug 9, 2024

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Aug 9, 2024

divyashreepathihalli mentioned this pull request Aug 9, 2024

Add the ResNet_vd backbone #1766

Merged

mattdangerw reviewed Aug 9, 2024

View reviewed changes

mattdangerw approved these changes Aug 9, 2024

View reviewed changes

Address comments

e939d07

james77777778 force-pushed the add-resnet branch from ff16482 to e939d07 Compare August 10, 2024 09:50

james77777778 requested review from divyashreepathihalli and mattdangerw August 10, 2024 09:51

divyashreepathihalli added the kokoro:force-run Runs Tests on GPU label Aug 12, 2024

kokoro-team removed kokoro:force-run Runs Tests on GPU labels Aug 12, 2024

divyashreepathihalli approved these changes Aug 12, 2024

View reviewed changes

divyashreepathihalli merged commit 7a17731 into keras-team:keras-hub Aug 12, 2024
10 checks passed

divyashreepathihalli pushed a commit that referenced this pull request Aug 12, 2024

Add ResNetBackbone and ResNetImageClassifier (#1765)

73b7bad

* Add ResNetV1 and ResNetV2 * Address comments

james77777778 deleted the add-resnet branch August 13, 2024 00:53

mattdangerw mentioned this pull request Aug 14, 2024

Add CSP DarkNet backbone and classifier #1774

Merged

mattdangerw pushed a commit to mattdangerw/keras-hub that referenced this pull request Sep 10, 2024

Add ResNetBackbone and ResNetImageClassifier (keras-team#1765)

fdf135c

* Add ResNetV1 and ResNetV2 * Address comments

mattdangerw pushed a commit that referenced this pull request Sep 11, 2024

Add ResNetBackbone and ResNetImageClassifier (#1765)

33854c6

* Add ResNetV1 and ResNetV2 * Address comments

mattdangerw pushed a commit that referenced this pull request Sep 13, 2024

Add ResNetBackbone and ResNetImageClassifier (#1765)

f305f64

* Add ResNetV1 and ResNetV2 * Address comments

sineeli pushed a commit to sineeli/keras-hub that referenced this pull request Sep 16, 2024

Add ResNetBackbone and ResNetImageClassifier (keras-team#1765)

2857d6f

* Add ResNetV1 and ResNetV2 * Address comments

mattdangerw pushed a commit that referenced this pull request Sep 17, 2024

Add ResNetBackbone and ResNetImageClassifier (#1765)

d180498

* Add ResNetV1 and ResNetV2 * Address comments

sineeli pushed a commit to sineeli/keras-hub that referenced this pull request Sep 19, 2024

Add ResNetBackbone and ResNetImageClassifier (keras-team#1765)

1c9e6fb

* Add ResNetV1 and ResNetV2 * Address comments

sineeli pushed a commit to sineeli/keras-hub that referenced this pull request Sep 19, 2024

Add ResNetBackbone and ResNetImageClassifier (keras-team#1765)

cc970ea

* Add ResNetV1 and ResNetV2 * Address comments

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add `ResNetBackbone` and `ResNetImageClassifier` #1765

Add `ResNetBackbone` and `ResNetImageClassifier` #1765

james77777778 commented Aug 9, 2024 •

edited

Loading

divyashreepathihalli left a comment

mattdangerw Aug 9, 2024

james77777778 Aug 10, 2024

mattdangerw Aug 9, 2024

james77777778 Aug 10, 2024

mattdangerw Aug 9, 2024

divyashreepathihalli Aug 9, 2024

mattdangerw Aug 9, 2024

james77777778 Aug 10, 2024

mattdangerw left a comment •

edited

Loading

james77777778 commented Aug 10, 2024

james77777778 commented Aug 12, 2024

divyashreepathihalli commented Aug 12, 2024 •

edited

Loading

divyashreepathihalli left a comment



		@keras_nlp_export("keras_nlp.models.ResNetBackbone")
		class ResNetBackbone(Backbone):

Add ResNetBackbone and ResNetImageClassifier #1765

Add ResNetBackbone and ResNetImageClassifier #1765

Conversation

james77777778 commented Aug 9, 2024 • edited Loading

divyashreepathihalli left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

mattdangerw left a comment • edited Loading

Choose a reason for hiding this comment

james77777778 commented Aug 10, 2024

james77777778 commented Aug 12, 2024

divyashreepathihalli commented Aug 12, 2024 • edited Loading

divyashreepathihalli left a comment

Choose a reason for hiding this comment

Add `ResNetBackbone` and `ResNetImageClassifier` #1765

Add `ResNetBackbone` and `ResNetImageClassifier` #1765

james77777778 commented Aug 9, 2024 •

edited

Loading

mattdangerw left a comment •

edited

Loading

divyashreepathihalli commented Aug 12, 2024 •

edited

Loading