Adding Video Classifier wrapper #1805

ushareng · 2024-08-30T15:09:17Z

No description provided.

* Agg Vgg16 backbone * update names * update tests * update test * add image classifier * incorporate review comments * Update test case * update backbone test * add image classifier * classifier cleanup * code reformat * add vgg16 image classifier * make vgg generic * update doc string * update docstring * add classifier test * update tests * update docstring * address review comments * code reformat * update the configs * address review comments * fix task saved model test * update init * code reformatted

* Add ResNetV1 and ResNetV2 * Address comments

* Add CSP DarkNet * Add CSP DarkNet * snake_case function names * change use_depthwise to block_type

…Backbone` (keras-team#1769) * Add FeaturePyramidBackbone and update ResNetBackbone * Simplify the implementation * Fix CI * Make ResNetBackbone compatible with timm and add FeaturePyramidBackbone * Add conversion implementation * Update docstrings * Address comments

* Add DenseNet * fix testcase * address comments * nit * fix lint errors * move description

* add vit det vit_det_backbone * update docstring * code reformat * fix tests * address review comments * bump year on all files * address review comments * rename backbone * fix tests * change back to ViT * address review comments * update image shape

* Add MixTransformer * fix testcase * test changes and comments * lint fix * update config list * modify testcase for 2 layers

* update input_image_shape -> image_shape * update docstring example * code reformat * update tests

add missing __init__ file to vit_det

This is a temporary way to test out the keras-hub branch. - Does a global rename of all symbols during package build. - Registers the "old" name on symbol export for saving compat. - Adds a github action to publish every commit to keras-hub as a new package. - Removes our descriptions on PyPI temporarily, until we want to message this more broadly.

* Add `CLIPTokenizer`, `T5XXLTokenizer`, `CLIPTextEncoder` and `T5XXLTextEncoder`. * Make CLIPTextEncoder as Backbone * Add `T5XXLPreprocessor` and remove `T5XXLTokenizer` Add `CLIPPreprocessor` * Use `tf = None` at the top * Replace manual implementation of `CLIPAttention` with `MultiHeadAttention`

* Bounding box utils * - Correct test cases * - Remove hard tensorflow dtype * - fix api gen * - Fix import for test cases - Use setup for converters test case * - fix api_gen issue * - FIx api gen * - Fix api gen error * - Correct test cases as per new api changes

* mobilenet_v3 added in keras-nlp * minor bug fixed in mobilenet_v3_backbone * formatting corrected * refactoring backbone * correct_pad_downsample method added * refactoring backbone * parameters updated * Testcaseupdated, expected output shape corrected * code formatted with black * testcase updated * refactoring and description added * comments updated * added mobilenet v1 and v2 * merge conflict resolved * version arg removed, and config options added * input_shape changed to image_shape in arg * config updated * input shape corrected * comments resolved * activation function format changed * minor bug fixed * minor bug fixed * added vision_backbone_test * channel_first bug resolved * channel_first cases working * comments resolved * formatting fixed * refactoring --------- Co-authored-by: ushareng <[email protected]>

* migrating efficientnet models to keras-hub * merging changes from other sources * autoformatting pass * initial consolidation of efficientnet_backbone * most updates and removing separate implementation * cleanup, autoformatting, keras generalization * removed layer examples outside of effiicient net * many, mainly documentation changes, small test fixes

* Add ResNet_vd to ResNet backbone * Addressed requested parameter changes * Fixed tests and updated comments * Added new parameters to docstring

* Add `VAEImageDecoder` for StableDiffusionV3 * Use `keras.Model` for `VAEImageDecoder` and follows the coding style in `VAEAttention`

…TextEncoder` (keras-team#1802)

mattdangerw · 2024-09-03T16:53:51Z

keras_nlp/src/models/video_classifier.py

+
+
+@keras_nlp_export("keras_nlp.models.VideoClassifier")
+class VideoClassifier(Task):


Why do we need this? What models use this? What are the expected input formats?

This wrapper will be used by video classifier models like video_swin, input format will be (depth, height, width, channel)

divyashreepathihalli · 2024-09-03T21:36:54Z

@ushareng has the VideoSwin model been added yet? The video classifier should be added once we have the VideoSwin backbone in. That will help us verify the implementation.

divyashreepathihalli and others added 19 commits August 12, 2024 17:17

Add ResNetBackbone and ResNetImageClassifier (keras-team#1765)

73b7bad

* Add ResNetV1 and ResNetV2 * Address comments

Add CSP DarkNet backbone and classifier (keras-team#1774)

26afc7e

* Add CSP DarkNet * Add CSP DarkNet * snake_case function names * change use_depthwise to block_type

Add DenseNet (keras-team#1775)

9860756

* Add DenseNet * fix testcase * address comments * nit * fix lint errors * move description

Merge remote-tracking branch 'upstream/master' into keras-hub

ececd14

Add Mix transformer (keras-team#1780)

fc485d6

* Add MixTransformer * fix testcase * test changes and comments * lint fix * update config list * modify testcase for 2 layers

update input_image_shape -> image_shape (keras-team#1785)

2797851

* update input_image_shape -> image_shape * update docstring example * code reformat * update tests

Create __init__.py (keras-team#1788)

18f8880

add missing __init__ file to vit_det

Add the ResNet_vd backbone (keras-team#1766)

be8888d

* Add ResNet_vd to ResNet backbone * Addressed requested parameter changes * Fixed tests and updated comments * Added new parameters to docstring

Add VAEImageDecoder for StableDiffusionV3 (keras-team#1796)

536474a

* Add `VAEImageDecoder` for StableDiffusionV3 * Use `keras.Model` for `VAEImageDecoder` and follows the coding style in `VAEAttention`

Replace Backbone with keras.Model in CLIPTextEncoder and `T5XXL…

0fbd84b

…TextEncoder` (keras-team#1802)

video_classifier wrapper added

e97865d

ushareng marked this pull request as ready for review August 31, 2024 14:45

ushareng and others added 2 commits September 3, 2024 17:06

added video classifier in api

955f5f1

Merge branch 'keras-hub' into video_classifier

42d0ca2

mattdangerw reviewed Sep 3, 2024

View reviewed changes

mattdangerw force-pushed the keras-hub branch 3 times, most recently from 753047d to a5e5d8f Compare September 13, 2024 20:00

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adding Video Classifier wrapper #1805

Adding Video Classifier wrapper #1805

ushareng commented Aug 30, 2024

mattdangerw Sep 3, 2024

ushareng Sep 3, 2024

divyashreepathihalli commented Sep 3, 2024



		@keras_nlp_export("keras_nlp.models.VideoClassifier")
		class VideoClassifier(Task):

Adding Video Classifier wrapper #1805

Are you sure you want to change the base?

Adding Video Classifier wrapper #1805

Conversation

ushareng commented Aug 30, 2024

mattdangerw Sep 3, 2024

Choose a reason for hiding this comment

ushareng Sep 3, 2024

Choose a reason for hiding this comment

divyashreepathihalli commented Sep 3, 2024