[Flux] Port Flux Core Model #1864

DavidLandup0 · 2024-09-23T15:11:07Z

This PR ports the core model into a Keras model and includes a weight conversion script.
VAE and rest of the pipeline would make sense in a separate PR.

Each layer is numerically compared against the original PyTorch implementation here: https://colab.research.google.com/drive/1Jr5pa9BGAxP6lZPimlpb22rD5DMijN3H#scrollTo=Bi_WbOjk7C4k

Modules included:

Maths module
- Timestep embedding
- RoPE
- Attention
- Scaled dot product attention re-implementation in Keras (to match the PyTorch one)
Layers module
- MLPEmbedder
- RMSNorm
- QKNorm
- SelfAttention
- Modulation
- DoubleStreamBlock
- SingleStreamBlock
- LastLayer

Output Comparison

The core model's outputs are latents. We plot the PCA of the output from the original implementation and the Keras re-implementation on the same input:

Numerically, equivalent to 1e-3 precision:

>>> np.allclose(output_keras.numpy(), output_pt.detach().numpy(), atol=1e-3)
True

DavidLandup0 · 2024-10-15T11:19:36Z

@divyashreepathihalli turned into a functional subclassing module - had to wrestle a bit with shapes/autograph, but it should be ready for another review.

Here's the notebook showing numerical equivalence to atol=1e-5 on all modules, as well as the final output of the core model: https://colab.research.google.com/drive/1Jr5pa9BGAxP6lZPimlpb22rD5DMijN3H#scrollTo=Bi_WbOjk7C4k

Adding a preprocessing flow and we can open a PR for integrating with T5 and CLIP.

DavidLandup0 · 2024-10-15T18:09:47Z

keras_hub/src/models/flux/flux_backbone_test.py

+
+class FluxBackboneTest(TestCase):
+    def setUp(self):
+        vae = VAEBackbone(


Will be part of the generation pipeline so these are added preemptively and unused for now

DavidLandup0 · 2024-10-28T13:02:59Z

@divyashreepathihalli could we do another review here?

divyashreepathihalli

Thanks David! left a few comments.
Do you have a demo colab to verify the outputs?

divyashreepathihalli · 2024-10-28T23:22:16Z

keras_hub/src/models/flux/flux_layers.py

+        return self.out_layer(x)
+
+
+# TODO: Maybe this can be exported as part of the public API? Seems to have enough reusability.


here - keras_hub/src/layers/modeling

keras_hub/src/models/flux/flux_layers.py

tools/checkpoint_conversion/convert_flux_checkpoints.py

DavidLandup0 · 2024-10-29T06:18:23Z

Thanks David! left a few comments. Do you have a demo colab to verify the outputs?

Yes - here: https://colab.research.google.com/drive/1Jr5pa9BGAxP6lZPimlpb22rD5DMijN3H#scrollTo=_ys5NSkcoQ_O

With converted weights (in the Colab as well), we get identical outputs between the official model and the port, within 1e-3 sensitivity:

keras_hub/src/models/flux/flux_layers.py

divyashreepathihalli

LGTM

sachinprasadhs marked this pull request as draft September 23, 2024 16:51

DavidLandup0 added 9 commits October 2, 2024 22:18

starter commit - ported time embeddings to keras ops

286f4b2

add mlpembedder

244f013

add RMS Norm re-implementation

480ad24

add qknorm reimplementation

2782242

add rope, scaled dot product attention and self attention

48c82e6

modulation layer

513e370

fix typing

8ccbb26

add double stream block

c88c949

adjustments to doublestreamblock

2bc150e

DavidLandup0 force-pushed the feature/flux branch from d6d626c to 2bc150e Compare October 2, 2024 13:18

DavidLandup0 changed the title ~~[Flux] Port Flux Model and Pipeline~~ [Flux] Port Flux Core Model Oct 2, 2024

add signle stream layer@

969d508

DavidLandup0 marked this pull request as ready for review October 3, 2024 12:10

DavidLandup0 requested a review from divyashreepathihalli October 3, 2024 23:27

DavidLandup0 added 15 commits October 4, 2024 10:20

update layers and add flux core model

77c9297

functions to layers

35769ab

refactor layer usage

13d46c4

refactor layer usage

c00c6a5

position math args in call()

05a1e3f

name arguments

f076006

fix arg name

f9fc4a4

start adding conversion script utils

f2f2c96

change reshape into rearrange

311d342

add rest of weight conversion and remove redundant shape extraction

db14c01

fix mlpembedder arg

c5b37c6

remove redundant args

8d3a385

fix params. to self.

fa5379e

add license

34e2477

add einops

cdd397a

DavidLandup0 added 5 commits October 15, 2024 17:17

remove build method

a65424b

ops to rearrange

cb11e28

remove build

f478f39

rearrange -> symbolic_rearrange

3b5cb4d

turn timesteps and guidance into inputs

40178e1

DavidLandup0 requested a review from divyashreepathihalli October 15, 2024 11:19

DavidLandup0 added 5 commits October 15, 2024 20:32

basic preprocessor flow

078459d

refactor layer names in conversion script

0003b08

add backbone tests

71b564f

raise not implemented on encode, encode_text, etc. methods

7aa93a2

styling

b05c94b

DavidLandup0 commented Oct 15, 2024

View reviewed changes

DavidLandup0 added 3 commits October 17, 2024 01:35

fix shape hack with a cleaner alternative

94f9ffb

remove unused attributes, fix tests

adeb842

change list into tuple for the expected shape

e97909c

DavidLandup0 mentioned this pull request Oct 23, 2024

Add FLUX to KerasHub #1868

Open

4 tasks

divyashreepathihalli added the kokoro:force-run Runs Tests on GPU label Oct 28, 2024

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Oct 28, 2024

divyashreepathihalli reviewed Oct 28, 2024

View reviewed changes

divyashreepathihalli reviewed Oct 30, 2024

View reviewed changes

keras_hub/src/models/flux/flux_layers.py Outdated Show resolved Hide resolved

DavidLandup0 added 2 commits October 31, 2024 14:03

Merge branch 'master' into feature/flux

bc1879a

address comments

c6e20f6

DavidLandup0 force-pushed the feature/flux branch from 048107c to c6e20f6 Compare November 10, 2024 12:59

DavidLandup0 added 2 commits November 10, 2024 22:03

save mdel on conversion

dda8ec3

Merge branch 'master' into feature/flux

446ed90

divyashreepathihalli approved these changes Nov 13, 2024

View reviewed changes

divyashreepathihalli merged commit 0756fb4 into keras-team:master Nov 13, 2024
6 of 7 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Flux] Port Flux Core Model #1864

[Flux] Port Flux Core Model #1864

DavidLandup0 commented Sep 23, 2024 •

edited

Loading

DavidLandup0 commented Oct 15, 2024

DavidLandup0 Oct 15, 2024

DavidLandup0 commented Oct 28, 2024

divyashreepathihalli left a comment

divyashreepathihalli Oct 28, 2024

DavidLandup0 commented Oct 29, 2024 •

edited

Loading

divyashreepathihalli left a comment

		return self.out_layer(x)


		# TODO: Maybe this can be exported as part of the public API? Seems to have enough reusability.

[Flux] Port Flux Core Model #1864

[Flux] Port Flux Core Model #1864

Conversation

DavidLandup0 commented Sep 23, 2024 • edited Loading

Output Comparison

DavidLandup0 commented Oct 15, 2024

DavidLandup0 Oct 15, 2024

Choose a reason for hiding this comment

DavidLandup0 commented Oct 28, 2024

divyashreepathihalli left a comment

Choose a reason for hiding this comment

divyashreepathihalli Oct 28, 2024

Choose a reason for hiding this comment

DavidLandup0 commented Oct 29, 2024 • edited Loading

divyashreepathihalli left a comment

Choose a reason for hiding this comment

DavidLandup0 commented Sep 23, 2024 •

edited

Loading

DavidLandup0 commented Oct 29, 2024 •

edited

Loading