Conv Layer Incorrect output #14236

shwetankTT · 2024-10-24T16:18:59Z

out_channels > 256 and output is row major layout incorrect output from pack_untilize_dst
has_bias and packer_l1_acc and fp32_accum flags are enabled for use_max_rows(output_layout is ROW_MAJOR)

Will add more details.

shwetankTT · 2024-10-29T17:24:55Z

jvasilje · 2024-10-30T15:07:00Z

@shwetankTT you opened a P0 bug and assigned it to yourself?
P0 bugs are company wide show stoppers - is that what this is?

shwetankTT · 2024-11-02T01:53:05Z

@shwetankTT you opened a P0 bug and assigned it to yourself? P0 bugs are company wide show stoppers - is that what this is?

Yeah It was def not a P0. Thanks for changing it to P1.

shwetankTT · 2024-11-11T08:11:36Z

Seems like a hardware limitation? Let's take an example where input and output is 8x320x16x16 distributed across 64 cores, resulting in each core having a shard shape of (32, 320). This equates to 1 tile row and 10 tile columns. Since the output is in row-major order, completing the each row requires data from all 10 tiles, but we only have 8 dst tiles. @mywoodstock

mywoodstock · 2024-11-13T00:40:01Z

Thanks @shwetankTT , yes if the width is > 8 tiles, we will need to iterate over the 8-tile blocks. Can you add a TT_FATAL for this for now?

rtawfik01 · 2024-11-13T22:12:21Z

Hi guys, so I see that pack_untilize has been tested in the unit level with Destination register Float32, and output format as Float32 https://github.com/tenstorrent/tt-metal/blob/main/tests/tt_metal/tt_metal/unit_tests/compute/test_untilize_tilize.cpp

But it seems the above use case is that Input is to the copy + pack_untilize is Float16_b, then input to the packer is Float32, and output of the packer is possibly back to Float16_b? We can add a unit test case that replicates the behavior you see to verify if it is a kernel issue, in the meantime @mywoodstock can you make sure the data format reconfigs are being correctly used with the copy + pack_untilize above?

@ncvetkovicTT @nvelickovicTT The issue above is that once Float32 destination register + Float32 packer input format is set, the results start being incorrect, and the pattern looks like each row of the Float32 destination register (failing result) is half of the datums of the Float16_b register (passing result). if possible, it would be good to add their specific test to our unit test infra.

shwetankTT added the bug Something isn't working label Oct 24, 2024

shwetankTT self-assigned this Oct 24, 2024

shwetankTT changed the title ~~Conv output mismatch~~ Incorrect output Oct 24, 2024

shwetankTT changed the title ~~Incorrect output~~ Conv Layer Incorrect output Oct 24, 2024

mywoodstock added P0 CNNs op_cat: conv2D 2D convolution for CNNs CNN_bug conv generality labels Oct 24, 2024

jvasilje added P1 and removed P0 labels Oct 30, 2024

shwetankTT added a commit that referenced this issue Nov 5, 2024

#14236: reproducing these test cases.

9954181

shwetankTT added a commit that referenced this issue Nov 11, 2024

#14236: Debugging no suceess yet.

148a692

shwetankTT added a commit that referenced this issue Nov 11, 2024

#14236: reproducing these test cases.

cde06ae

shwetankTT added a commit that referenced this issue Nov 11, 2024

#14236: Debugging no suceess yet.

ab6e106

shwetankTT added a commit that referenced this issue Nov 11, 2024

#14236: debug log.

7d0b6e9

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conv Layer Incorrect output #14236

Conv Layer Incorrect output #14236

shwetankTT commented Oct 24, 2024 •

edited

Loading

shwetankTT commented Oct 29, 2024

jvasilje commented Oct 30, 2024

shwetankTT commented Nov 2, 2024 •

edited

Loading

shwetankTT commented Nov 11, 2024

mywoodstock commented Nov 13, 2024

rtawfik01 commented Nov 13, 2024

Conv Layer Incorrect output #14236

Conv Layer Incorrect output #14236

Comments

shwetankTT commented Oct 24, 2024 • edited Loading

shwetankTT commented Oct 29, 2024

jvasilje commented Oct 30, 2024

shwetankTT commented Nov 2, 2024 • edited Loading

shwetankTT commented Nov 11, 2024

mywoodstock commented Nov 13, 2024

rtawfik01 commented Nov 13, 2024

shwetankTT commented Oct 24, 2024 •

edited

Loading

shwetankTT commented Nov 2, 2024 •

edited

Loading