-
Notifications
You must be signed in to change notification settings - Fork 9
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
fix attribute for arrray of bools (#279)
* fix attribute for arrray of bools * test * remove uses of unsafe pointer calls
- Loading branch information
Showing
6 changed files
with
44 additions
and
68 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
b92a7d8
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reactant.jl Benchmarks
ViT base (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :after_enzyme)
1350587665
ns1356899596
ns1.00
ViT base (256 x 256 x 3 x 32)/forward/CUDA/Reactant
1383284659
ns1387243563
ns1.00
ViT base (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :before_enzyme)
1318521515
ns1307710781
ns1.01
ViT base (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :only_enzyme)
3203498216
ns3391199395
ns0.94
ViT base (256 x 256 x 3 x 32)/forward/CUDA/Lux
211895034
ns288422658
ns0.73
ViT base (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :after_enzyme)
5192693081
ns7038978698
ns0.74
ViT base (256 x 256 x 3 x 32)/forward/CPU/Reactant
5105568219
ns4922184280
ns1.04
ViT base (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :before_enzyme)
6018217981
ns5148694966
ns1.17
ViT base (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :only_enzyme)
11767333133
ns7443923750
ns1.58
ViT base (256 x 256 x 3 x 32)/forward/CPU/Lux
37329667055
ns35583150632
ns1.05
ViT small (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :after_enzyme)
1352978597
ns1326518186
ns1.02
ViT small (256 x 256 x 3 x 4)/forward/CUDA/Reactant
1301145716
ns1335465488
ns0.97
ViT small (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :before_enzyme)
1251932856
ns1335531663
ns0.94
ViT small (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :only_enzyme)
3185200110
ns3467076041
ns0.92
ViT small (256 x 256 x 3 x 4)/forward/CUDA/Lux
8590755
ns9067176.5
ns0.95
ViT small (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :after_enzyme)
1620067599
ns1555452102
ns1.04
ViT small (256 x 256 x 3 x 4)/forward/CPU/Reactant
1562833541
ns1525737118
ns1.02
ViT small (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :before_enzyme)
1659962028
ns1533932694
ns1.08
ViT small (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :only_enzyme)
3277801510
ns3300553638
ns0.99
ViT small (256 x 256 x 3 x 4)/forward/CPU/Lux
6065735254.5
ns2534998626
ns2.39
ViT tiny (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :after_enzyme)
1274332633
ns1328057662
ns0.96
ViT tiny (256 x 256 x 3 x 32)/forward/CUDA/Reactant
1312980865
ns1259249953
ns1.04
ViT tiny (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :before_enzyme)
1297238205
ns1529248493
ns0.85
ViT tiny (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :only_enzyme)
3128524863
ns3214165683
ns0.97
ViT tiny (256 x 256 x 3 x 32)/forward/CUDA/Lux
21248949
ns25864507
ns0.82
ViT tiny (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :after_enzyme)
2146461465
ns2138887589
ns1.00
ViT tiny (256 x 256 x 3 x 32)/forward/CPU/Reactant
2127880295
ns2148375491
ns0.99
ViT tiny (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :before_enzyme)
2168496609
ns2178317010
ns1.00
ViT tiny (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :only_enzyme)
3916001186
ns3916690304
ns1.00
ViT tiny (256 x 256 x 3 x 32)/forward/CPU/Lux
6604223402
ns5911275014.5
ns1.12
ViT tiny (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :after_enzyme)
1287472160
ns1298681699
ns0.99
ViT tiny (256 x 256 x 3 x 4)/forward/CUDA/Reactant
1443583640.5
ns1457354852
ns0.99
ViT tiny (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :before_enzyme)
1313342058.5
ns1321007313
ns0.99
ViT tiny (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :only_enzyme)
3220364383
ns3192117311
ns1.01
ViT tiny (256 x 256 x 3 x 4)/forward/CUDA/Lux
7502452
ns7662836.5
ns0.98
ViT tiny (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :after_enzyme)
1416410227
ns1457203535
ns0.97
ViT tiny (256 x 256 x 3 x 4)/forward/CPU/Reactant
1419844842
ns1435056828
ns0.99
ViT tiny (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :before_enzyme)
1409644754
ns1438646883
ns0.98
ViT tiny (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :only_enzyme)
3159215564
ns3204423699
ns0.99
ViT tiny (256 x 256 x 3 x 4)/forward/CPU/Lux
3073518446
ns1204850367
ns2.55
ViT tiny (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :after_enzyme)
1415141711
ns1609570351
ns0.88
ViT tiny (256 x 256 x 3 x 16)/forward/CUDA/Reactant
1303150301
ns1288271363
ns1.01
ViT tiny (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :before_enzyme)
1319157274
ns1305692496
ns1.01
ViT tiny (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :only_enzyme)
3404838803
ns3210305896
ns1.06
ViT tiny (256 x 256 x 3 x 16)/forward/CUDA/Lux
11353488
ns15484395
ns0.73
ViT tiny (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :after_enzyme)
1723706202
ns1721890751
ns1.00
ViT tiny (256 x 256 x 3 x 16)/forward/CPU/Reactant
1709602541
ns1722159183
ns0.99
ViT tiny (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :before_enzyme)
1695411960
ns1701220224
ns1.00
ViT tiny (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :only_enzyme)
3432537936
ns3501882082
ns0.98
ViT tiny (256 x 256 x 3 x 16)/forward/CPU/Lux
5387012592
ns2892153610
ns1.86
ViT small (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :after_enzyme)
1329618389
ns1304479984
ns1.02
ViT small (256 x 256 x 3 x 16)/forward/CUDA/Reactant
1343532174
ns1315566933
ns1.02
ViT small (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :before_enzyme)
1352869420
ns1537931496
ns0.88
ViT small (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :only_enzyme)
3185476720
ns3352632273
ns0.95
ViT small (256 x 256 x 3 x 16)/forward/CUDA/Lux
25673267
ns25923540
ns0.99
ViT small (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :after_enzyme)
2152691101
ns2184873795
ns0.99
ViT small (256 x 256 x 3 x 16)/forward/CPU/Reactant
2143865762
ns2206178636
ns0.97
ViT small (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :before_enzyme)
2141145130
ns2173015030
ns0.99
ViT small (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :only_enzyme)
4249708829
ns3992993680
ns1.06
ViT small (256 x 256 x 3 x 16)/forward/CPU/Lux
16244609813
ns5901204125.5
ns2.75
ViT small (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :after_enzyme)
1310104193
ns1337918756
ns0.98
ViT small (256 x 256 x 3 x 32)/forward/CUDA/Reactant
1506787737
ns1292347450
ns1.17
ViT small (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :before_enzyme)
1269748486
ns1325281512
ns0.96
ViT small (256 x 256 x 3 x 32)/forward/CUDA/Reactant (optimize = :only_enzyme)
3262767860
ns3307275999
ns0.99
ViT small (256 x 256 x 3 x 32)/forward/CUDA/Lux
50691269
ns56296298.5
ns0.90
ViT small (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :after_enzyme)
6933624742
ns2942888604
ns2.36
ViT small (256 x 256 x 3 x 32)/forward/CPU/Reactant
4060945281
ns2995273898
ns1.36
ViT small (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :before_enzyme)
2967657517
ns3018336488
ns0.98
ViT small (256 x 256 x 3 x 32)/forward/CPU/Reactant (optimize = :only_enzyme)
4892229925
ns4949580089
ns0.99
ViT small (256 x 256 x 3 x 32)/forward/CPU/Lux
11162799884
ns14511536613
ns0.77
ViT base (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :after_enzyme)
1351893597
ns1309758052
ns1.03
ViT base (256 x 256 x 3 x 16)/forward/CUDA/Reactant
1298214491
ns1309349784
ns0.99
ViT base (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :before_enzyme)
1321267516
ns1528578322
ns0.86
ViT base (256 x 256 x 3 x 16)/forward/CUDA/Reactant (optimize = :only_enzyme)
3172547882
ns3242193874
ns0.98
ViT base (256 x 256 x 3 x 16)/forward/CUDA/Lux
68134904
ns72533617.5
ns0.94
ViT base (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :after_enzyme)
3112388445
ns3137681593
ns0.99
ViT base (256 x 256 x 3 x 16)/forward/CPU/Reactant
3311374032
ns3252812510
ns1.02
ViT base (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :before_enzyme)
3408895903
ns3113127307
ns1.10
ViT base (256 x 256 x 3 x 16)/forward/CPU/Reactant (optimize = :only_enzyme)
5053122323
ns5045606499
ns1.00
ViT base (256 x 256 x 3 x 16)/forward/CPU/Lux
26079756832
ns15819654237
ns1.65
ViT base (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :after_enzyme)
1300428116
ns1316381206
ns0.99
ViT base (256 x 256 x 3 x 4)/forward/CUDA/Reactant
1329236745
ns1321440422
ns1.01
ViT base (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :before_enzyme)
1335241615
ns1321148926
ns1.01
ViT base (256 x 256 x 3 x 4)/forward/CUDA/Reactant (optimize = :only_enzyme)
3163422535
ns3248024917
ns0.97
ViT base (256 x 256 x 3 x 4)/forward/CUDA/Lux
19502662
ns20424385
ns0.95
ViT base (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :after_enzyme)
1818841619
ns1868979954
ns0.97
ViT base (256 x 256 x 3 x 4)/forward/CPU/Reactant
1833108489
ns1974840414
ns0.93
ViT base (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :before_enzyme)
1850734843
ns1995499997
ns0.93
ViT base (256 x 256 x 3 x 4)/forward/CPU/Reactant (optimize = :only_enzyme)
3648898677
ns3753168538
ns0.97
ViT base (256 x 256 x 3 x 4)/forward/CPU/Lux
8063384938
ns3215953877.5
ns2.51
This comment was automatically generated by workflow using github-action-benchmark.