Releases: YichengDWu/MoYe.jl
Releases · YichengDWu/MoYe.jl
v4.0.0
MoYe v4.0.0
Merged pull requests:
- Update docs (#141) (@YichengDWu)
- update docs (#142) (@YichengDWu)
- Update docs, fix bugs (#143) (@YichengDWu)
v3.2.0
MoYe v3.2.0
Merged pull requests:
- assume pointers are already recasted before calling UniversalCopy (#137) (@YichengDWu)
- Add docs on vectorized async copy (#138) (@YichengDWu)
- docs on pipelining (#139) (@YichengDWu)
- add a shortcut in pointer (#140) (@YichengDWu)
v3.1.0
MoYe v3.1.0
Tensor core support added
Merged pull requests:
- remove unused macro (#129) (@YichengDWu)
- update docs (#131) (@YichengDWu)
- update docs (#132) (@YichengDWu)
- add comments (#133) (@YichengDWu)
- Add memcpy_async docs (#134) (@YichengDWu)
- build docs (#135) (@YichengDWu)
- Docs on tensor cores (#136) (@YichengDWu)
v3.0.0
MoYe v3.0.0
Merged pull requests:
- fix Composition bugs (#123) (@YichengDWu)
- update slicing docs (#124) (@YichengDWu)
- rework copy (#125) (@YichengDWu)
- add tiled copy docs (#126) (@YichengDWu)
- Allow Int32 indexing (#127) (@YichengDWu)
v2.0.0
MoYe v2.0.0
Merged pull requests:
- Bump Julia version to 1.10 (#104) (@YichengDWu)
- change @collective syntax (#105) (@YichengDWu)
- Improve docs (#106) (@YichengDWu)
- CompatHelper: bump compat for Adapt to 4, (keep existing compat) (#109) (@github-actions[bot])
- CompatHelper: bump compat for BFloat16s to 0.5, (keep existing compat) (#110) (@github-actions[bot])
- Bump codecov/codecov-action from 3 to 4 (#111) (@dependabot[bot])
- Bump julia-actions/setup-julia from 1 to 2 (#112) (@dependabot[bot])
- Change printing style of staticint (#113) (@YichengDWu)
- add unflatten (#114) (@YichengDWu)
- remove compose in favor of composition, fix complement (#115) (@YichengDWu)
- add array docs (#116) (@YichengDWu)
- update tile parallelize (#117) (@YichengDWu)
- make all mma ops (#118) (@YichengDWu)
- rework mma ops, traits and atoms, all functioning (#119) (@YichengDWu)
- Change broadcasting style so it respect the stride (#120) (@YichengDWu)
- Update MatMul docs (#121) (@YichengDWu)
- add code to void bank conflicts (#122) (@YichengDWu)
Closed issues:
- add a comment on reshape in the docs (#107)
v1.2.4
MoYe v1.2.4
Merged pull requests:
- add more methods (#97) (@YichengDWu)
- use cp_async in matmul (#98) (@YichengDWu)
- Bump actions/checkout from 3 to 4 (#99) (@dependabot[bot])
- CompatHelper: bump compat for StrideArraysCore to 0.5, (keep existing compat) (#101) (@github-actions[bot])
- CompatHelper: bump compat for CUDA to 5, (keep existing compat) (#102) (@github-actions[bot])
- CompatHelper: bump compat for LLVM to 6, (keep existing compat) (#103) (@github-actions[bot])
Closed issues:
v1.2.3
MoYe v1.2.3
Merged pull requests:
- add
foreach
map!
overloads (#91) (@YichengDWu) - allow constructing
MoYeSharedArray
from a static shape (#92) (@YichengDWu) - test tiled_mma (#94) (@YichengDWu)
- Some improvements on tiled mma (#95) (@YichengDWu)
- Tensor core programming example (#96) (@YichengDWu)
v1.2.2
v1.2.1
MoYe v1.2.1
Merged pull requests:
- Add ∘(x::MoYeArray, layout1, layouts...) (#88) (@YichengDWu)
- Fix logical product (#89) (@YichengDWu)
v1.2.0
MoYe v1.2.0
Merged pull requests:
- Remove unused code (#78) (@YichengDWu)
- Avoid local function calling itself (#79) (@YichengDWu)
- Avoid the use of local functions (#80) (@YichengDWu)
- improve docs (#81) (@YichengDWu)
- Test recast (#82) (@YichengDWu)
- doc complement (#83) (@YichengDWu)
- fixed zipped product & tiled product & tiled divide (#84) (@YichengDWu)
- Fix layout algebra on array (#85) (@YichengDWu)
- Fix ldmatrix traits (#86) (@YichengDWu)
- fix copy_unpack (#87) (@YichengDWu)