Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[SW] Fix kernel performance calculation #265

Merged
merged 30 commits into from
Dec 3, 2023
Merged

[SW] Fix kernel performance calculation #265

merged 30 commits into from
Dec 3, 2023

Conversation

mp-17
Copy link
Collaborator

@mp-17 mp-17 commented Nov 28, 2023

Improve performance measurement scripts and kernels.

Changelog

Fixed

  • Fix dump vtrace script for vsetvli instructions without x0 (ideal dispatcher)
  • Fix Pathfinder and FFT performance

Added

  • Plot kernels-Vl performance plot
  • Print I$/D$ stall metrics

Changed

  • Optimize Jacobi2d
  • Benchmark only the vector kernel in roi_align
  • Improve cache warming functions
  • [f]dotproduct works on the vector length in elements
  • Optimize DWT
  • Fix pathfinder performance

Checklist

  • Automated tests pass
  • Changelog updated
  • Code style guideline is observed

@mp-17 mp-17 self-assigned this Nov 28, 2023
@mp-17 mp-17 changed the title Feat/plot [SW] Fix kernel performance calculation Nov 28, 2023
@mp-17 mp-17 marked this pull request as ready for review December 2, 2023 11:08
@mp-17 mp-17 merged commit 4b8cd46 into main Dec 3, 2023
197 checks passed
@mp-17 mp-17 deleted the feat/plot branch December 3, 2023 14:08
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant