-
Notifications
You must be signed in to change notification settings - Fork 21
Home
For an innermost loop kernel in assembly, this tool allows automatic instruction fetching of assembly code and automatic runtime prediction including throughput analysis and detection for critical path and loop-carried dependencies.
In areas of computer science and computational applications in general, the key of efficient software is performance. Optimization of modern programs therefore is an important part of computational science but turns out to be complex and extremely difficult.
This is an effort to develop an easy to use but yet powerful static performance analysis tools for Linux operating systems. OSACA is capable of analyzing ... TODO
If you use OSACA for scientific work you can cite us as:
J. Laukemann, J. Hammer, J. Hofmann, G. Hager and G. Wellein, "Automated Instruction Stream Throughput Prediction for Intel and AMD Microarchitectures," 2018 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS), Dallas, TX, USA, 2018, pp. 121-131, doi: 10.1109/PMBS.2018.8641578. Pre-print: https://arxiv.org/abs/1809.00912
Bibtex:
@INPROCEEDINGS{8641578,
author={J. {Laukemann} and J. {Hammer} and J. {Hofmann} and G. {Hager} and G. {Wellein}},
booktitle={2018 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS)},
title={Automated Instruction Stream Throughput Prediction for Intel and AMD Microarchitectures},
year={2018},
volume={},
number={},
pages={121-131}, doi={10.1109/PMBS.2018.8641578}}
Furthermore, in 2019, we added support for non-x86 microarchitectures:
J. Laukemann, J. Hammer, G. Hager and G. Wellein, "Automatic Throughput and Critical Path Analysis of x86 and ARM Assembly Kernels," 2019 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS), Denver, CO, USA, 2019, pp. 1-6, doi: 10.1109/PMBS49563.2019.00006. Pre-print: https://arxiv.org/abs/1910.00214
Bibtex:
@INPROCEEDINGS{9059263,
author={J. {Laukemann} and J. {Hammer} and G. {Hager} and G. {Wellein}},
booktitle={2019 IEEE/ACM Performance Modeling, Benchmarking and Simulation of High Performance Computer Systems (PMBS)},
title={Automatic Throughput and Critical Path Analysis of x86 and ARM Assembly Kernels},
year={2019},
volume={},
number={},
pages={1-6}, doi={10.1109/PMBS49563.2019.00006}}
-
ISAs
-
Architectures
-
Tutorials
-
Adding new architectures
- Microbenchmarking
- Database integration