Code Optimization - mostly C++
g++ autovectorization tips and issues - SSE,AVX
Is Parallel Programming Hard, And, If So, What Can You Do About It?
- Paul E. McKenney
Performance Monitor
- What are perf cache events meaning?
- perfmon2
- perf wiki
  - also
  - Where Do I get Source package for Perf tool
- Linux Performance Monitoring, any way to monitor per-thread?
  - An easy-to-use interface to the Linux perf_event API.
- oprofile
  - tl;dr
```
operf [ options ] [ --system-wide | --pid=<PID> | [ command [ args ] ] ]
A typical usage might look like this:
operf ./my_test_program my_arg
```
MMU - TLB - huge pages
Cache issues
Other Memory issues
- stream memory benchmarks
- Poor memcpy Performance on Linux - AVX code for memcpy
- various memcpy tips
- fast memcpy
- new technologies from Intel : Cache Monitoring Technology (CMT), Memory Bandwidth Monitoring (MBM), Cache Allocation Technology (CAT) and Code and Data Prioritization (CDP) Technology
Timers
- Precise Linux Timing - What Determines the Resolution of clock_gettime()?
- Measure time in Linux - time vs clock vs getrusage vs clock_gettime vs gettimeofday vs timespec_get?
Videos from The 3rd annual JuliaCon 2016 (MIT)
- The 3rd annual JuliaCon 2016 (MIT) | Introduction to Writing High Performance Julia | Arch D. Robison
Useful pieces of information
- Difference between Mutex, Semaphore & Spin Locks
- Does endless While loop take up CPU resources?
Assembler
- Compiler Explorer C++
- Whirlwind Tour of ARM Assembly
Scientific Methodology and Performance Evaluation
Binary Instrumentation

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

performance_optimization_tips.md

performance_optimization_tips.md

Files

performance_optimization_tips.md

Latest commit

History

performance_optimization_tips.md

File metadata and controls