Simple AVX examples

There are 5 simple examples showing how to use AVX intrinsics to accelerate your program. However, please remember that I am a beginner in the use of CPU vector instructions. I'm not claiming this code is exemplary.

To build this respository, make sure your computer support AVX2, to find out, run the command

cat /proc/cpuinfo | grep avx2

References

Problem 1: add two (properly aligned) arrays of floats

Not only will we assume the input is correctly aligned, but also that their lengths are multiples of 256 bits.

Problem 2: add two arbitrary arrays of floats

Are there necessary restrictions on alignment with respect to each other, or can we take any two arrays of float anywhere in memory?

Problem 3: dot product

Let's calcuate dot product of two vectors.

Problem 4: linear search through an array

I have seen it asserted online that brute force linear search can beat binary search for arrays of size up to 10K. The calculations people give to support this claim involve vector instructions. Let's try writing a vectorized linear search.

Problem 5: aligned issue for AVX accelerated class

When the object is created dynamically, its address is determined at runtime. However, C++ Runtime Library does not concern the alignment statement, so we need to overload the new function.

In addition, if we want to create a class with aligned class dynamically, C++ Runtime Library will not call the overload new function, which will cause memory disalignment. The solution is relatively tricky, which requires users to use a Macro in their code. See the code for detail.

Reference http://eigen.tuxfamily.org/dox-devel/group__DenseMatrixManipulation__Alignement.html

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
examples		examples
.gitignore		.gitignore
CMakeLists.txt		CMakeLists.txt
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Simple AVX examples

References

Problem 1: add two (properly aligned) arrays of floats

Problem 2: add two arbitrary arrays of floats

Problem 3: dot product

Problem 4: linear search through an array

Problem 5: aligned issue for AVX accelerated class

About

Releases

Packages

Languages

bhwqy/avx-examples

Folders and files

Latest commit

History

Repository files navigation

Simple AVX examples

References

Problem 1: add two (properly aligned) arrays of floats

Problem 2: add two arbitrary arrays of floats

Problem 3: dot product

Problem 4: linear search through an array

Problem 5: aligned issue for AVX accelerated class

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages