Skip to content

OpenBLAS 0.2.16 version

Compare
Choose a tag to compare
@xianyi xianyi released this 15 Mar 19:03
· 7032 commits to develop since this release

Version 0.2.16
15-Mar-2016

common:

  • Upgrade LAPACK to 3.6.0 version.
    Add BUILD_LAPACK_DEPRECATED option in Makefile.rule to build
    LAPACK deprecated functions.
  • Add MAKE_NB_JOBS option in Makefile.
    Force number of make jobs.This is particularly
    useful when using distcc. (#735. Thanks, Jerome Robert.)
  • Redesign unit test. Run unit/regression test at every build (Travis-CI and Appveyor).
  • Disable multi-threading for small size swap and ger. (#744. Thanks, Jerome Robert)
  • Improve small zger, zgemv, ztrmv using stack alloction (#727. Thanks, Jerome Robert)
  • Let openblas_get_num_threads return the number of active threads.
    (#760. Thanks, Jerome Robert)
  • Support illumos(OmniOS). (#749. Thanks, Lauri Tirkkonen)
  • Fix LAPACK Dormbr, Dormlq bug. (#711, #713. Thanks, Brendan Tracey)
  • Update scipy benchmark script. (#745. Thanks, John Kirkham)
  • Avoid potential getenv segfault. (#716)
  • Import LAPACK svn bugfix #142-#147,#150-#155

x86/x86_64:

  • Optimize trsm kernels for AMD Bulldozer, Piledriver, Steamroller.
  • Detect Intel Avoton.
  • Detect AMD Trinity, Richland, E2-3200.
  • Fix gemv performance bug on Mac OSX Intel Haswell.
  • Fix some bugs with CMake and Visual Studio
  • Optimize c/zgemv for AMD Bulldozer, Piledriver, Steamroller
  • Fix bug with scipy linalg test.

ARM:

  • Support and optimize Cortex-A57 AArch64.
    (#686. Thanks, Ashwin Sekhar TK)
  • Fix Android build on ARMV7 (#778. Thanks, Paul Mustiere)
  • Update ARMV6 kernels.
  • Improve DGEMM for ARM Cortex-A57. (Thanks, Ashwin Sekhar T K)

POWER:

  • Fix detection of POWER architecture
    (#684. Thanks, Sebastien Villemot)
  • Optimize D and Z BLAS3 functions for Power8.

md5sum
8fae7cebfefa073c8640e99c4454dc03 OpenBLAS-0.2.16.zip
fef46ab92463bdbb1479dcec594ef6dc OpenBLAS-0.2.16.tar.gz

Download OpenBLAS