Performances of mul!(x, transpose(B), y) #42

matthieugomez · 2019-08-08T15:30:34Z

Thanks for the great package! I am trying to use BandedBlockBandedMatrix as a Jacobian in NLsolve (following SciML/DifferentialEquations.jl#483). However the performances get worse than for a sparse matrix. I think this comes from this line in NLSolve. To give an example

using LinearAlgebra, SparseArrays, BlockBandedMatrices, BenchmarkTools
x = rand(10000)
y = rand(10000)
J = BandedBlockBandedMatrix(Ones(10000, 10000), (fill(100, 100), fill(100, 100)), (1, 1), (1, 1))
@btime mul!($x, transpose($J), $y)
# 2.528 s (1 allocation: 16 bytes)
sparseJ = sparse(J)
@btime mul!($x, transpose($sparseJ), $y)
#  60.817 μs (1 allocation: 16 bytes)

Is there a way to make this operation faster?

ChrisRackauckas · 2019-08-08T15:34:00Z

If that matrix is a Jacobian, you probably just want to do reverse mode AD instead.

dlfivefifty · 2019-08-08T15:49:29Z

Yes I haven't made sure Transpose{T,<:BandedBlockBandedMatrix} implements the necessary routines. This should in principle be possible, though will take some work.

If you are eager, the starting point is to create a BandedBlockBandedRowMajor() memory layout, see:
https://github.com/JuliaMatrices/BlockBandedMatrices.jl/blob/24c5d6b3f030a7735ece01b0f918b3d634802957/src/BandedBlockBandedMatrix.jl#L279

Then we'd need to make sure sub-blocks are BandedRowMajor():
https://github.com/JuliaMatrices/BlockBandedMatrices.jl/blob/24c5d6b3f030a7735ece01b0f918b3d634802957/src/BandedBlockBandedMatrix.jl#L456

We probably need a call to @lazymul to make sure mul! lowers to the LazyArrays.jl multiplication routines.

In theory then we're done: mul!(x, transpose(A), y) should lower to

https://github.com/JuliaMatrices/BlockBandedMatrices.jl/blob/24c5d6b3f030a7735ece01b0f918b3d634802957/src/linalg.jl#L33

which then will call BandedMatrices.jl's implementation of BandedRowMajor multiplication. Though there is likely missing functionality that will be hit.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Performances of mul!(x, transpose(B), y) #42

Performances of mul!(x, transpose(B), y) #42

matthieugomez commented Aug 8, 2019 •

edited

Loading

ChrisRackauckas commented Aug 8, 2019

dlfivefifty commented Aug 8, 2019

Performances of mul!(x, transpose(B), y) #42

Performances of mul!(x, transpose(B), y) #42

Comments

matthieugomez commented Aug 8, 2019 • edited Loading

ChrisRackauckas commented Aug 8, 2019

dlfivefifty commented Aug 8, 2019

matthieugomez commented Aug 8, 2019 •

edited

Loading