add a note about `reinterpret`'s memory layout #199

johnnychen94 · 2021-08-25T17:03:51Z

I didn't add the StructArray{Point{Float64}}(X, dims=2) trick here because it is still quite strange to me; it has a strong assumption to how you interpret the data from the raw contents, but the confusion I get from #197 is how to get the "actual" memory layout from an already constructed StructArray.

closes #197

README.md

piever · 2021-08-26T09:51:25Z

Thanks, I've added some changes to wording, and a possible couple of sentences to put at the beginning for context.

I didn't add the StructArray{Point{Float64}}(X, dims=2) trick here because it is still quite strange to me; it has a strong assumption to how you interpret the data from the raw contents

I actually think it should be mentioned in this section, because it is the easiest way to get a StructArray from a higher-dimensional array of primitive types. Some of the confusion, IMO, comes from the fact that, unlike Vector{ComplexF64}, in StructArray{ComplexF64} there is no single "block of memory" to speak of, but two separate vectors which will be at distinct locations. The only way to associate it to a single "block of memory" is to choose those vectors as contiguous views of an existing matrix.

As a meta comment, I think this section should go among the "Advanced" sections at the end of the README. reinterpret is a bit on the technical side in my opinion.

johnnychen94 · 2021-08-26T10:08:50Z

README.md

+StructArrays also provides a way to reconstruct from a given memory block via `dims` keyword:
+
+```julia
+julia> v = Float64[1 3; 2 -1]


If I use v = [1 3; 2 -1] then I get

julia> StructArray{ComplexF64}(v, dims=1) 0-element StructArray(StructArray(), StructArray()) with eltype ComplexF64 with indices 1:0

Is this expected?

Are you sure? I get

julia> v = Float64[1 3; 2 -1] 2×2 Matrix{Float64}: 1.0 3.0 2.0 -1.0 julia> StructArray{ComplexF64}(v, dims=1) 2-element StructArray(view(::Matrix{Float64}, 1, :), view(::Matrix{Float64}, 2, :)) with eltype ComplexF64: 1.0 + 2.0im 3.0 - 1.0im

which is the expected behavior. Btw, dims=ndims(v) would be the way to get contiguous views as component arrays (selecting on the last dimension).

Sorry , I mean:

julia> v = Int[1 3; 2 -1] 2×2 Matrix{Int64}: 1 3 2 -1 julia> StructArray{ComplexF64}(v, dims=1) 0-element StructArray(StructArray(), StructArray()) with eltype ComplexF64 with indices 1:0 julia> StructArray{ComplexF64}(reinterpret(Float64, v), dims=1) 2-element StructArray(view(reinterpret(Float64, ::Matrix{Int64}), 1, :), view(reinterpret(Float64, ::Matrix{Int64}), 2, :)) with eltype ComplexF64: 5.0e-324 + 1.0e-323im 1.5e-323 + NaN*im julia> v = ComplexF64[1 3; 2 -1] 2×2 Matrix{ComplexF64}: 1.0+0.0im 3.0+0.0im 2.0+0.0im -1.0+0.0im julia> StructArray{ComplexF64}(v, dims=1) 2-element view(::Matrix{ComplexF64}, 1, :) with eltype ComplexF64: 1.0 + 0.0im 3.0 + 0.0im julia> StructArray{ComplexF64}(v, dims=2) 2-element view(::Matrix{ComplexF64}, :, 1) with eltype ComplexF64: 1.0 + 0.0im 2.0 + 0.0im

Interpreting the output is quite, hmmm, unintuitive. Maybe I just hit some undefined behaviors.

Ah, I see, it behaves a bit funny if the types don't match. It's probably because it has some logic to support nested cases, I've opened #200 to track this. Note that this constructor does not allocate, it can't use a matrix of integers to store components of ComplexF64.

piever · 2021-08-26T12:39:35Z

README.md

+ 1.0   3.0
+ 2.0  -1.0
+
+julia> StructArray{ComplexF64}(v, dims=1) # the actual memory is `([1.0, 3.0], [2.0, -1.0])`


Actually, here and below the memory is always v, this constructor does not allocate.

Oh thanks for pointing this out! I didn't realize this, I only did a quick check @btime StructArray{ComplexF64}(v, dims=1) with small v = Float64[1 3; 2 -1] and I thought the memory allocations means copy 😂

julia> @btime StructArray{ComplexF64}(v, dims=1); 521.754 ns (8 allocations: 352 bytes)

piever · 2021-08-26T12:42:47Z

README.md

+This, however, depends on the underlying data layout and how you interpret the memory block. You
+should use this with caution because otherwise it might give you unexpected results. To get the
+"same" memory layout with the raw data `v`, you can always pass `dims=ndims(v)`.


Maybe best to just mention the actual caveat, that is that v must be typed correctly. Rather than memory layout (as the only in memory object is always v), using dims=ndims(v) is done to get the best performance, because then the components of the StructArray are contiguous views (i.e., things like @view v[:,1], which is the most efficient in column-major languages).

johnnychen94 · 2021-08-26T14:15:07Z

Getting a better understanding of this now. Thanks for the feedback!

Hope this is the last commit 😄

README.md

piever · 2021-08-27T15:26:32Z

Great, I'm glad doing this was instructive!

johnnychen94 added 2 commits August 26, 2021 00:48

add a note about reinterpret's memory layout

a84ff9c

rephrase the words

f30a9ab

piever requested changes Aug 26, 2021

View reviewed changes

README.md Outdated Show resolved Hide resolved

README.md Outdated Show resolved Hide resolved

README.md Outdated Show resolved Hide resolved

README.md Outdated Show resolved Hide resolved

apply suggestions

3774918

johnnychen94 added 2 commits August 26, 2021 17:53

move to advanced section

bb5247d

add example for dims keyword

75f7e1d

johnnychen94 commented Aug 26, 2021

View reviewed changes

one more note on dims=ndims(v)

ae40727

piever reviewed Aug 26, 2021

View reviewed changes

explain the memory order and the view perspective

1469c84

piever reviewed Aug 27, 2021

View reviewed changes

README.md Outdated Show resolved Hide resolved

Minor wording change

7856ded

piever approved these changes Aug 27, 2021

View reviewed changes

piever merged commit 8958925 into JuliaArrays:master Aug 27, 2021

johnnychen94 deleted the jc/reinterpret branch August 28, 2021 00:04

johnnychen94 mentioned this pull request May 5, 2022

Docs: improve copy-vs-view, mutability, and advanced API #225

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add a note about `reinterpret`'s memory layout #199

add a note about `reinterpret`'s memory layout #199

johnnychen94 commented Aug 25, 2021

piever commented Aug 26, 2021

johnnychen94 Aug 26, 2021

piever Aug 26, 2021

johnnychen94 Aug 26, 2021 •

edited

Loading

piever Aug 26, 2021 •

edited

Loading

piever Aug 26, 2021

johnnychen94 Aug 26, 2021

piever Aug 26, 2021

johnnychen94 commented Aug 26, 2021

piever commented Aug 27, 2021

add a note about reinterpret's memory layout #199

add a note about reinterpret's memory layout #199

Conversation

johnnychen94 commented Aug 25, 2021

piever commented Aug 26, 2021

johnnychen94 Aug 26, 2021

Choose a reason for hiding this comment

piever Aug 26, 2021

Choose a reason for hiding this comment

johnnychen94 Aug 26, 2021 • edited Loading

Choose a reason for hiding this comment

piever Aug 26, 2021 • edited Loading

Choose a reason for hiding this comment

piever Aug 26, 2021

Choose a reason for hiding this comment

johnnychen94 Aug 26, 2021

Choose a reason for hiding this comment

piever Aug 26, 2021

Choose a reason for hiding this comment

johnnychen94 commented Aug 26, 2021

piever commented Aug 27, 2021

add a note about `reinterpret`'s memory layout #199

add a note about `reinterpret`'s memory layout #199

johnnychen94 Aug 26, 2021 •

edited

Loading

piever Aug 26, 2021 •

edited

Loading