LPSPI: embedded-hal 1.0 rework #145

Finomnis · 2023-11-18T22:08:58Z

No description provided.

…hat to the user

…if byte order is correct.

mciantyre

Thanks for working through this, and sorry for the slow review. There's lots of work in this branch, including changes to CI tools and updated dependencies. If you think these are helpful to include separately, I can take a look and help with separation.

We're free to break APIs and explore new drivers. But I'll admit: I'm going to miss the 0.5 Lpspi driver. It wasn't perfect, but I appreciated that the driver exposed the lower-level details of the peripheral to users. I've used this to drive I/O from interrupts (without async conveniences). And although I hadn't tried, I thought it could be the foundation for a future async driver, either in this package or another package.

During your prototyping, did you hit limitations of the existing Lpspi driver that prevented it from being usable as this driver's foundation? (I may be in the minority about the "give users low-level control" design decisions, so I'm happy to scrap it.)

Our approach is a single Lpspi driver that supports blocking, interrupt-driven async, and DMA-driven async IO in one / both directions. Having the interrpt and DMA async IO in a single implementation lets the user decide if the want interrupts to send data and DMA to receive data, DMA to send and receive data, etc. I'm curious if there's a way to let the user compose these features themselves, without it all being in one larger driver. Here's a rough outline of what it might look like:

Outline, trade-offs

lpspi::InterruptTransmit for asynchronously sending data with interrupts. Includes its own InterruptHandler-style object, which is used in the LPSPI ISR. Doesn't care about embedded-hal-async (EHA).
lpspi::InterruptReceive for async receiving data with interrupts. Includes its own InterruptHandler-style object, used inside the LPSPI ISR. Doesn't care about EHA.
lpspi::DmaTransmit for async-sending data with DMA. Includes an InterruptHandler-style object, used inside a DMA ISR. Doesn't care about EHA.
lpspi::DmaReceive: async-receive, complementary handler used in DMA ISR, no EHA care.
lpspi::AsyncLpspi accepts a combo of the *Transmit and *Receive splits. The user is already juggling the InterruptHandlers, and that's how we realize the async IO. This combiner implements the EHA async traits. The user constructs this.
Some internal means of splitting the LPSPI instance to realize these APIs. Waves hands waves hands.
Keep blocking behaviors separate.

The user manages two InterruptHandler objects, instead of the single InterruptHandler shown today. AsyncLpspi supports u16 IO only when composed of the two Interrupt* halves. This approach might not let us realize some kinds of optimizations ("use interrupts, not DMA, when you can saturate the TX FIFO and quickly return to caller") without planning.

My goal with this design would be to build a solid interrupt-driven async foundation, then make it easy to eventually develop DMA-driven async without changing the large LPSPI driver's plumbing. It looks like we're approaching this idea with the internal read- and write-halves; one more cut, and we're nearly there.

mciantyre · 2023-12-17T14:40:38Z

src/common/lpspi/status_watcher.rs

+    transfer_complete_waker: Option<Waker>,
+    error_caught: Option<LpspiError>,
+    error_caught_waker: Option<Waker>,
+    tx_fifo_watermark_busy: bool,
+    rx_fifo_watermark_busy: bool,
+    tx_fifo_watermark_waker: Option<Waker>,
+    rx_fifo_watermark_waker: Option<Waker>,


We're tracking four wakers in this shared state, each specific to a certain condition of the peripheral. I'm curious if this is necessary. What's the condition when at least two of these wakers will be associated with distinct futures produced by the LPSPI driver?

From what I can tell, the user can only produce one future from the LPSPI driver. This is because the embedded-hal-async traits all require exclusive references (&mut Lpspi<...>), and we're not exposing methods that produce futures from shared LPSPI references (&Lpspi<...>). So even when we internally compose futures with select_biased! and join!, the waker associated with all of those state machines is the same.

If that's correct, could we get away with one Waker? Once we wake that one waker for any condition, the executor polls our top-level LPSPI future, which in turn figures out if transfers completed, or if errors are caught, or if watermarks were crossed.

(My concern is that we're setting up execution where we could excessively follow function pointers and wake an executor inside a critical section. Heads up that I'm not measuring any of this, so this is just coming from code study. I've written code that does this too, so I'm generally looking for different approaches.)

That might actually be true. Haven't thought of it like this yet. It must be guaranteed though, otherwise this falls back to busy waiting.

Finomnis · 2023-12-17T17:32:49Z

I may be in the minority about the "give users low-level control" design decisions

I personally don't like this, because it introduces ways for the user to break the internal state of the driver. I kind of consider this a functional unsoundness. I found the existing driver very hard to use because it requires me to understand the peripheral, which I as a user don't want to have to understand. I felt I would have to write a driver for the driver to actually use it in a project.

I would personally rather go the other route - closing it off completely and then step by step adding features as required. But that's just my personal opinion.

Finomnis · 2023-12-17T17:47:38Z

Here's a rough outline of what it might look like:

Let me think about this. You might be onto something.

Be aware that:

not every u8 supports dma. It only supports full dma if both the read and write buffer are of the same u32 alignment. Otherwise it will always be the case that we have to decide whether to dma read or write, both aren't possible if they are misaligned.
there will always be a benefit from using interrupts, even in the dma case. For error handling and for the flush call.
the few unaligned bytes before and after a transfer also benefit from interrupts, and would still be required on the dma version.

teburd · 2023-12-17T17:53:06Z

I think it’s cool that you are adding support for eh 1.0. I’d note that interrupts are not free of cost and sometimes it really is faster to poll. I saw this in a few spi peripherals in Zephyr that I’ve modified/reviewed. At times interrupts can cause slowdowns. Especially true when doing XIP or on cores with longer pipelines with icache/dcache involved like the M7

Finomnis · 2023-12-17T18:23:04Z

That might be true!

If we

drop dma support for u8/u16
assume busy waiting for hal-async's flush call is acceptable

Then we can go with @mciantyre's architecture proposal.

Finomnis · 2023-12-17T18:40:58Z

The biggest open question I still have is error handling. But I have to clarify:

errors can only occur if NOSTALL is activated. Otherwise it's guaranteed that the overflow errors can never occur. So the entire error discussion is irrelevant if we don't allow this flag to be set. I would like to keep the possibility, though.
without interrupts, errors would have to be polled repeatedly. According to the RM, if an error occurs, the transmission would have to be aborted immediately. If we relax this requirement for the dma case, this might be possible.

mciantyre · 2023-12-17T18:59:33Z

I found the existing driver very hard to use because it requires me to understand the peripheral, which I as a user don't want to have to understand.

We support embedded-hal for hiding the peripheral's complexity. We still need to give the user APIs for driver configurations, since embedded-hal doesn't help us here. But after you configure your driver and pass ownership into your embedded-hal-using component, it should become difficult to break the driver's internal state.

I'm sure there's use-cases I'm not considering, but that's the thinking for giving users the lower-level control.

there will always be a benefit from using interrupts, even in the dma case. For error handling and for the flush call.

We could require the user to transform InterruptTransmit into DmaTransmit, for example. Once DmaTransmit takes ownership of InterruptTransmit, it could select the I/O behaviors, wait for flush, check errors.

I’d note that interrupts are not free of cost and sometimes it really is faster to poll.

Good point. We're free to spin in async code if we find it profitable. We block the executor and other tasks.

examples/rtic_spi.rs

src/common/lpspi/bus/eh1_impl.rs

Finomnis · 2023-12-19T08:03:23Z

But after you configure your driver and pass ownership into your embedded-hal-using component, it should become difficult to break the driver's internal state.

I agree. So we should maybe look into something like a builder pattern?

Finomnis · 2023-12-19T19:27:41Z

@mciantyre Might have to take a step back over the next couple of days/weeks. If someone has an idea, be free to use my code as a base/reference. There are a couple of things in there that took a while to figure out, like the proper settings of the timing registers and the clock multiplier calculation, which I think are improvements over the original version in a couple of details.

If I do continue (which I will if nobody else wants to take over), I might start incorporating a couple of improvements into the existing driver if you think it would be better to start with the existing one as a low-level backend. We could then add a high-level driver which takes ownership of the low-level one that then can do Embedded Hal Async stuff. Although @mciantyre I would need some guidance on the naming of things, because I feel like both the low level and the high level driver deserve the name Lpspi.

Finomnis · 2023-12-19T19:55:08Z

Moved discussions over to #147.

Finomnis · 2024-03-09T20:26:45Z

src/common/lpspi/bus.rs

+
+    /// Returns whether or not the busy flag is set.
+    fn busy(&self) -> bool {
+        ral::read_reg!(ral::lpspi, self.lpspi(), SR, MBF == MBF_1)


Turns out this is buggy on imxrt1062 (or all, maybe)? Writing to the FIFO does not set MBF immediately; if compiled with optimizations this has a chance to report not busy although a transfer is in progress.

Finomnis added 22 commits November 17, 2023 22:48

Initial ideas

7724d42

More ideas

42422ef

More work

854f727

Dummy implement SPI traits

9a65d36

Minor comment

d7bc1b5

More work; remove device, as it seems to be best practices to leave t…

47ff70a

…hat to the user

Add some comments

c4d938d

Bla

07a8343

Make DMA compile time configurable

b2619de

Small fixes

bfc7619

Refactor dma stuff int lpspi/dma.rs

9fa738b

More work

95109a9

Add word_types

a74fc69

Bla.

e6ae197

Add error handling to SPI bus

c91cfb3

Add data buffer tests

b27d56e

Implement blocking transfer

8711814

Refactoring to enable in-place transfer

ddef8d8

Remove unused variable

210e86f

First time compiling! Not working yet, though.

af51c0a

More rework; split data into dma and non-dma parts. To be determined …

5275bdc

…if byte order is correct.

Make dma config a member again

6f54cc9

mciantyre mentioned this pull request Nov 25, 2023

Road to embedded-hal 1.0 #142

Open

6 tasks

Finomnis added 2 commits November 25, 2023 23:19

Fix example and board

26482d4

Remove rtic-sync dependency

ae6ee7d

mciantyre self-requested a review November 27, 2023 13:22

Finomnis added 4 commits December 4, 2023 12:34

Fix lpspi clock config

ef4d4bc

Add comment to set_clock_hz

fd90b71

Partial rewrite

844d1c4

Remove obsolete bat script

c426668

Finomnis added 16 commits December 15, 2023 18:03

Update cargo.toml

7c40563

Fix tests

fe44f27

Refactor transfer_actions

88a5d60

Attempt to add read_single_word

64ff0c9

Fix read

3027e05

Add u32 stream; finish read part

9e5a427

Redistribute unsafe tags

1a2caab

Fix cleanup procedure

2ffab65

Remove finished TODO

86ef4ed

Remove lpspi_old driver

8bf18b3

Remove unnecessary pubs

7492ba2

Add TODO comments

b75ede3

Simplify read part

9cbd05e

Prepare write DMA

c70ea1a

Add TODO

0c5dcae

Adjust visibility of DMA mappings

0cfe3a7

mciantyre reviewed Dec 17, 2023

View reviewed changes

examples/rtic_spi.rs Outdated Show resolved Hide resolved

src/common/lpspi/bus/eh1_impl.rs Show resolved Hide resolved

Check in latest example version, does not work yet.

649748d

mciantyre mentioned this pull request Dec 26, 2023

LPSPI Rework Open Discussion Points #147

Open

11 tasks

Finomnis commented Mar 9, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LPSPI: embedded-hal 1.0 rework #145

LPSPI: embedded-hal 1.0 rework #145

Finomnis commented Nov 18, 2023

mciantyre left a comment

mciantyre Dec 17, 2023

Finomnis Dec 17, 2023

Finomnis commented Dec 17, 2023 •

edited

Loading

Finomnis commented Dec 17, 2023 •

edited

Loading

teburd commented Dec 17, 2023 •

edited

Loading

Finomnis commented Dec 17, 2023

Finomnis commented Dec 17, 2023

mciantyre commented Dec 17, 2023 •

edited

Loading

Finomnis commented Dec 19, 2023 •

edited

Loading

Finomnis commented Dec 19, 2023 •

edited

Loading

Finomnis commented Dec 19, 2023

Finomnis Mar 9, 2024

LPSPI: embedded-hal 1.0 rework #145

Are you sure you want to change the base?

LPSPI: embedded-hal 1.0 rework #145

Conversation

Finomnis commented Nov 18, 2023

mciantyre left a comment

Choose a reason for hiding this comment

mciantyre Dec 17, 2023

Choose a reason for hiding this comment

Finomnis Dec 17, 2023

Choose a reason for hiding this comment

Finomnis commented Dec 17, 2023 • edited Loading

Finomnis commented Dec 17, 2023 • edited Loading

teburd commented Dec 17, 2023 • edited Loading

Finomnis commented Dec 17, 2023

Finomnis commented Dec 17, 2023

mciantyre commented Dec 17, 2023 • edited Loading

Finomnis commented Dec 19, 2023 • edited Loading

Finomnis commented Dec 19, 2023 • edited Loading

Finomnis commented Dec 19, 2023

Finomnis Mar 9, 2024

Choose a reason for hiding this comment

Finomnis commented Dec 17, 2023 •

edited

Loading

Finomnis commented Dec 17, 2023 •

edited

Loading

teburd commented Dec 17, 2023 •

edited

Loading

mciantyre commented Dec 17, 2023 •

edited

Loading

Finomnis commented Dec 19, 2023 •

edited

Loading

Finomnis commented Dec 19, 2023 •

edited

Loading