SocketWrapper MbedClient debugging readSocket #912

JAndrassy · 2024-06-29T13:23:02Z

In connect we have to delete the socket object, because all other functions test it for null and then use mutex which is null, because configureSocket was not invoked. And there are too few sockets in total.
In readSocket the condition of the inner loop was always true and the inner loop never exited to wait for event or timeout on event. The inner loop only paused for yield(). With the PR if no data are available the inner loop does break to outer loop where the thread waits for event or timeout.

~~3) I think we have to do mutex in write too~~

andreagilardoni · 2024-07-01T09:15:37Z

Hi @JAndrassy, I agree with the changes you made for point 1.
For point 2 are you trying to fix a particular issue that is affecting you? Apart from missing mutex locks and unlocks and the nice code reordering, I don't see any huge changes in code that may impact functionality on MbedClient.
For point 3, I expect it to work without the need of mutex, since I think tx and rx buffer are separated, I will take a look at the lower level implementation in order to be sure

JAndrassy · 2024-07-01T09:38:45Z

@andreagilardoni so should we add everywhere to if (sock == NULL), && mutex == NULL?

https://os.mbed.com/docs/mbed-os/v6.16/mbed-os-api-doxy/class_socket.html#abc36eff670a3bee145f1c629a7eb7ee3
has

If connect() fails it is recommended to close the Socket and create a new one before attempting to reconnect.

I didn't look in the source but I think the unconnected socket still counts as used one from socket-max

andreagilardoni · 2024-07-01T11:55:44Z

I think the only method missing the check for sock==nullptr is read(), I think it is enough to return -1 there. I think it is just enough to check for sock, since mutex should be != nullptr when the sock is != nullptr after the changes you proposed here.

The changes on lines 123:124 are necessary.
The changes on line 220, 224 I don't think that may affect anything except performances.

I only need to understand why you performed changes on readSocket(), since to me everything seems to be a rearrangement on if statements (with some additional actions on mutex). Did you experience any kind of issue that I can try to replicate?

JAndrassy · 2024-07-01T12:00:38Z

changes on readSocket()

continue change to brreak if no data are available to read as I write in the description of the PR

JAndrassy · 2024-07-02T04:23:53Z

changes:

for unsuccessful connect socket->close (it was delete sock in the first version of the PR)
using mutex == null as a check for a not configured client

libraries/SocketWrapper/src/MbedClient.cpp

andreagilardoni · 2024-07-02T08:12:52Z

@JAndrassy after looking at that everything seems ok, I need to try to run some examples and then I think we can merge this PR. I like how you reorganized the thread function, thanks for your contribution!

JAndrassy · 2024-07-02T19:16:14Z

There is one more problem with the readSocket thread, but if I patch the solution in, it screams for a rewrite of the whole inner loop.

the problem is that on err < 0 the thread ends. it can be just that the peer closed the connection. on a new connect without stop(), the connection is established, but the readSocket thread is not started. It is not possible to restart a thread, so it should not end until sock is null.

andreagilardoni · 2024-07-03T11:46:51Z

I quickly tested this PR and I don't see any issues with this. About the other issue you are describing, what err < 0 are you talking about?

JAndrassy · 2024-07-03T11:51:15Z

sorry ret not err

      if (ret < 0 && ret != NSAPI_ERROR_WOULD_BLOCK) {
        mutex->unlock();
        goto cleanup;
      }

andreagilardoni · 2024-07-03T13:58:36Z

If the peer closed the connection than, I think, it is expected that the socket has to be closed and needs to be restarted. In connect I can see that this check is performed

ArduinoCore-mbed/libraries/SocketWrapper/src/MbedClient.cpp

Lines 84 to 89 in 2ece915

    
           if (sock && reader_th) { 
        
             // trying to reuse a connection, let's call stop() to cleanup the state 
        
             char c; 
        
             if (sock->recv(&c, 1) < 0) { 
        
               stop(); 
        
             }

Did I get your point?

JAndrassy · 2024-07-03T14:01:31Z

then it is ok

maidnl

I tried to analyse and test the proposed changes and I have a few comments (I hope you can find them meaningful).

One thing I found really confusing is the use of the _status flag which is set / reset in a lot of different places (and this make difficult to understand the underling logic).
In the readSocket() thread is appears that is not really necessary to set _status=true at the end of each innermost do-while cicle.
There is no need to set _status=true after the call of the configureSocket() function since is the configureSocket() function itself sets _status=true;. This happens twice: in the connect() and the connectSSL() function.
As additional simplification I would reset _status to false at the beginning of those 2 functions (if all goes well the configureSocket() function will set it to true and in case of problem it will remain set to false removing the need of setting if to false in case of errors). This removes the need to set _status to false in case of problem and ensure it is false if something wrong happened.

About the mutex lock/unlock in the write() function, my understanding is that the mutex is used to prevent access to the rxBuffer from different threads, so it appears to be not necessary here. One point that is certainly wrong is that the changes removed the check about the sock pointer: it is necessary to reintroduce here the check

if (sock == nullptr) {
   return 0;
}

this is very important because a user can call write() with an invalid sock and this would crash the program.

Since the mutex is used to prevent "common" access to rxBuffer I noticed that the peek() function that uses rxBuffer is unprotected: I would add the mutex lock/unlock mechanism to the peek() function.

A possible improvement is related to the setSocket() function: this function is called when client is create by a server. However the server allocates a Socket only if there is an incoming request, in case no request is made the server will always call setSocket(nullptr) (please check the EthernetServer::available() function in the EthernetServer.cpp file and verify if my understanding is correct).
In case like this it is probably pointless to call configureSocket() so I would add a check and execute the body of the function only if _sock is different from nullptr.

My last remark is more a doubt: in the connect() and connectSSL() function almost at the end of the function in case ret is not 1 it has been added the statement sock->close(), but this only closes the Socket. Would not be better to call directly the MbedClient stop() function? This would reset all the variables held by the client and not only "close the socket".

libraries/SocketWrapper/src/MbedClient.cpp

JAndrassy · 2024-07-24T11:58:07Z

_status only exists for status(). there is no other simple way. It has no internal use in MbedClient. This PR is not about _status.

I avoid doing unnecessary changes in my PR. Where would I stop if I begin to cleanup the code. So I don't even start. So the superfluous _status=true stay there for this PR.

The readSocket() function runs in a separate thread so it can't switch _status to false if it isn't definitive.

I can remove the lock from write. (btw if socket is null, then mutex is null too)

yes. peek() needs the lock. I add it.

My last remark is more a doubt: in the connect() and connectSSL() function almost at the end of the function in case ret is not 1 it has been added the statement sock->close(), but this only closes the Socket. Would not be better to call directly the MbedClient stop() function? This would reset all the variables held by the client and not only "close the socket".

as I understand it, the idea is that the socket can be reused for next try to connect, that is why it is not deleted. calling stop() would delete it. all other fields are not initialized because configureSocket didn't run

JAndrassy · 2024-07-25T07:10:46Z

I can remove the lock from write. (btw if socket is null, then mutex is null too)
yes. peek() needs the lock. I add it.

I made these changes ^^^

facchinm · 2024-09-05T13:52:26Z

Maybe this patch could help fixing #937 ?

schnoberts1 · 2024-09-09T17:57:30Z

libraries/SocketWrapper/src/MbedClient.cpp

@@ -22,28 +22,30 @@ void arduino::MbedClient::readSocket() {
    int ret = NSAPI_ERROR_WOULD_BLOCK;
    do {
      mutex->lock();
-      if (sock != nullptr && rxBuffer.availableForStore() == 0) {
+      if (sock == nullptr) {


Is this subject to the same issues as "Another racy example" here: https://queue.acm.org/detail.cfm?id=2088916? sock is loop invariant but changed in another thread. mutex is the same isn't it?

outdated

facchinm requested review from andreagilardoni and pennam July 1, 2024 07:54

JAndrassy force-pushed the mbedclient_readsocket_fixes branch 2 times, most recently from 00b5add to a5fc13f Compare July 2, 2024 04:20

JAndrassy force-pushed the mbedclient_readsocket_fixes branch 2 times, most recently from 444339a to 00bc011 Compare July 2, 2024 05:07

andreagilardoni reviewed Jul 2, 2024

View reviewed changes

libraries/SocketWrapper/src/MbedClient.cpp Outdated Show resolved Hide resolved

andreagilardoni reviewed Jul 2, 2024

View reviewed changes

libraries/SocketWrapper/src/MbedClient.cpp Outdated Show resolved Hide resolved

andreagilardoni reviewed Jul 2, 2024

View reviewed changes

libraries/SocketWrapper/src/MbedClient.cpp Outdated Show resolved Hide resolved

JAndrassy force-pushed the mbedclient_readsocket_fixes branch from 00bc011 to a40c63f Compare July 2, 2024 10:37

andreagilardoni previously approved these changes Jul 3, 2024

View reviewed changes

maidnl requested changes Jul 24, 2024

View reviewed changes

libraries/SocketWrapper/src/MbedClient.cpp Show resolved Hide resolved

JAndrassy force-pushed the mbedclient_readsocket_fixes branch from a40c63f to dbe0b07 Compare July 24, 2024 17:59

SocketWrapper MbedClient debugging readSocket

3428d75

JAndrassy force-pushed the mbedclient_readsocket_fixes branch from dbe0b07 to 3428d75 Compare July 24, 2024 18:04

schnoberts1 reviewed Sep 9, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SocketWrapper MbedClient debugging readSocket #912

SocketWrapper MbedClient debugging readSocket #912

JAndrassy commented Jun 29, 2024 •

edited

Loading

andreagilardoni commented Jul 1, 2024

JAndrassy commented Jul 1, 2024 •

edited

Loading

andreagilardoni commented Jul 1, 2024

JAndrassy commented Jul 1, 2024 •

edited

Loading

JAndrassy commented Jul 2, 2024 •

edited

Loading

andreagilardoni commented Jul 2, 2024

JAndrassy commented Jul 2, 2024

andreagilardoni commented Jul 3, 2024

JAndrassy commented Jul 3, 2024

andreagilardoni commented Jul 3, 2024

JAndrassy commented Jul 3, 2024

maidnl left a comment

JAndrassy commented Jul 24, 2024

JAndrassy commented Jul 25, 2024

facchinm commented Sep 5, 2024

schnoberts1 Sep 9, 2024 •

edited

Loading

SocketWrapper MbedClient debugging readSocket #912

Are you sure you want to change the base?

SocketWrapper MbedClient debugging readSocket #912

Conversation

JAndrassy commented Jun 29, 2024 • edited Loading

andreagilardoni commented Jul 1, 2024

JAndrassy commented Jul 1, 2024 • edited Loading

andreagilardoni commented Jul 1, 2024

JAndrassy commented Jul 1, 2024 • edited Loading

JAndrassy commented Jul 2, 2024 • edited Loading

andreagilardoni commented Jul 2, 2024

JAndrassy commented Jul 2, 2024

andreagilardoni commented Jul 3, 2024

JAndrassy commented Jul 3, 2024

andreagilardoni commented Jul 3, 2024

JAndrassy commented Jul 3, 2024

maidnl left a comment

Choose a reason for hiding this comment

JAndrassy commented Jul 24, 2024

JAndrassy commented Jul 25, 2024

facchinm commented Sep 5, 2024

schnoberts1 Sep 9, 2024 • edited Loading

Choose a reason for hiding this comment

JAndrassy commented Jun 29, 2024 •

edited

Loading

JAndrassy commented Jul 1, 2024 •

edited

Loading

JAndrassy commented Jul 1, 2024 •

edited

Loading

JAndrassy commented Jul 2, 2024 •

edited

Loading

schnoberts1 Sep 9, 2024 •

edited

Loading