Introduce socket service API #66758

jukkar · 2023-12-21T15:03:56Z

This PR introduces a socket service API. It can be used to create one socket listener that calls a callback if a set of sockets have any activity. This saves memory because instead of creating multiple threads for a set of sockets, one main thread can listen all needed sockets. This is similar concept how inetd works in Linux, although it is implemented differently.

jukkar · 2023-12-21T15:08:47Z

This is still a draft, we can change the APIs etc, or discard the PR if it is not needed. Anyway, please take a look.

subsys/net/lib/sockets/Kconfig

subsys/net/lib/sockets/sockets_service.c

include/zephyr/net/socket_service.h

rlubos · 2023-12-21T15:44:06Z

include/zephyr/net/socket_service.h

+ * User should create needed sockets and then setup the poll struct and
+ * then register the sockets to be monitored at runtime.
+ */
+struct net_socket_service_desc {


I wonder if we really need this static configuration for socket service? It seems a bit problematic, at least for my use case - DHCP server. Here, ideally, I'd need to have a single socket for each interface that wants to support the DHCP server functionality. But I don't think we have a way to calculate the number of interfaces in the system at build time to supply the static configuration macros. Purely runtime registration would be less cumbersome in such case.

I wonder if we really need this static configuration for socket service? It seems a bit problematic, at least for my use case - DHCP server. Here, ideally, I'd need to have a single socket for each interface that wants to support the DHCP server functionality. But I don't think we have a way to calculate the number of interfaces in the system at build time to supply the static configuration macros. Purely runtime registration would be less cumbersome in such case.

I originally did some experimentation with runtime configuration, but as the API requires some housekeeping activities, allocating them at runtime requires malloc etc. There is a call to malloc at this PR too but hopefully we could get rid of it.

We could place struct zsock_pollfd into DHCP config in network interface, that way the information this API needs, would already be part of network interface.

Devicetree could help here, just sayin 🤷🏼‍♂️

rlubos · 2023-12-21T15:46:48Z

subsys/net/lib/sockets/sockets_service.c

+		return -ENOENT;
+	}
+
+	if (svc->work_q != NULL) {


Shouldn't we call the registred callback directly from the services thread? I see a potential race here - we do submit the work to execute, but don't read the data (in case POLLIN was monitored) from the socket until the work executes. If the workqueue thread has lower prio than the services thread, we may end up busy looping (poll() will keep reporting POLLIN as the work item didn't have a chance to execute).

Shouldn't we call the registred callback directly from the services thread?

I originally planned to do that, but it felt more natural to use the existing APIs and resources in the system. So using a work queue does the same thing. But we can certainly change this, it is just to change one function call.

When using workqueue(s), we can serve (at least theoretically) multiple socket services at the "same" time and the priority of the thread using the services api determines which one gets called first. If we have a callback directly from services thread, then there is no such thing available.

I see a potential race here - we do submit the work to execute, but don't read the data (in case POLLIN was monitored) from the socket until the work executes.

Note that k_work_submit() does nothing if the work is already pending. It is indeed possible that we might busyloop in the service thread if the service user does not "run" fast enough. I tried to mitigate this problem by setting the service thread priority to lowest application thread priority.

There seems to an issue when calling the "callback" via the workqueue as it might be possible (depending on different thread priorities) that we start to loop through poll() if the callback does not get run properly. So it might be better if we call the callback directly (synchronously) without using the workqueue. I will investigate this more.

There seems to an issue when calling the "callback" via the workqueue as it might be possible (depending on different thread priorities) that we start to loop through poll() if the callback does not get run properly. So it might be better if we call the callback directly (synchronously) without using the workqueue. I will investigate this more.

Left the async call but fixed the code so that the callback work is only called once. Fixed also the tests that were not working properly with native_sim.

To be honest I'm not personally convinced about this async approach. Having multiple work queues kind of busts the idea the socket services were created for (saving resources), so most of the handlers will likely end up on a system workqueue anyway. Plus, having the handlers to be called directly from the services thread allows to reuse the services stack and kind of finetune its size for the worst case scenario (I imagine all of the registered services would need to allocate buffers for socket send/recv calls and a common stack area seemed like a good place for this instead of using static/heap/custom stack memory).

But it's just my personal opinion, no strong push from my side to change it, if that's the preferred way.

Personally I prefer the async way of doing things although it certainly complicates things. But I can see that sync way is also possible here. I think we can support both methods quite easily and let the user select how the callback should be called. I will propose a new version that supports both methods.

Yes - sync should be a relatively trivial wrapper around async ... or at least I thought it was... 🤔

cfriedt · 2023-12-21T19:18:28Z

Looks reasonable. It would be great to get a compile-time const number for network interfaces. Devicetree can fix that 😅

jukkar · 2023-12-22T07:07:10Z

It would be great to get a compile-time const number for network interfaces. Devicetree can fix that

Indeed, that info would help a lot in various part of the network stack. Although this PR should not depend on number of network interfaces.

jukkar · 2023-12-22T09:12:30Z

Updates:

removed malloc and allocated static array of pollfd in the size of CONFIG_NET_SOCKETS_POLL_MAX
tweak the tests so that they are not run in native_sim board (because eventfd does not work there)

cfriedt · 2023-12-23T10:26:37Z

tweak the tests so that they are not run in native_sim board (because eventfd does not work there)

Ugh.. that's annoying. I kept getting reassurances that posix arch and posix api were not mutually exclusive, but I often see that is still the case.

jukkar · 2023-12-27T13:45:59Z

Ugh.. that's annoying. I kept getting reassurances that posix arch and posix api were not mutually exclusive, but I often see that is still the case.

Indeed, that is bad. If I try to run the PR sample or the tests in native_sim/posix boards, they just "hang" and do not proceed. Pressing ctrl-c does not terminate the zephyr.exe even though the control is returned to user and the process looks like it terminated.

jukkar · 2024-01-03T12:19:23Z

Changed the sample to let CI pass.
Rebased on top of latest main

jukkar · 2024-01-03T16:25:24Z

re-enabled native_sim after finding out why the test was hanging

kartben

Looks good from docs standpoint, thanks!

samples/net/sockets/echo_service/prj.conf

subsys/net/lib/sockets/sockets_service.c

rlubos

One issue spotted when testing.

subsys/net/lib/sockets/sockets_service.c

The socket service provides a similar functionality as what initd provides in Linux. It listens user registered sockets for any activity and then launches a k_work for it. This way each application does not need to create a thread to listen a blocking socket. Signed-off-by: Jukka Rissanen <[email protected]>

Simple tests that verify that the socket service API works as expected. Signed-off-by: Jukka Rissanen <[email protected]>

The echo-service sample demostrates how to use the socket service API. Signed-off-by: Jukka Rissanen <[email protected]>

The socket services users to "net sockets" command. Signed-off-by: Jukka Rissanen <[email protected]>

If CONFIG_POSIX_API is enabled, then the socket.h is found under zephyr/posix/sys/socket.h etc. This allows one to compile the socket test applications without error prints. Signed-off-by: Jukka Rissanen <[email protected]>

gcc prints this warning message 'strncat' specified bound 1 equals source length [-Wstringop-overflow=] 58 | strncat(fd, "C", 1); There was no error in the code but avoid the warning by not using strncat(). Signed-off-by: Jukka Rissanen <[email protected]>

jukkar · 2024-01-15T16:56:05Z

New version fixes the issue that Robert was seeing.

rlubos

Thanks

jukkar added the area: Networking label Dec 21, 2023

jukkar requested review from rlubos, cfriedt and pdgendt December 21, 2023 15:03

jukkar changed the title ~~Introduce Socket service API~~ Introduce socket service API Dec 21, 2023

pdgendt reviewed Dec 21, 2023

View reviewed changes

subsys/net/lib/sockets/Kconfig Show resolved Hide resolved

rlubos reviewed Dec 21, 2023

View reviewed changes

jukkar mentioned this pull request Dec 22, 2023

net: context: Fix the v4 mapped address handling in sendto #66872

Merged

jukkar force-pushed the devel/socket-service branch from daf4a3d to 358fc23 Compare December 22, 2023 09:10

jukkar force-pushed the devel/socket-service branch from 358fc23 to e967a88 Compare December 22, 2023 10:52

jukkar force-pushed the devel/socket-service branch from e967a88 to 75147f3 Compare December 27, 2023 13:41

jukkar force-pushed the devel/socket-service branch from 75147f3 to 236e10d Compare January 3, 2024 12:18

jukkar marked this pull request as ready for review January 3, 2024 12:19

zephyrbot added area: Linker Scripts area: Samples Samples area: Sockets Networking sockets labels Jan 3, 2024

zephyrbot requested review from kartben, nashif, ssharks and tbursztyka January 3, 2024 12:20

zephyrbot assigned rlubos Jan 3, 2024

jukkar force-pushed the devel/socket-service branch from de3065a to 37ab5ef Compare January 4, 2024 09:25

kartben previously approved these changes Jan 15, 2024

View reviewed changes

pdgendt requested changes Jan 15, 2024

View reviewed changes

samples/net/sockets/echo_service/prj.conf Show resolved Hide resolved

subsys/net/lib/sockets/sockets_service.c Outdated Show resolved Hide resolved

jukkar dismissed stale reviews from kartben and rlubos via 543bf54 January 15, 2024 13:19

jukkar force-pushed the devel/socket-service branch from 95884e4 to 543bf54 Compare January 15, 2024 13:19

jukkar requested review from pdgendt, kartben and rlubos January 15, 2024 13:38

pdgendt previously approved these changes Jan 15, 2024

View reviewed changes

kartben previously approved these changes Jan 15, 2024

View reviewed changes

rlubos approved these changes Jan 15, 2024

View reviewed changes

rlubos requested changes Jan 15, 2024

View reviewed changes

subsys/net/lib/sockets/sockets_service.c Show resolved Hide resolved

jukkar added 6 commits January 15, 2024 18:50

tests: net: socket: service: Add tests for socket service API

16fae69

Simple tests that verify that the socket service API works as expected. Signed-off-by: Jukka Rissanen <[email protected]>

samples: net: sockets: Add echo-service sample

6b18136

The echo-service sample demostrates how to use the socket service API. Signed-off-by: Jukka Rissanen <[email protected]>

net: shell: Add sockets services prints

310ff08

The socket services users to "net sockets" command. Signed-off-by: Jukka Rissanen <[email protected]>

tests: net: socket: Add correct path to socket.h for POSIX_API

1b1db56

If CONFIG_POSIX_API is enabled, then the socket.h is found under zephyr/posix/sys/socket.h etc. This allows one to compile the socket test applications without error prints. Signed-off-by: Jukka Rissanen <[email protected]>

jukkar dismissed stale reviews from kartben and pdgendt via c6684c5 January 15, 2024 16:55

jukkar force-pushed the devel/socket-service branch from 543bf54 to c6684c5 Compare January 15, 2024 16:55

jukkar requested review from rlubos, pdgendt and kartben January 15, 2024 16:56

rlubos approved these changes Jan 15, 2024

View reviewed changes

cfriedt approved these changes Jan 16, 2024

View reviewed changes

carlescufi merged commit 6033161 into zephyrproject-rtos:main Jan 16, 2024
22 checks passed

jukkar deleted the devel/socket-service branch January 16, 2024 09:16

JordanYates mentioned this pull request Oct 5, 2024

net: socket_service: remove work_q parameter #79446

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Introduce socket service API #66758

Introduce socket service API #66758

jukkar commented Dec 21, 2023 •

edited

Loading

jukkar commented Dec 21, 2023

rlubos Dec 21, 2023

jukkar Dec 22, 2023

cfriedt Jan 8, 2024

rlubos Dec 21, 2023

jukkar Dec 22, 2023

jukkar Dec 22, 2023

jukkar Jan 3, 2024

jukkar Jan 4, 2024

rlubos Jan 4, 2024

jukkar Jan 8, 2024

cfriedt Jan 8, 2024

cfriedt commented Dec 21, 2023

jukkar commented Dec 22, 2023 •

edited

Loading

jukkar commented Dec 22, 2023

cfriedt commented Dec 23, 2023

jukkar commented Dec 27, 2023

jukkar commented Jan 3, 2024

jukkar commented Jan 3, 2024

kartben left a comment

rlubos left a comment

jukkar commented Jan 15, 2024

rlubos left a comment

Introduce socket service API #66758

Introduce socket service API #66758

Conversation

jukkar commented Dec 21, 2023 • edited Loading

jukkar commented Dec 21, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

cfriedt commented Dec 21, 2023

jukkar commented Dec 22, 2023 • edited Loading

jukkar commented Dec 22, 2023

cfriedt commented Dec 23, 2023

jukkar commented Dec 27, 2023

jukkar commented Jan 3, 2024

jukkar commented Jan 3, 2024

kartben left a comment

Choose a reason for hiding this comment

rlubos left a comment

Choose a reason for hiding this comment

jukkar commented Jan 15, 2024

rlubos left a comment

Choose a reason for hiding this comment

jukkar commented Dec 21, 2023 •

edited

Loading

jukkar commented Dec 22, 2023 •

edited

Loading