New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Launch cockpit-session via socket activation on /run/cockpit/session #16808

Draft

allisonkarlitskaya wants to merge 18 commits into cockpit-project:main from allisonkarlitskaya:cockpit-session-socket

Member

allisonkarlitskaya commented Jan 10, 2022 •

edited by martinpitt

Loading

This is a precondition for our goal of removing the static cockpit users.

A nice side effect of this is that we can now connect to unix sockets from cockpitauth, which is useful for https://github.com/allisonkarlitskaya/cockpit-cloud

Fixes #21201

allisonkarlitskaya requested a review from martinpitt

January 10, 2022 18:06

This comment was marked as resolved.

Sign in to view

allisonkarlitskaya temporarily deployed to cockpit-dist

January 10, 2022 18:12

Inactive

allisonkarlitskaya force-pushed the cockpit-session-socket branch from aaefe97 to 7ebf9e6 Compare

January 11, 2022 07:56

allisonkarlitskaya mentioned this pull request

use DynamicUser=yes for all cockpit components #16811

Draft

1 task

allisonkarlitskaya added the blocked label

allisonkarlitskaya temporarily deployed to cockpit-dist

January 11, 2022 08:03

Inactive

allisonkarlitskaya mentioned this pull request

ws: add support for connecting to unix sockets #16819

Merged

martinpitt removed their request for review

February 2, 2022 06:41

martinpitt removed the blocked label

martinpitt marked this pull request as draft

February 2, 2022 06:41

This comment was marked as outdated.

Sign in to view

allisonkarlitskaya force-pushed the cockpit-session-socket branch from 7ebf9e6 to 1330a6a Compare

February 10, 2022 14:34

allisonkarlitskaya temporarily deployed to cockpit-dist

February 10, 2022 14:40

Inactive

allisonkarlitskaya force-pushed the cockpit-session-socket branch from 1330a6a to ca43353 Compare

February 10, 2022 15:06

allisonkarlitskaya added the blocked label

allisonkarlitskaya temporarily deployed to cockpit-dist

February 10, 2022 15:11

Inactive

KKoukiou added the review-2022-12 label

martinpitt mentioned this pull request

systemd: Use sysusers.d to create ws users #18112

Closed

1 task

martinpitt added needs-rebase and removed blocked review-2022-12 labels

This comment was marked as resolved.

Sign in to view

martinpitt force-pushed the cockpit-session-socket branch from ca43353 to 9ed95a5 Compare

January 4, 2023 17:57

martinpitt removed the needs-rebase label

martinpitt force-pushed the cockpit-session-socket branch from 9ed95a5 to 6778062 Compare

January 4, 2023 18:02

martinpitt temporarily deployed to cockpit-dist

January 4, 2023 18:07

— with

GitHub Actions Inactive

martinpitt force-pushed the cockpit-session-socket branch from 6778062 to f1e5a6a Compare

January 4, 2023 18:12

martinpitt temporarily deployed to cockpit-dist

January 4, 2023 18:17

— with

GitHub Actions Inactive

martinpitt added the no-test label

martinpitt force-pushed the cockpit-session-socket branch 2 times, most recently from fc0605b to d986656 Compare

November 12, 2024 15:34

This comment was marked as resolved.

Sign in to view

martinpitt force-pushed the cockpit-session-socket branch from d986656 to 1e3eccd Compare

November 12, 2024 16:13

martinpitt marked this pull request as ready for review

November 12, 2024 17:03

allisonkarlitskaya commented

View reviewed changes

Member Author

allisonkarlitskaya left a comment

Lots of optional minor/style issues here and a couple of fairly serious problems. I trust you to know the difference by now :)

src/common/cockpitframe.c

-                unsigned char *buffer = malloc (size);
+                /* We now have size equal to the number of bytes we need to return. Add an extra byte for a NUL terminator,
+                 * to be friendly to callers for text frames. */
+                unsigned char *buffer = calloc (1, size + 1);

Member Author

allisonkarlitskaya Nov 12, 2024

grumble grumble about the unnecessary zero-initialization. These authorize messages might be large! This is performance critical!!

Member

martinpitt Nov 12, 2024

Yeah, as large as a kilobyte for an authorize message, it'll take dozens of nanoseconds!! 😁

(more seriously: I pondered adding a \0 but the heck with it. All memory ought to be zeroed initially, especially in security critical stuff)

src/session/session-utils.c Outdated

+                char *key_match;
+                int key_match_len = asprintf (&key_match, "\"%s\":\"", key);
+                if (key_match_len < 0)
+                  errx (EX, "out of memory");

Member Author

allisonkarlitskaya Nov 12, 2024

Can asprintf() actually fail? Learn something new every day :)

Member Author

allisonkarlitskaya Nov 12, 2024

Oh. This isn't alloca() backed, but malloc() backed. I think you're missing a free().

Member

martinpitt Nov 12, 2024

Fixed. Also used our shiny asprintfx.

src/session/session-utils.c Outdated

+                  errx (EX, "out of memory");
+                const char *match = strstr (json, key_match);
+                if (match)

Member Author

allisonkarlitskaya Nov 12, 2024

Geschmacksache: I'd consider if match == NULL as the error case here, leaving the main part of the body as the non-error case.

Member

martinpitt Nov 12, 2024

Yes, agreed. Updated.

src/session/session-utils.c Outdated

+                    size_t value_len = strcspn (match, "\"");
+                    if (match[value_len] != '"')
+                      errx (EX, "syntax error, expected closing quote");
+                    char *res = strndup (match, value_len);

Member Author

allisonkarlitskaya Nov 12, 2024

Any reason for the extra variable?

Member

martinpitt Nov 12, 2024

Leftover from when I had an extra debug() in there 🙈 fixed.

src/session/session-utils.c


		char get_authorize_key (const char json, const char *key, bool required)

Member Author

allisonkarlitskaya Nov 12, 2024

It might make sense to change bool required for const char *default and hard-fail in the case that default = NULL.

Member

martinpitt Nov 12, 2024

Hmm, that works poorly with the "negotiate" case.

src/session/client-certificate.c

+                /* start time is the token at index 19 after the '(process name)' entry - since only this
+                * field can contain the ')' character, search backwards for this to avoid malicious
+                * processes trying to fool us; See proc_pid_stat(5) */
+                char *p = strrchr (buffer, ')');

Member Author

allisonkarlitskaya Nov 12, 2024

You didn't nul-terminate buffer after the read. I'm a bit surprised that this didn't crash.

...well, I guess it's likely to hit a nul at some point. The stack grows down, after all....

Member

martinpitt Nov 12, 2024

Whoops! Fixed.

src/session/client-certificate.c

+                unsigned long long start_time = strtoull (p, &endptr, 10);
+                if (*endptr != ' ')
+                  errx (EX, "Failed to parse start time in /proc/pid/stat from %s", p);
+                return start_time;

Member Author

allisonkarlitskaya Nov 12, 2024

I guess this is the kernel so we don't have to worry about bounds-checking the returned value...

I think we can't have other error cases here. It's worth noting that strtoull eats leading whitespace, so if the field were "empty" we'd potentially end up reading whatever the next field is...

Member

martinpitt Nov 12, 2024

It actually doesn't -- this was failing initially, before I added the ++p; /* skip over the space */ above. I.e. it refused to parse " 1234".

Member

martinpitt Nov 12, 2024

I don't think we can impose a meaningful bound here. https://man7.org/linux/man-pages/man5/proc_pid_stat.5.html makes no restriction other than monotonicity.

src/session/client-certificate.c

                     return NULL;
                   }
+                /* check if stdin is a Unix socket; then we are being called via [email protected] and need to

Member Author

allisonkarlitskaya Nov 12, 2024

Do we use a pipe for the direct-spawn case? I forget and/or think we changed this at some point?

Member Author

allisonkarlitskaya Nov 12, 2024

Isn't passing our activating socket on stdin considered by the systemd folks to be a bit gauche? I recall reading somewhere that they'd prefer we use fd 3 and an environment variable, or something...

Member

martinpitt Nov 12, 2024

Do we use a pipe for the direct-spawn case? I

Yes, g_spawn_async_with_pipes() in session_start_process() (unless it's an unix socket path).

src/session/client-certificate.c Outdated

+                if (getsockopt (STDIN_FILENO, SOL_SOCKET, SO_PEERCRED, &ucred, &(socklen_t){sizeof ucred}) == 0)
+                  {
+                    debug ("unix socket mode, reading cgroup from peer pid %d", ucred.pid);
+                    ws_cgroup_fd = open_proc_pid (ucred.pid);

Member Author

allisonkarlitskaya Nov 12, 2024

This name is highly misleading: it doesn't have anything to do with cgroups (yet). It's an fd to the /proc/ entry for our peer process.

Member

martinpitt Nov 12, 2024

Renamed to ws_pid_dirfd, similarly my_pid_dirfd.

src/session/client-certificate.c Outdated

+                    debug ("peer start time: %lu, my start time: %lu", ws_start_time, my_start_time);
+                    close (my_fd);
+                    /* guard against pid recycling: ws must have started earlier than ourselves */

Member Author

allisonkarlitskaya Nov 12, 2024

Maybe write a bit more about the attack scenario here.

Member Author

allisonkarlitskaya Nov 12, 2024

Another thing we could do is alarm() ourselves to prevent people from keeping long-open connections to us. That's highly suspicious and really only useful for the attack scenario outlined in the PR. The -ws should send the authorize message almost immediately, no?

Member

martinpitt Nov 12, 2024

Maybe write a bit more about the attack scenario here.

It's in the commit message, but I put it into the comment as well.

alarm()

Ah, we have that on the ws side, but not in session. I'll check that tomorrow.

Member

martinpitt Nov 13, 2024

I added a commit which does the alarm() thing.

martinpitt force-pushed the cockpit-session-socket branch from 1e3eccd to 24ddd16 Compare

November 12, 2024 19:48

martinpitt mentioned this pull request

Preparations for launching cockpit-session via unix socket activation #21258

Open

martinpitt added the blocked label

martinpitt marked this pull request as draft

November 12, 2024 19:56

martinpitt added 14 commits

November 13, 2024 10:55


          session: Add rhost assertions

5883d8e

The various perform_*() functions all assume a non-NULL rhost, as
several functions such as `btmp_log()` do unchecked strncpy() on them.
Add assertions to (1) make them fail more gracefully and usefully), and
(2) document that requirement more explicitly.

This is currently guaranteed by having an explicit fallback to `""` if
`$COCKPIT_REMOTE_PEER` isn't set.


          doc: Clarify that $COCKPIT_REMOTE_PEER is optional

215e65e

cockpit_session_launch() doesn't set this env if the remote host is
unknown. That is never the case in practice at least in our tests, but
callers should still be aware of it.


          session: Drop cgroupsv1 compatibility for certificate authentication

cc5bdc7

None of our supported operating systems use cgroupsv1 any more. This
simplifies and robustifies the code a bit.


          session: Exit cleanly on EOF

28184a3

Stop failing with "didn't receive expected authorize message", as that's
confusing -- the user did nothing wrong. Instead, silently exit
successfully, and let cockpit-ws handle the timeout.

This is mostly occurring when enabling Negotiate (kerberos)
authentication, and thus cockpit-ws always starts *two* cockpit-session
attempts (one for Negotiate, one for PAM), like in
TestIPA.testQualifiedUsers.


          common: NUL-terminate cockpit_frame_read() result

b1d1bc4

This makes it easier for callers to treat the reply as textual string.
The only remaining user is cockpit-session, where we will need this
behaviour in the following commit.


          session: Support more keys in authorize JSON message parser

f70c365

We want to parse more fields than just "response" from the authorize message in
the message. So remove the very rigidly harcoded parsing in
`read_authorize_response()` and let it return the whole JSON string instead.
Add a new `get_authorize_key()` function that parses a single value from that,
and adjust the callers accordingly.

This is a very restricted parser to keep things simple: No spaces in the
structure, and no escaping. We can assume all that as cockpit-ws sends
very controlled messages, and '"' isn't used in base64 values. In the
worst case we get a truncated value, which will just fail
authentication.

Localize some variables while we are at it.


          ws: Re-use session-utils.c in mock-auth-command.c

7b299d5

Drop the duplicated (but not identical!) implementation of
`read_authorize_response()` and use the real implementation.


          selinux: Fix policy for socket-activated cockpit-session

8b9e14b

Run socket-activated cockpit-session with correct context. By default it
is `init_t`, but that will produce a memfd with the wrong permissions, which
the session cannot read.

Allow cockpit-ws to connect to /run/cockpit/session.

Allow restricted user_t and sysadm_t sessions to communicate (but not
connect) to cockpit-ws through the session unix socket. (Covered by
TestLogin.testSELinuxRestrictedUser)


          systemd: Allow [email protected] to fail with some known exit …

8c48b95

…codes

cockpit-session exits with 5 on authentication failures, including
`authentication-unavailable`; or with 127 on authentication timeouts.
These aren't a reason to mark the unit as failed.


          session: Rename my_cgroup* to ws_cgroup*

The point of this is really to determine the cgroup of our caller, i.e.
the cockpit-ws process. With [email protected] this is different
than cockpit-session's own cgroup. Rename the variables to clarify this.


          session: Send no-cockpit init problem if cockpit-bridge fails to exec

2568dbd

If cockpit-ws directly spawns cockpit-session, it can process the 127
exit code by itself. But there is no exit code if that happens via unix
socket. Check this condition explicitly and report `no-cockpit` via the
protocol.

This triggers a more specific error path in cockpit-ws and the login
page, adjust TestConnection.testBasic accordingly.


          test: Ignore session timeout messages in TestIPA.testQualifiedUsers

396f0c1

The test idles on the login page for more than the 60s authorize
timeout. When running cockpit-session via unix socket, this causes some
unsightly "session timed out during authentication" messages. This ought
to be handled better in cockpit-ws, but for the time being ignore these
messages.


          ws, session: Pass remote peer address in authorize message

12d9c5d

Passing the remote peer from ws to cockpit-session via the
`$COCKPIT_REMOTE_PEER` environment variable does not work in unix socket mode.
So make that part of the protocol instead and attach it to the authorize
response.


          session: Support Unix socket mode for client certificates

ab10f4c

When cockpit-session's stdin is a Unix socket, it is being spawned
by cockpit-ws through [email protected]. In that case it doesn't
make sense to look at its own cgroup, but we need to check the cgroup of
the socket peer (i.e. cockpit-ws).

We must guard against PID recycling attacks:

1. Eve logs into cockpit, gets ws pid E, and hacks ws: connect to
   session, forks, keeps the session fd in a different process, and
   kills pid E.

3. Eve waits until Alice logs in again and happens to get ws pid E
   (which can happen with a sufficient number of forks, social
   engineering, and some luck). cockpit-session checks that pid E
   is in cgroup /cockpit/alice, and starts an alice session for Eve's
   ws. (Note: SO_PEERCRED gives you pid/uid/gid at the time connect()
   was made.)

Thus require that the peer (ws) must have started earlier than
cockpit-session. This is the same approach that polkit uses as a
fallback if pidfds are not available:
https://github.com/polkit-org/polkit/blob/main/src/polkit/polkitunixprocess.c

Note that pidfds don't help us: There is no API to directly get from a
pidfd to a cgroup, startup time, or /proc/<pid> dirfd, this has to
happen via `pidfd_getpid()` and opening /proc/pid. But that's exactly
what we want to avoid, and thus is pointless (they are also only
available since kernel 6.5).

martinpitt force-pushed the cockpit-session-socket branch from 24ddd16 to b1ce12f Compare

November 13, 2024 09:57

martinpitt and others added 4 commits

November 13, 2024 11:36


          session: Add authorize response timeout

ddfef1b

ws times out the authorization attempt after one minute
(`cockpit_ws_auth_response_timeout`). Introduce a similar timeout on the
session side. This makes PID recycling attacks harder, as their victim
now has to log in within one minute.


          tools: Move cockpit-session.socket to cockpit-ws package

2cb374b


          ws: connect to cockpit-session via socket

61ccb39

Unless it's otherwise specified in the configuration file, we now spawn
cockpit-session by connecting to /run/cockpit/session if that exists.
Fall back to calling cockpit-session directly for custom setups.

We leave the cockpit_ws_session_program variable in place to allow the
tests to override things.

Update the unit files for cockpit-ws to ensure that the socket is
available when cockpit-ws is running.

Adjust TestConnection.testBasic accordingly: When running
cockpit-session via unix socket activation, its group permissions are
irrelevant. More thoroughly move the binary away and also disable the
socket, to fail both of cockpit-ws' session creation attempts.

Co-Authored-By: Martin Pitt <[email protected]>


          cockpit-session: stop installing setuid root

bf54d09

systemd spawns this for us now, so we don't need the setuid bit anymore.
Clean up the statoverride in the Debian packaging on upgrades.

However, that means that cockpit-ws cannot be run as
`cockpit-wsinstance` user outside of the unit any more. Adjust our tests
to run it as root instead.

martinpitt force-pushed the cockpit-session-socket branch from b1ce12f to bf54d09 Compare

November 13, 2024 10:37

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

blocked