Trim undefined behaviors from acctz #121

earies · 2023-08-18T23:38:35Z

This PR is to start discussion on various ambiguous or undefined behaviors.

Choosing to do this as a PR vs. issue to adjust as we discuss. Will tag items
as inline comments post submission and if we come to consensus on items, will
adjust the final commit message for a squash merge.

acctz/acctz.proto

earies · 2023-08-18T23:53:42Z

acctz/acctz.proto

 }

 // Command details for shell/vendor-CLI
 message CommandService {
  enum CmdServiceType {
    CMD_SERVICE_TYPE_UNSPECIFIED = 0;
-    CMD_SERVICE_TYPE_SHELL = 1;


Underlying "shell" accounting to my knowledge is not implemented across any shipping network implementation today so I guess I would like to bring this one back to "how" this would be suggested, what this might entail first before adding a variant here which cascades to the remainder of other messages/field options here.

Authorization and accounting of supported NBI is very well supported but since most systems implement on top of some alternate base OS, the underlying OS shell capabilities (either control or forwarding plane) are not subject to the same AAA. Generally this stops at if a user can access the shell or not coupled with an underlying group mapping that may or may not allow privilege escalation (su/sudo).

Each implementation could support a number of underlying shells (csh, sh, bash, zsh, etc..) and while there are techniques to "intercept" and log commands via shell profiles, this could likely be overridden by a user and would also call for those subsystems to be able to hook into accounting subsystems outside of general logging (file/syslog)

I fully comprehend that this means that there is lack of visibility into what a user that has these privileges may do in an underlying shell but maybe we can bring this back to if or how this is done in the compute space first and then reintroduce capabilities back into gNSI once defined.

Accounting is also the only service currently concerned with the underlying shell. Authorization would be an entirely other conversation as well.

None of that means that this accounting is not useful or that the issues with the method could not be overcome. I do not see a reason to remove this.

The reason why I would suggest removal and trimming such behaviors is because its currently an undefined wish list item is all. A protobuf IDL with fields and comments is not the same as a specification that considers how to implement and will result in misperception and ambiguity and thus differences and partial support among implementations.

At a high level, I do not disagree to the usefulness because without such, there is loss of visibility into some actions that are performed by already trusted classes of users however...

A network element is mostly a distributed system wrapped up to abstract a single management interface - there are fully supported interfaces into the network element and then there can be many non-supported interfaces/debug shells such as:

Underlying controller card OS shells

Linecard OS/ukern shells

Other components, SoC - BCRM debug shell, etc...

In that means there are various methods to essentially "jump" or "attach" to such endpoints by breaking out of the supported interface - it's many "systems" internally. Some but not all capabilities are brought up to the fully supported interfaces (e.g. common and absolutely necessary often providing some abstraction) - these generally inherit the supported AAA infra.

Each one of these interfaces has vastly different underlying resources and may or may not be modifiable from implementation to implementation so I'd like to peel this one back a bit to what this might look like and what might be realistic when we're talking about accounting to various (sub)systems which also as I mentioned would also really apply to any vision of authorization as well. This problem statement need not apply to network elements either and would pose a similar issue in various compute host scenarios as well.

So agree that it's useful but would start at dissecting the problem statement w/ various scenarios and then talk solutions before adding it to the API definition as that's the easier of all above.

My point is that it exists today, though it has the caveat that you mention. But the caveat exists not be cause it is not possible to avoid, but due to the laziness of the implementation.

Could you clarify the last statement? Are you saying command accounting (and/or authorization) for all of the underlying shells/subsystems specified above does exist today? I had asked in the initial comment if or how this might be done in the compute space because the analogy can be drawn to any OS and not just captive shells.

It seems that this can be overcorrected and solved by not giving access to the subsystems by any public API (CLI in this bucket as well) but that would be counterproductive in reality for everyone.

IMO its a problem statement that needs brainstorming and collaboration on solutions and not just a field in an API declaration

The intent of the PR is to trim undefined behaviors (which this is) with possible reintroduction back at a later point

"for all of the underlying shells/subsystems specified above does exist today?"

yes, for one platform we use, it does...

Is the problem you (ebben) see:
"Today no platform will be able to account for 100% of user actions, so having this toggle in proto is not helpful"
(this is your aspirational comment I think?)

I suppose that's a fine argument, but what if we open FR's with all vendors we deal with to say:

"Hey, all user input on the cli MUST be accounted, full stop."

yup, we'll be waiting a while but.. that really is the long term goal, I think.
we COULD also just go add the enum / etc at the time that the features start to be realized.

These non-command-like services should be split into a different (new) service-type.

earies · 2023-08-18T23:55:45Z

acctz/acctz.proto

-
-  // Optional repeated task_id that represent tasks that were used to
-  // accomplish the request on the system.
-  repeated string task_ids = 32;


My understanding off this 2-line comment would be that this could represent any system defined method to "trace" a call. Could you elaborate on it's intent or shall we mark for removal and define what this might mean later?

I think these are intended to be IDs that could be used to correlate processes etc with an RPC/cmd/etc. I believe that some systems include such IDs in syslog msgs. But, I was not around when this was created, so I am speculating.

Similar to other cases, if there is no specification or concrete intent considering how such would be implemented consistently, etc.. then I would suggest it's removal to be reintroduced at a later point in time should there be a need and detailed specification.

Marcus & Morrow should comment on this; they are more likely to know the intention.

Today we get this sort of notation in tac+ accounting logs, it's what stitches together all commands/etc sent over the life of a user's "session".

That's the intent of this field.
That may not matter for gRPRC things - 'connect send config, disconnect' - session-id is same as singular record.
It does matter for user-that-sshs sessions though. (I think it does, still matter)

Acct-Session-Id and Acct-Multi-Session-Id in Radius.

@morrowc You mentioned this field matters for SSH access, it can be the pid of the sshd instance serving the connection ?

Could be ssh-pid, sure!
I think I figured the universal answer: "uuid" would work here too.
but yup, sshd pid seems ok too.

earies · 2023-08-19T00:04:05Z

acctz/acctz.proto


  // In case of STATUS_DENY, cause for the deny
-  string deny_cause = 4;
+  string deny_cause = 3;
 }

 // Command details for shell/vendor-CLI
 message CommandService {


We have a split of gRPC and "command" services however the command services listed here are a mix of separate APIs and each API behavior may or may not be able to be broken down into the constructs of commands and command arguments.

For CLI, it might be natural to break this down in such a fashion

However for other NBI APIs you have a mix of protocol level semantics coupled with data payloads in various structures, some of this is analogous to gRPC services where currently the plan is to encapsulate the entire message in a PROTO_ANY

I think this will need some additional consideration if we were to elaborate with examples on all services. Currently in JUNOS, we log/record across various API calls by mostly breaking it down at the RPC/request "name" level only but not with full visibility into the remainder of request contents/payload.

The full requests are analogous to debug/tracing which come with additional cost

You seem to be conflating expanding the list service_requests with this log message?

I do not know what NBI is.

I do not see how the junos behavior you describe is different from the GrpcService message.

Sorry - NBI == "Northbound Interface" - NETCONF, RESTCONF, etc...

The GrpcService message can take the RPC from the HTTP header for the path and encode in the rpc_name (The examples in the comments mean some transformation would need to take place) then one can take the entire protobuf binary encoded contents of the request and wrap into the repeated payloads field (Not entirely sure what repeated buys here as well as I'm not sure this applies here) - this would include more complex data structures and anything outside of the well known service types would require access to the protobuf IDL to deserialize.

So for the GrpcService - it is structurally the same because it would reflect the encoded request payload directly

But for the CommandService the structure really represents something more like CLI/shell commands as typed out

For something like NETCONF, if you were to encode the cmd you could take the RPC name as I mentioned but that isn't the full intention most of the time (only partial) - the question would be how to pack the RPC structures in addition to any further data payload. In the case of a normal workflow of <edit-config> w/ XML encoding, you might be able to strip the RPC elements, put the start tag into the cmd and dump the entire remainder into a single cmd_args field.... I think you see where I'm going

For RESTCONF, we have similar where you have URIs, HTTP methods that encapsulate the RPC (or command) invocation but other headers and content that would need to find a proper placement

The repeated string of cmd_args is then problematic for non-XML/JSON type framing/contents as implied by the services listed here - binary representations are also a possibility here (e.g. CBOR)

You are suggesting splitting netconf/restconf in its own service_request type, because they more closely resemble (g)rpc and their arguments might not be representable in the string of cmd_args. That seems reasonable to me.

For http, what I've seen of these interfaces, they are just conduit of sorts to the cli. I realize that might not be universal and it could carry binary request data. I suppose it co-exist with netconf, or be its own type. Binary data could also be base64; which seems common.

I do not know what any of this has to do with the deny_cause field. :)

The comment is tagged to the message CommandService line but review rendering here makes it look like its against the changed line - Since this is a PR, not every line can hold a comment but rather requires a change within proximity (I had considered opening this for discussion w/ 1 or more distinct issues but opted for 1 PR to open up a wider discussion). deny_cause has nothing to do w/ this but rather just had a alignment in field numbers within proximity.

That aside, I do think each distinct API will need need further analysis that is likely to be conveyed as their own message structs otherwise an additional ruleset would need to be devised on top of how to appropriately map, what will not map, etc...

Since none of that is defined today, it's just an additional point that there are versions of gNSI being cut (currently tagged as 1.2.2 at time of this comment) and implementation underway where there is too much ambiguity which is likely to create a significant amount of experimental backwards incompatible churn. My suggestion is to eliminate such cases (removal) and build back up as time needs to be spent up front on the dissection of problem statements and specification vs. IDL.

The purpose of 'commandservice' is actually to account for 'chris uses ssh/telnet to connect to a device and types things'.

In a world where folk use some gNMI for config management (for instance) and gNOI for 'collection of stats/etc' MOST interaction is likely robots not humans. Where a human may still connect: "why is this bgp session not coming up?" "Why did this interface not do what I wanted?" "Oh, great, vendor bug, now I have to type 12 'test crash' type commands :("

we want to have the accounting data for that come over the gNSI path.

acctz/acctz.proto

Also see openconfig#121 (comment)

Also see #121 (comment)

Also see openconfig#121 (comment)

rnhaddad · 2024-08-15T20:04:21Z

acctz/acctz.proto

  enum AuthenStatus {
    AUTHEN_STATUS_UNSPECIFIED = 0;
    AUTHEN_STATUS_PERMIT = 1;
    AUTHEN_STATUS_DENY = 2;
  }
-  AuthenStatus status = 3;
+  AuthenStatus status = 2;


for backwards compatibility, perhaps it is best to NOT change this value? Use [deprecated=true]; ?

Comment applies throughout this PR

Trim undefined behaviors from acctz

734e225

earies commented Aug 18, 2023

View reviewed changes

acctz/acctz.proto Show resolved Hide resolved

earies commented Aug 18, 2023

View reviewed changes

earies commented Aug 19, 2023

View reviewed changes

Adjust wording on gRPC message truncation

0533f1a

earies commented Aug 19, 2023

View reviewed changes

acctz/acctz.proto Show resolved Hide resolved

haussli added a commit to haussli/gnsi that referenced this pull request Sep 5, 2023

Change privilege_level to a string named group.

c41ae00

Also see openconfig#121 (comment)

morrowc pushed a commit that referenced this pull request Nov 3, 2023

Change privilege_level to a string named group.

1ba75c6

Also see #121 (comment)

morrowc approved these changes Nov 17, 2023

View reviewed changes

nmahabaleshwar pushed a commit to nmahabaleshwar/gnsi that referenced this pull request Jan 3, 2024

Change privilege_level to a string named group.

01f1603

Also see openconfig#121 (comment)

rnhaddad reviewed Aug 15, 2024

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Trim undefined behaviors from acctz #121

Trim undefined behaviors from acctz #121

earies commented Aug 18, 2023

earies Aug 18, 2023

haussli Aug 19, 2023 •

edited

Loading

earies Aug 21, 2023

haussli Aug 21, 2023

earies Aug 21, 2023

morrowc Aug 22, 2023

haussli Sep 5, 2023

earies Aug 18, 2023

haussli Aug 19, 2023

earies Aug 21, 2023

haussli Aug 21, 2023

morrowc Aug 22, 2023

haussli Aug 22, 2023

nmahabaleshwar Nov 16, 2023

morrowc Nov 17, 2023

earies Aug 19, 2023 •

edited

Loading

haussli Aug 19, 2023

earies Aug 21, 2023

haussli Aug 21, 2023

earies Aug 21, 2023

morrowc Aug 22, 2023

rnhaddad Aug 15, 2024

rnhaddad Aug 15, 2024

Trim undefined behaviors from acctz #121

Are you sure you want to change the base?

Trim undefined behaviors from acctz #121

Conversation

earies commented Aug 18, 2023

Choose a reason for hiding this comment

haussli Aug 19, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

earies Aug 19, 2023 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

haussli Aug 19, 2023 •

edited

Loading

earies Aug 19, 2023 •

edited

Loading