Clarify note about eager vs. lazy evaluation #901

catamorphism · 2024-10-08T18:29:39Z

Closes #784

aphillips

Good start. A couple of minor comments.

aphillips · 2024-10-08T18:45:56Z

spec/formatting.md

+> Users may write custom functions that have observable side effects.
+> Lazy evaluation may involve evaluating the same expression multiple times


It's good practice to avoid the normative words like 'may' in non-normative contexts like this.

Suggested change

> Users may write custom functions that have observable side effects.

> Lazy evaluation may involve evaluating the same expression multiple times

> Users or implementations can provide functions that have observable side effects.

> Lazy evaluation might involve evaluating the same _expression_ multiple times

Notice that I removed the word "custom" (which might be presumed?) and bring in implementations.

Consider adding a health warning to this paragraph?

Removed the "may". What do you mean by "health warning"?

By health warning, something like: It's a bad idea to create selectors or formatter functions with such side effects

I see. I'm not sure I would agree across the board. For example, you might want a currentTime function that returns the system clock time. This doesn't have "side effects" as such, but returns a different value when it's called at different times, so:

.local $time = {:currentTime} {{ {$time} {$time} }}

could have a different result depending on whether the implementation evaluates the RHS of the variable time once or twice.

(The jargon is that currentTime is not referentially transparent.)

Perhaps some language saying that care should be taken when defining such functions and they should be avoided when possible?

I thought we already had something in the spec about side effects, but the closest we come to is

Function handler access to the formatting context MUST be minimal and read-only,
and execution time SHOULD be limited.

which relies on an implementation choosing to call "everything that a function can access" as its formatting context, because we've not explicitly prohibited access outside it.

That is to say, I agree with @aphillips that we ought to have stronger language to recommend against functions that mutate anything as a side effect.

I added some more language to (a) clarify that the problem also occurs with functions that don't mutate anything, but that depend on external mutable state; and (b) recommend against functions that mutate anything observably.

Hmm...

side effect == touches the resolved value?

Lots of functions have "side effects" if that's the case. @eemeli your suggestion of limiting the resolved value of :date and :time would be a side effect if we were to implement it. currentTime is a good example. random is another. Some operations, such as case-folding, alter the value in an irreversible way. Other operations, such as unfloating a time value, change the meaning.

Our design tries to be immutable, but we're actually mutable to the depth of one expression. Chaining .local or reannotating a placeholder can make these side-effects more visible.

Perhaps this paragraphs wants to be more like:

Evaluating an expression can affect the resolved value associated with
a given variable.
When combined with eager vs. lazy evaluation,
in which the same expression might be evaluated multiple times,
the side-effects of the function on the resolved value
might alter the result of formatting the message.

side effect == touches the resolved value?

Lots of functions have "side effects" if that's the case. @eemeli your suggestion of limiting the resolved value of :date and :time would be a side effect if we were to implement it.

Defining the resolved value returned by a function handler is not a side effect. Modifying the operand would be a side effect. But a function handler will quite rarely have a reason to return its input operand; in general, it will instead create a new resolved value and return that.

currentTime is a good example. random is another. Some operations, such as case-folding, alter the value in an irreversible way. Other operations, such as unfloating a time value, change the meaning.

A reasonable function handler for any of these would not have any side effects, but would instead create a new resolved value and return it.

Our design tries to be immutable, but we're actually mutable to the depth of one expression. Chaining .local or reannotating a placeholder can make these side-effects more visible.

Being able to use a resolved value as an operand does not introduce mutability.

The phrase "side effect" is clearly confusing, so I rewrote the paragraph without using it. Please take another look!

Co-authored-by: Addison Phillips <[email protected]>

…unctions that write state are not recommended

aphillips · 2024-10-09T00:25:00Z

spec/formatting.md

-> that returns the current time and date.)
+> In some environments, users or implementations can provide functions
+> that mutate state that is external to the message formatter
+> (for example, deleting a file in the filesystem). This is not recommended.


"not recommended" turns out to be a 2119 keyword 😉

Is this a likely example? Maybe a better example would be modifying the system or application locale (in some environments, setting the locale is not thread-safe)

Suggested change

> (for example, deleting a file in the filesystem). This is not recommended.

(for example, setting or modifying the system locale).

Actually... I think the problem here is the audience for your addition. This turns out not to be in a note block. It's in a normative "IMPORTANT" section. It should be written in a normative way or it should be in an informative NOTE inside the IMPORTANT block.

I'd actually do the former, replacing this paragraph with something like:

Implementations and users creating custom functions SHOULD avoid
creating function handlers that mutate external program state
or which depend on external mutable state in a way that would
result in different resolved values for a variable depending on
whether a given expression is evaluated at most once (call-by-need)
or multiple times (call-by-name).

The :currentDateTime is a good example of a function that implementation might very well want to provide and which produces a different result every time you call it (but at least a few ms, right?). Users of that function would need to know that they can't rely on it producing the exact same number each time they call it (the system clock is running) or that they can't rely on it to not provide the same value every time in a given formatting call (the system clock is running, but the function only evaluates it once).

The warning about mutating program state doesn't seem to belong with eager/lazy? Can you think of an example of a real function that an implementation might provide (which isn't a security exploit)? I mean, my "set locale" suggestion above might quality in C, for example. That would certainly affect the formatting of the message (unless the "formatting context" was caching the locale). It would also be cause for spanking the developer who created/installed it.

I follow, but I'm a little unsure as to what you're suggesting the normative change should be. If it's "SHOULD avoid...", as in your suggested wording, doesn't that suggest there shouldn't be a currentDateTime or random function? But I can imagine these functions being useful, it's just that it may be surprising that different implementations with different evaluation strategies would handle them differently. Do we want to say to not provide functions like these, or just to be judicious about it?

I tried splitting the note into a normative part and an editorial "note" part, maybe that's closer to what you have in mind?

macchiati · 2024-10-09T00:31:44Z

I think a SHOULD NOT "provide functions that mutate" ... is warranted.

…

On Tue, Oct 8, 2024 at 5:25 PM Addison Phillips ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In spec/formatting.md <#901 (comment)> : > @@ -58,14 +58,19 @@ nor be made available to _function handlers_. > have already been evaluated in the order in which the relevant _declarations_ > appear in the _message_. > -> Users or implementations can provide functions that have observable side effects -> or whose results depend on external mutable state (for example, a function -> that returns the current time and date.) +> In some environments, users or implementations can provide functions +> that mutate state that is external to the message formatter +> (for example, deleting a file in the filesystem). This is not recommended. "not recommended" turns out to be a 2119 keyword 😉 Is this a likely example? Maybe a better example would be modifying the system or application locale (in some environments, setting the locale is not thread-safe) ⬇️ Suggested change -> (for example, deleting a file in the filesystem). This is not recommended. +(for example, setting or modifying the system locale). — Reply to this email directly, view it on GitHub <#901 (review)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACJLEMDOXZPZRZIH2ZCO2VTZ2RZXLAVCNFSM6AAAAABPS3UZPGVHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMZDGNJVG42DENZTGA> . You are receiving this because you are subscribed to this thread.Message ID: ***@***.***>

macchiati · 2024-10-09T00:53:01Z

+1

…

On Tue, Oct 8, 2024 at 5:47 PM Addison Phillips ***@***.***> wrote: ***@***.**** commented on this pull request. ------------------------------ In spec/formatting.md <#901 (comment)> : > @@ -58,14 +58,19 @@ nor be made available to _function handlers_. > have already been evaluated in the order in which the relevant _declarations_ > appear in the _message_. > -> Users or implementations can provide functions that have observable side effects -> or whose results depend on external mutable state (for example, a function -> that returns the current time and date.) +> In some environments, users or implementations can provide functions +> that mutate state that is external to the message formatter +> (for example, deleting a file in the filesystem). This is not recommended. Actually... I think the problem here is the audience for your addition. This turns out not to be in a note block. It's in a normative "IMPORTANT" section. It should be written in a normative way or it should be in an informative NOTE inside the IMPORTANT block. I'd actually do the former, replacing this paragraph with something like: Implementations and users creating custom functions SHOULD avoid creating *function handlers* that mutate external program state or which depend on external mutable state in a way that would result in different *resolved values* for a *variable* depending on whether a given *expression* is evaluated at most once (call-by-need) or multiple times (call-by-name). The :currentDateTime is a good example of a function that implementation might very well want to provide and which produces a different result every time you call it (but at least a few ms, right?). Users of that function would need to know that they can't rely on it producing the exact same number each time they call it (the system clock is running) or that they can't rely on it to *not* provide the same value every time in a given formatting call (the system clock is running, but the function only evaluates it once). The warning about mutating program state doesn't seem to belong with eager/lazy? Can you think of an example of a real function that an implementation might provide (which isn't a security exploit)? I mean, my "set locale" suggestion above might quality in C, for example. That would certainly affect the formatting of the message (unless the "formatting context" was caching the locale). It would also be cause for spanking the developer who created/installed it. — Reply to this email directly, view it on GitHub <#901 (comment)>, or unsubscribe <https://github.com/notifications/unsubscribe-auth/ACJLEMCH2PKIYR4A3XCNFCDZ2R4I7AVCNFSM6AAAAABPS3UZPGVHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMZDGNJVG42TONRSGE> . You are receiving this because you commented.Message ID: ***@***.***>

Co-authored-by: Addison Phillips <[email protected]>

gibson042 · 2024-10-10T00:14:40Z

spec/formatting.md

+> Lazy evaluation might involve evaluating the same _expression_ multiple times
+> (call-by-name) or evaluating every expression at most once (call-by-need).


I really think it's a mistake to allow the same expression to ever evaluate to a different value inside a single message formatting operation. This is something that CEL (to pick a roughly analogous technology) gets very right.

spec/formatting.md

eemeli · 2024-10-10T20:11:19Z

Should we include some language encouraging implementations to sandbox or otherwise restrict the access of function handlers to external state?

Co-authored-by: Addison Phillips <[email protected]>

catamorphism · 2024-10-21T18:10:34Z

Per the 2024-10-21 call, I added stricter language saying that call-by-name implementations are not allowed because of the presence of impure functions. Please take another look!

Clarify note about eager vs. lazy evaluation

1d89191

catamorphism requested review from aphillips, eemeli, mihnita and echeran October 8, 2024 18:29

aphillips reviewed Oct 8, 2024

View reviewed changes

catamorphism and others added 3 commits October 8, 2024 11:50

Update spec/formatting.md

519d5dd

Co-authored-by: Addison Phillips <[email protected]>

Avoid using 'may'

d74ea7e

Add language about writing vs. reading mutable state; add note that f…

f87e643

…unctions that write state are not recommended

aphillips added editorial formatting LDML46.1 MF2.0 Draft Candidate labels Oct 8, 2024

Try again

61a4bd2

aphillips reviewed Oct 9, 2024

View reviewed changes

aphillips added normative and removed editorial labels Oct 9, 2024

catamorphism and others added 4 commits October 9, 2024 16:27

Update spec/formatting.md

648d365

Co-authored-by: Addison Phillips <[email protected]>

Split the addition into a normative part and an editorial part

cda3e57

Fix NOTE markup

1fb1c73

Fix markup again

4d5f4fe

gibson042 reviewed Oct 10, 2024

View reviewed changes

aphillips reviewed Oct 10, 2024

View reviewed changes

spec/formatting.md Outdated Show resolved Hide resolved

Update spec/formatting.md

bf1cb2f

Co-authored-by: Addison Phillips <[email protected]>

macchiati approved these changes Oct 10, 2024

View reviewed changes

Make language stricter and say that call-by-name is forbidden

2d152c4

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Clarify note about eager vs. lazy evaluation #901

Clarify note about eager vs. lazy evaluation #901

catamorphism commented Oct 8, 2024

aphillips left a comment

aphillips Oct 8, 2024

catamorphism Oct 8, 2024

aphillips Oct 8, 2024

catamorphism Oct 8, 2024

eemeli Oct 8, 2024

catamorphism Oct 8, 2024

aphillips Oct 8, 2024

eemeli Oct 8, 2024

catamorphism Oct 8, 2024

aphillips Oct 9, 2024

aphillips Oct 9, 2024

catamorphism Oct 9, 2024

catamorphism Oct 9, 2024

macchiati commented Oct 9, 2024 via email

macchiati commented Oct 9, 2024 via email

gibson042 Oct 10, 2024 •

edited

Loading

eemeli commented Oct 10, 2024

catamorphism commented Oct 21, 2024

		> Users may write custom functions that have observable side effects.
		> Lazy evaluation may involve evaluating the same expression multiple times

	> (for example, deleting a file in the filesystem). This is not recommended.
	(for example, setting or modifying the system locale).

		> Lazy evaluation might involve evaluating the same _expression_ multiple times
		> (call-by-name) or evaluating every expression at most once (call-by-need).

Clarify note about eager vs. lazy evaluation #901

Are you sure you want to change the base?

Clarify note about eager vs. lazy evaluation #901

Conversation

catamorphism commented Oct 8, 2024

aphillips left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

macchiati commented Oct 9, 2024 via email

macchiati commented Oct 9, 2024 via email

gibson042 Oct 10, 2024 • edited Loading

Choose a reason for hiding this comment

eemeli commented Oct 10, 2024

catamorphism commented Oct 21, 2024

gibson042 Oct 10, 2024 •

edited

Loading