-
Notifications
You must be signed in to change notification settings - Fork 3.4k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Deprecate struct update syntax, such as %URI{x | ...}
#13974
Comments
Is there a discussion about this anywhere? I don't see anything in the elixir-lang-core mailing list but I could have missed it. I know this isn't the place to comment on the issues validity and I 100% trust the core team whatever you decide - however I would like to see any discussion on the issue to see the different points of view and reasoning if possible. For what its worth I like the current syntax given that structs are made to feel slightly special than normal maps and I think the status quo is quite pleasant when you are returning an updated struct at the end of a function, It is immediately obvious that you are returning a struct without looking up at what the Saying that, I am speaking only from the developers POV and not from the compiler which this issue seems primarily concerned with. Sorry to add this here but I'm not sure where else to comment 🫡 |
It is completely fine to discuss it here. :)
While I do agree this is true, it is worth noting this is pretty much a special case of structs. You don't denote in return types when you are returning an integer, updating a tuple, etc. Typically in Elixir, you specify the types of variables in patterns and guards, and not throughout your function. Other than that, the rationale is pretty much in the issue description. All cases where you use the update syntax, you will be better served by pattern matching on it instead. |
Just wanted to add here that doing this with |
That's a good point and you make it well - Thank you |
Is there really a need to deprecate the struct update syntax? I can see the point of prefering pattern matching, but it doesn't feel like a great justification for getting rid of the struct update at the same time. Things are easily visible here in this short example, but consider 20ish lines between the |
I'd say so, because, if our official advice is to use the pattern matching syntax 99% of the cases, there is no reason to keep a pattern in the language that we wouldn't recommend. As said above, I understand the signaling can be helpful but it is also inconsistent. Structs are the only data structure that get those and only for updates. You don't signal integers, tuples, struct access, etc. And it will always do a worse job than pattern matching. |
I can kinda see that, but also should we deprecate
I don't necessarily agree with that, given I don't see a pattern matching as an alternative to the struct update syntax. To me I guess I'd be fine with deprecating the native syntax for struct updates, but if that happens imo the proper replacement would be |
So a more apt comparison is
|
Just mentioning for the record, but this change is not strictly equivalent and can possibly accidentally filter non-matching entries (e.g. nils) instead of raising. One might want to use |
Is there a reason that this example... for entry <- conf.entries do
%UploadEntry{entry | preflighted?: entry.preflighted? || entry.ref in refs}
end cannot provide useful typing information for expressions inside the block? It seems to me that, in order to succeed, for entry <- conf.entries do
%UploadEntry{} = entry
%UploadEntry{entry | preflighted?: entry.preflighted? || entry.ref in refs}
end If that "phantom match" could be created automatically, it would find typos in I'll acknowledge that this is only one example and this code could be modified such that you get more/better errors by pattern matching where |
Your proposal above would work for that example but what if someone wrote this: for entry <- conf.entries do
preflighted? = entry.preflighted? || entry.ref in refs
%UploadEntry{entry | preflighted?: preflighted?}
end What do we rewrite it to? If we do this: for entry <- conf.entries do
preflighted? = entry.preflighted? || entry.ref in refs
%UploadEntry{} = entry
%UploadEntry{entry | preflighted?: preflighted?}
end Then we still wouldn't have caught a bug if The simplest way to reason about the type system is that types are assigned to variables when they are defined. When we have type signatures, that makes total sense and it should just work as you expect, because the signatures will provide the type. But all Elixir code written today relies on inference and doing any sort of refinement during inference, which is partially what you are suggesting, is expensive because you need to either hoist it or go back and retype all prior uses of All in all, it is much simpler to say "variables get their types when they are defined" and those are the patterns we should promote.* We also need to accept that, as we work on the type system, we will discover Elixir idioms which are just not useful if we had types. Some struct features were designed exactly because we didn't have a type system. And other features are too lax. For example, the Overall, we want to keep the deprecations too a minimum, but I think we have to revisit our practices to make the best use of our new tools as we move forward. *There is a feature we will add to the type system called occurrence tying, which allows us to refine types in |
I was indeed considering hoisting it to the top of the scope (or just after I do agree that a separate pass just to pre-assign structs, resulting in a 2-5% slow down, would not be worth it. I know there's a delicate balance between inference quality and expense, so introducing things like this may not be worth it. If there were other cases, however, where inference could be improved by a separate pass, then the cost/benefit might change, but I'm not sure that those cases exist.
Agreed 👍. To be clear, though I do like the update syntax occasionally for the various reasons that others have stated, I'm not opposed to this change. Another thought that crossed my mind is that, in the future when typing information is exposed through some compiler API, editors will be able to re-expose this information through things like inlay hints. |
%URI{x | ...}
One difference between the original examples is if you mistype the field in the updated struct. The struct update syntax produces a compilation error, while the pattern match only produces a warning, if I am updating a field that doesn't exist. That can worse the experience for projects not using Would changing these warnings to errors be a part of the deprecation to keep the same developer experience? |
I would say having them as warnings is a benefit, as it allows compilation to continue and find other warnings, rather than stopping midway. In fact, we generally do our best to emit warnings and only emit errors when we cannot produce an actual artifact. |
I've been thinking about it some more, and we maybe missed the point of my comment a little. It's not really important whether it's an error or a warning technically. I feel it's the difference in the strength of feedback that makes the new proposed syntax worse. A typo in a field seems like a situation where the compiler knows the code will not work. And if it knows that I feel like it's good the compilation will fail and not let me run the code. I don't really care whether we use errors or warnings for that purpose. But using warnings without any other mechanism than So maybe this is not a point against this proposed deprecation, but rather a suggestion to have some multi-level warnings some of which could fail the compilation by default (but not in the same way as errors, as you mention above), if the compiler is sure the code is wrong. I am sure what is the right solution. But I know I myself would prefer using the old syntax for this very reason. |
It is very important to not break the user flow. Phoenix compiles during requests, tests compile before running. Being able to run and debug a program, even if it says that it will error, is better than aborting the user process and leave them to "fight the compiler" (or the type checker) as the only option available. Forcing compilation to fail can be very disruptive to the developer experience. If you really want that, then set |
This assumes that the field is missing because of a typo, but that may not be the only case. Imagine that we remove a field from a struct, while working on a subset of the code. In such case it can be useful to run the app and tests, without going and fixing all other places where the field is already used. |
Originally I wrote this:
But this is no longer true as of #14145. We do perform single pass inference now, which means we can find some bugs on the update syntax (but not all). Take this code: def bad_code(x) do
x.w
%Point{x | x: :zero}
end where x is a Point struct with fields
However, if we defined the struct upfront in pattern matching, we get this error instead:
So while we alleviate a bit the reasons for deprecating the syntax, it may be that finding more errors with better warnings are still enough reasons to discourage/deprecate it. Thoughts? |
I'd be curious how those "more errors" look like specifically. Because the better error on the example don't really come from deprecating the update syntax, but from making me add the struct on the pattern match, which I can do independently from the update syntax as well. Adding that pattern match I guess would be a quite natural reaction to seeing the typesystem consider |
The idea of the deprecation, since the beginning, is to nudge people towards pattern matching. The reason why you would use For completeness, here is an example of a warning that would not be caught with the current inference but would with pattern matching: def absolute_z(condition, point) do
point.w
if condition do
%Point{point | z: abs(point.z)}
else
point
end
end Any conditional is problematic because we can only assume that |
Isn't there some performance implications of using struct update syntax? I thought I had read something to that effect years ago. If there is no performance impact then deprecating the syntax then I see no reason not to. However, I think we need an equivalent function to replacement with that fails if a key is not already present in the map. Something like `Struct.put`. Then the formatter can migrate to that function.
|
We already have such syntax, it is |
Ahhh, okay. We're deprecating *struct* update syntax only not map update syntax as well. Understood.
|
I was a bit skeptical at first for the deprecation but this is a very convincing argument for me. +1 for being more assertive early on. I can imagine a couple of cases which can't be trivially be rewritten with pattern matching or might lose readablily, none are blockers IMHO but bringing it up for the sake of discussion:
|
I'd say the comprehension is the only tricky one, as it can lead to a subtle change of behaviour. And I would say if you wrote |
I think I somewhat missed the crux of the rationale previously (or maybe just forgot), it seems like the issue (from the types side) would be also solved by adding the pattern per: def bad_code(%Point{} = x) do
x.w
%Point{x | x: :zero}
end Personally I would use both already and my inclination is that removing one does not naturally lead someone to adding the other unless they understand why. Could it be solved with a warning or lsp message or a credo rule? Could it lead to a situation where the below code is produced by a user instead? On the face of it this would seem to be an even worse outcome for the code in question. def bad_code(x) do
x.w
%{x | x: :zero}
end I take your point about the signalling being inconsistent with the primitive types and I fully agree however I would like to add a slightly different perspective which is that in the trenches one typically treats each struct as their own individual types and from that angle the stats would most likely swing the other way where the the number of types in a given codebase that have return type signalling available ( Saying all this that I fully understand your position of not wanting to leave potential foot-guns laying around but I would be remiss if I didn't give the struct update syntax the defense it deserves. This feels like less of a foot gun and more of a "by the way you would get better type inference here if you added the pattern" which (i think) would be true for all functions that take structs as args, not just those that update them per the examples above. Either way thanks for everything and I trust the core team no matter which direction you choose, Cheers Tl;dr: Can we have our cake |
I think the struct update syntax is worth not-deprecating so long as it doesn't pose an obstruction to future work. Struct update syntax like |
Should perhaps this be addressed ultimately in the type system? for entry <- conf.entries do
%UploadEntry{entry | preflighted?: entry.preflighted? || entry.rfe in refs}
end There is theoretically enough information there to determine that Just a thought that if there is a possibility that the type system will one day be "smart enough" to catch that, then we may be removing the syntax unnecessarily. |
I have already address this in a later reply: #13974 (comment)
Right, the only reason it is being deprecated is because it poses obstruction to future work and code quality. The update syntax will give you fewer guarantees than pattern matching and, while we can keep on improving things, that will always be the case unless we start introducing false positives in dynamic code (i.e. warn about typing violations that won't happen in runtime, which is generally a worse idea). See the comment linked above and this one. In other words, we can confidently say that pattern matching on the struct will always be better than the struct update syntax. |
Another way to look at it is: statically typed languages have you declare types when the variable is defined, not when the variable is used. The struct update syntax is all about the latter. :) |
Instead, we should prefer pattern matching. With the type system, we already get the benefits of checking if fields exist or not, in a more reliable and extensible way.
Here is an example from LiveView:
However, the code above does not catch typos on
entry.ref
. This code, however, would:Since we attach type information to variables when they are defined.
The text was updated successfully, but these errors were encountered: