-
Notifications
You must be signed in to change notification settings - Fork 380
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
Signed-off-by: Nicolas Werner <[email protected]>
- Loading branch information
1 parent
81c7893
commit 6868738
Showing
1 changed file
with
151 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,151 @@ | ||
# MSC2781: Deprecate fallbacks in the specification | ||
|
||
Currently replies require clients to send and parse a fallback representation of | ||
the replied to message. Something similar is planned for edits. While at least | ||
in theory fallbacks should make it simpler to start supporting replies in a new | ||
client, they actually introduce a lot of complexity and implementation issues | ||
and block a few valuable features. This MSC proposes to deprecate and eventually | ||
remove those fallbacks. It is an alternative to | ||
[MSC2589](https://github.com/matrix-org/matrix-doc/pull/2589), which intends to | ||
double down on fallbacks. | ||
|
||
### Issues with the current fallbacks | ||
|
||
#### Stripping the fallback | ||
|
||
To reply to a reply, a client needs to strip the existing fallback of the first | ||
reply. Otherwise replies will just infinitely nest replies. [While the spec doesn't necessarily require that](https://github.com/matrix-org/matrix-doc/issues/1541), | ||
not doing so risks running into the event size limit, but more importantly, it | ||
just leads to a bad experience for clients actually relying on the fallback. | ||
|
||
Stripping the fallback sadly is not trivial. Most clients did that wrong at some | ||
point. At the point of writing this, FluffyChat seems to fail stripping the | ||
fallback, while Riot/Element did fail to do so in the past, when `<mx-reply>` | ||
tags were not evenly matched. Since clients can send arbitrary byte sequences, | ||
every client needs to basically parse untrusted input and sanitize it. A normal | ||
html parser will fail stripping a `</mx-reply><mx-reply></mx-reply>` sequence in | ||
some cases. If you use a regex to strip the fallback, you may be able to be | ||
attacked using regex DOS attacks (although that is somewhat mitigated by the | ||
maximum event size). In the end there are quite a few edge cases, that are not | ||
covered in the existing specification, since it doesn't specify the exact rules | ||
to strip a fallback at all. | ||
|
||
Stripping the fallback from body is even more complicated, since there is no way | ||
to distinguish a quote from a reply reliably. | ||
|
||
#### Creating a new fallback | ||
|
||
To create a new fallback, a client needs to add untrusted html to its own | ||
events. This is an easy attack vector to inject your own content into someone | ||
elses reply. While this can be prevented with enough care, since Riot basically | ||
had to fix this issue twice, it can be expected that other clients can also be | ||
affected by this. | ||
|
||
#### Requirement of html for replies | ||
|
||
The spec requires rich replies to have a fallback using html: | ||
|
||
> Rich replies MUST have a format of org.matrix.custom.html and therefore a formatted_body alongside the body and appropriate msgtype. | ||
This means you can't reply using only a `body` and you can't reply with an | ||
image, since those don't have a `formatted_body` property currently. This means | ||
a text only client, that doesn't want to display html, still needs to support | ||
html anyway and that new features are blocked, because of fallbacks. | ||
|
||
This is also an issue with edits, where the combination of edit fallback and | ||
reply fallback is somewhat interesting: You need to send a fallback, since the | ||
event is still a reply, but you can't send a reply relation, since this is an | ||
edit. So for clients relying on the edit fallback, you send a broken reply | ||
fallback, that doesn't get stripped, even when the client supports rich replies. | ||
|
||
#### Replies leak history | ||
|
||
A reply includes the `body` of another event. This means a reply to an event can | ||
leak data to users, that joined this room at a later point, but shouldn't be | ||
able to see the event because of visibility rules or encryption. While this | ||
isn't a big issue, there is still an issue about it: https://github.com/matrix-org/matrix-doc/issues/1654 | ||
|
||
Replies are also sometimes localized. In those cases they leak the users | ||
language selection for their client, which may be personal information. | ||
|
||
#### Low value of fallbacks | ||
|
||
The above issues would be somewhat acceptable, if reply fallbacks would provide | ||
some value to clients. Generally they don't though. The Qt html renderer breaks | ||
on `<mx-reply>` tags, so you need to strip them anyway to render replies. | ||
Bridges usually try to bridge to native replies, so they need to strip the reply | ||
part (https://github.com/matrix-org/matrix-doc/issues/1541). Even the IRC bridge | ||
seems to send a custom fallback, because the default fallback is not that | ||
welcome to the IRC crowd. | ||
|
||
Clients that actually use the fallback (original Riot/Android for example) tend | ||
to do so for quite some while, because the fallbacks are "good enough", but in | ||
the end that provides a worse experience for everyone involved. For example you | ||
couldn't see what image was replied to for the longest time. Just "X replied to | ||
an image with: I like this one" is not valuable, when 3 images were sent. | ||
|
||
Only replies to actual text messages have a somewhat reasonable fallback. The | ||
other ones do not provide any more value than a plain "this is a reply" tag, | ||
unless the client also already supports event links. | ||
|
||
## Proposal | ||
|
||
Deprecate the rich reply fallback. Clients should stop sending them and should | ||
consider treating `<mx-reply>` parts as either something to be unconditionally | ||
stripped or as something to be escaped as invalid html. In the future the | ||
fallback should be removed from the spec completely with only a note left, that | ||
it may exist in old events. | ||
|
||
Furthermore, no fallback should be used for edits, since just adding an asterisk | ||
before the same message does not provide much value to users and it complicates | ||
the implementation of edits for no tangible benefit. | ||
|
||
The fallback for more niche features, like in room verification can stay, since | ||
that feature is actually somewhat hard to implement and some clients may never | ||
support encryption. | ||
|
||
As a result of this, you would be able to reply with an image or edit a video. | ||
New clients would also be able to implement edits and replies more easily, as | ||
they can sidestep a lot of pitfalls. | ||
|
||
## Potential issues | ||
|
||
Obviously you can't remove the fallback from old events. As such clients would | ||
still need to do something with them in the near future. I'd say just not | ||
handling them in a special way should be okay after some unspecified period of | ||
time. | ||
|
||
Clients not implementing rich replies or edits may show some slightly more | ||
confusing messages to users as well. I'd argue though that in most cases, the | ||
reply is close enough to the replied to message, that you would be able to guess | ||
the correct context. Replies would also be somewhat easier to implement and | ||
worst case, a client could very easily implement a little "this is a reply" | ||
marker to at least mark replies visually. | ||
|
||
Same applies to edits. If 2 very similar messages appear after one another, | ||
someone new to online messaging would assume, this is a correction to the | ||
previous message. That may even be more obvious to them than if the message was | ||
prefixed with a `*`, since that has been confusing to users in the past. Since | ||
edits would now look exactly like a normal message, they would also be | ||
considerably easier to implement, since you jus need to replace the former | ||
message now, similar to a redaction, and not merge `content` and `new_content`. | ||
|
||
## Alternatives | ||
|
||
[MSC2589](https://github.com/matrix-org/matrix-doc/pull/2589): This adds the | ||
reply text as an addional key. While this solves the parsing issues, it | ||
doesn't address the other issues with fallbacks. | ||
|
||
One could also just stick with the current fallbacks and make all clients pay | ||
the cost for a small number of clients actually benefitting from them. | ||
|
||
## Security considerations | ||
|
||
Removing the fallback from the spec may lead to issues, when clients experience | ||
the fallback in old events. This should not add any security issues the | ||
client didn't already have from interpreting untrusted html, though. In all | ||
other cases this should **reduce** security issues. | ||
|
||
## Unstable prefix | ||
|
||
Seems unnecessary, since this only removes stuff. |