[Bug] From field and timestamp circoms generated with negate regexes are invalid. #19

SoraSuegami · 2023-09-30T03:02:30Z

If from field and timestamp circoms are generated with negation regexes, their tests do not pass.
https://github.com/zkemail/zk-regex/blob/feat/invalid_dfa/packages/circom/circuits/common/from_addr.json
https://github.com/zkemail/zk-regex/blob/feat/invalid_dfa/packages/circom/circuits/common/timestamp.json

In particular, my current implementation generates a DFA by regarding the negation [^abc] as a group [\u{ff}abc]=(\u{ff}|a|b|c). When generating a circom based on the DFA, if the edge string contains \u{ff}, it makes constraints that the state moves only if the input character is not in the list of the edge string except for \u{ff}.
We can regard this modification of constraints as the modification of edges in the already-generated DFA.
Therefore, it turns DFA back into an NFA with multiple possible edges.

One simple idea is that we convert [^abc], to a non-negate regex (\u00 | \u01 | \u02 | ... | \uff), which means a group regex of all 1-byte characters except for ones in the group of the negate regex, before generating the first DFA.
However, it will not be able to reduce the circom size because the resulting DFA will be still complex.
I am not sure this complexity is due to theoretical limitations or not.

SoraSuegami · 2023-09-30T14:54:33Z

New idea:

Change the negation [^abc] to a group [\u{ff}abc]=(\u{ff}|a|b|c) and generate its minimized DFA.
The DFA is converted to an object of NFA compatible with nfaToDfa in regex.js.
Find nodes whose edge contains a character \u{ff}.
For each node in 3, Find other edges that do not contain \u{ff}. Let C be a set of characters on those edges.
For each node in 3, if the edge with \u{ff} goes to state x, add characters in C to that edge in the converted NFA.
Apply transitions of NFA->DFA->minimized DFA to the NFA in 5 again.

SoraSuegami · 2023-10-03T09:41:06Z

This is solved based on the first approach.
9273a2e

SoraSuegami closed this as completed Oct 3, 2023

SoraSuegami mentioned this issue Oct 11, 2023

Support international emails: Add regex support for . with negation #23

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug] From field and timestamp circoms generated with negate regexes are invalid. #19

[Bug] From field and timestamp circoms generated with negate regexes are invalid. #19

SoraSuegami commented Sep 30, 2023

SoraSuegami commented Sep 30, 2023

SoraSuegami commented Oct 3, 2023

[Bug] From field and timestamp circoms generated with negate regexes are invalid. #19

[Bug] From field and timestamp circoms generated with negate regexes are invalid. #19

Comments

SoraSuegami commented Sep 30, 2023

SoraSuegami commented Sep 30, 2023

SoraSuegami commented Oct 3, 2023