Codec Vectors #12

davxy · 2024-08-30T16:33:03Z

ASN.1 schema
JAM compact integers codec

Once successfully processed by some of the teams, the codec will be applied to:

Partially address #10

vekexasia · 2024-08-31T07:49:18Z

I believe the disputes_exstrinsic.json test is wrong.

the encoded value starts with
02dd1b65c036547750d2f84ff4c6fac7de56944658530a62e81c6cc290087440d00300000001
mine is
02dd1b65c036547750d2f84ff4c6fac7de56944658530a62e81c6cc290087440d00301

that 1 is/shouldbe the vote which by formula 277 is encoded with generic E.

According to 272, x being between 0 and 2^(7) means that l` is 0 hence (2^8 - 2^(8-0) + 1/1 = 1

looks like E_4 was applied instead of E

davxy · 2024-08-31T09:34:17Z

@vekexasia 03000000 is the verdict "epoch index" (in json "age", which is an u32).
Follows the bool which is encoded as 01 as expected. Looks like in your string the epoch index in instead encoded as just 03 (a u8).

I think eq. 277 encoding of a must be explicitly specified as E_4 (gavofyork/graypaper#67)

From (272)

Note that at present this is utilized only in encoding the length prefix of variable-length sequences

Which make sense.

vekexasia · 2024-08-31T09:41:09Z

ha! yes sorry the a component has nothing specified so I assumed E not E_4 . but indeed it makes sense to constantly encode it as E_4

vekexasia · 2024-08-31T12:56:40Z

Also I think i found another 2 issues while tesitng assurances.

First of all it seems you're using E_2 instead of E when encoding validatorIndex. the codec for Ea is not formally specified hence I believe that we should use E or formally write the codec in the paper. To be honest I would just stick to E even if _2 is more than enough for jam.
Are you testing the decoding from bin? because (276) has no intrinsic length discriminator. this means that either we specify Ea codec to include a discriminator on the bitsequence length or I find it impossible to sequentially decode the extrinsic unless we use a bottom-up approach which I think is not the ideal solution.

davxy · 2024-08-31T14:17:46Z

@vekexasia

First of all it seems you're using $E_2$ instead of $E$

This situation is analogous to the previous one. I align with the guidance provided in the GP, just below equation 272, which I report here:

"Note that at present this is utilized only in encoding the length prefix of variable-length sequences."

Introducing variable-length encoding for arbitrary fields (e.g., using $E$ for some u16 fields and $E_2$ for others) is a bit of a trap for implementers and may lead to interoperability issues.

While it's true that, the given vector has at most $V$ entries, this approach could save a small amount of bytes (less than $V$), I question whether such a minor gain is truly worth. In my opinion, it may not be. However, I acknowledge that my perspective may not be definitive in this matter.

For now, I'll adhere to the guidance provided just below equation 272. Should it become a requirement, ideally with a clear rationale, I will gladly adjust the vectors accordingly. :-)

If is not required then maybe is the case to make it explicit in the GP.

cc @gavofyork ^^^

Are you testing the decoding from bin?

If you are referring to $f \in \mathbb{B}_C$ (eq. 123), this is a fixed-length bit sequence, so there is no need for a length discriminator.

In the proposed vectors, C is 2. I've updated the README with some notes about this subject

vekexasia · 2024-08-31T16:31:59Z

From (272)

Note that at present this is utilized only in encoding the length prefix of variable-length sequences

oh my. I'm sorry I must have missed this from your previous statement. I must also have missed it from the graypaper when I implemented the codecs a while ago.

Ultimately it makes sense to have a stricter codec ( Eg: E₂ instead of E) but then I believe it should be explicit in the paper.

For example it makes a lot of sense to use E₂ for the validatorIndex but leaving it open to the developer implementation should be avoided.

I believe one of the reasons the graypaper exists is to have a formal specification and avoid relying on "pseudocode" or reference implementation.

vekexasia · 2024-08-31T19:39:42Z

Question.

https://github.com/davxy/jam-test-vectors/blob/codec-vectors/codec/data/work_result_1.json#L1-L9

I see you set gas_ratio < 0 but if i am not mistaken g in L defined in (121) forbids it to be negative being in N_G which is an alias of N_2⁶⁴.

I know the readme states that values may conflict but if my previous statement above is right, the codec should also forbid encoding negative values:

(282):

(271):

davxy · 2024-09-01T10:19:52Z

@vekexasia fixed

vekexasia · 2024-09-01T11:55:19Z

hey @davxy looks like the work report (Set W) is also wrongly serialized.

According to (283) first entry should be x_a while it seems your binary has x_s as first entry

davxy · 2024-09-01T12:22:32Z

@vekexasia this has been reported here #9 (comment)

see this gavofyork/graypaper#67 for a modification proposal to the GP

codec/schema.asn

Co-authored-by: Xiliang Chen <[email protected]>

emielsebastiaan · 2024-09-12T21:02:06Z

codec/data/assurances_extrinsic.json

+[
+    {
+        "anchor": "0x0cffbf67aae50aeed3c6f8f0d9bf7d854ffd87cef8358cbbaa587a9e3bd1a776",
+        "bitfield": "0x01",


If we'd like to be more explicit about the actual value of the bitsequence bitfield (and not the octet representation, I'd change this JSON-testvector to show the following value. The size of the fixed length bitsequence is set to 2 (GP_constant_C). The suggested change below avoids any confusion about its length. That said, this is not really necessary, since the byte representation is also correct (and JSON will eventually not be used).

"bitfield": [ true, false ],

boymaas · 2024-09-13T10:39:29Z

@davxy I have a question regarding the header binary test vectors: I'm curious as to why we use four bytes to encode an integer as defined in (272) when a single byte would suffice. We can determine the length by reading the prefix, even though it's defined as a u32 we can pack it in one byte. Isn't it the purpose of (272) to make it as compact as possible?

boymaas · 2024-09-13T10:54:58Z

Another question concerning the validator_count: I assume that this count can change under certain conditions. Would it be more practical to make these validators variable-length and prefix them?

davxy · 2024-09-13T12:14:14Z

@boymaas The reasons are design decisions, so you should eventually raise your concerns in the graypaper.
The test vectors just follows what prescribed by the paper.

In any case, IMHO, it is better to maintain a uniform design for u16/u32 encoding in the protocol rather than saving 1 byte here and there by introducing pitfalls related to variable-length encoding.

xlc · 2024-09-16T01:43:04Z

This should be good to merge?

davxy · 2024-09-16T07:24:33Z

@xlc As far as I'm concerned, yes, I have nothing to add to this PR, and we haven't received any pushback so far.
However, I do not have the permissions to perform the merge myself.
I will proceed with regenerating the other vectors using the new compact codec.

davxy · 2024-09-16T12:08:36Z

Now is ready :-)

davxy added 3 commits August 30, 2024 16:40

Codec test vectors

40afa00

Use new compact codec for variable length sequences

04f8616

README

e179af8

davxy marked this pull request as ready for review August 30, 2024 16:35

davxy added 3 commits August 31, 2024 16:27

Improve README

e4c8bca

Update README with var-length interpretation

33f9445

README update

c82e04f

davxy added 3 commits September 1, 2024 11:38

ASN.1 schema and validation

9945270

README udpate

c8c49af

Work item gas is N_{2^64}

c7d09ee

davxy mentioned this pull request Sep 1, 2024

Disputes, Verdicts and Judgements STF Test Vectors (GP Section 10) #9

Open

7 tasks

Update schema.asn

a3c8dbe

xlc reviewed Sep 1, 2024

View reviewed changes

codec/schema.asn Outdated Show resolved Hide resolved

davxy and others added 3 commits September 2, 2024 09:16

Update schema.asn

63aa879

Co-authored-by: Xiliang Chen <[email protected]>

Update binary files: WorkItem::index U32 -> U16

f041922

Add some size constraints to sequences

c2f50c2

basedafdev mentioned this pull request Sep 4, 2024

Serialization adjustments gavofyork/graypaper#67

Merged

emielsebastiaan reviewed Sep 12, 2024

View reviewed changes

davxy added 2 commits September 16, 2024 14:05

Small changed in item name

2ffa167

Adjust ASN.1 schema accordingly

a0c0c44

gavofyork merged commit 7a96598 into w3f:master Sep 18, 2024
1 check passed

davxy deleted the codec-vectors branch September 18, 2024 03:06

davxy mentioned this pull request Oct 2, 2024

Provide JAM codec test vectors #10

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Codec Vectors #12

Codec Vectors #12

davxy commented Aug 30, 2024 •

edited

Loading

vekexasia commented Aug 31, 2024

davxy commented Aug 31, 2024 •

edited

Loading

vekexasia commented Aug 31, 2024 •

edited

Loading

vekexasia commented Aug 31, 2024

davxy commented Aug 31, 2024 •

edited

Loading

vekexasia commented Aug 31, 2024

vekexasia commented Aug 31, 2024 •

edited

Loading

davxy commented Sep 1, 2024

vekexasia commented Sep 1, 2024

davxy commented Sep 1, 2024 •

edited

Loading

emielsebastiaan Sep 12, 2024 •

edited

Loading

boymaas commented Sep 13, 2024

boymaas commented Sep 13, 2024

davxy commented Sep 13, 2024

xlc commented Sep 16, 2024

davxy commented Sep 16, 2024

davxy commented Sep 16, 2024

Codec Vectors #12

Codec Vectors #12

Conversation

davxy commented Aug 30, 2024 • edited Loading

vekexasia commented Aug 31, 2024

davxy commented Aug 31, 2024 • edited Loading

vekexasia commented Aug 31, 2024 • edited Loading

vekexasia commented Aug 31, 2024

davxy commented Aug 31, 2024 • edited Loading

vekexasia commented Aug 31, 2024

vekexasia commented Aug 31, 2024 • edited Loading

davxy commented Sep 1, 2024

vekexasia commented Sep 1, 2024

davxy commented Sep 1, 2024 • edited Loading

emielsebastiaan Sep 12, 2024 • edited Loading

Choose a reason for hiding this comment

boymaas commented Sep 13, 2024

boymaas commented Sep 13, 2024

davxy commented Sep 13, 2024

xlc commented Sep 16, 2024

davxy commented Sep 16, 2024

davxy commented Sep 16, 2024

davxy commented Aug 30, 2024 •

edited

Loading

davxy commented Aug 31, 2024 •

edited

Loading

vekexasia commented Aug 31, 2024 •

edited

Loading

davxy commented Aug 31, 2024 •

edited

Loading

vekexasia commented Aug 31, 2024 •

edited

Loading

davxy commented Sep 1, 2024 •

edited

Loading

emielsebastiaan Sep 12, 2024 •

edited

Loading