Adds IonWriter_1_1 interface #656

popematt · 2023-11-22T21:35:50Z

Issue #, if available:

Description of changes:

Adds an interface that can be used for raw writers (and extended for other writer types that build on top of it).

By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

tgregg · 2023-11-22T22:53:07Z

src/main/java/com/amazon/ion/impl/Ion11Writer.kt

+ *
+ * This interface allows the user to write Ion data without being concerned about which output format is being used.
+ */
+interface Ion11Writer {


We can change this if necessary, but we should decide on a convention for naming Ion 1.1 classes/methods and stick with it. I think we've both used _1_1 in some places, which is ugly due to the underscores but at least doesn't look like the number eleven. If we go with 11 as a convention, should it be Ion11Writer or IonWriter11?

I kind of like Ion11..., but I'll go with IonWriter_1_1 for now to be consistent with what we already have.

tgregg · 2023-11-22T22:55:14Z

src/main/java/com/amazon/ion/impl/Ion11Writer.kt

+    fun writeAnnotations(annotation0: Long, annotation1: Long)
+
+    /** Writes three or more annotations for the next value. */
+    fun writeAnnotations(annotation0: Long, annotation1: Long, vararg annotations: Long)


Do we need variants that allow mixing Strings and Longs, for use with the FlexSym encoding?

That's a good point. I'll add overloads that accept SymbolTokens.

SymbolToken isn't ideal because it may require allocating a new one just to specify that we want text and not a symbol ID, or vice versa. Is there some way to finesse this in Kotlin? Otherwise, maybe the API is addAnnotation with variants for both String and Long...

I could create inline value classes.

@JvmInline value class SymbolText(val text: String): SymbolToken { override fun getText(): String = text override fun assumeText(): String = text override fun getSid(): Int = 0 } @JvmInline value class SymbolId(val id: Int): SymbolToken { override fun getText(): String? = null override fun assumeText(): Nothing = throw UnknownSymbolException(id) override fun getSid(): Int = id }

If that's not okay, I can change to something like this.

fun addAnnotation(annotation: CharSequence) fun addAnnotation(annotation: Long) fun writeAnnotations()

Yeah, the inline value classes look like a good start. Let's go with that for now. (Note: the SID for SymbolText should be -1 (there's an UNKNOWN_SID = -1 constant somewhere), since 0 is a valid SID.

Actually, the value classes might not play nice with Java. I'll have to double check that.

I ended up just adjusting the contract for writeAnnotations so that it is legal to call more than once preceding a value. I don't want to have a mix of addAnnotations and writeAnnotations because it would bloat the number of APIs for writing annotations and because it could be confusing having two different ways of writing annotations with different semantics.

tgregg · 2023-11-22T22:56:07Z

src/main/java/com/amazon/ion/impl/Ion11Writer.kt

+     * Writes the field name for the next value. Must be called while in a struct and must be called before [writeAnnotations].
+     * @throws com.amazon.ion.IonException if annotations are already written for the value or if not in a struct.
+     */
+    fun writeFieldName(text: String)


When will this be written inline vs as a symbol ID?

Implementation detail. The raw writer will write the string method as inline field name and the long method as a SID. A configuration-driven, managed writer may choose to intern the field name in the symbol table, but it also depends on what sort of struct the writer is currently in.

tgregg · 2023-11-22T22:58:57Z

src/main/java/com/amazon/ion/impl/Ion11Writer.kt

+    fun writeTimestamp(value: Instant)
+
+    fun writeSymbol(id: Long)
+    fun writeSymbol(text: CharSequence)


We use String some places and CharSequence in others here. What's the rationale for each?

When is this written inline vs as a symbol ID?

The String vs CharSequence issue is something I need to clean up. No rationale for having both.

Re. When inline be symbol ID?
That's an implementation detail. The raw writer will write the string arg as an inline symbol, but a "managed" writer could choose to intern the symbol text and write a numeric ID.

I don't think it's quite that simple, because (as with all symbol tokens) the user may want to specify that some symbols should be inlined rather than interned. Or, we need to provide a detection mechanism with some configurable options to allow the user to influence when things should be inlined vs. interned. Is that what you had in mind?

Yes, what I had in mind is configuration driven "managed" writers. I say writers (plural) because I think we''ll want managed writers for both text and binary so that the text writer will also be able to write out templates.

I suspect that users who care about how individual symbols are encoded will probably want to manage their own symbol table too, so they can just use the raw writer, which would give them that control.

I suspect that users who care about how individual symbols are encoded will probably want to manage their own symbol table too

I don't think that's necessarily true, but I'm happy to move forward for now.

Adds Ion11Writer interface

65f80aa

popematt assigned tgregg Nov 22, 2023

tgregg reviewed Nov 22, 2023

View reviewed changes

popematt changed the title ~~Adds Ion11Writer interface~~ Adds IonWriter_1_1 interface Nov 22, 2023

Adds suggested changes

9782d24

popematt force-pushed the raw11writer branch from 7f36f3e to 9782d24 Compare November 24, 2023 20:56

popematt added 2 commits November 24, 2023 14:31

Changes symbol and template IDs to int; remove some extraneous functions

403cacc

Remove method to write timestamp from LocalDate

478f534

popematt force-pushed the raw11writer branch from 4a2c25b to 478f534 Compare November 27, 2023 17:25

tgregg approved these changes Nov 27, 2023

View reviewed changes

popematt merged commit 085ea3f into amazon-ion:ion-11-encoding Nov 27, 2023
17 of 27 checks passed

tgregg pushed a commit that referenced this pull request Jan 22, 2024

Adds IonWriter_1_1 interface (#656)

9d74734

tgregg pushed a commit that referenced this pull request Feb 10, 2024

Adds IonWriter_1_1 interface (#656)

92f5d6c

tgregg mentioned this pull request Feb 21, 2024

Evaluating user-defined template macros #730

Open

tgregg pushed a commit that referenced this pull request May 2, 2024

Adds IonWriter_1_1 interface (#656)

0a09a8e

tgregg pushed a commit that referenced this pull request Jun 28, 2024

Adds IonWriter_1_1 interface (#656)

a89e875

tgregg pushed a commit that referenced this pull request Sep 9, 2024

Adds IonWriter_1_1 interface (#656)

d5ca028

tgregg pushed a commit that referenced this pull request Dec 13, 2024

Adds IonWriter_1_1 interface (#656)

e895835

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds IonWriter_1_1 interface #656

Adds IonWriter_1_1 interface #656

popematt commented Nov 22, 2023

tgregg Nov 22, 2023

popematt Nov 22, 2023

tgregg Nov 22, 2023

popematt Nov 22, 2023

tgregg Nov 22, 2023

popematt Nov 22, 2023

tgregg Nov 22, 2023

popematt Nov 22, 2023

popematt Nov 24, 2023

tgregg Nov 22, 2023

popematt Nov 22, 2023

tgregg Nov 22, 2023

popematt Nov 22, 2023

tgregg Nov 22, 2023

popematt Nov 22, 2023

tgregg Nov 27, 2023

Adds IonWriter_1_1 interface #656

Adds IonWriter_1_1 interface #656

Conversation

popematt commented Nov 22, 2023

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment