Cast using Grok's data conversion syntax #4928

philrz · 2023-12-07T17:26:04Z

tl;dr

The reference Grok implementation has a :type "type conversion" syntax that may appear in patterns. This is not yet supported in Zed's grok() function.

Details

From Elastic's page for the Grok filter plugin for Logstash:

Optionally you can add a data type conversion to your grok pattern. By default all semantics are saved as strings. If you wish to convert a semantic’s data type, for example change a string to an integer then suffix it with the target data type. For example %{NUMBER:num:int} which converts the num semantic from a string to an integer. Currently the only supported conversions are int and float.

This syntax is used in many of the examples out on the Internet that users may find as they're learning Grok with intent to apply it in Zed (e.g., here).

The initial grok() implementation added to Zed via #4827 accepts the syntax but effectively ignores it such that the parsed values become strings. We rationalized this simplification since the user can apply Zed's casting functions downstream in the pipeline to turn these strings into richer Zed types if they wish. However, in a complex log parsing config this could potentially lead to the repetition of lots of field names which makes for Zed that's less readable and more difficult to maintain. Therefore we may want to add support for this syntax at some point.

Note that this could ultimately create a unique differentiator in Zed: Other JSON-centric implementations of Grok are limited to converting to the limited set of JSON types whereas the Zed implementation could support the full set of rich Zed data types.

The text was updated successfully, but these errors were encountered:

philrz · 2024-08-16T20:57:19Z

A user found themselves asking about this functionality in a recent community Slack thread. In their own words:

i guess this probably couldn’t easily be dealt with … using NONNEGINT results in a string in the zson … but I get that at a Go code level, it’s just a regex and there’s no real data there that could tell it “hey, this is a number, don’t write it to zson as a string”

philrz mentioned this issue Dec 7, 2023

Parsing text lines into records (Grok) #4140

Closed

philrz added the community label Aug 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cast using Grok's data conversion syntax #4928

Cast using Grok's data conversion syntax #4928

philrz commented Dec 7, 2023

philrz commented Aug 16, 2024

Cast using Grok's data conversion syntax #4928

Cast using Grok's data conversion syntax #4928

Comments

philrz commented Dec 7, 2023

tl;dr

Details

philrz commented Aug 16, 2024