💬 Prompt updates #48

tylermaran · 2024-10-07T01:02:33Z

No description provided.

pradhyumna85 · 2024-10-07T20:00:13Z

@tylermaran, I am thinking that we can define a JSON which has constants like the default system prompt which both the JS and Python SDK could use this JSON to load the default system prompt. This will reduce redundancy across zerox's SDKs.

tylermaran · 2024-10-13T18:02:59Z

Hey @pradhyumna85, agreed. We could definitely have a shared prompt library. As well as some shared tests. Since it's really hard to tell how the output changes with a prompt change.

On the subject of shared code, I'm planning on writing some tests that do something like:

Run ~10 documents through
Do a regex test on the resulting markdown for a bunch of discrete indicators that we know it should contain.

ex:
If we ran this doc: https://github.com/getomni-ai/zerox/blob/main/examples/cs101.pdf

We could do a regex test

const bits = [
  "byte",
  "16-bit signed 2s complement integer",
  "Short",
  "32-bit signed 2s complement integer",
  "Integer",
  "64-bit signed 2s complement integer",
  "Long",
  "float",
  "32-bit IEEE 754 floating point number"
  ]

const countMatches = (text) => {
  const pattern = new RegExp(bits.join('|'), 'g');
  const matches = text.match(pattern);
  return matches ? matches.length : 0;
}

tylermaran added 2 commits September 28, 2024 22:01

1.0.29

58a2402

prompt updates

490cc00

pradhyumna85 mentioned this pull request Oct 7, 2024

Feat. Postprocessing control - custom page separator, postprocess function etc #40

Open

pradhyumna85 mentioned this pull request Oct 11, 2024

Missing Content in PDF to Markdown Conversion #50

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

💬 Prompt updates #48

💬 Prompt updates #48

tylermaran commented Oct 7, 2024

pradhyumna85 commented Oct 7, 2024

tylermaran commented Oct 13, 2024

💬 Prompt updates #48

Are you sure you want to change the base?

💬 Prompt updates #48

Conversation

tylermaran commented Oct 7, 2024

pradhyumna85 commented Oct 7, 2024

tylermaran commented Oct 13, 2024