Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Convert tables and figures in the report to RDF with multimodal LLMs #4

Open
wbcbugfree opened this issue Jun 6, 2024 · 2 comments
Assignees

Comments

@wbcbugfree
Copy link
Collaborator

wbcbugfree commented Jun 6, 2024

The report contains a large number of tables and figures that contain much information not mentioned in the text. In this stage, we focus on converting the text to RDF, but these tables and figures also contain information that is crucial for the soil health knowledge graph. Multimodal LLMs are a feasible solution for extracting this information and converting it to RDF, and will be tested soon.

@wbcbugfree wbcbugfree self-assigned this Jun 6, 2024
@robknapen
Copy link

I want to try out LlamaParse soon-ish :-) At the moment it scans 1000 pages per day for free. So maybe you can already use on the report and check if results improve.

https://cloud.llamaindex.ai/parse

@wbcbugfree
Copy link
Collaborator Author

Refer to the prompt on processing figures, especially flowcharts and diagrams:

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants