-
Dear authors, I would like to ask how I can utilize part of your work to clean and compress a raw html file to get a new compressed html file. By reading the official website, I tried to use the python library functions you provide to do this, and I wonder if this should be the idea? |
Beta Was this translation helpful? Give feedback.
Replies: 1 comment
-
Although you may find the library useful for cleaning html files, it is not the primary goal of this library; rather, our goal is to process the html files so they can be ingested by LLM models for predicting web actions.
You can use the DMR retriever to dynamically find relevant candidate, given a context (action history similar to those of weblinx). See this example for concreteness: https://github.com/McGill-NLP/webllama/blob/main/examples/complete/run_all.py
If I understand your question correctly: you can use the DMR model as part of your automation pipeline, so you can get candidates automatically given raw html, bounding boxe coordinates and action/dialogue history. |
Beta Was this translation helpful? Give feedback.
Although you may find the library useful for cleaning html files, it is not the primary goal of this library; rather, our goal is to process the html files so they can be ingested by LLM models for predicting web actions.
You can use the DMR retriever to dynamically find relevant candidate, given a context (action history similar to those of we…