You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
We have a form which target `` URL which is not rewritten because warc2zim consider this URL is external. This is only partially true because the form never calls this URL without a query parameter, and with the query parameter the URL become internal, i.e. proper ressource is insise the ZIM ... too bad.
Should we detect that <form> URL is internal because we have a least one URL with a query parameter matching the form URL? Clearly a bit nasty and would not work in 100% of the cases, but would probably be ok for at least a majority... to be investagated further.
The text was updated successfully, but these errors were encountered:
Note that on this website we are lucky to only have a warc2zim issue because we have prev / next links which allowed the crawler to properly fetch all required pages. But from a general PoV, we have also an issue with browsertrix crawler which is not capable to crawl combobox urls automatically. I've just opened webrecorder/browsertrix-crawler#702
ZIM request: openzim/zim-requests#604
Problem: see openzim/zim-requests#604 (comment)
We have a form which target `` URL which is not rewritten because warc2zim consider this URL is external. This is only partially true because the form never calls this URL without a query parameter, and with the query parameter the URL become internal, i.e. proper ressource is insise the ZIM ... too bad.
Should we detect that
<form>
URL is internal because we have a least one URL with a query parameter matching the form URL? Clearly a bit nasty and would not work in 100% of the cases, but would probably be ok for at least a majority... to be investagated further.The text was updated successfully, but these errors were encountered: