support_description_long_description #183

joe-rabbit · 2024-01-04T09:11:49Z

I have added a function called handle_descriptions that takes in three inputs: default_description, description, and long_description. It returns the description either as long_description or description. I have also made changes in the get_zim_info function.

Additionally, the newer version of make_zim_file does not take favicon as an input parameter, so I changed it to illustrations.

benoit74

Thank you for this first version. Some modifications are necessary.

Please use the "Fix ####" format in your first comment to link this PR to the original issue (see https://docs.github.com/en/get-started/writing-on-github/working-with-advanced-formatting/using-keywords-in-issues-and-pull-requests)

Please also not that it might not be that clear in the original issue but we need to check for validity of provided description and long description as soon as possible. Same for other ZIM parameters (e.g. illustation). Here we check after get_content has been called, which is an issue because it means the scraper can spend hours to retrieve the whole content and only then fail because the description is too long (for instance). You need to find a way to change this and check that ZIM metadata (illustration, description, title, long_description, ...) are valid. Basically a call to validate_metadata must be made asap to check everything is ok before downloading all content.

benoit74 · 2024-01-08T08:34:37Z

openedx2zim/scraper.py

@@ -831,6 +842,24 @@ def render(self):
            self.build_dir.joinpath("assets"),
        )

+
+    def handle_descriptions(self, default_description, description=None, long_description=None):


Instead of this custom code, please use the one from https://github.com/openzim/python-scraperlib/blob/4dc30126a54040b4383ffed3617a1394a51b5a78/src/zimscraperlib/inputs.py#L56

I have handled this change

openedx2zim/scraper.py

requirements.txt

joe-rabbit · 2024-01-08T14:44:08Z

okay sure will make the necessary changes :)

joe-rabbit · 2024-01-08T17:43:09Z

@benoit74 ,Regarding the check, before downloading the zim file may i add the check before prepare_mooc_data() to check the ZIM_MetaData and call it again after the favicon gets downloaded ???

benoit74 · 2024-01-09T10:19:40Z

@benoit74 ,Regarding the check, before downloading the zim file may i add the check before prepare_mooc_data() to check the ZIM_MetaData and call it again after the favicon gets downloaded ???

This check must be done ASAP.

To be honest, I think you shouldn't spend too much time on this issue, the scraper is not working anymore and complex, you won't be able to really test your change. Taking an issue from youtube / ted / freecodecamp / kolibri / zimfarm would be more appropriate.

benoit74 · 2024-02-01T08:55:30Z

Closing this for now, feel free to reopen

support_description_long_description

1c6631b

joe-rabbit mentioned this pull request Jan 4, 2024

Upgrade python-scraperlib to 3.x, including CLI support for description / long_description flags #181

Open

benoit74 self-requested a review January 8, 2024 08:31

benoit74 assigned benoit74 and joe-rabbit and unassigned benoit74 Jan 8, 2024

benoit74 requested changes Jan 8, 2024

View reviewed changes

joe-rabbit added 2 commits January 8, 2024 22:52

Update dependencies in requirements.txt

d85a272

Resolved minor issues

630cb66

benoit74 closed this Feb 1, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

support_description_long_description #183

support_description_long_description #183

joe-rabbit commented Jan 4, 2024 •

edited

Loading

benoit74 left a comment

benoit74 Jan 8, 2024

joe-rabbit Jan 8, 2024

joe-rabbit commented Jan 8, 2024

joe-rabbit commented Jan 8, 2024 •

edited

Loading

benoit74 commented Jan 9, 2024

benoit74 commented Feb 1, 2024

support_description_long_description #183

support_description_long_description #183

Conversation

joe-rabbit commented Jan 4, 2024 • edited Loading

benoit74 left a comment

Choose a reason for hiding this comment

benoit74 Jan 8, 2024

Choose a reason for hiding this comment

joe-rabbit Jan 8, 2024

Choose a reason for hiding this comment

joe-rabbit commented Jan 8, 2024

joe-rabbit commented Jan 8, 2024 • edited Loading

benoit74 commented Jan 9, 2024

benoit74 commented Feb 1, 2024

joe-rabbit commented Jan 4, 2024 •

edited

Loading

joe-rabbit commented Jan 8, 2024 •

edited

Loading