-
-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
New request: (A few playlists) of Youtube Channel "C'est pas sorcier" #1182
Comments
@benoit74 I wanted to scrape https://www.youtube.com/watch?v=Ofa1OV6d5xc&list=PLh-qVJTuss13TJpf3Fdd8cbLaPbT18a9Z at https://farm.openzim.org/recipes/cest-pas-sorcier_fr_astronomie and I failed twice. It seems I'm not able to gather/configure properly the playlist id. The FAQ does also not explain how to retrieve the ID in case of a playlist. Do I just have done something wrong or is something unclear or maybe even buggy? |
See https://github.com/openzim/youtube/wiki/FAQ---FEE#how-do-i-find-a-channel--user--handle-technical-id (I promise it is here since Friday, and it was even here before but a bit harder to find I have to admit)
In https://farm.openzim.org/pipeline/0409f83a-76b0-476e-a552-971fbdf79e97/debug you should have set "type" to "playlist". I fixed this and requested recipe again. I also opened openzim/youtube#361 |
This is now mostly working, but a new bug in Youtube scraper appeared: openzim/youtube#362 |
We have a working ZIM as can be seen at https://dev.library.kiwix.org/#lang=&tag=&q=Magazine (build with Note that this ZIM is impacted by a bug at the library / library generation / scraper level around tags: kiwix/operations#286 |
@kelson42 I let you continue with other playlists you wanted to create? |
May I wait to openzim/youtube#369 to be fixed? Or is that not recommend by you? |
@benoit74 Do you mean https://github.com/openzim/youtube/wiki/Frequently-Asked-Questions#how-do-i-find-a-channel--user--handle-technical-id ? This does not speak of "playlist"... and if it does, then this is unclear from a user perspective if this concerns playlist On the top of it: If I looks to the source of https://www.youtube.com/playlist?list=PLh-qVJTuss13TJpf3Fdd8cbLaPbT18a9Z, there is nothing like this:
|
I'm halfway (50%) of all playlists. |
I've added https://github.com/openzim/youtube/wiki/Frequently-Asked-Questions#how-do-i-find-a-playlist-id for playlists, didn't realized at that time you were speaking about playlist, read your comment too fast, sorry.
Cool
Sure. Will be done somewhere this week at the latest, but setting up the recipes and checking they work as expected is not wasting Zimfarm time, especially since the Zimfarm is mostly empty ATM. |
@benoit74 I have creates all recipes (one per playlist). This is ready to review. See https://farm.openzim.org/recipes?name=sorcier |
I don't get what is the reasoning behind title / description / long description metadata which seems to follow more or less a pattern but still be quite different from time to time, or even a bit inconsistent from time to time (or I miss the logic, e.g why Can you explain how you've reasoned about it? And more precisely, why do you consider that:
Something like this card make me think something is broken or at least very odd (upper case in the middle of a string without upper case at the beginning): And finally, I have only 17 ZIMs (instead of 20) with https://dev.library.kiwix.org/#lang=&q=sorcier, I don't get why. From my perspective so far, title and description are meant to:
This is at least what guided us in https://library.kiwix.org/#lang=eng&q=ted (where I have to admit we've been lucky to have "TED" and nothing longer). Here I feel like we are realizing that the 30 and 80 chars limits are too short and we are fighting against them with (a bit ugly) hacks. |
My best guess: this is a consequence of having recipes of the Zimfarm with the same ZIM metadata "Name". At least was the case at some point. Somehow this has impact on the script building the library of dev.library.kiwix.org. This is why i called the impact "vicious" and requested to act quickly on this. |
@benoit74 I have modified/fixed all the recipes according do your remarks and our discussion. Unfortunately it seems latest version of Youtube scraper fails! None of them seem to pass anymore. |
As discussed on Slack, this has nothing to do with youtube scraper, but a Zimfarm bug openzim/zimfarm#668 Please replace |
The full channel is already available but is huge. I would like to be able to distribute only part of the channel, and being able to have only a few playlist seems to be the most easy way to do that.
The text was updated successfully, but these errors were encountered: