Please support dzen.ru quickly #32654

aflamrip · 2023-12-02T21:39:17Z

Checklist

I'm reporting a new site support request
I've verified that I'm running youtube-dl version 2021.12.17
I've checked that all provided URLs are alive and playable in a browser
I've checked that none of provided URLs violate any copyrights
I've searched the bugtracker for similar site support requests including closed ones

Example URLs

Single video: https://dzen.ru/video/watch/6471e50e06863726828b435c
Single video: https://dzen.ru/embed/vnVEaPfaSym8?from_block=partner&from=zen&mute=0&autoplay=0&tv=0
Playlist: nothing

Description

It is a very popular Russian site and needed to be supported

dirkf · 2023-12-03T04:16:27Z

Please review and complete the checklist (why is it there?).

The non-JS page seen by yt-dl contains a gigantic hydration JSON object as a parameter of a JS function that defines the page. Video links can be extracted using the traversal path 'data', ..., 0, 'videoViewer', 'items', ..., 0, 'rawStreams', 'SingleStream', ..., 'StreamInfo', ..., 'OutputStream'. 'Title' and 'Thumbnail' are available alongside 'StreamInfo'. I was able to watch the clip about the fake IKEA (why not ШведГаус?) using one of these links in mpv.

aflamrip · 2023-12-03T20:16:45Z

I have no experience in understanding these matters, but I hope that you will support downloading from this site, as it is a strong competitor to ok.ru and vk.com.

dirkf · 2023-12-03T21:47:34Z

I encourage someone who's interested to take this up. Otherwise it's at the end of a long queue.

Revisto · 2023-12-09T16:36:05Z

Hi, can I work on this issue as my first contribution to youtube-dl?

dirkf · 2023-12-09T20:07:52Z

By all means. Are you happy that you know what to do? See #29310 in addition to the manual and FAQ.

3052 · 2023-12-12T01:15:04Z

note this uBlock Origin rule is needed for the web client:

dzen.ru dzeninfra.ru * noop

abhimessi16 · 2023-12-21T12:07:54Z

Hi @dirkf, is @Revisto still working on the issue else i would like to work on it

dirkf · 2023-12-21T14:36:21Z

There's no PR yet. Unless we hear otherwise, carry on.

mccarreon · 2024-01-29T02:40:39Z

Hi, I'm working on this right now but was wondering if there's a certain way we should handle redirects and how you got the page downloaded, @dirkf? I'm running into an issue where it hits these two URL's before the final webpage with the video and JSON object:

https://sso.passport.yandex.ru/push?uuid=d738a747-82b1-432d-8536-30918ebb94aa&retpath=https%3A%2F%2Fdzen.ru%2Fvideo%2Fwatch%2F6471e50e06863726828b435c - sends back a 200
https://sso.dzen.ru/install?uuid=604f2166-bd61-4d1a-bfed-715e331e51d6 - sends back a 200

So the _download_webpage_handle function is downloading the empty page at the first page it gets redirected to since they aren't sending back 302 for redirect afterwards.

mccarreon · 2024-01-29T02:52:22Z

Hm looking at some other issues actually, seems like this could be bypassed by grabbing the cookies after visiting those URL's

dirkf · 2024-01-29T03:04:06Z

Exactly.

I found that I could just download the second problem video and find the media link. Looking at the first one, it seems to go to the SSO link that you mention, presumably because it needs a login. So the first thing to try is passing cookies from a logged-in browser session that can play that video.

Getting --username ... --password ... to work would be ideal but experience indicates that today's login procedures are either too hard to implement or too transient or both. But maybe passport.yandex.ru has been handled already -- I haven't checked.

aflamrip added the site-support-request Add extractor(s) for a new domain label Dec 2, 2023

dirkf added the Good first issue An issue that should be easier to solve label Dec 3, 2023

dirkf assigned dirkf and Revisto and unassigned dirkf Dec 9, 2023

This comment was marked as off-topic.

Sign in to view

This comment was marked as resolved.

Sign in to view

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Please support dzen.ru quickly #32654

Please support dzen.ru quickly #32654

aflamrip commented Dec 2, 2023 •

edited

dirkf commented Dec 3, 2023

aflamrip commented Dec 3, 2023

dirkf commented Dec 3, 2023

Revisto commented Dec 9, 2023

dirkf commented Dec 9, 2023

3052 commented Dec 12, 2023

This comment was marked as off-topic.

This comment was marked as resolved.

abhimessi16 commented Dec 21, 2023 •

edited

dirkf commented Dec 21, 2023

mccarreon commented Jan 29, 2024

mccarreon commented Jan 29, 2024 •

edited

dirkf commented Jan 29, 2024

Please support dzen.ru quickly #32654

Please support dzen.ru quickly #32654

Comments

aflamrip commented Dec 2, 2023 • edited

Checklist

Example URLs

Description

dirkf commented Dec 3, 2023

aflamrip commented Dec 3, 2023

dirkf commented Dec 3, 2023

Revisto commented Dec 9, 2023

dirkf commented Dec 9, 2023

3052 commented Dec 12, 2023

This comment was marked as off-topic.

This comment was marked as resolved.

abhimessi16 commented Dec 21, 2023 • edited

dirkf commented Dec 21, 2023

mccarreon commented Jan 29, 2024

mccarreon commented Jan 29, 2024 • edited

dirkf commented Jan 29, 2024

aflamrip commented Dec 2, 2023 •

edited

abhimessi16 commented Dec 21, 2023 •

edited

mccarreon commented Jan 29, 2024 •

edited