Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Please support dzen.ru quickly #32654

Open
5 tasks done
aflamrip opened this issue Dec 2, 2023 · 13 comments
Open
5 tasks done

Please support dzen.ru quickly #32654

aflamrip opened this issue Dec 2, 2023 · 13 comments
Assignees
Labels
Good first issue An issue that should be easier to solve site-support-request Add extractor(s) for a new domain

Comments

@aflamrip
Copy link

aflamrip commented Dec 2, 2023

Checklist

  • I'm reporting a new site support request
  • I've verified that I'm running youtube-dl version 2021.12.17
  • I've checked that all provided URLs are alive and playable in a browser
  • I've checked that none of provided URLs violate any copyrights
  • I've searched the bugtracker for similar site support requests including closed ones

Example URLs

Description

It is a very popular Russian site and needed to be supported

@aflamrip aflamrip added the site-support-request Add extractor(s) for a new domain label Dec 2, 2023
@dirkf
Copy link
Contributor

dirkf commented Dec 3, 2023

Please review and complete the checklist (why is it there?).

The non-JS page seen by yt-dl contains a gigantic hydration JSON object as a parameter of a JS function that defines the page. Video links can be extracted using the traversal path 'data', ..., 0, 'videoViewer', 'items', ..., 0, 'rawStreams', 'SingleStream', ..., 'StreamInfo', ..., 'OutputStream'. 'Title' and 'Thumbnail' are available alongside 'StreamInfo'. I was able to watch the clip about the fake IKEA (why not ШведГаус?) using one of these links in mpv.

@dirkf dirkf added the Good first issue An issue that should be easier to solve label Dec 3, 2023
@aflamrip
Copy link
Author

aflamrip commented Dec 3, 2023

I have no experience in understanding these matters, but I hope that you will support downloading from this site, as it is a strong competitor to ok.ru and vk.com.

@dirkf
Copy link
Contributor

dirkf commented Dec 3, 2023

I encourage someone who's interested to take this up. Otherwise it's at the end of a long queue.

@Revisto
Copy link

Revisto commented Dec 9, 2023

Hi, can I work on this issue as my first contribution to youtube-dl?

@dirkf dirkf assigned dirkf and Revisto and unassigned dirkf Dec 9, 2023
@dirkf
Copy link
Contributor

dirkf commented Dec 9, 2023

By all means. Are you happy that you know what to do? See #29310 in addition to the manual and FAQ.

@3052
Copy link

3052 commented Dec 12, 2023

note this uBlock Origin rule is needed for the web client:

dzen.ru dzeninfra.ru * noop

@aflamrip

This comment was marked as off-topic.

@dirkf

This comment was marked as resolved.

@abhimessi16
Copy link

abhimessi16 commented Dec 21, 2023

Hi @dirkf, is @Revisto still working on the issue else i would like to work on it

@dirkf
Copy link
Contributor

dirkf commented Dec 21, 2023

There's no PR yet. Unless we hear otherwise, carry on.

@mccarreon
Copy link

Hi, I'm working on this right now but was wondering if there's a certain way we should handle redirects and how you got the page downloaded, @dirkf? I'm running into an issue where it hits these two URL's before the final webpage with the video and JSON object:

So the _download_webpage_handle function is downloading the empty page at the first page it gets redirected to since they aren't sending back 302 for redirect afterwards.

@mccarreon
Copy link

mccarreon commented Jan 29, 2024

Hm looking at some other issues actually, seems like this could be bypassed by grabbing the cookies after visiting those URL's

@dirkf
Copy link
Contributor

dirkf commented Jan 29, 2024

Exactly.

I found that I could just download the second problem video and find the media link. Looking at the first one, it seems to go to the SSO link that you mention, presumably because it needs a login. So the first thing to try is passing cookies from a logged-in browser session that can play that video.

Getting --username ... --password ... to work would be ideal but experience indicates that today's login procedures are either too hard to implement or too transient or both. But maybe passport.yandex.ru has been handled already -- I haven't checked.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Good first issue An issue that should be easier to solve site-support-request Add extractor(s) for a new domain
Projects
None yet
Development

No branches or pull requests

6 participants