Skip to content

Instantly share code, notes, and snippets.

@Tithen-Firion
Last active November 20, 2021 15:14
Show Gist options
  • Select an option

  • Save Tithen-Firion/8b3921d745131837519d5c5b95b86440 to your computer and use it in GitHub Desktop.

Select an option

Save Tithen-Firion/8b3921d745131837519d5c5b95b86440 to your computer and use it in GitHub Desktop.
Openload: extract download URL using PhantomJS
// Usage: phantomjs openload.js <video_url>
// if that doesn't work try: phantomjs --ssl-protocol=any openload.js <video_url>
var separator = ' | ';
var page = require('webpage').create(),
system = require('system'),
id, match;
if(system.args.length < 2) {
console.error('No URL provided');
phantom.exit(1);
}
match = system.args[1].match(
/https?:\/\/(?:openload\.(?:co|io)|oload\.tv)\/(?:f|embed)\/([\w\-]+)/);
if(match === null) {
console.error('Could not find video ID in provided URL');
phantom.exit(2);
}
id = match[1];
// thanks @Mello-Yello :)
page.onInitialized = function() {
page.evaluate(function() {
delete window._phantom;
delete window.callPhantom;
});
};
page.settings.userAgent = "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/56.0.2924.87 Safari/537.36";
page.open('https://openload.co/embed/' + id + '/', function(status) {
var info = page.evaluate(function() {
return {
decoded_id: document.getElementById('streamuri').innerHTML,
title: document.querySelector('meta[name="og:title"],'
+ 'meta[name=description]').content
};
});
var url = 'https://openload.co/stream/' + info.decoded_id + '?mime=true';
console.log(url + separator + info.title);
phantom.exit();
});
@Tithen-Firion

Tithen-Firion commented Apr 9, 2017

Copy link
Copy Markdown
Author

Requires PhantomJS.

The code is being used by Openload extractor for youtube-dl.

@honboubao

honboubao commented May 23, 2017

Copy link
Copy Markdown

I had to call phantomjs with --ssl-protocol=any otherwise page.open would fail.

@Tithen-Firion

Copy link
Copy Markdown
Author

Thanks, I forgot to add it here.

It's already added in this PR. :D

@xEmily911

xEmily911 commented Sep 11, 2017

Copy link
Copy Markdown

Does this have method restricted by API downloads? I plan to use this to mount Openload streams into JWPlayer.

@Tithen-Firion

Copy link
Copy Markdown
Author

It's the same URL as when you watch something on Openload in your browser. That means it won't work for different IP. It will probably expire too but don't know after how long. I was downloading something for 2-3h and it worked fine.

@f0nZ

f0nZ commented Nov 5, 2017

Copy link
Copy Markdown

Just a little question, when a client goes to my website and does "phantomjs openload.js <video_url>" via exec(), the ip shown on openload url is my machine's IP, not my visitor/client IP. How can I fix this? Do you have any idea? Cheers.

@Tithen-Firion

Copy link
Copy Markdown
Author

You've got two options:

  • stream the video through your server; downside: huge traffic on your server,
  • run the code on clients side; downside: client has to download phantomjs and run the code for themselves.

@dammysky

Copy link
Copy Markdown

Hi,
Thanks for this.
I will like to ask a few questions,
I added this line to my edited version of the code page.settings.resourceTimeout = 3000; hoping that should in case the page takes time to load it should timeout.
I use a Python subprocess to execute the JS. it works as expected but in some cases when I call the communicate() function the application just freezes and have to start all over.
Is there a better approach you will advise to handle this?
Thanks a lot.

@Tithen-Firion

Copy link
Copy Markdown
Author

This snippet works fine for me:

import subprocess

try:
   my_input = raw_input
except NameError:
   my_input = input

url = my_input('Enter URL: ')

p = subprocess.Popen(['phantomjs', '--ssl-protocol=any', 'openload.js',
    url], stdout=subprocess.PIPE, stderr=subprocess.PIPE)
out, err = p.communicate()
print(out)

Both in Python 2 and 3.

@DarbyCrash

DarbyCrash commented Jan 6, 2018

Copy link
Copy Markdown

This is not working again. You must change streamuri with streamurj.

@Tithen-Firion

Copy link
Copy Markdown
Author

See, when I update it they will change it again. So I'm not gonna bother.

@gonubana

gonubana commented Jan 17, 2018

Copy link
Copy Markdown

It seems like it does not work anymore even after running
sed -e "32 s/getElementById..streamur./querySelector('*[id^=streamur]/"
on this openload.js file.

I'm of course getting a similar link to "https://openload.co/stream/miEoI5oT8JE~1516281189~184.73.0.0~ImWW-Wbq?mime=true | bunny HTTP/1.1" but GETting this link only gets me a "HTTP/1.1 400 Bad Request" type response.

Is this only me or they somehow changed something else somewhere.

@Tithen-Firion

Copy link
Copy Markdown
Author

@gonubana just tested and
curl -L -k -o <filename> <generated URL>
works perfectly fine.

@Tithen-Firion

Copy link
Copy Markdown
Author

Just read 4 comments above before saying "it doesn't work".

@lawchihon

Copy link
Copy Markdown

I don't think it is working anymore
`
TypeError: null is not an object (evaluating 'document.getElementById('streamuri').innerHTML')

undefined:3
:7
`

@JeelsBoobz

Copy link
Copy Markdown

Not work anymore :(

@aaronzolla

Copy link
Copy Markdown

#streamuri and #streamurl have been empty for a couple of weeks now.

@Narendra-WAL

Copy link
Copy Markdown

Did anyone find a solution for this?

@gonubana

Copy link
Copy Markdown

This is the current magic:
console.log('https://openload.co/stream/' + page.content.match(/\w+\d+\d+.\d+.\d+.\d+~\w+/)[0] + '?mime=true');

@feryw

feryw commented Jul 17, 2018

Copy link
Copy Markdown

it should be
console.log('https://oload.stream/stream/' + page.content.match(/\w+~\d+~\d+.\d+.\d+.\d+~\w+/)[0] + '?mime=true');
for more clear

@giuliano-macedo

Copy link
Copy Markdown

Since there is a domain oload.download, regex should be
/https?:\/\/(?:openload\.(?:co|io)|oload\.tv|oload\.download)\/(?:f|embed)\/([\w\-]+)/)

And, i was looking into the deobfuscated jsnice code of the source and i saw this line

if ("toString" in sin && sin.toString().indexOf("[native code") != -1 && document.getElementById.toString().indexOf("[native code") == -1 || window.callPhantom || /Phantom/.test(navigator.userAgent) || window.__phantomas || next() || window.domAutomation || window.webdriver || document.documentElement.getAttribute("webdriver"))

I guess there is some anti-measure phantomjs in the site, but aparently its not working, because @gonubana comment does captures the download URL...

@feryw

feryw commented Jul 18, 2018

Copy link
Copy Markdown

reduce regex complexity
console.log('https://oload.stream/stream/' + page.content.match(/\w+~\d+~[\d\.]+~\w+/)[0] + '?mime=true');

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment