chromedl

Go library for scraping or downloading files bypassing Cloudflare protection and browser checks

MIT License

Stars

12

View Code on GitHub

Ecosystems: Cloudflare

========================
Chrome File Downloader

.. image:: https://pkg.go.dev/badge/github.com/rusq/chromedl.svg :alt: Go Reference :target: https://pkg.go.dev/github.com/rusq/chromedl

.. contents:: :depth: 2

The sole purpose of this package is to download files from the Internets with headless Chrome bypassing the Cloudflare and maybe some other annoying browser checks.

It does so by implementing the solutions posted in "bypass headless chrome detection issue" for chromedp.

This library may help you if the other download methods don't work, i.e. curl or the standard http.Get().

The implementation is based on this chromedp example_.

Thanks to @ZekeLu_ for huge help in getting this going.

Compatibility

Tested with:

Chrome (stable) v90.0.4430.93.
github.com/chromedp/chromedp v0.6.12
github.com/chromedp/cdproto v0.0.0-20210323015217-0942afbea50e

Newer versions of Chrome will require some code changes, as described in this issue_, as it uses calls that are deprecated in newer protocol version in order to be compatible with current stable version of Chrome (see above).

When using headless-shell docker image, please use the following tag::

FROM chromedp/headless-shell:90.0.4430.93

LICENCES

chromedp_: Copyright (c) 2016-2020 Kenneth Shaw

.. _this issue: https://github.com/chromedp/chromedp/issues/807 .. _chromedp example: https://github.com/chromedp/examples/tree/master/download_file .. _@ZekeLu: https://github.com/ZekeLu .. _chromedp: https://github.com/chromedp/chromedp .. _bypass headless chrome detection issue: https://github.com/chromedp/chromedp/issues/396

Package Rankings

Top 8.17% on Proxy.golang.org

Related Projects

KARMA-DDoS

DDoS Script (DDoS Panel) with Multiple Bypass ( Cloudflare UAM,CAPTCHA,BFM,NOSEC / DDoS Guard / G...

20 Mar 2022 642

rebrowser-patches

Collection of patches for puppeteer and playwright to avoid automation detection and leaks. Helps...

06 Aug 2024 173

dnscovery

Discover services embedded in a site's DNS records

CF-Clearance-Scraper

A simple program for scraping Cloudflare clearance (cf_clearance) cookies from websites issuing C...

17 Apr 2022 202

cf-forbidden

Efficient methods to circumvent Cloudflare's 403 restrictions, allowing for seamless scraping of ...

selenium-fetch

A simple module that lets you access the fetch API with selenium!

zendriver

A blazing fast, async-first, undetectable webscraping/web automation framework based on ultrafunk...

chrome-devtools-rs

Rust library for the Chrome Devtools Protocol

CloakQuest3r

Uncover the true IP address of websites safeguarded by Cloudflare & Others

02 Nov 2023 1,241

ddns-updater

Container to update DNS records periodically with WebUI for many DNS providers

09 Oct 2018 1,302

mitm.watch

a net.Conn implementation which records all of its traffic until told otherwise. +build !prod +...

undetected_geckodriver

A custom Firefox Selenium-based WebDriver. Passes all bot mitigation systems

Th3inspector

Th3Inspector 🕵️ Best Tool For Information Gathering 🔎

17 Feb 2018 2,222

CloudProxy

Proxy server to bypass Cloudflare protection.

10 Jul 2020 537

pupflare

A webpage proxy that request through Chromium (puppeteer) - can be used to bypass Cloudflare anti...

22 May 2020 362