Browserlify - browser as a service

Browserlify is a browser service:

Content API: Retrieve web page content by URL, supporting PDF generation, screenshot capture, text extraction, and HTML dumping.
Headless Chrome: Connect to Headless Chrome via WebSocket and perform testing using Puppeteer.
🔥 Remote Browser: Implement a remote browser based on noVNC, allowing browser access through a web interface.

With Browserlify, you can easily access web content, automate testing with Headless Chrome, and enjoy the convenience of a remote browser.

🚀 They is a cloud browser demo. 👏

Screenshots

How to run (via docker)

mkdir -p -m 777 `pwd`/data
docker run  -ti --rm --privileged  --shm-size=500m\
  -v `pwd`/data:/tmp/browserlify \
  -p 9000:9000 shenjinti/browserlify:latest

visit http://localhost:9000

Run via cargo & puppeteer (local development)

Note: you need to install puppeteer and rust first remote browser Only tested on linux

# for linux dev 
$ sudo apt-get install x11vnc xvfb scrot pkg-config libssl-dev chromium-browser 

# 
$ cd browserlify
$ cargo run

Test via puppeteer

  const browser = await puppeteer.connect({
    browserWSEndpoint: `ws://localhost:9000`,
  });
  const page = await browser.newPage();
  await page.goto('https://example.org');
  await page.screenshot({ path: 'example.png' });

Session API

/list - list all session
/kill/:session_id - kill session by id
/kill_all - kill all sessions

Content API

The parameters:

{
  "url": "https://example.org",

  "file_name": "example.pdf",
  "timeout": 60000, // total timeout: milliseconds
  "wait_load": 1000, // wait for load: milliseconds
  "selector": "#main", // wait for selector: css selector
  "images": true, // wait for images loaded
  "network_idle": 1000, // wait for network idle: milliseconds
  "page_ready": true,
  "scroll_bottom": 1000, // milliseconds
  "scroll_interval": 1000, // milliseconds

  "paper_width": 8.5, //pdf: inches
  "paper_height": 11, //pdf: inches
  "scale": 1,
  "margin_top": 0.4, // inches
  "margin_bottom": 0.4, // inches
  "margin_left": 0.4, // inches
  "margin_right": 0.4, // inches
  "background": true, // pdf: print background
  "landscape": false, // landscape or portrait
  "page_ranges": "1-5",  // pdf: page ranges: 1-5, 1,2,3
  "device": "ipad",  // emulate device: iphone, ipad, 2k, 4k
  "disable_link": true,  // pdf: disable link
  "paper_size": "A4",  // pdf: paper size: A4, A3, Letter, Legal
  "header_template": "<div>Header</div>",
  "footer_template": "<div>Footer</div>",
  "format": "png",    // screenshot: format: png, jpeg, pdf
  "quality": 100,     // screenshot: only used for jpeg format
  "clip": "0,0,800,600",   // screenshot: clip the screenshot to the specified rectangle
  "full_page": true,       // screenshot: capture the full scrollable page, not just the viewport
  "author": "Browserlify", // pdf: author
}

/pdf - generate pdf from url

curl "http://localhost:9000/pdf?url=http://browserlify.com&images=true" > browserlify.pdf

/screenshot - generate screenshot from url

curl "http://localhost:9000/screenshot?url=http://browserlify.com&format=png&full_page=true" > browserlify.png

/text - dump dom text from url

curl "http://localhost:9000/text?url=http://browserlify.com" > browserlify.txt

/html - dump html content from url

curl "http://localhost:9000/html?url=http://browserlify.com" > browserlify.html

Related Projects

puppeteer

Node.js API for Chrome

09 May 2017 86,832

html-pdf-chrome

HTML to PDF or image (jpeg, png, webp) converter via Chrome/Chromium

08 May 2017 775

Recorder

A browser extension that generates Cypress, Playwright and Puppeteer test scripts from your inter...

14 Nov 2021 439

screenshuttle

Capture screenshot of webpage in full or a specific node and return them as JPEG, PNG or even PDFs.

04 Jun 2024 2

surfingkeys-conf

🏄 A SurfingKeys config which adds 180+ key mappings & 50+ search engines

28 Aug 2017 379

WitnessMe

Web Inventory tool, takes screenshots of webpages using Pyppeteer (headless Chrome/Chromium) and ...

06 Jul 2019 729

ferrum

Headless Chrome Ruby API

23 Jul 2019 1,676

puppeteer-sharp

Headless Chrome .NET API

30 Sep 2017 3,141

PhpChromeToPdf

A slim PHP wrapper around Google Chrome for converting URLs to PDFs or taking screenshots. It's e...

06 Jul 2017 147

playwright

Playwright is a framework for Web Testing and Automation. It allows testing Chromium, Firefox and...

15 Nov 2019 66,091

headless-chrome-crawler

Distributed crawler powered by Headless Chrome

02 Dec 2017 5,519

playwright-go

Playwright for Go a browser automation library to control Chromium, Firefox and WebKit with a sin...

16 Aug 2020 1,795

take-screenshots

📸 Take clean website screenshots

30 Dec 2018 2

puppetron

Puppeteer (Headless Chrome Node API)-based rendering solution.

22 Aug 2017 527

serverless-chrome

🌐 Run headless Chrome/Chromium on AWS Lambda

11 Mar 2017 2,837