The Wayback Machine - https://web.archive.org/web/20220819071438/https://github.com/topics/website-scraper
Here are
66 public repositories
matching this topic...
Download website to local directory (including all css, images, js, etc.)
Updated
Jul 1, 2022
JavaScript
Website Cloner - Utilizes powerful Go routines to clone websites to your computer within seconds.
A new web development methodology for JavaScript & C# developers. A super fast and very easy to use CMS.
🕵️♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.
Updated
Jun 22, 2021
TypeScript
Plugin for website-scraper which returns html for dynamic websites using puppeteer
Updated
Aug 8, 2022
JavaScript
A server to collect & archive websites that also supports video downloads
Updated
Apr 29, 2022
TypeScript
Plugin for website-scraper which returns html for dynamic websites using PhantomJS.
Updated
Dec 29, 2021
JavaScript
🕸 Generates and delivers RSS feeds via HTTP. Docker image available! Create your own feeds or get started quickly with the included configs.
Wayback Machine Downloader. 🔥 Download your entire archived websites from the Internet Archive Wayback Machine.
Evaluate JavaScript on a URL through headless Chrome browser.
Updated
Jun 18, 2021
JavaScript
A spider to crawl webpages
Updated
Feb 16, 2020
Python
JSON collection of scraped file extensions, along with their description and type, from FileInfo.com
Updated
Jul 6, 2022
Python
Now you can keep track of your followers from YouTube, Instagram and Twitter accounts - Followers scraper API on AWS serverless
Updated
Jul 7, 2022
TypeScript
Website Penetration Testing Tool With Dos Attack Feature
Updated
Sep 5, 2020
Python
sᴇᴀʀᴄʜ ᴇɴɢɪɴᴇ sᴄʀᴀᴘᴇʀ ᴛᴏᴏʟ (ʙᴀsʜ)
Updated
Nov 1, 2021
Shell
Scraping websites made easy! A minimalistic yet powerful tool for collecting data from websites.
Updated
Jan 3, 2019
JavaScript
Bandwidth efficient scheduled downloads
Updated
Mar 28, 2018
Shell
Download ALL the images (JPEG/GIF/PNG) from any Tumblr website! This project employs Python3 and BeautifulSoup4 to scrape a Tumblr site (with the url provided by the user) to download, page by page, all the images from the Tumblr site's posts. Ideal for archiving other peoples' Tumblrs <3
Updated
Apr 9, 2018
Python
This is a python based website crawling script equipped with Random time intervals, User Agent switching and IP rotation through proxy server capabilities to trick the website robot and avoid getting blocked.
Updated
Jul 6, 2022
Python
Alexa Bulk Website Rank Checker PHP Script 2020 Latest! you can grab 200+ URL's website ranking at once!
Improve this page
Add a description, image, and links to the
website-scraper
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
website-scraper
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.