The Wayback Machine - https://web.archive.org/web/20220819071438/https://github.com/topics/website-scraper

#

website-scraper

Here are 66 public repositories matching this topic...

website-scraper / node-website-scraper

Download website to local directory (including all css, images, js, etc.)

nodejs javascript scraper hacktoberfest website-scraper

Updated Jul 1, 2022
JavaScript

imthaghost / goclone

Website Cloner - Utilizes powerful Go routines to clone websites to your computer within seconds.

go golang crawler cloning website-scraper website-cloner

Updated Aug 14, 2022
Go

Kooboo / Kooboo

A new web development methodology for JavaScript & C# developers. A super fast and very easy to use CMS.

javascript cms development website-builder website-scraper kooboo netcoreapp website-development netcore21

Updated Jun 23, 2022
C#

jvandenaardweg / linkedin-profile-scraper

🕵️‍♂️ LinkedIn profile scraper returning structured profile data in JSON. Works in 2020.

Updated Jun 22, 2021
TypeScript

website-scraper / website-scraper-puppeteer

Plugin for website-scraper which returns html for dynamic websites using puppeteer

nodejs javascript chrome scraper chromium hacktoberfest website-scraper puppeteer

Updated Aug 8, 2022
JavaScript

xarantolus / Collect

A server to collect & archive websites that also supports video downloads

self-hosted video-downloader archive webinterface web-archiving website-scraper website-archive

Updated Apr 29, 2022
TypeScript

website-scraper / node-website-scraper-phantom

Plugin for website-scraper which returns html for dynamic websites using PhantomJS.

nodejs javascript scraper phantomjs hacktoberfest website-scraper

Updated Dec 29, 2021
JavaScript

html2rss-web

html2rss / html2rss-web

🕸 Generates and delivers RSS feeds via HTTP. Docker image available! Create your own feeds or get started quickly with the included configs.

ruby docker rss scraper builder feed roda rss-feed rss-aggregator serves rss-feed-scraper website-scraper webfeeds webfeed html2rss rolling-release html2rss-configs feed-configs

Updated Aug 3, 2022
Ruby

erlange / wbm-dl

Wayback Machine Downloader. 🔥 Download your entire archived websites from the Internet Archive Wayback Machine.

console command-line-app csharp internet internet-archive command-line-tool console-application wayback-machine command-line-parser website-scraper console-app internet-wayback-machine wayback-machine-downloader

Updated Aug 5, 2022
C#

yuis-ice / jseval

Evaluate JavaScript on a URL through headless Chrome browser.

Updated Jun 18, 2021
JavaScript

Ashwin-op / Email-Extractor

A spider to crawl webpages

python crawler spider scrapy website-scraper

Updated Feb 16, 2020
Python

faheel / file-extensions

JSON collection of scraped file extensions, along with their description and type, from FileInfo.com

json scraper python3 scraped-data fileinfo file-extensions website-scraper

Updated Jul 6, 2022
Python

jeanrauwers / followers-scraper-serverless

Now you can keep track of your followers from YouTube, Instagram and Twitter accounts - Followers scraper API on AWS serverless

aws instagram lambda scraper youtube typescript twitter aws-lambda webscraper instagram-scraper aws-serverless webscraping twitter-scraper website-scraper webscraper-api followers-scraper nodejs-lambda twittersc instagramscraper

Updated Jul 7, 2022
TypeScript

cometolearnofficial / WebHawk

Website Penetration Testing Tool With Dos Attack Feature

website hacking penetration-testing termux website-scraper ddos-attack webhawk come-to-learn

Updated Sep 5, 2020
Python

orangmuda / SECTOOL

sᴇᴀʀᴄʜ ᴇɴɢɪɴᴇ sᴄʀᴀᴘᴇʀ ᴛᴏᴏʟ (ʙᴀsʜ)

crawler scraper crawling website-scraper

Updated Nov 1, 2021
Shell

epegzz / node-scraper

Scraping websites made easy! A minimalistic yet powerful tool for collecting data from websites.

javascript scraper node cheerio scraping axios website-scraper

Updated Jan 3, 2019
JavaScript

dann1 / ndown

Bandwidth efficient scheduled downloads

scheduler wget bandwidth youtube-downloader aria2 website-scraper

Updated Mar 28, 2018
Shell

nigeld3v / Tumblr_Image_scrape

Download ALL the images (JPEG/GIF/PNG) from any Tumblr website! This project employs Python3 and BeautifulSoup4 to scrape a Tumblr site (with the url provided by the user) to download, page by page, all the images from the Tumblr site's posts. Ideal for archiving other peoples' Tumblrs <3

Updated Apr 9, 2018
Python

MLArtist / web-scraper

This is a python based website crawling script equipped with Random time intervals, User Agent switching and IP rotation through proxy server capabilities to trick the website robot and avoid getting blocked.

crawler scraper user-agent scraping beautiful-soup robots-txt beautifulsoup scrapper website-scraper scrapping-python website-crawler beautifulsoup4 crawling-python iprotation

Updated Jul 6, 2022
Python

Sachinart / alexa-rank-checker

Alexa Bulk Website Rank Checker PHP Script 2020 Latest! you can grab 200+ URL's website ranking at once!

php website script seo rank amazon-alexa website-scraper ranks

Updated Sep 1, 2019
CSS

Improve this page

Add a description, image, and links to the website-scraper topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the website-scraper topic, visit your repo's landing page and select "manage topics."