A collection of awesome web crawler,spider in different languages
-
Updated
Sep 20, 2019 - 77 commits
- 16 contributors
A collection of awesome web crawler,spider in different languages
Web Scraper in Go, similar to BeautifulSoup
A framework for creating semi-automatic web content extractors
A list of scrapers from around the web.
A simple browser/client-side web scraper.
MetaData html scraper and parser for Node.js (supports Promises and callback style)
`scrape_linkedin` is a python package that allows you to scrape personal LinkedIn profiles & company pages - turning the data into structured json.
A collection of awesome web scaper, crawler.
PHP Library for detecting CMS
A command line interface for downloading Bollywood and punjabi songs
Go cascadia package command line CSS selector
Powerful web scraping framework for Crystal
A pluggable, simple and powerful web scraper.
Web Scraping Craigslist's Engineering Jobs in NY with Scrapy
Public web scraping scripts for the University of Toronto.
JSON configurable concurrent scraper
Yellowpages.com Web Scraper written in Python and LXML to extract business details available based on a particular category and location.
State of the art object detection in real-time using YOLOV3 algorithm. Augmented with a process that allows easy training of the classifier as a plug & play solution . Provides alert if an item in an alert list is detected.
A fast Web API scraper written in C++ and built on Boost ASIO
A modular template for scraping data from the web to send yourself scheduled email reports
Adult XXX Addons (18+) for the Kodi Media Center - Kodi is a registered trademark of the XBMC Foundation. We are not connected to or in any other way affiliated with Kodi - DMCA: [email protected]
Depth controllable Web scraper and Sitemap Generator in Go
Hepsiburada review/comment and rating scraper. Turkish text dataset creator for data science and NLP projects. Nearly 30M reviews with category and product links can be crawled and used for text classification, sentiment analysis, text mining, NLP models etc. Supported by multithreading, written in Python.
Webscraper for DC, Marvel and more Comicbook Wikias to download CB covers