The Wayback Machine - https://web.archive.org/web/20200622170702/https://github.com/topics/extract-data
Skip to content
#

extract-data

Here are 102 public repositories matching this topic...

Ziinc
Ziinc commented Dec 11, 2019

Currently, there the Crawly.Engine apis are lacking for spider monitoring and management, especially for when there is no access to logs.

I think some critical areas are:

  • spider crawl stats (scraped item count, dropped request/item count, scrape speed)
  • stop_all_spiders to stop all running spiders

The stopping of spiders should be easy to implement.

For the spider stats, since so

gr-eventstream is a set of GNU Radio blocks for creating precisely timed events and either inserting them into, or extracting them from normal data-streams precisely. It allows for the definition of high speed time-synchronous c++ burst event handlers, as well as bridging to standard GNU Radio Async PDU messages with precise timing easily.

  • Updated Sep 20, 2017
  • C++

Improve this page

Add a description, image, and links to the extract-data topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the extract-data topic, visit your repo's landing page and select "manage topics."

Learn more

You can’t perform that action at this time.