COLLECTED BY
Organization:
Internet Archive
Focused crawls are collections of frequently-updated webcrawl data from narrow (as opposed to broad or wide) web crawls, often focused on a single domain or subdomain.
The Wayback Machine - https://web.archive.org/web/20210123092913/https://github.com/topics/data-streaming
Here are
25 public repositories
matching this topic...
Apache Kafka running on Kubernetes
Updated
Jan 22, 2021
Java
An extensible distributed system for reliable nearline data streaming at scale
Updated
Jan 20, 2021
Java
Sample Applications for Pravega.
A simple, time-tested, family of random hash functions in Java, based on CRC32, affine transformations, and the Mersenne Twister. 🎲
Updated
Dec 27, 2020
Java
Apache Kafka for the Hybrid IoT
Updated
Mar 21, 2019
JavaScript
Ixian S2 end to end data streaming network software
A Tool for Timed Patten Matching with Automata-Based Acceleration
A lightweight fast data streaming library for raspberry pi in python.
Updated
Aug 27, 2019
Python
Elastic Stack with Nginx, Logstash and Beats demo
AutoML Techniques for Data Streams - Research Paper
Subscribe to datasets and be notified of changes via webhook
Updated
Jan 21, 2021
Python
Data Stream Generator - a lightweight C# .NET based application, which parses TOML-configuration files for specifications of time series, generates these series and successively streams them via MQTT.
A projects developed while learning data streaming
Updated
Oct 17, 2020
Python
HFlow introduces a unified data abstraction for I/O forwarding systems that are managed elastically, dynamically, and actively
Updated
Apr 6, 2018
Scala
Kafka Devlopment and Production repo for all data streamings
Updated
Dec 28, 2020
Python
AWS Firehose Sender - Sending data securely through a Firehose stream using boto3
Updated
Aug 24, 2019
Python
Maintenance of the GigaVoxels / GigaSpace project from INRIA.
The missing command line interface for Jet:
Fault-tolerant streaming pipeline for real-time soccer match analysis.
Updated
Nov 8, 2019
Scala
Data Streaming Nanodegree (from Udacity) exercises, projects and their solutions
Updated
Feb 2, 2020
Python
Large Scale Test Bed Network Emulator (CURENT @ UTK)
Updated
Oct 18, 2019
Python
Large Scale Test Bed Network Emulator
Updated
Sep 12, 2019
Python
This repository contains a collection of three Data Engineering capstone projects made for the DTU Data Engineering course 02807: Computational Tools for Data Science
Updated
Jan 12, 2020
Jupyter Notebook
Improve this page
Add a description, image, and links to the
data-streaming
topic page so that developers can more easily learn about it.
Curate this topic
Add this topic to your repo
To associate your repository with the
data-streaming
topic, visit your repo's landing page and select "manage topics."
Learn more
You can’t perform that action at this time.
You signed in with another tab or window. Reload to refresh your session.
You signed out in another tab or window. Reload to refresh your session.