Distributed Frequency Count Algorithms for Data Streams
-
Updated
Jun 20, 2022 - Kotlin
Distributed Frequency Count Algorithms for Data Streams
Simulation toolbox for Crimegraph.
Hands on data streaming
DataStream-SQLServer provides real-time data streaming from SQL Server using Zookeeper, Kafka, and Debezium. This repository contains the necessary configurations, Docker setups, and sample code to get you started.
HFlow is a platform for I/O forwarding managed elastically, dynamically, and actively
Tool to approximate the frequency of occurrences of different items in a data stream.
PyFlink data stream processing utilities 🐿
Automated deployment of an Apache Flink cluster in your Grid'5000 reserved nodes.
In a team of 4 people, we implemented a public lighting control and monitoring system for a smart city
Real-time data engineering pipeline for an American hiring platform
A lightweight and polyglot stream-processing library, to be used as a data backplane-, message relay-, or pipeline-subsystem.
Geometric Figure Clasifier program
This project is a data pipeline to stream data from meetup, perform realtime analysis and mapping back to google map.
A collection of exercises and notes to help me better understand SQL and NO-SQL databases. I attempt to touch on common approaches to schema modeling, normalization, and query optimization.
Final project for the course 'Architecture for Large Data Volumes', taught in the Bachelor's program in Data Science at ITAM
Code to find rows of high leverage in a data stream.
BigData knowledge system(大数据知识体系).
Apache Flink boilerplate to build performant data streaming applications from.
Udacity Data Streaming project based on Apache Kafka
Add a description, image, and links to the data-stream-processing topic page so that developers can more easily learn about it.
To associate your repository with the data-stream-processing topic, visit your repo's landing page and select "manage topics."