Scott Morris, Software Engineer at Rockset wrote this piece about building a real-time dashboard for operational monitoring and analytics on streaming event data from Kafka, which often requires complex SQL, including filtering, aggregations, and joins with other data sets.
Apache Kafka is a widely used distributed data log built to handle streams of unstructured and semi-structured event data at massive scales. Kafka is often used by organizations to track live application events ranging from sensor data to user activity, and the ability to visualize and dig deeper into this data can be essential to understanding business performance.
The article is then split into:
- Streaming data from Reddit
- Connecting Kafka to Rockset
- Connecting Rockset to Tableau
- Creating real-time dashboards
… and more plus all the code is included together with explanatory screen grabs. Interesting not just for aspiring data scientists!
[Read More]