Skip to the content.

Analyzing real-time data with Spark Streaming and Kafka

ban

Project Description

     The project deals with the processing the weather data from www.weatherbit.io using Kafka and Spark Streaming. Here we are simulating the streaming data using previous days data. Then we used a PySpark program to run the spark SQL queries to process the data consumed from the kafka topic along with their required dependencies and finally publish the processed data to another Kafka topic. Then we will consume the data into another python program and plotted the real time graph using Matplotlib.

Workflow

Technologies Used

</p>

Features

List of features ready and TODOs for future development

To-do

Getting Started

All the operations below are for Windows OS

Usage

The following are the steps to get the program working.

Output

Contirbutors

License

This project uses the MIT license.

Reference

https://www.weatherbit.io/api

https://www.goavega.com/install-apache-kafka-on-windows/

https://phoenixnap.com/kb/install-spark-on-windows-10

https://matplotlib.org/devdocs/index.html

VISIT