pyspark-realtime-streaming-sentiment-analysis

⏱ Real-Time Sentiment Analysis using PySpark and simulation of Twitter/X API using FastAPI

Stars
2
Committers
1

PYSPARK REALTIME STREAMING SENTIMENT ANALYSIS

⏱ Real-Time Sentiment Analysis using PySpark and FastAPI

PROTOTYPE VIDEO

https://github.com/raghavtwenty/pyspark-realtime-streaming-sentiment-analysis/assets/126254197/6f247698-5d9e-4905-aef7-0e7cf247b94a

HOW TO EXECUTE

Terminal

git clone https://github.com/raghavtwenty/pyspark-realtime-streaming-sentiment-analysis.git
cd code/
pip install -r requirements.txt

Run Fast API Server

uvicorn _1_fastapi_server:app --reload

Run PySpark

python _3_sentiment_analysis_pyspark.py

DOMAIN

Big Data Analytics

OBJECTIVE

Perform sentiment analysis for the realtime streaming data

INTRODUCTION

In the era of big data, the ability to process and analyze real-time data streams is crucial for gaining actionable insights. This project aims to demonstrate a real-time streaming sentiment analysis application using PySpark and FastAPI. The sentiment analysis model processes incoming data in real time, determining the sentiment polarity (positive, negative, or neutral) of each data point. This prototype is particularly beneficial for applications such as social media monitoring, customer feedback analysis, and other domains where timely sentiment information is critical.

I haven't used Twitter/X API, Instead simulated the same using FastAPI. Free of cost.

FEATURES

  • Highly Scalable
  • Real-time
  • Parallel Processing

TECHNOLOGIES USED

  • Spark
  • Fast API

END USERS

  1. Students
  2. Data Analyst
  3. Data Scientists

OUTPUTS

  • Fast API Server 1

  • PySpark 2

  • PySpark 3

END OF README

Related Projects