This project's goal is to collect tweets data from Twitter API and then analyze sentiments. In a high level overview, the steps are highlighted below.
- Data was collected from Twitter API using an EC2 and kinesis firehose instance.
- Sentiments were generated based on the collected tweet data using PySpark.
- The data was further cleaned using SQL Athena.
- Dashboard was developed to support the sentiments analyses, which could be found in the screenshot within one of the folders.
Each folder represents each major step in the project pipeline.