Skip to content

Latest commit

 

History

History
49 lines (49 loc) · 1.79 KB

README.md

File metadata and controls

49 lines (49 loc) · 1.79 KB

CSC 501: Data Models and Algorithms

Group Project 2


Introduction

An exploratory Data Analysis of the Influence and Engagement of Twitter(now X) feeds from a Russian "troll factory," the Internet Research Agency (IRA), that alleged to have deliberately sought to sow political discontent in the US with inflammatory social media content that may have affected the 2016 US presidential election.


Data Source

Data

Pre-requisites

  • Python 3.10 or greater
  • pip as python package manager

Installation

  • Open terminal and Run `pip install virtualenv` (if you don't already have virtualenv installed)
  • Create virtualenv groupproject2 by running `python -m virtualenv groupproject2`
  • `cd /groupproject2`
  • Clone the repo to your virtualenv folder
  • Run `source Scripts/activate`
  • Run `pip install -r requirements.txt`
  • Run `cd /src`
  • Run `mkdir data`
  • Run `cd /data`
  • Run `mkdir raw`
  • Copy the data from source above to the /src/data/raw/ dirctory
  • `cd ..`

Execute

  • Execute `python merge.py`
  • To create normalized tables Execute `python rawToNorm.py`
  • To get Insight1 Execute `python normInsight1.py`
  • To get Insight2 Execute `python normInsight2.py`
  • To get Insight3 Execute `python normInsight3.py`
  • To get Insight4 Execute `python normInsight4.py`
  • To get Visual plots for Insight4 Execute `python insight4Visualize.py`

Contributors

  • Bhan Singh
  • Anuinder Sekhon
  • Vatsala Arora
  • Mohit Kaushik
  • Jitendra Palaparty