This system will provide small businesses with easy-to-understand reports on the platform. Specifically:
- Identify business attributes, cities, and states that lead to positive/negative reviews.
- Uncover emerging trends and customer sentiments.
Initial Datasets Pulling: https://www.yelp.com/dataset
Datasets used: yelp_academic_dataset_business.json, yelp_academic_dataset_review.json
- AWS S3 Bucket
- AWS Sagemaker Notebook
- AWS RDS Postgre SQL
- AWS VPC public & private subnets
- AWS EC2 instance SSH & Flask
- sample_df.csv
- flaskDB.py
- run flaskDB.py in ec2 instance
- ‘Cleaned_sampled_dataset.csv’ is taken 1GB sample of the overall dataset(8GB+)
- ‘sample_df.csv’ is the result from ‘Cleaned_sampled_dataset.csv’ after executing this file ’Sentiment Analysis’.
- ‘sample_df.csv’ has been transformed into RDS Postgre database by running ‘RDSPostgre.py’.
- run flaskDB.py in ec2 instance
- Interacting with the website: 44.197.239.78:5001