GPA : 3.85
Predictive Analytics Summer 2016
- Implemented Ensemble Modeling, Data mining techniques, built a predictive model and scored using R for Portuguese Bank Marketing data set
- Developed a classification model for driver alertness using Driver information to predict the driver state, and implemented Decision Trees, Random forest algorithms using Scikit-Learn library in Python to predict the driver state.
Enterprise Data Summer 2016
- Retail Data Analysis using Hive, Implemented Partitioning and bucketing to analyze top 10 customers
- Processed the Data using PIG and stored the results to Hadoop File System
Data Mining Spring 2016
- Applied Data mining techniques on Bolivian schools data set, used R programming to detect Outliers, performed distributions, Principal Component Analysis and documented all the results
- R Packages like glm,lm,knn were used to implement Supervised learning on Prostate cancer dataset.
Business Analytics Fall 2015
- Scraped the data from various web sources using packages like Beautiful Soup, urllib2 in Python and developed a regression model to predict the unemployment rate in Philadelphia
- Collected the twitter stream data using Twitter API and perform sentiment analysis on the release of (iPhone 6s) using oauth2 module in Python.