Skip to content

Latest commit

 

History

History
11 lines (6 loc) · 586 Bytes

File metadata and controls

11 lines (6 loc) · 586 Bytes

PPP-Analysis

This project aims to analyze trends in the Paycheck Protection Program and to generate predictive models based on demographics to predict the likelihood of receiving a loan.

Overarching Business Question: What factors played the largest roles in PPP loan distribution, and can I accurately predict maximum loan values given those factors?

Data Source: https://www.kaggle.com/susuwatari/ppp-loan-data-paycheck-protection-program

Packages: Tidyverse, ggplot2, tm, car (for VIF), MASS (for stepAIC)

Attempt 2 will make use of Data Cleaning in R and analysis in Sklearn