Bank_Fraud_Predictive

Model to predict whether or not a bank application is fraudulent, coded in R.

I used a large, realistic dataset from Kaggle for this project. Data available here: https://www.kaggle.com/datasets/sgpjesus/bank-account-fraud-dataset-neurips-2022

Base Analysis uses the Base dataset, which represents the real-world, anonymized fraud data. Variant V Analysis uses the Variant V datset, which has better separability in the training data. Both datasets contain 1,000,000 rows and 32 variables.

I created this project for the course Econ 695: Econometrics for Big Data in Spring 2023. As part of the class, I gave a five minute presentation on my project and wrote up a findings report. Those files can also be found in this folder.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
Base Analysis.Rmd		Base Analysis.Rmd
Presentation - Bank Fraud Predicitve Model.pptx		Presentation - Bank Fraud Predicitve Model.pptx
Project Write-up - Bank Fraud Predictive Model.pdf		Project Write-up - Bank Fraud Predictive Model.pdf
README.md		README.md
Variant V Analysis.Rmd		Variant V Analysis.Rmd

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bank_Fraud_Predictive

About

Releases

Packages

AvaACook/Bank_Fraud_Predictive

Folders and files

Latest commit

History

Repository files navigation

Bank_Fraud_Predictive

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages