Credit Card
ML - randomForest
Last updated
Was this helpful?
ML - randomForest
Last updated
Was this helpful?
The results from TPOT point to using a Decision Tree algorithm.
Once we've selected our algorithm:
Train a randomForest model in R.
Deploy your model.
Predict fraudulent credit card transactions.
The model that will be used is randomForest.
To listen to the videos please copy and paste the website URL into your host Chrome browser, as there's no soundcard in the Lab environment.
Train the randomForest algorithm with the same dataset.
In Spoon, open the following main job:
/home/pentaho/Workshop--Data-Integration/Labs/Module 7 - Workflows/Machine Learning/Credit Card Fraud/solution/jb_fraud_main_job.kjb
Right-click on the train_model transformation and select Open Referenced Object -> Transformation.
Double-click on the rscrpt-train_model_randomForest step to bring up the configuration settings.
Under the Configure tab, ensure the Input Frames points to the step name sv-convert_booleans_to_numbers and the R Frame name: train.
Set Row Handling to Number of Rows to Process: All.
Select the R script tab. Copy and paste the code snippets based on the Comments.