top of page

Workflow Automation Templates

A library of ready-to-use workflow templates to accelerate your data journey

Subset Sampling Method

Split data into training & test sets

Data-cleaning.jpg
Overview

This workflow demonstrates how to perform data sampling using the standard split method, enabling efficient model training and evaluation by dividing data into proportional subsets.

Details

The process begins with the Credit Card Fraud dataset. The Split node divides the dataset into two subsets based on a specified split ratio—for example, 70% for training and 30% for testing. The first output edge receives the defined percentage, while the other receives the remainder.

Each resulting subset is previewed using the Print N Rows nodes to verify correct partitioning. This workflow helps ensure balanced sampling for unbiased model training and reliable performance evaluation.

bottom of page