Workflow Automation Templates
A library of ready-to-use workflow templates to accelerate your data journey
Subset Sampling Method
Split data into training & test sets

Overview
This workflow demonstrates how to perform data sampling using the standard split method, enabling efficient model training and evaluation by dividing data into proportional subsets.
Details
The process begins with the Credit Card Fraud dataset. The Split node divides the dataset into two subsets based on a specified split ratio—for example, 70% for training and 30% for testing. The first output edge receives the defined percentage, while the other receives the remainder.
Each resulting subset is previewed using the Print N Rows nodes to verify correct partitioning. This workflow helps ensure balanced sampling for unbiased model training and reliable performance evaluation.