Workflow Automation Templates
A library of ready-to-use workflow templates to accelerate your data journey
Group By Recency Frequency Mean FE Spark
Generate user insights with RFM metrics

Overview
This workflow aggregates transactional data using Spark to compute Recency, Frequency, and Monetary (RFM) features. It provides user-level insights into purchasing patterns, spending behavior, and engagement for analytics or predictive modeling.
Details
The workflow starts by loading the dataset containing user transactions with details such as purchase date, amount, and category. The Group By RFM Features node groups data by user ID and calculates key metrics including purchase frequency, recency (days since the last transaction), and average intervals between purchases.
Additional attributes like total spend and customer age can be derived for richer behavioral analysis. The output, combining original and computed user-level features, is displayed through the Print N Rows node for validation.
This workflow helps uncover valuable user behavior trends and supports customer segmentation, retention, and lifetime value modeling.