Workflow Automation Templates
A library of ready-to-use workflow templates to accelerate your data journey
FP Growth Prediction
Apply pre-trained Spark ML models

Overview
This workflow applies a pre-trained Spark Machine Learning (ML) model to make predictions on transactional datasets. It enables seamless integration of existing models with new data, allowing users to perform scalable, distributed predictions directly within a Spark environment.
Details
The process begins with loading transactional data from a CSV file, followed by an SQL transformation to filter, aggregate, or prepare the data for prediction. A pre-trained model is then loaded using the Spark ML Model Load node.
The prepared dataset and the model are combined in the Spark Predict node, which applies the ML model to generate predictions efficiently across large-scale data. The Print N Rows node provides a preview of prediction results for quick validation and inspection.
This workflow streamlines the model deployment process, enabling rapid inference, operationalization of trained models, and integration of predictive analytics into existing Spark pipelines.