Installation
AWS
Sparkflows can be installed on AWS. It can be deployed on a standalone EC2 machine. It can then read data from S3, Redshift, etc., process them, and write out the results to S3, Redshift, etc.
Or it can be installed on the edge node of an EMR cluster. In this case, it would submit the jobs to the EMR cluster for processing.
GCP
Sparkflows can be installed on GCP. It can be deployed on a standalone EC2 machine. It can then read data from S3, Redshift, etc., process them, and write out the results to S3, Redshift, etc.
Or it can be installed on the edge node of an EMR cluster. In this case, it would submit the jobs to the EMR cluster for processing.
Azure
Sparkflows can be installed on Azure. It can be deployed on a standalone machine. It can then read data from ADLS, SQL Server, etc., process them, and write out the results to ADLS, SQL Server, etc.
Or it can be installed on the edge node of an HDInsight cluster. In this case, it would submit the jobs to the HDInsight cluster for processing.