top of page

Installation

Seamless Installation Across Cloud, On-Premise & Standalone.

cloudera_logo_darkorange.png
icons8-aws-logo-480.png
image (31) (1).png
Databricks_Logo.png
Azure-Logo-Transparent (1).png
Google-Cloud-Symbol.png
Cloudera.png

Overview

Sparkflows can be installed on the cloud or on-premise. It can be installed on AWS, Incorta, Azure, Google Cloud, Incorta, Databricks, Cloudera, and Hortonworks.

Incorta

Sparkflows can be deployed on Incorta to support end-to-end workflow execution in a fully integrated environment. This setup helps users prepare data, run processes, and access outputs in a streamlined and consistent manner.

sparkflows_azure_databricks.png

Databricks

Sparkflows can be installed on one or more machines. The jobs get submitted to the Databricks cluster.

AWS

Sparkflows can be installed on AWS. It can be deployed on a standalone EC2 machine. It can then read data from S3, Redshift, etc., process them, and write out the results to S3, Redshift, etc.

Or it can be installed on the edge node of an EMR cluster. In this case, it would submit the jobs to the EMR cluster for processing.

AWS_SPARKFLOWS_ARCHITECTURE.png
GCP_architecture.png

GCP

Sparkflows can be installed on GCP. It can be deployed on a standalone EC2 machine. It can then read data from S3, Redshift, etc., process them, and write out the results to S3, Redshift, etc.

Or it can be installed on the edge node of an EMR cluster. In this case, it would submit the jobs to the EMR cluster for processing.

Azure

Sparkflows can be installed on Azure. It can be deployed on a standalone machine. It can then read data from ADLS, SQL Server, etc., process them, and write out the results to ADLS, SQL Server, etc.

Or it can be installed on the edge node of an HDInsight cluster. In this case, it would submit the jobs to the HDInsight cluster for processing.

sparkflows_azure_hdinsights.png
sparkflows_cloudera.png

Cloudera

Sparkflows can be installed on the edge node of a Cloudera Cluster. It then submits the jobs to the Cluster. Sparkflows interact with HIVE, HDFS, Kafka, etc.

Laptop

Sparkflows can be installed on a standalone machine.

SPARKFLOWS-STANDALONE-ARCHITECTURE.png
bottom of page