top of page

Collaborative Self-Serve Advanced
Analytics with Sparkflows + Google Cloud Platform (GCP)

Perform Data Analytics, Data Exploration, and build ML models

and Data Engineering in minutes using the 450+ Processors in Sparkflows

GCP+sparkflows-03-4 (1).png

Sparkflows is deeply integrated with and certified on GCP. It can be installed on an GCP machine, run in standalone mode, or submit the jobs to Google Cloud Dataprep. It can process data from Google Cloud Storage, BigQuery, Google Cloud Data etc.

GCP_architecture.png

Build and Run Analytics and ML jobs on Dataproc or standalone machines

Read and Write data to Google BigQuery

Seamlessly read files from Google Cloud Storage and process them

Read and process streaming

data from Apache Kafka and Google Cloud Dataflow

Send data to and build ML

models on Vertex AI Workbench

Results include data in

Charts, Tables, Text etc

Integration with Google cloud Dataproc

Sparkflows can be easily installed on GCP. It can connect to any Google Dataproc cluster, submit jobs to it and display results. 

Integration with Google Cloud Dataprep

Sparkflows can submit the Analytical Jobs to be run onto Google Dataprep. The results and visualizations are displayed back in Sparkflows.

Integration with BigQuery

Sparkflows is fully integrated with BigQuery. Sparkflows has processors for reading from and writing to BigQuery. Seamlessly browse the BigQuery catalog, run queries and view results. 

Integration with Google Cloud Storage

Sparkflows allows you to access your files on Cloud Storage. The jobs run by Sparkflows can read from and write to files on Cloud Storage. The files can be in various file formats including CSV, JSON, Parquet, Avro, etc.​ Sparkflows also allows you to browse your files on Cloud Storage.

Integration with Vertex AI Workbench

Sparkflows is fully integrated with Vertex AI Workbench. Sparkflows provides a number of processors for doing model building with Vertex AI Workbench. These include :

LinearLearnerBinaryClassifier

LinearLearnerRegressor


​XGBoost Model

 

AutoML

Scikit-learn library Models

 

Benefits of Sparkflows on Google Cloud

Find quick value with Sparkflows and GCP

Enable Business Analysts

Enable Business Analysts to find quick value with GCP clusters.

Self Serve Advanced Analytics

Enable users to do Analytics and Machine Learning in minutes.

Enable 10x more users to build
Data Science use cases.

10x More Users

Makes it easy to build, maintain
and execute.

No code and low code platform

Return on Investment (ROI)

Solve your Data Science use cases 10x faster.

bottom of page