top of page

Templates

Input/Output

Read/Write Files
Workflows
  • Save As JSON Files 

  • Save As Parquet Files

  • Read-CSV

  • Read Multi-line JSON

  • Read Parquet File

  • Read Excel File

  • PDF Image OCR

  • Read PDF File

  • Retail Example

Connectors
Workflows
  • Read File from URL

  • Elastic Search Save

  • Read from HIVE

  • Read JSON from URL

Data Preparation/Engineering

Data Preparation
Workflows
  • Dedup Customers

  • Titanic Data Cleaning Example

  • Data Validation Multiple

  • Data Validation for Email

  • REST - CSV Reader and Parse

  • Convert to Timestamp

  • Date Time Field Extract

  • REST Read and Parse JSON

  • Drop Rows With Null

  • Remove Special Characters

  • Remove Duplicate Rows

  • Column Filter

  • Concat Columns

  • Data Preparation - 1

  • Regex Multi-Example

  • Cast Nodes

  • Select - Drop - Rename Columns

  • Data Wrangling

  • Using Imputeand Case Statements

  • OCR

  • Profiling Housing Dataset

  • Remove Duplicate Rows

  • Select - Drop - Rename Columns

  • String Functions

Application
  • Customers Segmentation App

Data Engineering
  • Titanic Data Cleaning

  • Data Validation - Complex

  • Clickstream Data Pipeline

  • Data Validation

  • Data Validation Address

  • Data Validation Schema

  • Complex Customer ETL

  • Simple Customer ETL

  • Employee - Data Profiling

  • Housing Data Report

  • Multi Join

  • NYC Taxi Average Speed Report

  • Read & Write To Hive Table

  • Remove Duplicate Rows

  • Retail Transactions Pipeline

  • SQL Workflow

  • Scala Workflow

Machine Learning

Recommendations
Workflows
  • Movie Lens Ratings Distribution

  • Music Recommender

  • Ratings

  • Ratings Exploration

Price Elasticity
  • Predict volume and create parabola

  • Calculate profit and create model

  • Data Exploration

NLP
  • OpenNLP Name Finder

Data Science

  • Churn Prediction - RFC - H2O

  • Churn Prediction - RFC

  • ML-Wizard

  • H2O KMeans of Bike Sharing Dataset

  • Churn Data Analysis

  • KMeans Clustering - Save & Load

  • Click-thru Prediction Using GBT

  • Spark ML Pipeline Model Save

  • Car Acceleration

  • Telco Churn Prediction - Data Profile

  • Bike Sharing Analysis

  • H2O DRF

  • Anomaly Detection with Isolation Forests using H2O

  • Correlation and Summary

  • Score Spark ML Model

  • Farmers Market Prediction

  • SMS Spam Detection Using Logistic Regression Model

  • Store wise Retail Stock Prediction Using Linear Regression Model

  • Spark XGBoost - Classification

  • H2O Neural Network

  • GBTRegression

  • Feature Selection Methods

  • Churn Prediction - Decision Tree Classifier

  • Decision Tree Regression

  • K-means clustering using Housing dataset

  • Credit Card Fraud Prediction Using Logistic Regression Model

  • Parameters Passing

  • Chi-Square Test for Feature Selection

  • Churn Prediction - RFC & Logistic Regression

  • Comparative Analysis of Multiple Models

  • Covariance and Correlation

  • CrossTab

  • Dimensionality Reduction - PCA

  • Distribution

  • Feature Decomposition

  • H2O DRF

  • H2O GBM of Bike Sharing Dataset

  • H2O GLM

  • H2O GLRM of Credit card fraud Dataset

  • H20 - Isolation Forest

  • H2O KMeans of Bike Sharing Dataset

  • H2O Neural Network

  • H2O XGBoost

  • Household Power Consumption

  • K-means clustering using Housing dataset

  • House Price Prediction Using Random Forest Regression

  • Imputing Methods

  • Load CSV - JSON

  • Multi-layer Perceptron Classifier - WineQuality

CPG Solutions

Customer Segmentation
Workflows
  • Transactional-Data-EDA

  • Voice Data Customer Call Volume Clustering

  • Voice Data Customer Call Charge Clustering

  • Voice Data Customer Call Duration Clustering

  • Final Profiling

  • Customers Clustering

  • Campaign Data Feature Engineering

  • Transactional Data Enrichment

Reports
  • Customer Voice Data Clusters

  • Customer Segmentation Overview

Application
  • Customers Segmentation App

Product Recommendation
Workflows
  • Customers Invoice Data EDA

  • Cluster Wise FPG Recommendations

  • Customers Clustering and Profiling

  • Data Preparation for clustering

  • Read Parquet File

  • Invoice Data Cleaning

Churn Prediction
Workflows
  • Churn Prediction

  • Churn Classification Model Training

  • Sales Transaction Data EDA

  • Sales Transaction Data Preparation

  • Sales Transaction Data EDA 2

Reports
  • Churn Reports

Application
  • Churn Analytical App

Timeseries

Workflows
  • Monthly Passengers distribution

  • Stock Forecast Graph

  • AirPassengers Forecasting - Prophet Model

  • AirPassengers Forecasting - Prophet Model - With holidays

  • AirPassengers Forecasting - SARIMAX Model

  • AirPassengers Forecasting - Prophet Model - Multivariate

  • Generate the Forecasts - Stocks

  • Temperature Forecast

  • Earthquake Prediction - SQL - RandomForest

  • Feature Engineering Techniques For Time Series Data

  • AirPassengers Forecasting - SARIMAX Multivariate Model

  • AirPassengers Forecasting - ARIMA Model

  • GDP Forecasting - Multivariate VAR Model

  • Safety Stock Calculations for Inventory Using Prophet

Others

Data Quality
Visualization
Streaming
  • Transactional Data EDA

  • Multiple Validation Example

  • Employee Data Cleaning and Data Quality

  • Housing - Data Quality

  • Print n rows Example

  • Housing Data - Graph values

  • Subplot Graph-Example

  • Graph Group by column-Example

  • Gauge Graph Example

  • Bubble chart Example

  • Distribution Graphs

  • Boxplot Example

  • Creating Subplots

  • Churn Data Analysis

  • Creating a Boxplot

  • Graph column values by count

  • NYC Taxi Average Speed

  • Print Rich Text Example

  • Train Data - Graph values

  • Box Plot & Sub Plots

  • Data Visualization 

  • Kafka Streaming - Bike Sharing

  • File Streaming - JSON Events

  • Structured Streaming - File Sink

  • Streaming test

  • Kafka Streaming - Flights Delay

  • Streaming Socket Text Stream

  • Structured Streaming - Join CSV

  • Structured Streaming - Join Hive Table

  • Structured Streaming - Basic

  • Recover Partition - HIVE StreamingSink

Reports
  • Streaming - Bike-Sharing

Healthcare Optimize
Hospital Operation
  • Classify Patient's Number of Stays In Hospital

  • Classify Severity of illness

  • Hospital Patient Data EDA

Reports
  • Health Severity Condition Report

Data Profiling and Visualizations

Data Profile Explore
Visualization
  • Data Exploration of Housing Data

  • Loan - Data Profile

  • Housing Training - Data Profile

  • Sales Example Data - Data Profile

  • Correlation between Columns

  • Detecting Outliers

  • WHO Data Cleaning

  • Loan DataProfile - II

  • Loan DataProfile - I

Reports
  • Profile

  • Housing Dataset Profiling

  • Report

  • Report Test

  • Print n rows Example

  • Housing Data - Graph values

  • Subplot Graph-Example

  • Graph Group by column-Example

  • Gauge Graph Example

  • Bubble chart Example

  • Distribution Graphs

  • Boxplot Example

  • Creating Subplots

  • Churn Data Analysis

  • Creating a Boxplot

  • Graph column values by count

  • NYC Taxi Average Speed

  • Print Rich Text Example

  • Train Data - Graph values

  • Box Plot & Sub Plots

  • Data Visualization 

Retail

Sales Forecasting
Retail
  • ML Model Building

  • Sales Data Cleaning

  • Sales Data Profile

  • Sales Weekly Cumulative for all Depts

  • Stores Data Cleaning

  • Store Type Avg Weekly Sales

  • Store Type Size Bubble Chart

  • Feature Selection and Correlation

  • Data Validation And Indexing

  • Combining All Datasets

Reports
  • Profile

  • Housing Dataset Profiling

  • Report

  • Report Test

  • Complex Customer ETL

  • Dedup Customers

  • Clickstream Data Analysis

  • Retail Data Preparation

  • Retail Data Analysis

Reports
  • Clickstream Data Analysis

  • User Engagement Analysis Report

  • Apache Logs Analysis

  • Isolation Forest - Threshold

  • Auto Encoder - Deep neural network

  • Bank Marketing - Analyse categorical variable

  • Bank Marketing - ML

  • Bank Marketing - Numerical Variables Analysis

  • Data Analysis - Credit Card

  • Stocks Moving Average

Code in Workflows

PySpark
  • JDBC

  • Housing Sklearn Bayesian Ridge Regression

  • GBT Classification - diabetes or not

  • diabetes classification SKlearn logistics classifier

  • German Credit Card - Classification

  • Keras - Example

  • Sarimax - daily female birth

  • Housing SkLearn Lasso Regression

  • Housing gradient boosting regression

  • Join Workflow

  • Multi Input To Multi Output Pyspark  Code

  • Neural Network with Keras

  • Pandas Workflow

  • PySpark Workflow

  • SQL Workflow

  • Sales record SkLearn Ridge Regression

  • Wine Quality Linear Regression

  • diabetes classification  SKlearn GTBC

  • diabetes classification SKlearn Random Forest classifier

  • diabetes classification  SKlearn GTBC

  • house price - ridge regression

  • MultiInputPyspark  Code

  • Regession test pyspark node

  • Sales record SkLearn Random forest Regression

  • Sklearn Model Save - House Price

  • Sklearn Model Save - House Price

  • SplitByExpression

  • Variable Selection - ScoreCardPy

Code
  • Pipe Python with Pandas

  • Keras Anomaly Detection AutoEncoder

  • Jython

  • SQL Workflow

  • Scala Workflow

  • Pivot Example

  • RFM - Calculation

Customer 360

  • Customer Credit Data Agg Data Prep

  • Customer Churn

  • Data Reports

bottom of page