ML Engines

Scikit Learn

Apache Spark

H20

Keras

facebook_prophet_icon.png

Prophet

Arima

ARIMA

Stats Model

ML Algorithms

machine-learning.png

Scikit Learn

Classification

  • Gradient Boosting Classifie

  • Logistic Regression

  • Random Forest Classifier

Regression 

  • Bayesian Ridge Regression

  • Gradient Boosting Regression

  • Lasso Regression

  • Random Forest Regression

  • Ridge Regression

Evaluator 

  • Regression Evaluator

  • Classification Evaluator

  • Custom Metrics

Modeling

  • Model Predict

  • Model Save

  • Model Load

Time Series:

  • ARIMA

  • Prophet

  • VAR

H20

  • Gradient Boosting Machine

  • Generalized Linear Models

  • Generalized Low Rank Models

  • Distributed Random Forest

  • Isolation Forest

  • K-Means

  • Naive Bayes

  • Neural Network

  • PCA

  • Word to Vec

  • XGBoost

Spark ML

Feature Transformers

  • Binarizer

  • IDF

  • Index String

  • N Gram Transformer

  • Normalizer

  • One Hot Encoder

  • Polynominal Expansion

  • Quantile Discretizer

  • SMOTE(Synthetic Minority

  • Over-sampling Technique)

  • SQL Transformer

  • Stop Words Remover

  • String Indexer

  • Tokenizer

  • Vector Assembler

  • Vector Functions

  • Vector Indexer

  • Word To Score Mapping

Feature Extraction

  • Count Vectorizer

  • Hashing TF

  • R Formula

  • Word2 Vec

Feature Scaler

  • Min Max Scaler

  • Standard Scaler

Evaluator

  • Binary Classification Evaluator

  • Multiclass Classification Evaluator

  • Regression Evaluator

  • Spark ML ROC

Modeling 

  • Train Validation Split

  • Cross Validator

  • Spark Pipeline

  • Spark Predict

  • ModelSave

  • ModelLoad

Spark ML

Clustering :

  • K-Means Clustering

  • LDA

  • Gaussian Mixture

Regression

  • AFT Survival Regression

  • Decision Tree Regression

  • GBT Regression

  • Linear Regression

  • Random Forest Regression

  • XGBoost Regression

Classification

  • Decision Tree Classifier

  • GBT Classifier

  • Logistic Regression

  • MultiLayer Perceptron

  • Naive Bayes

  • Random Forest Classifier

  • XGBoost Classifier

Collaborative Filtering:

  • ALS - Alternating Least Squares

FP Growth :

  • FP Growth

Feature Selection:

  • ChiSq Selector

  • Vector Slicer

Split Dataset:

  • Split

  • Split Probability Column

  • Split With Stratified Sampling

Dimensionality Reduction :

  • PCA

  • SVD