top of page

Extending Sparkflows with Custom Processors


Sparkflows now has close to 150 Processors for Machine Learning, NLP, ETL, Entity Resolution, OCR, Handling Unstructured Data etc.

But, the amazing thing about Sparkflows is that any user can add their own Custom Processors. Enterprises are finding that by doing so, they start with the rich set of existing ~150 Processors, and can focus on add those core Processors only which apply to their Business.

Writing New Processors

This github repo guides you through writing New Processors:

The master branch is for Spark 1.6.X and the spark-2.x branch is for Spark 2.X.

It has a number of example Processors. You can take any of them and build your own.

Get started with your own Processors. Why not contribute your Processors to the community by making them public.


106 views1 comment
bottom of page