Sep 27, 20171 min

Extending Sparkflows with Custom Processors

Sparkflows now has close to 150 Processors for Machine Learning, NLP, ETL, Entity Resolution, OCR, Handling Unstructured Data etc.

But, the amazing thing about Sparkflows is that any user can add their own Custom Processors. Enterprises are finding that by doing so, they start with the rich set of existing ~150 Processors, and can focus on add those core Processors only which apply to their Business.

Writing New Processors

This github repo guides you through writing New Processors:

https://github.com/sparkflows/writing-new-node

The master branch is for Spark 1.6.X and the spark-2.x branch is for Spark 2.X.

It has a number of example Processors. You can take any of them and build your own.

https://github.com/sparkflows/writing-new-node/tree/master/src/main/java/fire/nodes/examples

Get started with your own Processors. Why not contribute your Processors to the community by making them public.

    1061
    0