Oct 14, 20172 minWriting to and Reading from Elastic Search with Apache SparkElastic Search is often used for indexing, searching and analyzing datasets. Sparkflows makes is very smooth to read any data, clean and...
Oct 1, 20171 minSparkflows now available for CDH/HDP/MAPRSparkflows binaries are now available for CDH, HDP and MapR. You can download them from https://www.sparkflows.io/archives
Sep 27, 20171 minExtending Sparkflows with Custom ProcessorsSparkflows now has close to 150 Processors for Machine Learning, NLP, ETL, Entity Resolution, OCR, Handling Unstructured Data etc. But,...
Aug 30, 20171 minSplitting the incoming DataFrame into many based on various conditional expressionsSparkflows has a couple of nodes for splitting the incoming DataFrame. One is to split it into two based on the percentage specified for...