In Sparkflows, we can use the ‘Group By’ processor to compute aggregates based on a set of ‘Grouping Columns’ selected.
To use the ‘Group By’ Processor:
- Select a set of ‘Grouping Columns’. Aggregated values would be computed based on these grouping columns. In this case grouping columns would be ‘Location’ and ‘Dept’.
- Select details of the aggregated values to be computed. In this case ‘Aggregated Columns’ would be ‘Salary’ and ‘Aggregated Function’ would be ‘Summation’. Output would display the sum of salary for each combination of location and department.
For more information read the Sparkflows Documentation here:
Hey Chris,
In Sparkflows, we can use the ‘Group By’ processor to compute aggregates based on a set of ‘Grouping Columns’ selected.
To use the ‘Group By’ Processor:
- Select a set of ‘Grouping Columns’. Aggregated values would be computed based on these grouping columns. In this case grouping columns would be ‘Location’ and ‘Dept’.
- Select details of the aggregated values to be computed. In this case ‘Aggregated Columns’ would be ‘Salary’ and ‘Aggregated Function’ would be ‘Summation’. Output would display the sum of salary for each combination of location and department.
For more information read the Sparkflows Documentation here:
https://docs.sparkflows.io/en/latest/user-guide/data-preparation/aggregate.html?highlight=group%20by#group-by