Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Creating a scalable solution

  1. Always start with a Batch Scaling Collection Workflow that collects from the original file source and forwards UDRs to Kafka.

  2. The Batch Scaling Processing Workflows can be one or a series of workflows. Batch Duplication Check and Aggregation can be part of the same workflow. There can only be one Aggregation agent and one Deduplication agent per workflow.

  3. Decide how many maximum workflows should execute in parallel. Think about how you can evenly distibrute (shard) your data into different groups. For example….add example here. Finally you will need to select an identifier that the workflow will use to distribute the UDR. Typically, this would be a field based on the record group like a customer ID or an account number. You also have the option to create and populate such a field using APL (insert link).

...

Parameter

Comment

ID Field

Defines how to match a UDR to a partition.

Max Scale Factor - this is located in the Partition profile config.

Number of partitions, which is the same as maximum number of workflows that can execute in parallel. - this means that there can be fewer workflows - but not more.

Note!

If any of the parameters needs to be changed, it is considered a new configuration, and they need to start with empty topics.

You can use the existing data, but you must use the standard Kafka Agents and migrate the data. Or do we even want to mention this?

...

Usage Engine will scale out and in and re-balance scalable batch workflows automatically and you can schedule when to start a scale-out or scale-in.

...

Deploying/grouping a scale-out configuration with ECDs:

...

Use the regular ECD (Execution context deployment) definition using Dynamic Workflows to define how to package a scale-out. For instance: yOU HAVE TO MANUALLY DEFINE when these ECDs will activate.

A Collection Workflow scales with 1 extra Workflow per ECD.

...