Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

Version 1 Next »

Private_Edition_flag.png

Introduction.. Lorem ipsum dolor sit amet, consectetur adipiscing elit. Curabitur accumsan malesuada leo sed fermentum. Nullam non vehicula ligula, ut facilisis nisl. Curabitur at iaculis nisl, sit amet luctus justo. Cras hendrerit quam orci, eget consectetur eros congue eu. Aliquam pulvinar tempor mi, quis efficitur quam cursus a.

Include an image of an example workflow..

Workflow design guide

  1. Always start with a Batch Scaling Collection Workflow that collects from the original file source and forwards UDRs to Kafka.

  2. The Batch Scaling Processing Workflows can be one or a series of workflows. Batch Duplication Check and Aggregation can be part of the same workflow. There can only be ONE Aggregation Agent and ONE Duplication Agent per workflow.

  3. Decide how many maximum workflows that can execute in parallell, i.e. how can you shard your data in an efficient way / distribute evenly in different groups. Then you need to decide how to decide which identifier / sorting parameter the workflow should use to distribute the UDR. Typically a field based on record group / ID / number etc. If there is no such field, use APL to create and populate such a field (round-robin among shards for instance). UI Parameters:

Parameter

Comment

ID Field

Defines how to match a UDR to a partition.

Max Scale Factor

Number of partitions, which is the same as maximum number of workflows that can execute in parallell.

Note that if any of the parameters needs to be changed, it is considered being a new configuration, and hence starting with empty topics.

If you want to use the existing data, you must use the standard Kafka Agents and migrate the data. Or do we even want to mention this?

  • No labels