Versions Compared

Key

  • This line was added.
  • This line was removed.
  • Formatting was changed.

...

Overview

...

Usage Engine Private Edition now supports automatic batch scaling, making it possible to increase or decrease processing capacity as needed , without the need of changing configuration or any manual intervention.

Batch scaling in 9.2 is not easily achieved:

...

Transactions are stored per WF and file.

...

File collection cannot scale using the same WF config (you need to have different reg exp).

...

.

...

Duplicate UDR is a separate storage (file/db). Only one WF at a time can work on one Profile/storage. The only alternative option you have is sharding (additional Profiles).

The solution we implemented The new solution uses Kafka for both transaction/state and working storage for aggregation and duplication data. Mention the new agents - but don’t go into detailThe feature consists of several new agents that work in the same way as their existing counterparts but with an extra Storage state in the profile. These new agents cannot be grouped in the same workflows as regular agents because they have different storage.

Info

Note that you cannot mix standard agents with scaling agents in the same workflow. Workflows with standard agents save the state in PE. Workflows with batch agents save the state in Kafka.

How it works

Assume that you have a batch use case where you collect files, and have to do duplication checks and aggregation. You want to be able to scale. You need 2 or 3 WFs. In the picture below we use 3 WFs.

  • What prerequisites for Kafka

...

  • ?

...

  1. The File collection workflow(s) will use the ID Fields (e.g. customer id?) to determine which shard/partition a UDR belongs to.

  2. The number of partitions is determined using the Max Scale Factor parameter. The number of partitions will be the same for all different storages needed:

    1. Passing of UDRs between workflows.

    2. Duplicate UDR keys.

    3. Aggregation Sessions

  3. The Duplication Check workflow(s) will check for duplicates across all partitions. Checked UDRs are placed in another topic with the same corresponding partitions as the topic the workflow collected from. (The Duplication Keyes are saved in a separate topic with the same number of partitions having the same ID fields.)

  4. The Aggregation workflow(s) will collect from an inter-workflow topic, and work against a separate aggregation session storage topic.

Subsections

This section contains the following subsections:

...

  • Configuration

  • Batch scaling agents