Skip to end of metadata
Go to start of metadata

You are viewing an old version of this page. View the current version.

Compare with Current View Page History

« Previous Version 13 Next »

""""""""""""""""""REMOVE THIS SECTION AS IT'S AN EXACT REPETITION OF THE PREVIOUS, WHICH MAKES IT CONFUSING""""""""""""""""""""""" To create a new Parquet profile configuration, click the New Configuration button in the upper left part of the Desktop window, and then select Parquet Profile from the menu. The contents of the menus in the menu bar may change depending on which configuration type hasĀ  been opened. The Parquet profile uses the standard menu items and buttons that are visible for all configurations, and these are described in 2.1 Menus and Buttons.


The Parquet profile's Schema tab with an example of a defined Schema. You will have to write the Schema for your desired functions.


SettingDescription

Schema

See below.

Validate

Press the Validate button to validate the Schema and make sure it has a correct format.

Defining the Parquet Schema

To be able to define a Schema, it is useful to have knowledge about primitives, nested groups, repetition levels, and logical types, as described below:

Error rendering macro 'excerpt-include' : No link could be created for '9.57.1 Overview and Concepts'.

Example Parquet Schema """""""""""""""I SUGGEST MOVING THE SAME SECTION FROM OVERVIEWS AND CONCEPTS INSTEAD""""""""""""""""""""""

Apache Parquet supports a small set of primitives (integer, floating point, boolean, and byte array). These primitives can be extended using logical type annotations, which are modifiers on primitives. For example, the UTF8 annotation is a modifier to byte arrays that denote string data. Parquet also supports structured data through groups and repetitions (i.e., optional, required, repeated).

Example - Parquet Schema

The structured text block shows an example Parquet schema for company employees:

message employee {
  required group id {
    required group name {
      required binary surname (UTF8);
      required binary firstName (UTF8);
      optional binary preferredFirstName (UTF8);
    }
  required int32 employeeNumber;
  }
  optional group phones (LIST) {
    repeated group list {
      required group element {
        required binary type (ENUM);
        required binary phoneNumber (UTF8);
      }
    }
  }
  required binary email (UTF8);
  optional binary manager (UTF8);
  required binary jobTitle (UTF8);
  required group team {
    required binary country (UTF8);
    required binary businessUnit (UTF8);
    required binary function (UTF8);
    optional binary team (UTF8);
    optional binary department (UTF8);
    required binary legalEntity (UTF8);
  }
  optional int32 birthdate (DATE);
}
  • No labels