To decode or encode XML data, a format definition (an XML Schema syntax) is included in Ultra xml_schema block. Code Block |
---|
| xml_schema {
<?xml version="1.0" encoding="ISO-8859-1"?>
<schema xmlns = " http://www.w3.org/2001/XMLSchema ">
<element name="TRANSACTION_LOG">
<complexType>
<sequence>
<element ref="TRANSACTION" maxOccurs="unbounded"/>
</sequence>
</complexType>
</element>
<element name="TRANSACTION">
<complexType>
<attribute name="TXID" type="string" use="required"/>
<sequence>
<element name="USER" type="string" minOccurs="1" maxOccurs="1"/>
<element name="IP" type="string" minOccurs="1" maxOccurs="1"/>
<element name="ITEM" type="string" minOccurs="1" maxOccurs="1"/>
<element name="VALUE" type="long" minOccurs="1" maxOccurs="1"/>
<element name="TIMESTAMP" type="dateTime" minOccurs="1" maxOccurs="1"/>
<element name="CURRENCY" type="string" minOccurs="1" maxOccurs="1"/>
<element name="MISC" type="string" minOccurs="0" maxOccurs="unbounded"/>
</sequence>
</complexType>
</element>
</schema>
}; |
Collected XML UDRs are often terminated by one or several whitespace characters. When mapping, the whitespace temporary record is identified. Although input data that includes trailing whitespace characters is valid in XML, it is recommended that you eliminate them when decoding the data. To remove any excessive white spaces from the XML UDRs, the external format WhiteSpace is used with the in_map of the deocder. For further information see 11. In-maps and 12. Decoders. Code Block |
---|
| external WhiteSpace : identified_by( value == 0x20 || value == 0xA || value == 0xD ) {
int value: static_size(1);
}; |
The internal and external formats can be mapped automatically but are mapped explicitly in the example. This is to demonstrate how the interpretation of XML types works. Code Block |
---|
| internal TransactionLog {
list<Transaction> Transactions;
};
internal Transaction {
string TxId;
string User;
string IP;
string Item;
long Value;
date Timestamp;
string Currency;
list<string> Misc;
};
in_map inTransactionLog: external(TRANSACTION_LOG), internal(TransactionLog) {
i:Transactions and e:TRANSACTION using in_map inTransaction;
};
in_map inTransaction: external(TRANSACTION), internal(Transaction) {
i:TxId and e:TXID;
i:User and e:USER;
i:IP and e:IP;
i:Item and e:ITEM;
i:Value and e:VALUE;
i:Timestamp and e:TIMESTAMP;
i:Currency and e:CURRENCY;
i:Misc and e:MISC;
};
out_map outTransactionLog: external(TRANSACTION_LOG), internal(TransactionLog) {
i:Transactions and e:TRANSACTION using out_map outTransaction;
};
out_map outTransaction: external(TRANSACTION), internal(Transaction) {
i:TxId and e:TXID;
i:User and e:USER;
i:IP and e:IP;
i:Item and e:ITEM;
i:Value and e:VALUE;
i:Timestamp and e:TIMESTAMP;
i:Currency and e:CURRENCY;
i:Misc and e:MISC;
}; |
To get rid of white spaces, create a <literal>in_map</literal> for the external format WhiteSpace using the discard_output option. For further information, see the 11. In-maps. Code Block |
---|
| in_map WS_map: external(WhiteSpace), discard_output {automatic;}; |
To remove as many white spaces as possible from the processed data, WS_map is set first in the deocder. Code Block |
---|
| encoder encTransactionLog: out_map(outTransactionLog);
decoder decTransactionLog: in_map(WS_map), in_map(inTransactionLog); |
|