XML Column Input Node

DataFlux Data Management Studio 2.5: User Guide

XML Column Input Node

An XML Column Input node reads XML from a column in a row of data as an input in a data job. It enables you to augment rows with columns pulled from a column containing XML. For example, you could use this node in conjunction with the XML Column Output node with Web Service node. You could also use column input and output when you need to pull data out of data sources that are stored as XML.

The XML column nodes support reading XML columns and writing XML output. Because XML is traditionally hierarchical, the nodes select certain portions of the XML presented in the XML column input node and present the information as additional columns, and possibly rows, for processing. Once complete, you can output the information back into XML using the XML Column Output node.

Note Note: For Profiles, use the XML Column Input node to extract your data into a text file or database, and then profile that text file or database.

The properties dialog for this node includes the following elements:

Name - Specifies a name for the node.

Notes - Enables you to open the Notes dialog. You use the dialog to enter optional details or any other relevant information for the node.

XML input field - Specifies the name of the incoming column that contains the XML to process. Select one of the following processing options:

Use XQuery from file - Specifies the location of the XQuery file.

Use XMLMap from file - Specifies the location of the XMLMap file. You can use SAS XMLMapper software or you can write a custom XQuery to convert the source XML into a compatible XML map. This node will work with either XMLMap version 1.9 or XMLMap version 2.1. Version 1.9 XMLMaps can be created by the SAS 9.2 SAS XML Mapper. Version 2.1 XMLMaps can be created by the SAS 9.3 XML Mapper. For information about obtaining the SAS XML Mapper, see How can I obtain a copy of the SAS XML Mapper?. For information about essential settings needed to use SAS XML Mapper, see How can I make the SAS XML Mapper work with the XML data job nodes?.

You also need to specify values for the following related fields:

Table Name - Specifies the name of the table to process. If this field is empty, the first table in the XMLMap is processed. If there are duplicate table names, the first occurrence is used.
XQuery output file - Specifies the full path where the generated XQuery file will be saved.

No XQuery or XMLMap - Specifies that neither an XQuery nor an XMLMap file is used.

Additional Options - Enables you to set more optional parameters for the XML Input node. The following options are available:

Maximum number of rows to return - Specifies the maximum number of rows to process. When left _null, all rows are processed. The default is 0.

Number of rows per RowSet - Specifies the number of rows to place in each returned RowSet. The default value is 100.

Maximum number of queued RowSets - Specifies the maximum number of queued RowSets. The default value is 2.

Additional classpaths - Specifies additional Java© classpaths to search.

The Fields section of the properties dialog includes the following elements:

Fields Table - Enables you to define each field in your text file, including field name, field type, and field length. The following field types are available:

STRING - An ordered sequence of text or symbols

INTEGER - A number from the set of integers

REAL - The set of real numbers expressed as decimals

DATE (MDY) - Date in the order of month, date, and year

DATE (DMY) - Date in the order of date, month, and year

DATE (YMD) - Date in the order of year, month, and date

BOOLEAN - True or false

You can click Add to define a new field and use the up and down arrows to change the order of fields. Finally, you can click Delete to remove field definitions.

Import - Displays the Open dialog and enables you to restore a file layout from a .dfl file that you have exported.

Export - Displays the Save As dialog and enables you to store a given file layout that can be shared for parsing other input files. In this case you can set the text qualifier and the delimiter and store the file layout to create. The Export option lets you save the file layout as a .dfl file.

Additional Outputs - Displays the Additional Outputs dialog. This dialog enables you to specify the fields that you can make available to the next node in your data job.

You can also access the following advanced properties by right-clicking the XML Columns Input node in the data job:

Documentation Feedback: yourturn@sas.com
Note: Always include the Doc ID when providing documentation feedback.

Doc ID: dfDMStd_XML_Col_Input.html