In the directive Sort
and De-Duplicate Data in Hadoop, in the Columns task,
you use the Advanced Editor to add or edit
user-written Hive expressions. The expressions are run by the Sort
and De-Duplicate job to add data to new target columns. The Advanced
Editor enables you to insert column names and Hive function
syntax into your expressions.
Follow these steps to
use the Advanced Editor:
-
As needed in the
Columns task,
click
or
to open the
Advanced Editor.
-
In the
Advanced
Editor, in the
Column Name field,
enter a name for a new column or rename an existing column. The fields
Column
type and
Column length describe
the selected column.
-
To build an expression,
you can start by pasting Hive code from your clipboard. To edit or
build your expression, click the column names and functions in the
Resources box.
Tip
When you select a function,
syntax help is displayed at the bottom of the
Resources box.
-
To save your expression
and return to the
Columns task, click
Save.
To save and create another new column and expression, click
Save
and New. In the
Columns task,
new columns are displayed at the bottom of the
Selected
Columns box.