QKBPUSH.SH: Reference

Overview

The qkbpush.sh script deploys a SAS Quality Knowledge Base in a Hadoop cluster. When executed without options, the script automatically discovers all nodes in the cluster and deploys the QKB on them. The script also generates an index file from the contents of the QKB and pushes this index file to HDFS. Flags are available that enable you to deploy the QKB on individual nodes or on a subset of nodes instead. Flags are also available to enable you to suppress index creation or perform only index creation.
qkbpush.sh should be run as the root user. It becomes the HDFS user in order to detect the nodes in the cluster. It sets the QKB permissions on the Hadoop nodes so that the QKB is owned by the default user name for the SAS Embedded Process. Flags are available to specify the HDFS and SAS Embedded Process user names if user names other than the defaults were configured.
To simplify maintenance, the QKB directory is copied to a fixed location (/opt/qkb/default) on each node. The QKB index file is created in the /sas/qkb directory in HDFS.

Syntax

qkbpush <options> qkb_path

Required Arguments

qkb_path

specifies the path to the source QKB directory.

Optional Arguments

Authentication Options

-s hdfs-user

specifies the user name to associate with HDFS, when the default user name (hdfs) is not used.

-g ep-group

specifies the group name to associate with the SAS Embedded Process, when the default group name (sasep) is not used.

-u ep-user

specifies the user name to associate with the SAS Embedded Process, when the default user name (sasep) is not used.

QKB Index Options

-i

creates and pushes the QKB index only.

-x

suppresses QKB index creation.

Subsetting Options

-h hostname

specifies the host name or IP address of the computer on which to perform the deployment.

-f hostfile

specifies the name of a file that contains a list of the host names or IP addresses on which to perform the deployment.

General Options

-?

prints usage information.

-l logfile

directs status information to a log file, instead of to standard output.

-r

removes the QKB from the Hadoop nodes and the QKB index file from HDFS.

-v

specifies verbose output.