The SIMILARITY Procedure

OUTSEQUENCE= Data Set

The OUTSEQUENCE= data set records the input and target sequences that are associated with each INPUT and TARGET statement variable. This data set records the input and target sequence values for each slide index and for each warp index that is associated with the slide index. The sequence values that are recorded are normalized and scaled based on the NORMALIZE= and SCALE= options. This data set also contains the similarity measure associated with the two sequences.

The OUTSEQUENCE= data set contains the variables specified in the BY statement in addition to the following variables:

_INPUT_

input variable name

_TARGET_

target variable name

_TIMEID_

time ID values

_SLIDE_

slide index

_WARP_

warp index

_INPSEQ_

input sequence values

_TARSEQ_

target sequence values

_SIM_

similarity measure

_STATUS_

sequence status

The sorting of the OUTSEQUENCE= data set depends on the SORTNAMES and the ORDER= option.

The OUTSEQUENCE= data set is ordered by the variables _INPUT_, then _TARGET_, then _TIMEID_ when ORDER=INPUTTARGET or ORDER=INPUT. The OUTSEQUENCE= data set is ordered by the variables _TARGET_, then _INPUT_, then _TIMEID_ when ORDER=TARGETINPUT or ORDER=TARGET.

If there are a large number of slides or warps or both, this data set might be large.