Glossary :: SAS(R) Visual Analytics 5.2: User's Guide

Apache Hadoop

a framework that allows for the distributed processing of large data sets across clusters of computers using a simple programming model.

bar chart

a chart that consists of a grid and some vertical or horizontal columns (bars). Each column represents quantitative data.

box plot

See box-and-whisker plot

box-and-whisker plot

a graphical display of five statistics (the minimum, lower quartile, median, upper quartile, and maximum) that summarize the distribution of a set of data. The lower quartile (25th percentile) is represented by the lower edge of the box, and the upper quartile (75th percentile) is represented by the upper edge of the box. The median (50th percentile) is represented by a central line that divides the box into sections. The extreme values are represented by whiskers that extend out from the edges of the box.

capability

an application feature that is under role-based management. Typically, a capability corresponds to a menu item or button. For example, a Report Creation capability might correspond to a New Report menu item in a reporting application. Capabilities are assigned to roles.

crosstab

See crosstabulation table

crosstabulation table

a two-dimensional table that shows frequency distributions or other aggregate statistics for the intersections of two or more category data items. In a crosstabulation table, categories are displayed on both the columns and rows, and each cell value represents the data result from the intersection of the categories on the specific row and column.

data item

an item in a data source that is either a logical view of a data field or a calculation. The author of a report decides which data items to use in a particular section of a report. There are three types of data items: hierarchies, categories, and measures.

data source

a table, view, or file from which you will extract information. Sources can be in any format that SAS can access, on any supported hardware platform. The metadata for a source is typically an input to a job.

filter

specified criteria that are applied to data in order to identify the subset of data for a subsequent operation, such as continued processing.

heat map

a graphical representation of data where the values taken by a variable in a two-dimensional map are represented as colors.

list table

a two-dimensional representation of data, in which the data values are arranged in rows and columns.

pie chart

a circular chart that is divided into slices by radial lines. Each slice represents the relative contribution of each part to the whole.

progressive chart

See waterfall chart

report

output that is generated by running custom SAS code against the data in your project.

role

a set of capabilities. In some applications, certain actions are available only to users or groups that have a particular role.

scatter plot

a two- or three-dimensional plot that shows the joint variation of two (or three) variables from a group of table rows. The coordinates of each point in the plot correspond to the data values for a single table row (observation).

scatter plot matrix

a grid of scatter plots showing pairwise combinations of multiple numeric variables.

source

See data source

time series

an ordered sequence of values of a variable that are observed at equally spaced time intervals.

treemap

a graph that represents the relative values of data by using rectangular areas. The color of each area represents the value of one measure in the query. The size of each area represents the value of the another measure in the query. (In SAS/GRAPH a treemap is created with the GTILE procedure.)

user role

See role

visual exploration

a metadata object that contains visualizations and data settings that are saved from a session of the SAS Visual Analytics explorer.

visualization

an interactive visual representation of data. A visualization can be a table, a chart, or a geographic map.

waterfall chart

a form of data visualization that is used to understand or explain the cumulative effect on an initial value of sequentially introduced positive or negative values. Usually the initial and the final values are represented by whole columns, while the intermediate values are denoted by floating columns.