This course describes the functionality of SAS Text Miner software, which is a separately licensed component that is available for SAS Enterprise Miner. In this course, you learn to use SAS Text Miner to uncover underlying themes or concepts contained in large document collections, automatically group documents into topical clusters, classify documents into predefined categories, and integrate text data with structured data to enrich predictive modeling endeavors.
This course can help prepare you for the following certification exam(s): SAS Text Analytics, Time Series, Experimentation and Optimization.
Learn how to
- convert documents stored in standard formats (Microsoft Word, Adobe PDF, and so on) into general purpose HTML or TXT formats
- read documents from a variety of sources (web pages, flat files, data elements in a relational database, spreadsheet cells, and so on) into SAS tables
- process textual data for text mining (for example, correct misspellings or recode acronyms and abbreviations)
- convert unstructured text-based character data into structured numeric data
- explore words and phrases in a document collection
- query document collections using keywords (that is, identify documents having specific words or phrases)
- identify topics or concepts that appear in a document collection
- create user-influenced topic tables from scratch or by modifying machine generated topics or concepts using domain knowledge
- use derived topic tables or pre-existing user-influenced topic tables (or both) to enhance information retrieval and document classification
- cluster documents into homogeneous subgroups
- classify documents into predefined categories.
Who should attend
Statisticians, business analysts, and market researchers who incorporate free-format textual information in their analyses; managers of large document collections who must organize and select documents using data mining; and students of data mining who want to learn about text mining
Before attending this course, you should.
- be acquainted with Microsoft Windows and Windows-based software
- have at least an introductory-level familiarity with basic statistics and regression modeling.
Previous SAS software experience, especially SAS Enterprise Miner experience, is helpful but not required.
This course addresses SAS Text Miner software.This course uses SAS Text Miner 14.1 and SAS Enterprise Miner 14.1.