What's New |
SAS Text Miner includes the following new features and enhancements:
Three new nodes have been added in SAS Text Miner:
In SAS Text Miner, you have additional options for controlling how your text is mined by using nodes that focus on a particular text mining step.
Note: You can still use the Text Miner node if you want to collapse all of the text mining steps into a single node in your process flow diagram.
The Text Parsing node enables you to parse a document collection in order to quantify information about the terms that are contained therein. The Text Parsing node provides a standard parsing facility, and enables you to import custom entities as defined in SAS Content Categorization.
In addition to new functionality, the Text Parsing node offers improved parsing performance. By using a Text Parsing node in a process flow diagram, you now need only to parse a document collection one time. This can lead to performance improvements beyond what you can obtain with the Text Miner node. For example, modifications to filtering would require the Text Miner node to reparse all the documents again. Similarly, the Text Topic node does not reparse the document collection.
The Text Filter node enables you to reduce the total number of parsed terms or documents that are analyzed so that you can eliminate extraneous information from your analysis. The Text Filter node enables you to perform spell checking, do full text searches using integrated Teragram search capabilities, view and analyze results with concept linking, and conduct subsetting management of terms and documents.
The Text Topic node enables you to combine terms into topics or provide your own topics that you want to analyze. With the Text Topic node, you can manage topics by mining for multiple topics per document, automatically creating single and multi-word topics, editing automatically generated topics, and defining your own topics. The Interactive Topic Viewer enables you to manage your topics. Results from the Text Topic node provide charts and tables that enable you to analyze—for example—the number of documents by topics and the number of terms by topics.
SAS Text Miner now supports the Solaris for x64 server platform.
In addition to the languages supported in previous releases (Chinese, English, French, German, Italian, Portuguese, and Spanish), SAS Text Miner 4.2 also supports these languages: Arabic, Dutch, Japanese, Korean, Polish, and Swedish. Entity parsing is available for all supported languages.
You can use SAS Concept Creation for SAS Text Miner to help you create custom entities.
Copyright © 2010 by SAS Institute Inc., Cary, NC, USA. All rights reserved.