Problem Note 52298: The HP Text Miner node does not process the created multiterm.txt file
In SAS® Text Miner, the HP Text Miner node generates a multiterm.txt file using UTF-8 encoding. However, the generated file fails to use any multi-word terms from the terms table.
To work around the problem, add the following line of code to the SAS Enterprise Miner Project Start Code:
options nobomfile;
This option enables creation of the multiterm file without a byte-order mark, and the multi-word terms are outputted in the resulting terms table.
Note: the option does not take affect until you either force the existing HP Text Miner node to rerun, or replace your existing HP Text Miner node with a new one.
Operating System and Release Information
SAS System | SAS Text Miner | Microsoft® Windows® for x64 | 12.3 | 13.1 | 9.4 TS1M0 | 9.4 TS1M1 |
Microsoft Windows 8 Enterprise x64 | 12.3 | 13.1 | 9.4 TS1M0 | 9.4 TS1M1 |
Microsoft Windows 8 Pro x64 | 12.3 | 13.1 | 9.4 TS1M0 | 9.4 TS1M1 |
Microsoft Windows 8.1 Enterprise 32-bit | 12.3 | 13.1 | 9.4 TS1M0 | 9.4 TS1M1 |
Microsoft Windows 8.1 Enterprise x64 | 12.3 | 13.1 | 9.4 TS1M0 | 9.4 TS1M1 |
Microsoft Windows 8.1 Pro | 12.3 | 13.1 | 9.4 TS1M0 | 9.4 TS1M1 |
Microsoft Windows 8.1 Pro 32-bit | 12.3 | 13.1 | 9.4 TS1M0 | 9.4 TS1M1 |
Microsoft Windows Server 2008 R2 | 12.3 | 13.1 | 9.4 TS1M0 | 9.4 TS1M1 |
Microsoft Windows Server 2008 for x64 | 12.3 | 13.1 | 9.4 TS1M0 | 9.4 TS1M1 |
Microsoft Windows Server 2012 Datacenter | 12.3 | 13.1 | 9.4 TS1M0 | 9.4 TS1M1 |
Microsoft Windows Server 2012 R2 Datacenter | 12.3 | 13.1 | 9.4 TS1M0 | 9.4 TS1M1 |
Microsoft Windows Server 2012 R2 Std | 12.3 | 13.1 | 9.4 TS1M0 | 9.4 TS1M1 |
Microsoft Windows Server 2012 Std | 12.3 | 13.1 | 9.4 TS1M0 | 9.4 TS1M1 |
Windows 7 Enterprise x64 | 12.3 | 13.1 | 9.4 TS1M0 | 9.4 TS1M1 |
Windows 7 Professional x64 | 12.3 | 13.1 | 9.4 TS1M0 | 9.4 TS1M1 |
64-bit Enabled AIX | 12.3 | 13.1 | 9.4 TS1M0 | 9.4 TS1M1 |
64-bit Enabled Solaris | 12.3 | 13.1 | 9.4 TS1M0 | 9.4 TS1M1 |
HP-UX IPF | 12.3 | 13.1 | 9.4 TS1M0 | 9.4 TS1M1 |
Linux for x64 | 12.3 | 13.1 | 9.4 TS1M0 | 9.4 TS1M1 |
Solaris for x64 | 12.3 | 13.1 | 9.4 TS1M0 | 9.4 TS1M1 |
*
For software releases that are not yet generally available, the Fixed
Release is the software release in which the problem is planned to be
fixed.
The HP Text Miner node does not process the created multiterm.txt file
Type: | Problem Note |
Priority: | high |
Topic: | Analytics ==> Data Mining Analytics ==> Text Mining
|
Date Modified: | 2014-03-14 08:02:46 |
Date Created: | 2014-02-10 22:42:06 |