The %TMPUNC Macro

The %TMPUNC macro removes unwanted punctuation from terms in your document collection. If your documents contain terms with run-on punctuation, such as **people or +bags, these punctuation characters become part of the terms when you parse documents in SAS Text Miner. The %TMPUNC macro enables you to convert terms with run-on punctuation by putting spaces before and after the punctuation characters to prevent them from appearing as part of the term. Without the %TMPUNC macro, these two examples would parse out as two terms, **people and +bags. After running the %TMPUNC macro, they would parse as five terms:
  • *
  • *
  • people
  • +
  • bags
See Using the %TEXTSYN and %TMPUNC Macros in the Text Miner node documentation in SAS Text Miner for more information.