Text and Language Processing in Mathematica 11

Mathematica Version 11 enhances the string, text, and natural language processing framework, providing new and more powerful functionality for symbolic manipulation and analysis of texts. New functions are available to identify and extract structured data from unstructured text, explore historical word frequency data, and parse natural language input.

  • New string operations and patterns make string processing easier and more efficient. »
  • New normalization functions allow preprocessing of capitalization, diacritics, stopwords, and more. »
  • Work with different alphabets, taking into account special character properties. »
  • Access properties of words like definitions, stems, or parts of speech. »
  • Find historical word frequency data from multilingual corpora of texts. »
  • Programmatic segmentation of texts by words or sentences. »
  • Find the grammatical structure of a sentence and display it as a dependency graph, a constituent tree, and more. »
  • Write flexible natural language parsers to translate from text to actions. »

