Text and Language Processing

Text and Language Processing in Mathematica 11

Mathematica Version 11 enhances the string, text, and natural language processing framework, providing new and more powerful functionality for symbolic manipulation and analysis of texts. New functions are available to identify and extract structured data from unstructured text, explore historical word frequency data, and parse natural language input.

Key Features

  • New string operations and patterns make string processing easier and more efficient. »
  • New normalization functions allow preprocessing of capitalization, diacritics, stopwords, and more. »
  • Work with different alphabets, taking into account special character properties. »
  • Access properties of words like definitions, stems, or parts of speech. »
  • Find historical word frequency data from multilingual corpora of texts. »
  • Programmatic segmentation of texts by words or sentences. »
  • Find the grammatical structure of a sentence and display it as a dependency graph, a constituent tree, and more. »
  • Write flexible natural language parsers to translate from text to actions. »

Related Examples

Flexible String Processing »

String Decomposition  »

Palindromes »

Frequencies of Letters vs. First Letters  »

Positions of Letters in Alphabets »

Learn about Characters »

Computation with Multilingual Word Lists »

Zipf's Law »

Definitions and Parts of Speech »

Generate and Verify Stemmed Words »

Sherlock vs. Watson »

Word Frequency over Time »

Frequency of Common Nouns in Speeches »

Find Country Entities in Texts »

Explore the Structure of Texts »

Compare the Structure of Sentences »

Define Grammar Rules »

Create a Natural Language Calculator »

Create a Flexible Calculator Using a Context-Free Grammar »

Query the Wolfram Knowledgebase Using Natural Language »