Machine Translation
This is one of the most important applications of Natural Language Processing. Translation is an activity comprising the interpretation of the meaning of a text in one language (the source text) and the production, in another language, of a new and equivalent text (the target text): the translation. Traditionally, translation has been a human activity, although attempts have been made to automate and computerize the translation of natural-language texts (machine translation) or to use computers as an aid to translation (computer-assisted translation).
Information Retrieval
Information retrieval (IR) is the science of searching for information in documents, searching for documents themselves, searching for metadata that describe documents, or searching within databases, whether relational stand-alone databases or hypertext networked databases such as the Internet or intranets, for text, sound, images or data.
Information Extraction
Information extraction (IE) is a type of information retrieval whose goal is to automatically extract structured or semi structured information from unstructured machine-readable documents. A typical example is the extraction of information on corporate merger events, whereby instances of the relation ”MERGE (company1, company2, date)” are extracted from online news (“Yesterday, New-York based Foo Inc. announced their acquisition of Bar Corp.”).
A typical subtask of IE is Named Entity Recognition: recognition of entity names (for people and organizations), place names, temporal expressions, and certain types of numerical expressions (currency amounts…)
Automatic Summarization
Automatic summarization is the creation of a shortened version of a text by a computer program. The product of this procedure still contains the most important points of the original text.
Speech Recognition
Speech recognition is the process of converting a speech signal (i.e. voice) to a set of words, by means of an algorithm implemented as a computer program.