Text¶
A large amount of information is available in the form of text. For example, tweets, emails, survey responses, product reviews and so forth contain information that is written in natural language.
The goal of working with text is to convert it into data that can be useful for analysis. Some applications of text analysis include: sentiment analysis, named entity recognition, summarization, and so forth.
The following table lists the plugins currently available for working with text data.
Note
Support level: These plugins are not supported / Tier 2 supported features
Plugin |
Description |
Language coverage |
---|---|---|
Detect languages, correct misspellings and clean text data using open source libraries |
||
Analyze text data with ontology tagging |
||
Estimate sentiment polarity (positive/negative) of text data using open source models |
English |
|
Automatically summarize text data using open source algorithms to extract sentences |
Language-agnostic |
|
Extract information on named entities (people, dates, places, etc.) from text data using open source models |
||
Convert speech to text offline using open-source components |
English |
|
Use the Amazon Transcribe API to convert speech to text |
||
Compute numerical sentence representations for use as feature vectors in a Machine Learning model or for similarity search, using open source models |
English |
|
Use the Amazon Comprehend API for language detection, sentiment analysis, named entity recognition and key phrase extraction |
||
Use the Amazon Comprehend Medical API for Protected Health Information extraction and medical entity recognition |
English |
|
Use the Azure Cognitive Services – Text Analytics API for language detection, sentiment analysis, named entity recognition and key phrase extraction |
||
Use the Crowlingo Multilingual NLP API for language detection, sentiment analysis, summarization and multiple other tasks |
||
Use the Google Cloud NLP API for sentiment analysis, named entity recognition and text classification |
||
Use the Google Cloud Translation API to translate text |
||
Use the Amazon Translation API to translate text |
||
Use the Azure Translation API to translate text |
||
Use the DeepL Translation API to translate text |
||
Translate text offline using open-source components |
||
Use the MeaningCloud API for language detection, sentiment analysis, topic extraction, summarization and text classification |
||
Use the OpenAI API to perform tasks expressed in natural language, such as Zero-shot Classification or Q&A |
English |
|
Perform Optical Character Recognition (OCR) offline using the Tesseract engine |