NLP Building Blocks
These NLP applications are building blocks for larger NLP pipelines and systems. They provide functions integral to text processing such as text language detection, sentence extraction, string tokenization, document classification, and entity extraction. They are implemented as microservices and are stateless, scalable, and each features a REST-like API.
Idyl E3 Entity Extraction Engine
Idyl E3 Entity Extraction Engine is extracts named-entities from natural language text. Idyl E3 supports user-created entity models, has a REST-like API, and can be used in an air-gapped network.
Prose Sentence Detection Engine
Prose Sentence Extraction Engine breaks input text into its individual sentences based on rules or trained models. Many NLP applications operate on individual sentences and Prose provides an efficient means of extracting sentences from input text.
Renku Language Detection Engine
Renku Language Detection Engine detects the language of input natural language text. Capable of identifying over 100 languages, Renku is designed to be lightweight and efficient. Use Renku in your NLP pipelines to route text based on the text’s language.
Sonnet Tokenization Engine
Sonnet Tokenization Engine breaks input text into its individual tokens based on rules or trained models. Many NLP applications require tokenized text as input and Sonnet provides an easy to use means to tokenizing text.
Verso Text Preprocessing Engine
Verso Text Preprocessing Engine performs text preprocessing for NLP pipelines. Use Verso to quickly and easily get text in the proper format for analysis. Remove special characters, punctuation, digits, stem, and more.