ngramdb provides a distributed means of storing and querying N-grams (or bags of words) organized under contexts. A REST interface provides the ability to insert n-grams, execute “starts with” and “top” queries, and calculate similarity metrics of contexts. Apache Ignite provides the distributed and highly available persistence and powers the querying abilities.
ngramdb is experimental and significant changes are likely. We welcome your feedback and input into its future capabilities.
ngramdb is open source under the Apache License, version 2.0.