Built with advanced search engine technology by Openindex.
Openindex is all about respecting privacy. All data collected is stored anonymously. No data is sold or provided to third parties.
Extract numerous entities from your text
Automatically finds people, brands, locations, time stamps and much more.
Works on any website and text source
It doesn’t matter what technology is behind the websites or data sources, as long as the output is readable by our parser.
Also works on separate source files
Input can be any accessible website, but also text files, PDFs etc.
Support for multiple languages
We currently support seven languages for EntitySearch (no, de, fr, en, da, es, nl), but more additional language models are being built. It supports about forty different entity types and integrates seamlessly with our parser.
Try the online demo below
Enter a URL or text in the text box and see what information is extracted.
EntitySearch uses the following techniques
Apache OpenNLP is a machine learning based toolkit for the processing of natural language text.
Solr is the popular, blazing-fast, open source enterprise search platform built on Apache Lucene™. Openindex has its own highly customized and optimized Solr instance that server as the base of our platform.
A perceptron is a neural network in which the neurons are connected in different layers. A first layer consists of input neurons, where the input signals are applied.
The principle of maximum entropy states that the probability distribution that best reflects the current state of knowledge is the one with the greatest entropy, in the context of accurately stated previous data
In corpus linguistics, part-of-speech tagging, also known as grammatical tagging, is the marking of a word in a text corresponding to a particular word portion, based on both the definition and the context.