Datastreamer lets you connect Tisane Entity Extraction with thousands of the most popular capabilities, so you can accelerate working with web data and focus on your product – no code required.
Working with web data is resource-intensive, slow, and distracting from your product. Companies using Datastreamer are able to accelerate how they work with web data, by using Pipelines to power their workflows.
Pipelines created in the Datastreamer platform simplify how you work with web data, making it faster to ingest, enrich, and deliver insights. Remove complexity from your web data workflows, reduce distractions from your products, and scale effortlessly.
Entities are elements of relevance or interest in the text. Tisane extracts both standard entities and those relevant to trust & safety/law enforcement applications.
Standard entities are names of people, their social roles, organizations, places, and so on. We also extract cryptocurrency addresses, bank accounts, credit card numbers, phone numbers, software package names, and more.
Every entity entry is an object made of:
type
- the type of the entityname
- a standard name, if exists; otherwise, the string that was loggedsubtypes
- more detailed additional typessubtype
- the first subtype (for backward compatibility purposes)mentions
- an array of all detected mentions, with:
offset
length
sentence_index
text
wikidata
- a Wikidata ID, if existsAdd Datastreamer components to your data stack and explore its full capabilities
We’re always happy with any other questions you might have. Send us an email at [email protected]