We’re always happy with any other questions you might have. Send us an email at [email protected]
Integrate Tisane Entity Extraction into AWS S3 Storage
Top companies trust Datastreamer to integrate, enrich, join, and apply their web data needs.
About Tisane Entity Extraction
Detect mentions of people, organizations, locations, filenames, phone numbers, crypto addresses, and more.
Entities are elements of relevance or interest in the text. Tisane extracts both standard entities and those relevant to trust & safety/law enforcement applications.
Standard entities are names of people, their social roles, organizations, places, and so on. We also extract cryptocurrency addresses, bank accounts, credit card numbers, phone numbers, software package names, and more.
Every entity entry is an object made of:
type - the type of the entity
name - a standard name, if exists; otherwise, the string that was logged
subtypes - more detailed additional types
subtype - the first subtype (for backward compatibility purposes)
mentions - an array of all detected mentions, with:
offset
length
sentence_index
text
wikidata - a Wikidata ID, if exists
About AWS S3 Storage
Connect your data pipeline to AWS S3 storage for direct data delivery and archiving.
Quickly connect Tisane Entity Extraction and AWS S3 Storage with a Datstreamer Pipeline.
Step 1
Start your Pipeline with Tisane Entity Extraction
Web data is the starting point for any pipeline. You can use any number of data sources to power your Pipelines. You can use web data from our partner network, your own systems, or any web data.
Step 2
Transform, and then add AWS S3 Storage
Want to do more with your web data? Datastreamer offers hundreds of ready-to-use operations—filter, join, enrich, search, and more—to help you process and transform data at speed.
Step 3
That's it! You have just connected Tisane Entity Extraction and AWS S3 Storage
With Datastreamer, web data integration is effortless. Add new capabilities to your Pipelines dynamically and solve the operational issues that used to complicate your process.