Experience Seamless Data Integration Yourself

Add Datastreamer components to your data stack and explore its full capabilities

Try it Now

Questions?

We’re always happy with any other questions you might have. Send us an email at [email protected]

Enrich Socialgist Blogs with Tisane Entity Extraction

Top companies trust Datastreamer to integrate, enrich, join, and apply their web data needs.

About Socialgist Blogs

Aggregated content from over 2,000 Chinese blogs and over 200,000 diverse English blogs, capturing the pulse of conversations. From niche interest to mainstream topics, Socialgist dataset provides a window into the vast array of perspectives, trend and insights that blog uniquely offer. Harness this rich resource for nuanced understanding of public sentiment, emerging trends, and influential bloggers in various domains.

About Tisane Entity Extraction

Detect mentions of people, organizations, locations, filenames, phone numbers, crypto addresses, and more.

Entities are elements of relevance or interest in the text. Tisane extracts both standard entities and those relevant to trust & safety/law enforcement applications.

Standard entities are names of people, their social roles, organizations, places, and so on. We also extract cryptocurrency addresses, bank accounts, credit card numbers, phone numbers, software package names, and more.

Every entity entry is an object made of:

  • type - the type of the entity
  • name - a standard name, if exists; otherwise, the string that was logged
  • subtypes - more detailed additional types
  • subtype - the first subtype (for backward compatibility purposes)
  • mentions - an array of all detected mentions, with:
    • offset
    • length
    • sentence_index
    • text
  • wikidata - a Wikidata ID, if exists

How Datastreamer works

Quickly enrich Socialgist Blogs with Tisane Entity Extraction with a Datstreamer Pipeline.

Step 1

Start your Pipeline with Socialgist Blogs

For robust enterprise data integration, web data acts as a foundational source. It can be drawn from a variety of channels—including third-party partners, internal applications, and public web repositories.

Step 2

Add Tisane Entity Extraction to enrich

Datastreamer lets you accelerate your data usage by applying operations like structuring, enriching, joining, and filtering—choose from hundreds of prebuilt, plug-and-play options.

Step 3

That's it! You have just connected  Socialgist Blogs and Tisane Entity Extraction

Supercharge your data workflows with Datastreamer. Add flexibility to your Pipelines and put an end to the common bottlenecks in handling web data.

Experience Seamless Data Integration Yourself

Add Datastreamer components to your data stack and explore its full capabilities

Try it Now

Questions?

We’re always happy with any other questions you might have. Send us an email at [email protected]

We look forward to connecting with you.

Let us know if you're an existing customer or a new user, so we can help you get started!