Experience Seamless Data Integration Yourself

Add Datastreamer components to your data stack and explore its full capabilities

Try it Now

Questions?

We’re always happy with any other questions you might have. Send us an email at [email protected]

Apply Tisane Entity Extraction to Socialgist Boards

Top companies trust Datastreamer to integrate, enrich, join, and apply their web data needs.

About Tisane Entity Extraction

Detect mentions of people, organizations, locations, filenames, phone numbers, crypto addresses, and more.

Entities are elements of relevance or interest in the text. Tisane extracts both standard entities and those relevant to trust & safety/law enforcement applications.

Standard entities are names of people, their social roles, organizations, places, and so on. We also extract cryptocurrency addresses, bank accounts, credit card numbers, phone numbers, software package names, and more.

Every entity entry is an object made of:

  • type - the type of the entity
  • name - a standard name, if exists; otherwise, the string that was logged
  • subtypes - more detailed additional types
  • subtype - the first subtype (for backward compatibility purposes)
  • mentions - an array of all detected mentions, with:
    • offset
    • length
    • sentence_index
    • text
  • wikidata - a Wikidata ID, if exists

About Socialgist Boards

Covering more than 200 popular Chinese message boards and forums, including baidu.com, hupu.com, and zhihu.com, and 3,000 English message boards and forums, Socialgist offers an unrivalled view into real-time discussions and community sentiments. This message boards dataset is goldmine for understanding consumer behaviour, tracking viral topics, and gaining insights into niche communities and broader public opinion.

How Datastreamer works

Quickly apply Tisane Entity Extraction to Socialgist Boards with a Datstreamer Pipeline.

Step 1

Start your Pipeline with Socialgist Boards

Web data is the starting point for any pipeline. You can use any number of data sources to power your Pipelines. You can use web data from our partner network, your own systems, or any web data.

Step 2

Add Tisane Entity Extraction with an Operation

Want to do more with your web data? Datastreamer offers hundreds of ready-to-use operations—filter, join, enrich, search, and more—to help you process and transform data at speed.

Step 3

That's it! You have just connected  Tisane Entity Extraction and Socialgist Boards

With Datastreamer, web data integration is effortless. Add new capabilities to your Pipelines dynamically and solve the operational issues that used to complicate your process.

Experience Seamless Data Integration Yourself

Add Datastreamer components to your data stack and explore its full capabilities

Try it Now

Questions?

We’re always happy with any other questions you might have. Send us an email at [email protected]

We look forward to connecting with you.

Let us know if you're an existing customer or a new user, so we can help you get started!