Download PDF
Gathr > Case Studies > Real-time Multi-lingual Classification and Sentiment Analysis of Text
Gathr Logo

Real-time Multi-lingual Classification and Sentiment Analysis of Text

Technology Category
  • Analytics & Modeling - Big Data Analytics
  • Analytics & Modeling - Natural Language Processing (NLP)
Applicable Industries
  • Telecommunications
Applicable Functions
  • Business Operation
Services
  • Data Science Services
The Challenge
The client, a major telecom company providing nationwide telecom services, was in need of a system that could perform real-time, multi-lingual classification and sentiment analysis of text data. They were looking for a solution that allows storing, indexing, and querying PetaBytes (PBs) of data with a very high throughput. The critical requirements included the ability to ingest and parse a high volume of data [250M (15 TB) records/day] of varied types such as weblogs, email, chat, and files. They also needed to apply real-time multi-lingual classification and sentiment analysis with very high accuracy (four nines), store metadata and raw binary data for querying, and meet a Query SLA of 5s on cold data.
About The Customer
The customer in this case study is a major telecom company that provides nationwide telecom services. They were in need of a system that could perform real-time, multi-lingual classification and sentiment analysis of text data. The company deals with a high volume of data of varied types such as weblogs, email, chat, and files. They required a solution that could ingest and parse this data, apply real-time multi-lingual classification and sentiment analysis with very high accuracy, store metadata and raw binary data for querying, and meet a Query SLA of 5s on cold data.
The Solution
Impetus provided a solution that consisted of three modules. The Analytics Module was responsible for performing text categorization and sentiment analysis. It implemented a matrix decomposition-based text-classification algorithm. The incoming test document had to pass through a series of pre-processing and numerical computations. Impetus designed the classifier to achieve very low latency. The Event Store/Indexer Abstraction Layer was responsible for storing and indexing the information based on the configuration. The Publish Module was responsible for publishing the analytical result or event data to the external system.
Operational Impact
  • Rapid and accurate real-time text categorization and sentiment analysis
  • Adjustable text categorization for domain-specific classes
  • Multi-lingual support
Quantitative Benefit
  • Ingest and parse high volume of data [250M (15 TB) records/day]
  • Achieved very high accuracy (four nines) in real-time multi-lingual classification and sentiment analysis
  • Met Query SLA of 5s on cold data

Related Case Studies.

Contact us

Let's talk!

* Required
* Required
* Required
* Invalid email address
By submitting this form, you agree that IoT ONE may contact you with insights and marketing messaging.
No thanks, I don't want to receive any marketing emails from IoT ONE.
Submit

Thank you for your message!
We will contact you soon.