Deploying Large-Scale Real-Time Predictions with Apache Kafka: A Playtika Case Study
- Analytics & Modeling - Machine Learning
- Analytics & Modeling - Real Time Analytics
- Education
- Equipment & Machinery
- Product Research & Development
- Quality Assurance
- Predictive Maintenance
- Real-Time Location System (RTLS)
- Data Science Services
- System Integration
Playtika, a leading Game-Entertainment company, faced significant challenges in scaling the production of real-time machine learning. With over 10 million daily active users, 10 billion daily events, and over 9TB of daily processed data, the company's existing batch and web services deployment methods were unable to scale to meet their needs or produce predictions in real-time. The REST APIs in their ML Pipelines led to service exhaustion, client starvation, handling failures and retries, and performance tuning of bulk size for batch partitioning. Playtika’s event-driven ecosystem required a solution that could support real-time streaming of their production models and scale without downtime. They also needed a solution that could integrate with various processes including Airflow and Spark, and handle bursts, peaks, and fast creation of new ML Pipelines.
Playtika is a leading Game-Entertainment company that has been leading with top-grossing titles for over five straight years. The company provides audiences around the world with a wide variety of games based on quality, and personalized content. Playtika uses massive amounts of data to reshape the gaming landscape by tailoring UX based on in-game actions. With over 10 million daily active users, 10 billion daily events, and over 9TB of daily processed data, Playtika is able to provide its scientists the data they need to create a continuously changing and adaptive game environment for its users based on their in-game behavior.
Playtika turned to cnvrg.io AI OS to handle their experiments, scaling, and deployments. cnvrg.io provided a one-click streaming endpoint solution with built-in monitoring and MLOps, enabling Playtika to execute real-time predictions with advanced model monitoring features. cnvrg.io organized every stage of Playtika’s data-science projects, including research, information collection, model development, and model optimization at scale. It also bridged the work between their Data Scientists and ML Data Engineers, enabling them to continuously write, train, and deploy machine learning models to various stages in one click. cnvrg.io delivered a scalable solution for streaming endpoints with Apache Kafka, leading to a massive increase in successful throughput, and little to no latency. It also provided Playtika with event-at-a-time processing, exactly once processing, distributed processing and fault-tolerance with fast failover, reprocessing capabilities, and Kubernetes backed autoscaling for Kafka.