下载PDF
实例探究 > Reliability and happy customers with SLOs

Reliability and happy customers with SLOs

技术
  • 分析与建模 - 预测分析
  • 应用基础设施与中间件 - API 集成与管理
  • 应用基础设施与中间件 - 数据交换与集成
适用行业
  • Professional Service
  • Software
适用功能
  • 商业运营
  • 质量保证
用例
  • 预测性维护
服务
  • 系统集成
  • 软件设计与工程服务
挑战
Too many monitoring vendors left Laurel with numerous blindspots and no way to know if they were delivering on customer promises. Laurel is committed to helping their customers automate time-keeping and billing processes. Most lawyers use Microsoft products exclusively, so they primarily interact with Laurel’s system through the Windows desktop application. Nine outside vendors monitored Laurel’s systems, making the reliability of their desktop application and per-customer infrastructure architecture challenging. Even with this battery of solutions, there were a lot of reliability blindspots — they didn’t know if they were delivering on customer expectations to manage time efficiently. With only alerts for CPU and memory on their containers, many poorly enriched alerts were ignored. Their on-call rotation was disorganized and included people who weren’t in the company anymore. Laurel’s product and engineering teams took a hard look at their current workflows — and found them lacking. They had to fix their general approach and understanding of infrastructure monitoring and, in the process, improve the well-being of their engineers through better clarity and less stress. Laurel ran a request for proposal (RFP) with nine vendors, searching for a monitoring solution to provide broader visibility. Laurel’s product and engineering teams also decided to take a service level objective (SLO) approach to reliability and set out to find solutions that aligned with this.
关于客户
Laurel is a company dedicated to helping their customers automate time-keeping and billing processes, primarily catering to lawyers who use Microsoft products. The company faced significant challenges in ensuring the reliability of their desktop application and per-customer infrastructure architecture due to the involvement of nine different monitoring vendors. This led to numerous blindspots and a lack of clarity on whether they were meeting customer expectations. Laurel's product and engineering teams recognized the need to overhaul their infrastructure monitoring approach to improve both customer satisfaction and the well-being of their engineers.
解决方案
Laurel had two primary goals: ensuring customer satisfaction and improving the mental health of their employees by focusing on high-value tasks. They chose Sumo Logic as their monitoring platform, which worked seamlessly with other tools like OpsGenie for on-call rotation and OpenTelemetry for simplifying and broadening data collection. Sumo Logic's Reliability Management solution allowed Laurel to define and track Service Level Objectives (SLOs), providing deeper insights into customer experience beyond basic monitoring. The SLO dashboard within Sumo Logic's UI offered an active view of service health, including service-level indicators, error budgets, and error history. This setup enabled Laurel to create generic SLOs for new services and customers easily, streamlining the monitoring process and reducing unnecessary alerts.
运营影响
  • Laurel's engineering team gained deeper insights into customer experience through SLOs, allowing them to focus on what matters to end users.
  • The SLO dashboard provided a clear view of service health, including service-level indicators, error budgets, and error history.
  • Sumo Logic's integration with other tools like OpsGenie and OpenTelemetry streamlined data collection and monitoring processes.
数量效益
  • Laurel's team only needed to write 90 lines of code to create a few hundred SLOs for their clientele.

相关案例.

联系我们

欢迎与我们交流!

* Required
* Required
* Required
* Invalid email address
提交此表单,即表示您同意 IoT ONE 可以与您联系并分享洞察和营销信息。
不,谢谢,我不想收到来自 IoT ONE 的任何营销电子邮件。
提交

Thank you for your message!
We will contact you soon.