AIOPS

Page 32

AIOPS

It’s all about scale Why a delivery app company needed true cloud-native monitoring. BY CHRONOSPHERE Constant packet loss At first, the delivery app company was using a combination of StatsD monitoring for its cloudnative stack and another solution to monitor its virtual machine environment. As the cloud-native environment scaled and developers delivered new features, however, the monitoring system kept breaking down. “We were experiencing constant packet loss,” explained the observability lead. “Developers could break the whole system by making some benign change that has some weird bug and it crashes the whole system.” There was an extreme noisy neighbor problem – a developers change could easily impact StatsD’s ability to monitor some other, seemingly unrelated application in an entirely unpredictable way. It was so bad, he said, that when he first joined the company no one was actually paying attention to

32

WWW.DIGITALISATIONWORLD.COM

l

ISSUE II 2021

l

COPYRIGHT DIGITALISATION WORLD

the metrics generated by StatsD, instead relying on work-arounds like counting log lines. When the team upgraded StatsD, they discovered that they had been losing metrics because of a bug in the system. “We did the upgrade, and there was an instant change in the pattern,” the observability lead said. “That’s the bigger problem: The pattern changed. That means we had been flying blind for a long time. We had been seeing a zigzag, but then it became a flat line, which was more or less what we expected.” The team had been suspecting that something was off, but hadn’t had a way to verify it. In general, the company thinks of itself as being data driven. Everyone is crazy about numbers, from the CEO down to the newest engineer. Data is used to make better decisions about technology and about the business. If everyone loses observability, it means


Turn static files into dynamic content formats.

Create a flipbook

Articles inside

What is an AI strategy and how do you build one?

4min
pages 86-87

Using the greater data ecosystem to drive great decision making

5min
pages 93-94

The ROI of data-driven development improving how teams work

6min
pages 90-92

How to train your AI algorithm

4min
pages 84-85

How to thrive in the enterprise AI era

5min
pages 82-83

Companies consider work from anywhere

4min
page 78

Why is APM important? Breaking down the benefits

4min
pages 80-81

Three tips for creating outstanding digital customer experiences

3min
pages 76-77

The key to unlocking optimum digital employee experience: Proactivity

5min
pages 74-75

Prioritise, predict and act with BMC Helix

4min
page 73

Catchpoint releases enhanced version of the WebPageTest API

2min
page 72

Understanding legacy infrastructure by looking at the Egyptian pyramids

5min
pages 70-71

Accelerate your automation journey with a Centre of Excellence

5min
pages 68-69

Weeding your digital garden to provide outstanding software experiences

4min
pages 60-61

It’s now or never for more perfect software

4min
pages 62-63

CloudFabrix launches AIOps 2.0

3min
page 64

Telecom: Enabling automaton everywhere through AI and analytics

3min
pages 54-55

The future of service management in the DevOps era

5min
pages 58-59

The importance of AIOps within value stream management

6min
pages 56-57

Tackling cloud complexity with the right tools

5min
pages 52-53

Poor visibility and silos impact business outcomes

3min
page 51

Reducing operational costs and improving productivity in a post-Covid world

11min
pages 46-49

The data-driven IT operations organization

3min
pages 44-45

The importance of visibility in AIOps for digital transformation

3min
pages 28-29

The move to SaaS

5min
pages 34-35

It’s all about scale

4min
pages 32-33

Improving cloud application performance

3min
page 30

Seven KPIs for AIOps

3min
pages 26-27

Netreo acquires Stackify

3min
page 23

How AIOPs is driving digital transformation

3min
pages 10-11

Observability vs. Monitoring for DevOps professionals

6min
pages 20-21

Implementing AIOPs in 5 simple steps

6min
pages 16-19

Visibility across the entire software stack

4min
page 22

Study says Digital Operations Management is essential

3min
page 12

How to get started with AIOps

5min
pages 6-7

ScienceLogic are in the money

3min
page 13

AIOps: Why network monitoring plays an even more important role

5min
pages 8-9
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.