StackState uses Graphite data to create a total unified overview of the IT stack. On this, it supplies functions like investigation, remediation and prevention by using big data analytics on top of its real-time operations model. This will give you faster mean times to repair. It also gives you the ability to know the effect of changes even […]

Read more

Service Level Agreements (SLAs) have long been used for quick and accurate views on overall performance. They provide technical, input-based metrics (uptime counts, downtime, response time, etc.) that highlight areas of strength and weakness. But what do these SLA metrics really mean to your business? Do the outcomes direct actions that result in a happy customer? […]

Read more

While the landscape of infrastructure, DevOps, tools, technologies and processes keeps shifting dramatically, the pressure to keep everything up and running keeps growing. With this increasing complexity, small scale problems can quickly escalate into full-scale issues, reverberating well beyond IT operations into every aspect of your business, impacting your bottom line. Here are five dilemmas […]

Read more

Getting a change approved is hard even if it makes the system more stable As an engineer, again and again I’ve found myself in this position: explaining why some company should invest in rebuilding or refactoring some set of components that at that time do not seem to cause any problems. These discussions were never […]

Read more

The pace of change is increasing. Component sizes are shrinking. All the while monitoring solutions are bombarding us with log data, metrics, status reports and alerts. It all scales, but we don’t. How do we prevent from drowning in run-time data? A lot of companies are facing the same problem. They have such a huge […]

Read more