Skip to content
Projects

Reshaping Observability at EON: New Relic + OpenTelemetry

Reshaping Observability at EON: New Relic + OpenTelemetry
Tungi Dang
October 3, 2025
EnergyObservabilityOT/IT ConvergenceBusiness JourneysMulti-tenant PlatformOpenTelemetryOTel StandardisationPaths & SLOsIncident PlaybooksOnboarding PlaybookInner SourceOpen SourceRoot Cause AnalysisService MappingGDPRKRITIS

ow E.ON modernised critical energy and digital operations with a unified, AI-driven observability backbone across IT, OT, and grid.

E.ON is one of Europe’s largest energy companies, operating 1.6Mio. km of networks for 47 Mio. customers across 17 countries. The goal: reliable, compliant, real-time visibility across IT, OT, and grid operations to support the energy transition.

E.ON set out to standardise telemetry, speed incident response, and align technical signals with business impact across cloud, Kubernetes, and grid assets:

  • Exploding grid complexity from decentralised renewables and volatile demand
  • Fragmented monitoring toolchain and limited end-to-end visibility
  • Cyber-physical threats across IT/OT and IoT devices
  • Regulatory, security, and data-governance requirements
  • Alert fatigue and slow MTTR in distributed systems

E.ON combined New Relic with OpenTelemetry and platform engineering practices to make observability a first-class platform service:

  • OpenTelemetry pipelines for vendor-neutral traces, metrics, and logs
  • New Relic APM, Infrastructure, Logs, and Kubernetes (eBPF/eAPM auto-discovery)
  • Business journey mapping with Pathpoint for revenue and process impact
  • Unified ingest of IT + OT signals for grid state, security, and performance
  • AI-assisted anomaly detection and root-cause analysis to cut MTTR

State estimation, congestion detection, and operational dashboards feed planning and live operations. Customer journeys, order flows, and backend services are monitored with shared KPIs that tie incidents to financial and service impact.

Self-service dashboards and templates scale to thousands of users. Data is shared via standard tooling and collaboration channels to enable faster, evidence-based decisions.

A reusable onboarding playbook standardises collectors, alerts, SLOs, and dashboards across teams and markets. Experiments on alert policies and runbooks are propagated globally.

  • Unified telemetry across cloud, Kubernetes, and grid assets
  • Faster incident detection and reduced MTTR with AI-assisted insights
  • Business-aligned observability via Pathpoint and shared KPIs
  • Lower tool fragmentation and consistent SLOs across teams
  • Compliance-ready data lineage and security monitoring
47M customersAcross 17 countries
IT + OT + GridUnified telemetry
AI-assistedRoot cause analysis

Got a platform problem? I've probably seen it before.

Schedule a call