MathWorks logo

MathWorks

Senior Observability Engineer

🇺🇸 Remote - Natick, MA 🕑 Full-Time 💰 $122K - $190K 💻 Other 🗓️ June 15th, 2026
Kubernetes

Edtech.com's Summary

MathWorks is hiring a Senior Observability Engineer. This role involves defining and driving the strategy, architecture, and implementation of observability capabilities across MathWorks' cloud platform and product engineering teams, focusing on improving reliability, performance, and visibility of online products and services. The engineer will lead initiatives using cloud-native tools to build scalable monitoring solutions and integrate observability into SRE practices.

Highlights
  • Architect and evolve observability systems using cloud-native tools like Prometheus, Thanos, Alertmanager, and OpenTelemetry.
  • Build scalable, multi-tenant observability solutions for Kubernetes microservices at scale.
  • Implement SLOs, SLIs, and error budgets, integrating observability into Site Reliability Engineering practices.
  • Drive instrumentation standards to improve telemetry signal quality and coverage.
  • Develop automation and tooling for telemetry collection, alerting, and correlation.
  • Support application teams with observability patterns, instrumentation frameworks, and dashboards.
  • Collaborate with incident management and SRE teams to enhance root-cause analysis and reduce mean time to recovery.
  • Requires a bachelor's degree with 6 years of professional experience or equivalent advanced education and experience.
  • Expertise in cloud-native observability stacks such as the Prometheus ecosystem, OpenTelemetry, and the Grafana suite.
  • Strong knowledge of Kubernetes internals, SRE principles, multi-cloud environments (AWS, Azure, GCP), and secure telemetry data handling.

Senior Observability Engineer Full Description

Senior Observability Engineer


Job Summary

As a Senior Observability Engineer, you wil contribute to definig and driving the strategy, architecture, and implementation of observability capabilities across MathWorks’ cloud platform and product engineering teams. This role is ideal for someone who brings deep expertise in cloud‑native observability systems, understands how modern distributed systems behave, and can lead cross-functional initiatives that improve reliability, performance, and end‑to‑end visibility for MathWorks' online products and services.

Responsibilities
  • Architect and evolve observability systems using cloud‑native tools such as Prometheus, Thanos, Alertmanager, OpenTelemetry
  • Build scalable, multi‑tenant observability solutions for Kubernetes clusters running microservices at scale.
  • Implement SLOs, SLIs, and error budgets—integrating observability into SRE practices.
  • Improve signal quality and coverage by driving instrumentation standards across teams.
  • Develop automation and tooling to enhance telemetry collection, alerting, and correlation.
  • Support application teams in adopting observability patterns, instrumentation frameworks, and dashboards.
  • Collaborate with incident management and SRE teams to improve root‑cause analysis and reduce MTTR.

Minimum Qualifications
  • A bachelor's degree and 6 years of professional work experience (or a master's degree and 3 years of professional work experience, or equivalent experience) is required.

Additional Qualifications
  • Deep expertise in cloud-native observability stacks (Prometheus ecosystem, OpenTelemetry, Grafana suite).
  • Strong understanding of Kubernetes internals and common cloud‑native patterns (sidecars, operators, CRDs).
  • Experience applying SRE principles: SLO/SLI design, chaos engineering, error budget management.
  • Experience in designing self-service observability capabilities for platform users.
  • Security‑aware approach to telemetry data handling.
  • Knowledge of cloud providers (AWS, Azure, GCP).