Logo of TexanEngineers.com
Fornia.dev
Observability Technical Consultant

thinkahead

fulltime

Posted on: 6/13/2025

Required Skills:

DatadogAWSPython

Job Description:

Observability Engineer

AHEAD builds platforms for digital business. By weaving together advances in cloud infrastructure, automation and analytics, and software delivery, we help enterprises deliver on the promise of digital transformation. At AHEAD, we prioritize creating a culture of belonging, where all perspectives and voices are represented, valued, respected, and heard. We create spaces to empower everyone to speak up, make change, and drive the culture at AHEAD. We are an equal opportunity employer, and do not discriminate based on an individual's race, national origin, color, gender, gender identity, gender expression, sexual orientation, religion, age, disability, marital status, or any other protected characteristic under applicable law, whether actual or perceived. We embrace all candidates that will contribute to the diversification and enrichment of ideas and perspectives at AHEAD.

Responsibilities

  • Experience implementing solutions such as Datadog, Dynatrace, New Relic, Splunk ITSI, Elastic, Grafana, and/or LogicMonitor.
  • Practical knowledge of cloud-native observability solutions like Prometheus, OpenTelemetry, ELK Stack.
  • Familiarity with public cloud monitoring solutions (AWS CloudWatch, Azure Monitor, GCP Operations Suite).
  • Strong understanding of infrastructure fundamentals: distributed systems, networking, databases.
  • Knowledge of ITSM processes, DevOps, and SRE methodologies.
  • Excellent problem-solving, collaboration, and communication skills.

Qualifications

  • Deploy, configure, and maintain observability platforms and solutions, ensuring system scalability, reliability, and performance.
  • Develop monitoring, logging, tracing, and alerting solutions aligned with business requirements and operational KPIs.
  • Collaborate effectively with cross-functional teams to integrate observability into workflows and operational processes.
  • Create and optimize custom dashboards, alerts, and automated AIOps routines.
  • Identify, troubleshoot, and resolve performance issues and anomalies.
  • Document observability systems, solutions, and operational procedures thoroughly.
  • Participate actively in ongoing training, stay updated on technology trends, and continuously improve personal skillsets and certifications.
  • Exposure to container technologies (Docker, Kubernetes).
  • General software engineering and scripting proficiency (Python, Java, .NET).
  • Familiarity with Infrastructure as Code tools (Terraform, Ansible).