Observability and Monitoring Engineer Job at e&e IT Consulting Services, Inc., West Des Moines, IA

U2tkU2RJQWdORWhZc2hsakJHMXgyQzd3
  • e&e IT Consulting Services, Inc.
  • West Des Moines, IA

Job Description

e&e is seeking an Observability and Monitoring Engineer for a hybrid contract opportunity in West Des Moines, IA!

The Observability and Monitoring Engineer is responsible for designing, building, and maturing enterprise-wide monitoring, logging, alerting, and observability capabilities across a cloud-based technology environment. This role defines the overall observability strategy, architecture, and implementation standards that enable proactive issue detection, faster troubleshooting, and data-driven operational insights across applications, infrastructure, operating systems, databases, file transfers, and batch processes. The ideal candidate brings strong hands-on engineering experience, architectural leadership, and the ability to integrate and rationalize multiple monitoring tools into a cohesive observability framework.

Responsibilities:

  • Define and implement standards for logs, metrics, traces, event correlation, and alerting across multiple environments.
  • Design and build centralized dashboards and alerting policies providing unified visibility across:
  • Applications and services
  • Operating systems
  • Cloud services (e.g., compute, storage, databases, serverless, audit/logging services)
  • Relational databases
  • File transfer platforms and managed transfer tools
  • Batch jobs and scheduled processes
  • Develop actionable, noise-free alerting thresholds, escalation policies, and operational runbooks.
  • Integrate and manage multiple monitoring and logging platforms into a cohesive observability ecosystem.
  • Assess existing tools and recommend consolidation, optimization, or modernization where appropriate.
  • Manage the lifecycle, configuration, tuning, and health of observability platforms.
  • Automate monitoring deployments using Infrastructure as Code and CI/CD pipelines; create reusable templates and standards to enable rapid onboarding of new applications.
  • Build self-service dashboards and reporting for both technical and business stakeholders.
  • Define and maintain SLOs, SLIs, and reliability KPIs for critical services.
  • Partner with application, infrastructure, and security teams to reduce MTTR and improve system reliability.
  • Participate in incident response, root cause analysis, and problem management activities.
  • Provide technical leadership and mentoring, advising teams on observability architecture and best practices.
  • Develop and maintain system documentation and contribute to technical planning and strategy sessions.

Required Qualifications

  • Bachelor’s degree in Computer Science, Information Technology, or a related field.
  • 5+ years of experience implementing monitoring and observability solutions, including extensive hands-on experience with Dynatrace.
  • Experience working with monitoring and logging platforms such as Zabbix, Graylog, Splunk, SolarWinds, or comparable tools.
  • 5+ years of hands-on experience with cloud platforms and services, with strong emphasis on AWS architectures.
  • Deep understanding of observability concepts including metrics, logs, traces, distributed tracing, and event correlation.
  • Proven experience building dashboards and KPIs across application, infrastructure, and database layers.
  • Strong scripting and automation skills (Python, Bash, PowerShell).
  • Experience with Infrastructure as Code tools such as Terraform and/or CloudFormation.
  • Solid understanding of systems architecture, network monitoring, and performance tuning.
  • Familiarity with ITIL incident and problem management processes.

Preferred Qualifications

  • Experience using AI-enabled tools to enhance observability, alerting, and operational insights.
  • Experience with containerized and microservices-based architectures.
  • Hands-on experience with OpenTelemetry, Prometheus, Grafana, or similar observability frameworks.

Required Technical Skills

  • Cloud Services: Compute, storage, databases, serverless, and container services
  • Monitoring & Observability Tools: Dynatrace, CloudWatch, Zabbix, SolarWinds, Graylog, Splunk
  • Configuration Management: Ansible, Puppet, Chef
  • CI/CD Tools: Jenkins, QuickBuild, Bitbucket
  • Scripting Languages: Python, PowerShell, Bash
  • Databases: Microsoft SQL Server, PostgreSQL
  • Infrastructure as Code: Terraform, CloudFormation
  • Containers: Docker, Kubernetes

Job Tags

Contract work,

Similar Jobs

BayCare Health System

IRB Coordinator Job at BayCare Health System

Join the team that is revolutionizing health care BayCare Health System Our network consists of 16 community-based hospitals, a long-term acute care facility, home health services, outpatient centers and thousands of physicians. With the support of more than 30,00...

Onvida Health

Occupational Therapist Job at Onvida Health

 ...Summary: The Occupational Therapist will provide diagnostic evaluations and treatment to assigned patients, and help patients reach maximum performance level, restore function, prevent disability, and use skills learned to the fullest to function in the community within... 

LMC

Senior Event Planner Job at LMC

 ...Senior Event Planner to our team. The Senior Event Planner will serve as project leader on specific events, coordinating teams that plan, implement, and manage all aspects of events. They will initiate and lead meetings with stakeholders to plan scope and format of... 

ITG Brands

Process Optimization Engineer Job at ITG Brands

 ...improvement through reduction of quality defects, rate losses, process centerline deviations, minor stops, and changeover losses...  ...change management system for the assigned process. Serve as engineerings primary point of contact for technology and platform-related... 

Rutland Regional Medical Center

MA Care Coordinator Job at Rutland Regional Medical Center

 ...MA Care Coordinator The MA Care Coordinator is a versatile member of the outpatient clinic team providing a wide range of operational and administrative support to the providers, leaders, and staff within the clinic setting. This multi-faceted role is a knowledgeable...