The Enterprise Monitoring and Operations Support Lead is responsible for engineering, configuring, and overseeing the enterprise monitoring ecosystem, ensuring full visibility into infrastructure, application, network, cloud, and security performance metrics to support operational continuity and SLA compliance.
Location: Rockville, MD
Salary: 128K-150K
Clearance: Secret or L Clearance
Responsibilities:
• Administer and optimize enterprise monitoring platforms overseeing server health, database performance, network throughput, storage utilization, Microsoft 365 service availability, and cloud resource metrics to ensure real-time operational awareness and SLA adherence
• Validate integration of infrastructure, endpoint, network, and identity logs into enterprise SIEM platforms ensuring accurate event correlation, alert threshold tuning, and comprehensive monitoring coverage across support areas
• Develop and maintain performance dashboards and operational reporting packages detailing uptime percentages, incident response times, resource utilization trends, and system health indicators for executive review
• Coordinate 24x7 monitoring activities ensuring alert escalation procedures, incident documentation within ITSM workflows, and response workflows are executed in accordance with operational standards
• Conduct trend analysis on recurring alerts, performance degradation patterns, and capacity constraints, implementing proactive remediation strategies to prevent SLA impact
• Participate in incident response bridge calls and provide technical monitoring data to support root cause analysis and executive briefings
• Ensure monitoring configurations comply with enterprise technical standards and cybersecurity requirements for audit logging and event visibility
• Maintain documentation of monitoring architectures, alert configurations, escalation matrices, and operational runbooks to ensure governance alignment and audit readiness
Requirements
Bachelor’s Degree; 8+ years relevant experience; analyze system performance; monitor application and infrastructure metrics; identify performance bottlenecks; recommend optimization strategies; support troubleshooting and resolution efforts; ensure operational performance standards are maintained.
Master’s Degree with 6+ years or Associate’s Degree with 10+ years of relevant experience may be substituted.
• Experience administering enterprise monitoring platforms supporting infrastructure, network, database, M365, and cloud workloads
• Knowledge of alert tuning, threshold configuration, escalation workflows, and incident documentation processes
• Experience integrating monitoring outputs with SIEM and incident response activities
• Familiarity with performance dashboard development and SLA reporting
• Experience analyzing recurring performance trends and implementing proactive remediation measures
• Experience administering enterprise monitoring and observability platforms (e.g., Splunk, SolarWinds, Dynatrace, Azure Monitor, SCOM, or equivalent tools) supporting infrastructure, network, database, and cloud workloads
• Preferred Certifications: ITIL 4 Foundation; Certified Monitoring Professional or equivalent vendor monitoring certification
• Clearance: Must be able to obtain and maintain the appropriate background investigation and credential required for access to organizational systems and facilities.
Clearance:
SECRET or L Clearance