Job Description
We are seeking experienced and highly skilled IT Solution Architect to join the Monitoring and Event Management Design Team. The successful candidate will be responsible for designing and implementing enterprise monitoring and observability solutions that ensure the performance, health, resiliency, and availability of IT assets. This role involves collaboration with multiple design leads and stakeholders to deliver innovative and scalable monitoring solutions using a wide range of technologies.
Key Responsibilities:
- Lead the design and implementation of enterprise monitoring, observability, and event management solutions.
- Collaborate with MEMD design leads across various domains including Event Management, Function Apps, Azure Monitor, App Dynamics, Grafana, ThousandEyes, and more.
- Develop and sustain observability and monitoring standards aligned with enterprise architecture.
- Design monitoring services for applications across SAAS, IAAS, and PAAS platforms.
- Engage with business units to gather requirements and deliver cost-effective monitoring solutions.
- Perform day-to-day design activities and plan for software and automation upgrades.
- Drive innovation through proof-of-concept initiatives and automation strategies.
Technologies and Tools:
- SRE Practices: Observability and Monitoring
- ServiceNow ITOM Event Management
- JavaScript, Python, PowerShell
- Azure Fundamentals, Azure Monitor, Azure DataLake, Azure Fabric
- API Integration, Ansible
- .NET and C# Programming
- DevOps and Automation Tools
- Grafana, Prometheus, App Dynamics, ThousandEyes, Riverbed, Nobl9
Required Qualifications:
- Bachelor’s degree in IT /Computer science/Engineering, or related field.
- 5+ years of experience in IT monitoring or related technical roles.
- Strong knowledge in Azure Log Analytics, KQL, Telemetry, APM, Ansible, PowerShell, Automation
- Proven ability to analyze and resolve complex application and infrastructure issues.
- Excellent communication and teamwork skills.
Preferred Qualifications:
- 3+ years of experience with IT architecture role and deep understanding of monitoring and application performance management
- Experience with Azure services.
- Experience in DevOps and programming languages: Python, PowerShell, C#, JavaScript, SQL.
- Knowledge of proactive monitoring using Azure services, telemetry, and synthetic transactions.
- Understanding of network architecture and security: WAN/LAN, TCP/IP, PKI.
- Familiarity with ITSM processes and tools (e.g., ServiceNow), and compliance processes
- Have AIOps vision and awareness