Sre Architect

Details of the offer

As an Architect for Site Reliability Engineering , the focus is to ensure that the designed solution responds to non-functional requirements such as reliability, availability, performance, security, and maintainability. You will closely work with the development and other related Release and extended support teams.You will bring a strong engineering focus to operations, putting your leadership to identify methods for preventing incidents, increasing observability, automation frameworks, self-service infrastructure, logging and metrics, and operational reports.
You will be expected to use tools include logging, monitoring, event management, notification, Runbook Automation, ChatOps, Root Cause Analysis.
You will work with Automation Engineers and QA Engineers, development team to ensure seamless delivery of our service offerings.
Build sufficient expertise in the IBM Cloud control plane to create automated monitoring processes

In this role, you will lead the problem resolution process for our clients, from analysis and troubleshooting, to deploying the latest software updates & fixes.

Your primary responsibilities include: 24x7 Observability: Be part of a worldwide team that monitors the health of production systems and services around the clock, ensuring continuous reliability and optimal customer experience.
Cross-Functional Troubleshooting: Collaborate with engineering teams to provide initial assessments and possible workarounds for production issues. Troubleshoot and resolve production issues effectively.
Deployment and Configuration: Leverage Continuous Delivery (CI/CD) tools to deploy services and configuration changes at enterprise scale.
Security and Compliance Implementation: Implementing security measures that meet or exceed industry standards for regulations such as GDPR, SOC2, ISO 27001, PCI, HIPAA, and FBA.
Maintenance and Support
Keeping your assigned site or service up and running or getting it back up and running quickly when failure occurs
Working closely with internal partners and teams to ensure that our infrastructure meets security, SLA, and performance requirements
Writing, updating, and using documentation, including runbooks/playbooks
Automating work including infrastructure needs, testing, failover solutions, failure mitigation, and much more
Debugging complex problems across an entire stack and creating solid solutions
Developing CI/CD processes to improve cadence
Persistent testing of application and infrastructure resiliency over a variety of error conditions.
Partnering with security engineers and developing plans and automation to aggressively and safely respond to new risks and vulnerabilities.
Develop, communicate, and monitor standard processes to promote the long-term health of sustainability and health of operational development tasks.
Standup and maintain pre-production and developer environments to support the entire development organization and improve overall team velocity
Use metrics and analytics to determine reliability issues and remove them through automation and tooling
Be an advocate for our customers, providing them self-diagnosing tools to resolve common issues that arise in the field


Nominal Salary: To be agreed

Source: Brassring

Requirements

Cloud Software Engineer

Job Summary NetApp is uniquely placed in the industry and in an enviable position partnering with major hyper scalers (AWS, GCP and Azure) which adds a new c...


Netapp - Karnataka

Published 8 days ago

Manager Cloud Operations

Remote Work: Hybrid Overview: At Zebra, we are a community of innovators who come together to create new ways of working to make everyday life better. United...


Zebra - Karnataka

Published 8 days ago

Ai Data Engineer

Req ID:303835 NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, a...


Nttdata - Karnataka

Published 8 days ago

Ai Data Engineer

Req ID:303832 NTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, a...


Nttdata - Karnataka

Published 8 days ago

Built at: 2024-11-22T13:37:22.374Z