We are seeking a highly skilled Network Software Architect with extensive experience in network monitoring and automation. The ideal candidate will have a proven track record in designing and implementing scalable, high-performance solutions that enhance the reliability, security, and efficiency of complex network infrastructures. In this role, you will play a pivotal role in driving innovation, establishing best practices, and leading the development of cutting-edge technologies that empower our network operations. This position will also involve leveraging Kubernetes for containerized environments and incorporating AI/ML technologies to optimize network monitoring and automation processes, pushing the boundaries of what's possible in modern network management.
Key Responsibilities:Architectural Design:
Design and implement robust network monitoring solutions capable of providing real-time insights into network performance and health.
Develop automation frameworks to streamline network management, including provisioning, configuration, and incident response.
Leverage Kubernetes for deploying and managing containerized network monitoring and automation applications, ensuring scalability and reliability.
AI/ML Integration:
Incorporate AI/ML technologies to enhance network monitoring, including predictive analytics for network performance, anomaly detection, and automated incident response.
Develop and implement machine learning models that can analyze network data and provide actionable insights for optimization.
Technical Leadership:
Lead the development and integration of innovative network monitoring and automation strategies.
Establish and enforce industry standards and best practices for software development within the network domain.
Development and Implementation:
Ensure seamless integration of monitoring and automation tools with existing network infrastructure and third-party systems.
Collaboration and Communication:
Work closely with cross-functional teams, including Network Reliability Engineers (NRE) and Service Reliability Engineers (SRE) to ensure the effective use of monitoring and automation solutions.
Performance Optimization:
Continuously monitor, optimize, and tune network monitoring and automation systems to meet required service levels.
Develop and implement automation scripts and tools to improve network incident response times.
Security and Compliance:
Integrate security considerations into the design of monitoring and automation solutions.
Ensure compliance with industry regulations, including the development of automated auditing and reporting features.
Continuous Improvement:
Provide training and mentorship to team members on the latest technologies and methodologies.