Design and implement an AIOps platform to automate monitoring, anomaly detection, and incident response in Kubernetes clusters.
Key activities:
- Integrate monitoring tools and telemetry pipelines
- Implement ML algorithms for failure prediction and anomaly detection
- Build automated remediation workflows to reduce MTTR
- Aim to improve service availability and incident response time
Email subject to use: RD001
📧 Pour postuler: internship@insomea.com