Mission
As Senior Site Reliability Engineer at Kakarot, you will be responsible for designing, automating, and maintaining our Zero Knowledge proving infrastructure. You will ensure high availability, security, continuous deployment and scalability of our proving, blockchain, and micro-services infrastructure. You will work closely with Software Engineers and ZK engineers to improve performances, security, and operational reliability.
Responsibilities
- Architect, deploy, and maintain highly available AWS-based infrastructure.
- Automate the deployment of our ZK proving infrastructure, micro-services, and blockchain nodes, ensuring minimal downtime and fast recovery.
- Implement and optimize CI/CD pipelines using GitOps (ArgoCD/FluxCD) and infrastructure automation.
- Define and implement process for incident response, performing post-mortems, and continuously improve system resilience.
- Enhance security across our stack, ensuring compliance with security best practices
- Improve monitoring, logging, and alerting using Prometheus, Grafana, and distributed tracing.
- Mentor and collaborate with engineers to drive SRE best practices and operational excellence across the company
- Shape our culture: As an early joiner, you will significantly impact setting our engineering culture around our core values.
What’s in it for you
By joining KKRT Labs, you will lead the journey at the frontier of verifiable computing, a new paradigm that will to revolutionize the way information systems work.
You will contribute to the scaling of Ethereum, the leading application blockchain, and help build tangible use cases fit for the future of computing?
For more information, visit Working at KKRT Labs.
Requirements
Must Have
- 6+ years of experience as a Senior SRE or DevOps Engineering