Tech Stack
Job Description, Responsibilities & Requirements
About the Position
We’re hiring a Senior SRE/DevOps Engineer to own the reliability and infrastructure behind a real-time trading platform processing millions of events with strict latency expectations. You’ll design and operate production systems that engineers depend on to ship safely, with a focus on cloud, Kubernetes, observability, and disciplined infrastructure practices.
Responsibilities
- Design, maintain, and evolve AWS cloud infrastructure, including EKS, networking, IAM, and supporting services with focus on reliability, security, and cost control.
- Own Kubernetes workloads end-to-end: deployments, autoscaling, resource management, and cluster health across environments.
- Build and operate CI/CD pipelines with GitHub Actions to ensure fast, repeatable, and auditable releases.
- Maintain and improve the observability stack (Grafana, Loki, alerting) for fast issue detection and investigation.
- Own on-call and incident response processes: configure alerting, escalation policies, and drive post-incident reviews.
- Manage and scale Kafka-based streaming infrastructure supporting real-time data processing.
- Continuously improve system reliability through tuning, automation, and elimination of operational toil.
- Document infrastructure, incidents, and runbooks; ensure operational knowledge is shared and actionable.
Requirements
- Strong hands-on experience with AWS in production: VPC design, EKS, IAM, RDS, S3, CloudWatch.
- Deep expertise in Kubernetes: cluster operations, Helm, RBAC, resource optimization, and troubleshooting real workloads.
- Proven experience building and maintaining CI/CD pipelines with GitHub Actions, including multi-environment promotion, secrets management, and gated releases.
- Experience operating centralized logging solutions such as Grafana Loki (or equivalent), including index management, retention, and log-based alerting.
- Practical experience with Kafka: topic management, consumer monitoring, offsets, and performance tuning.
- Familiarity with ClickHouse or similar columnar databases in high-throughput environments.
- Hands-on use of Grafana for dashboards, alerting, and incident/on-call workflows with runbooks and escalation policies.
- Strong infrastructure-as-code skills (Terraform or equivalent) with version-controlled, peer-reviewed changes.
- Solid understanding of networking fundamentals: DNS, load balancing, TLS termination, ingress patterns, and basics of service mesh; experience with latency-sensitive systems and production ownership mindset.
- Upper-Intermediate English and strong communication skills.
Nice to Have
- Background in fintech, proprietary trading, or other high-stakes environments with strict uptime and data integrity requirements.
- Experience with service mesh technologies such as Cilium, Istio, or Linkerd.
- Familiarity with NATS, Redis, or other low-latency messaging and caching systems.
- Scripting skills in Python, Go, or Bash for automation and tooling.
We Offer
Your time off
- 15 paid vacation days, up to 5 unpaid days off, and 7 paid sick days
- 10 Indian public holidays
Learning & growth
- Sombra University workshops and internal learning programs
- Tech Communities and knowledge sharing sessions
- Language courses and workshops
- Mentorship opportunities
Work & community
- Company-provided work equipment
- Internal referral program
- Sombra events and internal initiatives
About the Company
Before you apply, our recruitment team will carefully review your profile, and if we see a good match with the role, we’ll reach out to you shortly.
If you don’t hear from us within 5 business days, it means we’ve decided to continue the process with other candidates for this position. Thanks for understanding.
Galyna Oliyarchyk
Recruitment Partner
Send CV
Apply now!
Thank You!
In the meantime, feel free to explore our blog or follow us on Linkedin for more updates.
We’re excited to stay connected!
Contact us
Thank you for getting in touch!
We’ve received your message, and a member of our sales team will contact you within 3 business days.
In the meantime, if you have an urgent inquiry, feel free to contact us directly at [email protected].
We look forward to connecting with you soon!