Site Reliability Engineer
Sigma360
Software Engineering
New York, NY, USA · Remote
Sigma360 is seeking a Site Reliability Engineer to help safeguard the availability, performance, and resilience of the platform our clients depend on. This is a hands-on engineering role embedded between our DevOps and Security teams: you will own incident response alongside the engineers who build the systems, sharpen our alerting so the right people are paged for the right reasons, and grow the observability and monitoring tooling that lets us understand production behavior at a glance. We are looking for an engineer who treats reliability as a product — proactively hunting down sources of toil and risk before they turn into outages — rather than someone who waits for tickets.
This role is fully remote within Europe. You will work closely with engineers across multiple time zones and participate in a healthy, well-staffed on-call rotation that values learning from incidents over blame.
Key Responsibilities
Partner with our DevOps and Security teams across the full incident lifecycle: drive response during live incidents, run blameless post-incident reviews, and turn each operational surprise into durable improvements in code, runbooks, or tooling. Improve the signal-to-noise ratio of our alerting by tuning thresholds, removing duplicate or low-value alerts, and codifying SLIs/SLOs that reflect actual customer impact. Build and operate the observability stack — metrics, logs, traces, and dashboards — so engineers across the company can answer their own questions about production. Maintain and evolve our AWS infrastructure through Terraform, with a focus on safe, reviewable changes and reproducible environments. Identify reliability and security risks proactively, propose remediations, and follow them through to production rollout.
Qualifications
- Strong production operations experience. You have supported business-critical systems and can stay calm and methodical during a live incident, then translate what you learned into structural fixes.
- Cloud and infrastructure-as-code fluency. You can design, review, and refactor Terraform that manages real AWS workloads, and you understand the trade-offs of the AWS primitives you reach for.
- Observability sensibility. You know what makes an alert actionable, what makes a dashboard useful, and how to instrument a service so the next on-call engineer can debug it without paging the author.
- Collaborative, low-ego communication. You work well with engineers who are not SREs, write clearly in async channels, and can push back on a design without making it personal.
Must Have
- 4+ years of experience in an SRE, Production Engineering, DevOps, or similar role supporting production systems at meaningful scale
- Hands-on experience operating workloads on AWS in production
- Practical experience with Terraform, including reviewing and authoring modules that change real infrastructure
- Experience participating in a 24/7 on-call rotation, including incident command and post-incident review
- Track record of improving observability and monitoring tooling — not just consuming it
- Europe, fully remote, must be an EU citizen
Nice to Have
- Experience writing Go, particularly for reliability tooling, controllers, or internal services
- Exposure to security-adjacent reliability work: secrets management, IAM hygiene, audit logging, vulnerability response
- Experience with docker and CI/CD pipeline tooling
- Background in financial services, payments infrastructure, financial crime compliance, or regulatory technology
What We Offer
- Opportunity to grow with a fast-scaling, product-led company
- Collaborative, low-ego culture focused on solving hard problems
- Flexible work arrangements and generous time-off policies
- Comprehensive health, dental, and commuter benefits
About Sigma360
Sigma360 is a venture-backed data and analytics company building modern software to help organizations manage risk and make better decisions. We value curiosity, clarity of thought, and pragmatic execution, and we believe great products come from strong collaboration across disciplines. Sigma360 is an equal opportunity employer committed to building an inclusive and diverse workplace.