All jobs

Production Engineer

Full-time US - Remote $135k - $195k

At Synadia—the company behind NATS.io—we enable global enterprises and innovative startups to seamlessly connect all their applications and data, no matter where they live or how they’re deployed. Today, end users everywhere expect lower latency and faster access to digital services; with NATS under the hood, distributed applications can finally deliver.

About the Role

You’ll shape how we design, build, and operate the production systems behind our SaaS platform. This is more than a traditional infrastructure role — we’re looking for someone who can architect new approaches to service management, not just maintain existing ones. You’ll own reliability and security while bringing creative, systems-level thinking to how we evolve our platform. Expect to write real code, lead design decisions, and have a direct impact on how we ship and operate software.

What You’ll Do

System Design & Architecture

  • Architect and design scalable approaches to managing our SaaS services — from service topology and deployment strategies to multi-tenant isolation and lifecycle management
  • Evaluate trade-offs across cost, complexity, reliability, and velocity when proposing new infrastructure patterns
  • Produce clear design documents and architecture decision records that communicate the why behind technical choices
  • Lead technical design discussions, solicit input from across the team, and drive decisions to resolution

Security & Authentication

  • Design and implement secure authentication flows and credential management systems
  • Apply cryptographic principles — identity, signing, encryption, and verification — to protect our infrastructure and services

Infrastructure & Deployments

  • Design and build code-driven deployment pipelines across cloud platforms (AWS, Azure, or GCP)
  • Own infrastructure-as-code using Terraform, including virtual networks and configuration management
  • Identify opportunities to simplify or rethink how services are provisioned and managed

Reliability & Observability

  • Implement and own continuous deployment workflows with robust monitoring and validation gates
  • Drive observability using tools like Prometheus, Grafana, and the Elastic Stack
  • Participate in on-call rotations and actively support staging and production environments

Collaboration

  • Work alongside developers and customer success teams to troubleshoot cloud integration issues and inform platform design with real-world feedback
  • Communicate clearly across the organization — in writing, in code, and in conversation

Requirements

  • 5+ years in a DevOps, SRE, platform, or infrastructure engineering role
  • 7+ years of experience in programming or systems management
  • Demonstrated experience designing and implementing production infrastructure or platform architectures — not just operating them
  • Ability to think from first principles about how services should be structured, deployed, and managed
  • Solid understanding of modern web application authentication frameworks
  • Working knowledge of cryptographic identity, signing, encryption, and verification
  • Hands-on experience with at least one major cloud platform (AWS, Azure, or GCP)
  • Proficiency in Go or Python for writing and maintaining production tooling
  • Strong working knowledge of git and CI/CD pipelines
  • Comfortable on the command line in Linux environments (shell, SSH)
  • Clear written and verbal communication skills in English

Nice to Have

  • Experience designing multi-tenant SaaS architectures or service management platforms
  • Track record of producing architecture proposals, design documents, or RFCs in a team setting
  • Experience with Terraform and infrastructure-as-code at scale
  • Familiarity with configuration management systems
  • Hands-on experience with Prometheus, Grafana, and the Elastic Stack
  • A habit of understanding the tools you use one level deeper than required — you read the docs, the source, or both

Who You Are

A creative, systems-minded engineer who sees infrastructure challenges as design problems. You don’t just ask “how do we fix this?” — you ask “how should this work?” You bring clarity to ambiguous problems, propose concrete solutions, and take ownership of seeing them through. You thrive both independently and within a distributed team, and you communicate as well in prose and diagrams as you do in code.

Location

US - Remote

Diversity and Inclusion

Synadia is an equal opportunity employer. Our continuing policy is to recruit and employ the best-qualified individuals without regard to race, color, sex, religion, national origin, disability, age, sexual orientation, gender identity, and/or any other protected characteristic.

Salary Range

The projected salary range for this position is $135,000 to $195,000. The projected salary range is just one component of the total compensation for this position. Total compensation for this position also includes equity, bonuses, and comprehensive benefits.

Apply

Please send a cover letter and resume to cs-jobs@synadia.com

Apply Now
Get the NATS Newsletter

News and content from across the community


© 2026 Synadia Communications, Inc.
Cancel