Software Architect, Reliability Engineering

twilioRemotegreenhouse
Posted Date:

September 30, 2025

Employment Type:

Not specified

Work Arrangement:

Remote

Skills & Technologies

Engineeringpreferred

Contact Information

Job Description

Who we are

At Twilio, we’re shaping the future of communications, all from the comfort of our homes. We deliver innovative solutions to hundreds of thousands of businesses and empower millions of developers worldwide to craft personalized customer experiences.

Our dedication to remote-first work, and strong culture of connection and global inclusion means that no matter your location, you’re part of a vibrant team with diverse experiences making a global impact each day. As we continue to revolutionize how the world interacts, we’re acquiring new skills and experiences that make work feel truly rewarding. Your career at Twilio is in your hands.

See yourself at Twilio

Join the team as Twilio’s next Reliability Architect.

About the job

As an Architect in SRE, you will drive the technical strategy, vision and outcomes for Twilio’s Reliability Engineering organization. You will define and lead solutions and initiatives that ensure Twilio products are reliable worldwide, and you will define standards and guide engineering teams on best practices for designing, building, and operating resilient systems. This role is pivotal to Twilio’s commitment to operational excellence, scalability, and pragmatic, large-scale systems design in the cloud.

Responsibilities

In this role, you’ll:

    • Partner with senior technical leaders across Twilio to set and communicate the reliability strategy, translating business goals into measurable outcomes.
    • Influence company-wide architectural decisions while balancing long-term vision with near-term and compliance needs.
    • Lead the design, implementation, and operation of scalable solutions and paved roads that enable reliable, high-traffic services;
    • Influence company-wide architectural decisions to focus on availability, performance, resilience, and cost efficiency using Kubernetes, AWS, Terraform, and modern observability.
    • Ensure integrity and quality across the service lifecycle; design fault-tolerant architectures, incident response, disaster recovery, and capacity/cost management.
    • Collaborate with product and cross-functional teams to identify reliability risks and convert them into actionable designs, programs, and tooling.
    • Establish and champion reliability practices and drive systemic improvements.
    • Mentor and grow engineers and technical leaders
    • Track and apply emerging SRE, cloud, and large-scale systems best practices; introduce pragmatic innovations that improve reliability at scale.

Qualifications

Twilio values diverse experiences from all kinds of industries, and we encourage everyone who meets the required qualifications to apply. If your career is just starting or hasn't followed a traditional path, don't let that stop you from considering Twilio. We are always looking for people who will bring something new to the table!

*Required:

    • 15+ years of experience in Reliability Engineering, Software Engineering, DevOps roles with a focus on infrastructure, backend systems, and reliability, including as a principal/architect.
    • Strong experience in driving strategic technical decisions and defining long-term technical vision.
    • In-depth understanding of the role of Reliability Engineering in a large and diverse SaaS organization.
    • Experience driving cross-org technical architecture outcomes.
    • Knowledge of cloud architecture, devops practices, and large-scale systems design with microservices.
    • Bachelor's or Master's degree in Computer Science, Engineering, or a related field (or equivalent experience).
    • Strong production experience, including operational management, scaling, partitioning strategies, and tuning for performance and reliability in high-scale environments.
    <