Sr DevOps Engineer at Workday in Boulder, Colorado

Posted in Engineering about 16 hours ago.

Type: Full Time





Job Description:

Your work days are brighter here.

At Workday, it all began with a conversation over breakfast. When our founders met at a sunny California diner, they came up with an idea to revolutionize the enterprise software market. And when we began to rise, one thing that really set us apart was our culture. A culture which was driven by our value of putting our people first. And ever since, the happiness, development, and contribution of every Workmate is central to who we are. Our Workmates believe a healthy employee-centric, collaborative culture is the essential mix of ingredients for success in business. That's why we look after our people, communities and the planet while still being profitable. Feel encouraged to shine, however that manifests: you don't need to hide who you are. You can feel the energy and the passion, it's what makes us unique. Inspired to make a brighter work day for all and transform with us to the next stage of our growth journey? Bring your brightest version of you and have a brighter work day here.

About the Team

As a Senior DevOps Engineer you will be joining one of Workday's most exciting product and technology teams, Core Software. With nearly 750 employees globally, Core Software is responsible for evolving the core technology and runtime components of the Workday Platform and empowering developers to innovate and build on our products. This DevOps Engineer will be reporting to the VP of Core Services, will support our growth and continued success by ensuring our systems are resilient and capable of withstanding various challenges.

We are looking for a Senior Software DevOps Engineer who has proven experience to lead efforts to embed resilience and fault tolerance within our software architecture. Working closely with engineering, operations, and DevOps teams, this role will be responsible for developing and implementing strategies that improve system reliability, reduce Mean Time to Detect (MTTD) and Mean Time to Recovery (MTTR), and ensure our services can withstand disruptions.

About the Role


  • Design and deploy fault-tolerant and resilient architecture patterns to improve system availability and performance


  • Facilitate FMEA workshops to identify and prioritize potential failure points within critical applications and services

  • Establish and track metrics such as MTTD, MTTR, and Service Level Objectives (SLOs) to measure and improve system resilience

  • Work with QA and Perf teams to implement chaos engineering, stress testing, and other resiliency testing methodologies

  • Collaborate with Site Reliability Engineering (SRE) and operations teams to define incident management playbooks and recovery procedures for high-severity incidents

  • Educate development teams on best practices for building resilient applications, providing guidance on design principles, patterns, and tools

  • Lead post-incident reviews to identify root causes and implement changes to prevent recurrence, creating a culture of learning and continuous improvement

  • Provide program management support for short- and long-term work related to resiliency, quality, and security

  • Drive priorities and address backlogs for ongoing technical work that will advance reactions to incidents, security improvements, version updating, observability enhancements, and long-term system stability and scalability

  • Support Engineering and Product leaders within the Core Services pillar in establishing plans, managing dependencies and risks, and providing organizational visibility and team accountability to ensure the initiative moves forward at the pace needed

  • Implement techniques and best practices in processes and methodology so teams stay aligned and on track to achieve their stated goals.

  • Indepth OMS and workday stack knowledge is a big bonus

About You

Basic Qualifications


  • 5+ years in software engineering, architecture, or DevOps with a focus on reliability, resilience, and high availability

  • Proficiency in cloud platforms (AWS, GCP), distributed systems, containerization (Kubernetes, Docker), and infrastructure as code

  • Strong knowledge of FMEA, Chaos Engineering, Disaster Recovery, and redundancy strategies

  • Experience with monitoring tools (e.g., Prometheus, Grafana) and logging/alerting platforms

  • Proven ability to diagnose complex issues in distributed systems and design/influence/advise resilient solutions

  • Ability to synthesize information and drive alignment to a plan through varying levels of ambiguity

Other Qualifications


  • Bachelor's or Master's Degree in Computer Science, Engineering, or a related field (or equivalent work experience)

  • Desire to help others succeed, and the ability to quietly assess when and where to act as needed

  • Outstanding relationship-building and partnership skills

  • A high level of organization and attention to detail

  • Excellent training skills and ability to work hands-on with teams in an advisor role

  • Excellent written and verbal communication skills to document processes and influence engineering best practices

  • A keen desire to gain deeper knowledge of technologies and methodologies that may benefit us in the future through self-paced research, learning and experimentation

Posting End Date: 02/03/2025

If hired in Colorado, click here for information about Workday's comprehensive benefits in Colorado: https://workdaybenefits.com/us/welcome-to-workday-benefits/prospective-workmates.

The application deadline for this role is the same as the posting end date stated.


Workday Pay Transparency Statement

The annualized base salary ranges for the primary location and any additional locations are listed below. Workday pay ranges vary based on work location. As a part of the total compensation package, this role may be eligible for the Workday Bonus Plan or a role-specific commission/bonus, as well as annual refresh stock grants. Recruiters can share more detail during the hiring process. Each candidate's compensation offer will be based on multiple factors including, but not limited to, geography, experience, skills, job duties, and business need, among other things. For more information regarding Workday's comprehensive benefits, please click here.

Primary Location: USA.CO.Boulder


Primary Location Base Pay Range: $114,200 USD - $171,200 USD


Additional US Location(s) Base Pay Range: $108,500 USD - $206,500 USD

The application deadline for this role is the same as the posting end date stated as below:

12/20/2024



Our Approach to Flexible Work

With Flex Work, we're combining the best of both worlds: in-person time and remote. Our approach enables our teams to deepen connections, maintain a strong community, and do their best work. We know that flexibility can take shape in many ways, so rather than a number of required days in-office each week, we simply spend at least half (50%) of our time each quarter in the office or in the field with our customers, prospects, and partners (depending on role). This means you'll have the freedom to create a flexible schedule that caters to your business, team, and personal needs, while being intentional to make the most of time spent together. Those in our remote "home office" roles also have the opportunity to come together in our offices for important moments that matter.

Pursuant to applicable Fair Chance law, Workday will consider for employment qualified applicants with arrest and conviction records.

Workday is an Equal Opportunity Employer including individuals with disabilities and protected veterans.

Are you being referred to one of our roles? If so, ask your connection at Workday about our Employee Referral process!

PDN-9db03d78-9504-47cd-bccc-84958a5f8e14
More jobs in Boulder, Colorado

Health Care
about 21 hours ago

Boulder Post Acute
General Business
1 day ago

King Soopers
$18.50 - $23.85 per hour
More jobs in Engineering

Engineering
14 minutes ago

NEOTech
Engineering
14 minutes ago

Spinx Oil Company Inc
Engineering
30+ days ago

CooperVision, Inc