Incident Management Site Reliability Engineer

Confluent
ON, CA
30+ days ago
Confluent
Confluent
confluent.io

Job Description

Position at Infinitem Canada Ltd.

With Confluent, organizations can harness the full power of continuously flowing data to innovate and win in the modern digital world. We have a purpose that drives us to do better every day – we're creating an entirely new category within data infrastructure - data streaming. This technology will allow every organization to create experiences and use the power of data in ways that profoundly impact the way we all live. This impact is our purpose and drives us to do better every day.

One Confluent. One team. One Data Streaming Platform.

Data Connects Us.

About the Role:

Do you have a passion for data that can turn events into outcomes, enabling intelligent, real-time apps, and empowering teams and systems to be able to act on data instantly? Have you ever dreamt about the opportunity to work with key agencies of the public sector? Confluent's team of Site Reliability Engineers, will allow you to do just that by putting you in the driver seat to deliver highly performant, reliable systems that enable prominent public sector agencies to make real time decisions with their data to solve real time problems through Confluent Cloud. Confluent Cloud delivers a complete end-to-end streaming experience as a Software as a Service (SaaS) model.

What You Will Do:

  • Partner with our Cloud Architecture and Engineering teams to build upon the operational resiliency of the Confluent Cloud systems
  • Collaborate broadly across teams to verify and deploy production changes to Confluent Cloud systems and infrastructure
  • Be an active partner with peer engineering teams, engaging during incidents and driving towards positive outcomes for our customers
  • Maintain critical monitoring used for triage and escalations in the federal space and improve upon automated recovery
  • Adhere to established change and incident management processes and help drive continuous improvements through root cause analysis
  • Engage across all teams, infrastructure, and services to identify and close gaps in SLAs, SLOs, and SLIs

What You Will Bring:

  • 5+ years of relevant experience
  • Expertise in Cloud Native technologies with experience operating production services in the cloud
  • Strong fundamentals of Distributed Systems and their design
  • Deep knowledge of Kubernetes and containerization
  • Experience with telemetry tooling to monitor production systems
  • Confidence with problem-solving and troubleshooting critical services
  • Proficiency with scripting and automation (e.g Go, Java, Python, Bash)
  • Working knowledge of infrastructure as code (e.g Terraform, Cloudformation, AWS CDK, Pulumi)
  • Strong written and verbal skills, with experience in communicating with stakeholders and Enterprise Customers
  • Exceptional teamwork, collaboration skills, and the ability to act critically with minimal supervision at times in a remote first environment
  • Experience with a rotating on-call schedule to provide 24/7 support
  • BS Degree in Computer Science, Engineering, or equivalent experience

Come As You Are

At Confluent, equality is a core tenet of our culture. We are committed to building an inclusive global team that represents a variety of backgrounds, perspectives, beliefs, and experiences. The more diverse we are, the richer our community and the broader our impact. Employment decisions are made on the basis of job-related criteria without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or any other classification protected by applicable law.

Visit Original Source:

http://ca.indeed.com/viewjob

Other Jobs

Trusscore

Who We Are Trusscore is a material science company focused on developing sustainable building materials. We're starting a journey to change the way people build buildings and the environmental

 
Kitchener ON
StackAdapt

StackAdapt is a self-serve advertising platform that specializes in multi-channel solutions including native, display, video, connected TV, audio, in-game, and digital out-of-home ads. We empower hund

 
CA
Benevity

MEET BENEVITY Benevity is the way the world does good, providing companies (and their employees) with technology to take social action on the issues they care about. Through giving, volunteering, gra

 
Toronto ON