Database Reliability Engineer
Okta, Inc.
  • locationLakemba, NSW
  • salaryNot disclosed
  • full-time 24 February 2021
  • locationLakemba, NSW
  • salaryNot disclosed
  • full-time
Job Description

At Okta some of the common catchphrases are "Always On" and “No Mysteries”, and nowhere do we embrace that more than in Site/Data Reliability Operations. We are looking for a Database Reliability Engineer who has several years of experience working on large scale environments with zero downtime. The job is to deliver an extremely reliable, performant, and secure database infrastructure through the skillful use of automation, and when undoubtedly something breaks hunt down the root cause and fix it.If you like to be challenged and have a passion for solving problems at scale with automation, testing, and tuning, then we would love to hear from you. The ideal candidate is someone who exemplifies the ethics of, “If you have to do something more than once, automate it” and who can rapidly self-educate on new concepts and tools. Job Duties and Responsibilities:As a Database Reliability Engineer, you will have ownership of all technical aspects of our data services tier. Reporting to the Manager of Site Reliability Engineering, you will partner with our core product engineers, performance engineers, site reliability engineers, and growing DBRE team, work towards scaling, securing and tuning our MySQL clusters. Additionally, you will play a key role as we evolve our architecture to meet the demands of Okta's enormous growth and the hundreds of millions of users who rely on us to provide uninterrupted access to business-critical enterprise and consumer applications. Ensure effective performance and 24X7 availability of the production database systemsDesign, automate and document operational processes, tasks, and configuration managementLead efforts on performance tuning, scaling, and benchmarking the data services infrastructure Work closely with performance engineers and core product engineers on a myriad of topicsContribute to automation such as configuration automation using chef, launching infrastructure using terraform and in house tooling as well as automate any other repetitive tasks.Track resource usage trends and take preventative actions to restore full healthMonitor security and database operation related alerts, take preventive or corrective action to resolve issuesParticipate in on-call rotation and occasional off-hour activities Minimum Required Knowledge, Skills, Abilities, and Qualities: 5+ years of experience managing MySQL / Percona Server 5.7 / 8, Aurora at scale2+ years of experience using AWS/GCP or any other cloud provider1+ years of experience with managing Vitess in production2+ years of experience with automating systems and infrastructure using TerraformProficient using and developing Chef cookbooks and recipes to manage configurationProficient in a Linux environment including Linux internals and tuningExperience as a first responder for the data tier on a high-traffic siteExperience working in AWS (EC2 / EBS / S3 Snapshots / Aurora / RDS)Identify with: security conscious, self-motivated, accountable, collaborative, reliable, and a team player.Proficiency in automating administrative tasks using (Ruby, Python, Shell, Ansible, Go) Community Tech blogging / Open source projects contributions a plus

Supporting Documents

    NONE

Share This Job
About

As a leading specialist fibre and network solutions provider, Vocus connects people, businesses, governments, and communities across Australia and New Zealand, to the world. With a world-class team of experts, we challenge convention and do things d

Supporting Documents

    NONE

company-profile-photo

Database Reliability Engineer

  • Job Details:
    Not disclosed AUD
    Lakemba, NSW, Any
  • Key Dates:
    24 February 2021
    Last -7 days to apply
  • Industry:
    Information and Communication Technology
  • Insights:
    0 Applicants
    1 Views
Job Description

At Okta some of the common catchphrases are "Always On" and “No Mysteries”, and nowhere do we embrace that more than in Site/Data Reliability Operations. We are looking for a Database Reliability Engineer who has several years of experience working on large scale environments with zero downtime. The job is to deliver an extremely reliable, performant, and secure database infrastructure through the skillful use of automation, and when undoubtedly something breaks hunt down the root cause and fix it.If you like to be challenged and have a passion for solving problems at scale with automation, testing, and tuning, then we would love to hear from you. The ideal candidate is someone who exemplifies the ethics of, “If you have to do something more than once, automate it” and who can rapidly self-educate on new concepts and tools. Job Duties and Responsibilities:As a Database Reliability Engineer, you will have ownership of all technical aspects of our data services tier. Reporting to the Manager of Site Reliability Engineering, you will partner with our core product engineers, performance engineers, site reliability engineers, and growing DBRE team, work towards scaling, securing and tuning our MySQL clusters. Additionally, you will play a key role as we evolve our architecture to meet the demands of Okta's enormous growth and the hundreds of millions of users who rely on us to provide uninterrupted access to business-critical enterprise and consumer applications. Ensure effective performance and 24X7 availability of the production database systemsDesign, automate and document operational processes, tasks, and configuration managementLead efforts on performance tuning, scaling, and benchmarking the data services infrastructure Work closely with performance engineers and core product engineers on a myriad of topicsContribute to automation such as configuration automation using chef, launching infrastructure using terraform and in house tooling as well as automate any other repetitive tasks.Track resource usage trends and take preventative actions to restore full healthMonitor security and database operation related alerts, take preventive or corrective action to resolve issuesParticipate in on-call rotation and occasional off-hour activities Minimum Required Knowledge, Skills, Abilities, and Qualities: 5+ years of experience managing MySQL / Percona Server 5.7 / 8, Aurora at scale2+ years of experience using AWS/GCP or any other cloud provider1+ years of experience with managing Vitess in production2+ years of experience with automating systems and infrastructure using TerraformProficient using and developing Chef cookbooks and recipes to manage configurationProficient in a Linux environment including Linux internals and tuningExperience as a first responder for the data tier on a high-traffic siteExperience working in AWS (EC2 / EBS / S3 Snapshots / Aurora / RDS)Identify with: security conscious, self-motivated, accountable, collaborative, reliable, and a team player.Proficiency in automating administrative tasks using (Ruby, Python, Shell, Ansible, Go) Community Tech blogging / Open source projects contributions a plus


Be Careful

Don’t provide your bank or credit card details when applying for jobs. Learn how to protect yourself here.

Share This Job
Want to be successful in securing this job?

Post your task and get experts help on:

  • Resume
  • Coverletter
  • Job Application

Get help from Experts Now!