Sr Site Reliability Engineer
Job Description
At Yugabyte, we are on a mission to become the default transactional database for enterprises building cloud-native applications. YugabyteDB is our PostgreSQL-compatible distributed database for cloud-native apps. Resilient, scalable, and flexible, it runs on any cloud and enables developers to become instantly productive using well-known APIs. We are looking for talented and driven people to join us on our ambitious mission and help us build a lasting and impactful company. YugabyteDB is cloud-native by design, has on-demand horizontal scalability, and supports geographical distribution of data using built-in replication. This means that we are well-positioned to meet market demand for geo-distributed, high-scale, high-performance workloads.
Join the Database Revolution at Yugabyte.
Modern applications need a cloud-native database that eliminates tradeoffs and silos. YugabyteDB retains the power and familiarity of PostgreSQL by pairing its trusted API with a precision-engineered, distributed, cloud-native architecture. Even better, it’s 100% open source. Many of the world's leading enterprises are migrating from legacy RDBMSs (like Oracle, SQL Server, and DB2) to YugabyteDB, to meet their mission-critical app demands.
We are looking for strong engineers who exemplify collaboration, teamwork, empathy and like to lead by example. We enjoy working with people who are driven and thrive in a fast-paced startup environment, and who have a strong desire to build an internet-scale, extensible control plane with strong emphasis on simplicity and user experience.
Role and Responsibilities
- Design, develop, test, debug and maintain components of the cloud infrastructure
- Manage operational priorities of the DBaaS
- Establish process for handling and leading response to security new vulnerabilities
- Lead certification efforts from the security perspective
- Participate in penetration testing efforts
- Design and build DBaaS processes for key management, rotation storage, encryption, and password management
- Strong software design and implementation skills in building infrastructure frameworks
- Experience building and operating extensible, scalable resilient data systems
- Working knowledge of Java and Python Experience using public cloud infrastructure (AWS, GCP, or Azure)
- Containerization tooling (Docker, EKS, Kubernetes)
- Infrastructure as Code Tooling (Example: Terraform, Cloudformation, Etc.)
- Configuration Management Tooling (Ansible, Chef, Etc.)
- Automation Scripting (Python preferred)
- Solid understanding of basic systems operations (disk, network etc)
- Willingness and ability to learn new languages and concepts
- 6+ years of relevant experience