Position Overview:
As a Database Reliability Engineer Lead, you will be defining SRE best practices being the point person for management. You will maintain operational coverage of Databases residing on our cloud AWS platforms, with a focus on Health and Availability. Develop automation tools to streamline operations and provide vital support to our engineering teams. Additionally, improving automation, scale, process improvement, metric collection, security, and visibility into non-production and production databases. Leveraging various DevOps approaches which includes but not limited to CI/CD processes, working closely with development teams to ensure a manageable and secure migration of change into the production databases.
Essential Job Functions:
- Ensuring the availability, scalability, and performance of database systems
- Analyzing and implementing best practices for database performance, reliability, and scalability.
- Providing database expertise to engineering and SRE teams
- Develop and maintain database infrastructure that supports thousands of concurrent users & manage databases capacity
- Work on the observability of database metrics to achieve operational objectives.
- Create tools and automation to simplify database operations.
- Enabling the development teams with Enterprise platform and CI/CD tools
- Coordinate with Development, QA, and Platform teams
- Monitoring environment to identify opportunities for improvement.
- Other activities as may be assigned by your manager
Qualifications/ Requirements:
- Bachelor’s degree or equivalent combination of education and experience
- 8+ years cumulative experience with relational databases in production environments – with 60% Admin 40% Development efforts preferred
- Proficiency in data modeling, structure design, and SQL, with deep knowledge of PostgreSQL internals.
- Experience in shell scripting and one scripting language (python preferred)
- Enterprise Architecture Experience preferred
- Strong Experience with AWS Cloud Database services such as Aurora, RDS-SQL, DynamoDb, mySQL
- Experience with CI/CD Processes and Helm chart deployments preferred.
- Knowledge of Windows and Linux operating systems
- Experience working in Azure DevOps- Git Repo preferred
- Experience with Monitoring tools, Dynatrace, troubleshooting issues and be the escalation point preferred
- Strong Network Management skills
- Excellent written and verbal communication skills
- Knowledge of development methodologies
- Experience working in Jira ticketing system
- Ability to prioritize
- The employee may be required to report to a different local office as a normal, contemplated, and mandated incident of their employment
Working Conditions:
- Office environment with frequent computer, mouse, keyboard use
- Alternating between sitting or standing as needed
- Hearing, talking, reaching, grasping
Note: This job description is not intended to be all inclusive or exclusive. At any time, employees may perform other related duties as required to meet the ongoing needs of the organization and participate in additional trainings.