Oracle Principal Site Reliability Engineer in Overland Park, Kansas

Design, develop, troubleshoot and debug software programs for databases, applications, tools, networks etc.

As a member of the software engineering division, you will take an active role in the definition and evolution of standard practices and procedures. You will be responsible for defining and developing software for tasks associated with the developing, designing and debugging of software applications or operating systems.

Work is non-routine and very complex, involving the application of advanced technical/business skills in area of specialization. Leading contributor individually and as a team member, providing direction and mentoring to others. BS or MS degree or equivalent experience relevant to functional area. 7 years of software engineering or related experience.

Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans status or any other characteristic protected by law.

Oracle Corporation - Corporate Architecture Team

Cloud Site Reliability Engineer - Cloud Infrastructure / Linux

We are looking for a highly self-motivated engineer to join our team to improve our internal services on Oracle Cloud Infrastructure. As an SRE, you will be in a distributed team of motivated members with high level of impact on the services delivered and will have ownership of the services deployed. Management opportunities are possible for those demonstrating aptitude in this area.

Responsibilities

  • Architecture and design: create and improve current build system deployment infrastructure using the latest cloud computing techniques to improve agility, reliability, and observability.

  • Ownership: understand internal development and build processes end-to-end in order to streamline CI/CD pipelines.

  • Migration: move/reimplement existing services to Oracle Cloud Infrastructure in a manner that is secure and leverages the latest cloud services.

  • Troubleshooting: have a deep understanding of our products and dependencies in order to efficiently debug incidents and minimize/restore service disruptions when they occur. Root cause issues so that improvements can be made.

Required Experience and Skills

  • BS degree in CS, EE, or equivalent

  • 5 years running distributed services in production on Linux

  • Experience with Linux distros, open source, github, jenkins, REST APIs

  • Strong technical background in Linux, Virtualization, Cloud

  • Strong technical background in cloud networking, storage, and performance

  • Strong technical knowledge of Jenkins, Kubernetes, Docker, and containers in general

  • Strong technical knowledge of monitoring and building dashboards (e.g. Kibana, Prometheus, Grafana, etc)

  • Experience with building rpms and container images

  • Languages: Python, Go, C, Java, bash

  • Excellent problem solving and analytical skills

  • Excellent written and verbal communication skills

  • Handles hard problems with a positive "can do" attitude

  • Team player and able to work with others at all skill levels

As part of Oracle s employment process candidates will be required to complete a pre-employment screening process, prior to an offer being made. This will involve identity and employment verification, professional references, education verification and professional qualifications and memberships (if applicable).

Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans status or any other characteristic protected by law.

Job: *Product Development

Organization: *Oracle

Title: Principal Site Reliability Engineer

Location: CA,California-Redwood City

Requisition ID: 18001478

Other Locations: United States