Share this Job

Principal Software Engineer, Platform Reliability and DevOps

Apply now »

Date: Jun 17, 2022

Location: Charlotte, NC, US, 28216

Company: Corning

Requisition Number: 49304

Location:  Charlotte, NC




Corning is the world leader in specialty glass and ceramics, building and manufacturing keystone components that enable high-technology systems. Corning’s history is filled with breakthrough technologies that have played a meaningful role in the way the world works. We thrive on solving difficult, commercially relevant problems through a creative and collaborative research and development process.


Data, automation and advanced analytics technologies are drastically transforming industrial manufacturers beyond point process automation to systemic, highly contextualized and data driven systems. Corning is building the foundational digital infrastructure for these company-wide efforts, and are looking for passionate, hard-working, and talented staff-level software engineers that will design that foundation for reuse, velocity and scale.




The Principal Software Engineer, Platform Reliability and DevOps is a key member of our core platform teams and is responsible to continually improve the automation and practices that ensure the overall reliability and availability of our four digital application platforms. The successful candidate will work closely with the developers and product owners for our native cloud, advanced analytics, toolchain and application development platforms, with a responsibility for:


  1. defining and establishing world-class develops and reliability engineering tools and practices for each of our native and hybrid cloud-based digital platforms,
  2. identifying process and technology gaps in current capability and working with the development teams to address them,
  3. piloting these new capabilities technically, documenting and hardening them for large-scale adoption and working with technical teams across the company to get them adopted.


Proven, world-class DevOps and SRE skills are foundational to this role, including software engineering, reliability engineering, toolchains, pipelines, runners, containers & container orchestration, automation, infrastructure as code, continuous integration and continuous deployment, application performance monitoring and change management.


Responsibilities:  As our Principal Software Engineer, Platform Reliability and DevOps, your main responsibilities will be to:


  • Design automated processes for implementation, security, monitoring, alerting and operations using Terraform, Ansible, YAML, Python and environment-specific scripting languages
  • Define and drive team efforts to adopt and enforce service level objectives for our platforms and system deployments
  • Work with development teams across the company to help them adopt DevOps and CI/CD practices including automated unit, integration, regression and smoke testing
  • Lead DevOps inner-sourcing efforts by developing and supporting generalized DevOps templates and actively participating in information sharing efforts with the broader developer community
  • Design and build re-usable templates to accomplish specific workflow use cases
  • Continually document all actions and work to define repeatable actions that can be automated
  • Design, implement, and support system monitoring infrastructure
  • Continually improve our monitoring, metrics and automated response capabilities
  • Collaborate with team members to tackle complex technological infrastructure, security, and development problems
  • Provide direct and responsive support for availability incidents and other urgent analytic, development or operational needs
  • Debug production issues across services, at all component levels and respond to after-hours escalations
  • Use our toolchain service offering for your day-to-day activities and work with the toolchain team to continually improve it


Education, Experience, and Certifications: 

  • Graduate degree in Computer Science or equivalent area of technical study
  • 10 years of hands-on production programming experience with agile software development practices and tools in a variety of languages, such as Python, .NET/C#, Go, Java, JavaScript/Node.JS
  • 8-10 years of production site reliability engineering experience across both cloud and on-premise environments
  • AWS Solution Architect certification within the last 12 months, with a strong preference for professional grade certification
  • Additional DevOps certifications by one or more major cloud infrastructure providers and/or CNCF related technology providers within the last 24 months Technical



  • Minimum 10 years of infrastructure and application platform automation experience, at least 8 years of that managing large scale production environments
  • A demonstrated deep understanding of the AWS Well Architected Framework
  • A demonstrated deep understanding of kubernetes and docker internals
  • A minimum of 5 years hands-on staging and production experience in each of the following areas: 
  • Infrastructure and configuration management including Terraform and Ansible
  •  Administering and deploying development CI/CD tools such as GitLab, Jenkins, or Azure DevOps.
  • Containers and container orchestration services, especially Docker & Kubernetes
  • Systems administration scripting methods such as Python, Bash or PowerShell.
  • Supporting Windows and Linux operating system environments
  • Network architecture, including load-balancing, DNS, BGP, IPSec VPNs


Other Qualifications:


  • Demonstrated systems perspective when analyzing problems, thinking about overall operation, failure modes and how to address these problems proactively
  • A strong sense for the importance of documentation, and the importance of not having to learn things twice
  • Ability to work in an agile product team environment and balance a diverse set of stakeholder requests
  • Excellent oral and written communication skills with an ability to break down complex technical systems to help business partners understand the value
  • Strong technical collaboration and communication skills as well as the ability to drive cultural change and adoption of best practices through community participation
  • Ability to collaborate with other teams across the company, defining technology roadmaps, sharing experiences and lessons learned for continual improvement
  • Excellent problem-solving and troubleshooting skills
  • Process-oriented with great documentation skills



What sets us apart? Corning’s unwavering commitment to Diversity. Diversity is integral to Corning’s belief in the fundamental dignity of the individual – one of Corning’s seven Values. We are committed to providing an environment where all employees can thrive. This begins with an understanding that our global workforce consists of a rich mixture of diverse people. This diversity will continue to be a source of our strength as well as a competitive advantage. If you have a passionate belief in the power of innovation to change the world; and if you are up to the challenge of working for a world-class organization that makes real, profitable advanced materials, then visit Corning’s website at



Does this sound like you? Apply today!



We prohibit discrimination on the basis of race, color, gender, age, religion, national origin, sexual orientation, gender identity or expression, disability, veteran status or any other legally protected status.


We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.

Nearest Major Market: Charlotte