Share this Job

Data Engineer, Analytics

Apply now »

Date: Nov 17, 2021

Location: Tewksbury, MA, US, 01876

Company: Corning

Requisition Number: 50348

 

Corning is one of the world’s leading innovators in materials science. For more than 160 years, Corning has applied its unparalleled expertise in specialty glass, ceramics, and optical physics to develop products that have created new industries and transformed people’s lives.

Corning succeeds through sustained investment in R&D, a unique combination of material and process innovation, and close collaboration with customers to solve tough technology challenges.

As a leading developer, manufacturer, and global supplier of scientific laboratory products for 100 years, Corning’s Life Sciences segment collaborates with researchers seeking new approaches to increase efficiencies, reduce costs and compress timelines in the drug discovery process. Using unique expertise in the fields of materials science, surface science, optics, biochemistry and biology, the segment provides innovative solutions that improve productivity and enable breakthrough discoveries.

Scope of Position:

The Data Engineer, Analytics will be responsible for the development and continuous improvement of Corning Life Sciences’ data lake supporting the division’s centralized analytics platform. You will be joining an exciting, newly formed analytics center of excellence for the Corning Life Science division. Our mission is to add value to the business by driving key metrics, analyses, and insights for all areas of the business globally including, but not limited to, sales, marketing, manufacturing, customer service and operations. 

 

A successful candidate will have a track record of developing reliable data ingestion pipelines from multiple process and operational data stores using both on-premise and cloud-based technologies. These pipelines will require data validation and data profiling automation using version control to ensure ongoing resiliency and maintainability of the inbound data flows supporting both business intelligence as well as advanced analytics projects. Embedded within the business intelligence and data science team, this role will be a key partner in advancing how the division capitalizes on its data.

 

Description of Work:

  • Develop, test, deploy and maintain production big-data ingestion pipelines using established frameworks, patterns of practice, agile software development and continuous delivery and/or continuous deployment (CI/CD) practices, collaborating closely with the advanced analytics platform team
  • Work with cross-organizational data source teams to define data ingestion requirements for structured, unstructured and semi-structured data, pilot their implementation, and ensure user acceptance
  • Define and implement automated validation and profiling capabilities needed to ensure reliable data delivery, using agile software development and CI/CD practices
  • Work with data source teams, domain experts, analysts and data scientists to define and develop data cleansing and data enrichment processes to ensure the final data sets are usable without additional processing effort
  • Actively participate in code reviews and technical information sharing with your team members and the broader software engineering community at Corning
  • Support data governance processes to support a robust and well documented data lake environment
  • Stay up to date with industry standards and technological advancements that will improve the quality, productivity, and performance of your work
  • Functional/Technical
    • Proficient with MS and Oracle SQL

 

Experience / Education:

  • Bachelor's degree in Computer Science, Engineering, Math, Finance, or related discipline

 

Required

  • 3+ years of experience in big data engineering roles, developing, and maintaining ETL and ELT pipelines for data warehousing, on-premise and cloud data lake environments
  • 3+ years of experience with interpreted declarative programming language such as Python
  • 1+ years of experience developing batch, micro-batch and streaming ingestion pipelines using high-level Apache Spark APIs (pySpark, SparkR, and SparkSQL)
  • Familiarity with Apache Spark architecture, S3, parquet and Delta Lake architecture, technologies, and tools
  • Experience with agile software development & continuous integration + continuous deployment methodologies along with supporting tools such as Git (Gitlab), Jira, Terraform, New Relic
  • Excellent organizational skills including prioritization of multiple concurrent projects while still delivering timely and accurate results

 

Desired:

  • Prior full-stack app development experience (front-end, back-end, microservices)
  • Familiarity with Oracle, Microsoft SQL Server, SSIS, SSRS data technologies
  • Established enterprise ETL and integration tools including Informatica, Mulesoft
  • Familiarity with reporting and analysis tools such as PowerBI, Tableau, or similar
  • Experience in Python or similar languages 

 

This position does not support immigration sponsorship.

Corning offers a competitive salary, vacation, pension, and family medical leave.

Are you ready to start an exciting career with Corning, Inc.? Apply today!

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, disability status or any other legally protected status.


Nearest Major Market: Boston