Share this Job

Senior Data Engineer, Analytics

Apply now »

Date: Sep 3, 2022

Location: Tewksbury, MA, US, 01876

Company: Corning

Requisition Number: 54529


Corning is one of the world’s leading innovators in materials science. For more than 160 years, Corning has applied its unparalleled expertise in specialty glass, ceramics, and optical physics to develop products that have created new industries and transformed people’s lives.

Corning succeeds through sustained investment in R&D, a unique combination of material and process innovation, and close collaboration with customers to solve tough technology challenges.

As a leading developer, manufacturer, and global supplier of scientific laboratory products for 100 years, Corning’s Life Sciences segment collaborates with researchers seeking new approaches to increase efficiencies, reduce costs and compress timelines in the drug discovery process. Using unique expertise in the fields of materials science, surface science, optics, biochemistry and biology, the segment provides innovative solutions that improve productivity and enable breakthrough discoveries.


Title:                            Senior Data Engineer, Analytics

Job Number:               54529

Location:                     Tewksbury, MA


Scope of Position:

The Senior Data Engineer, Analytics will be responsible for the architecture, implementation and governance of Corning Life Sciences’ data lake supporting the division’s centralized analytics platform. You will be joining an exciting, newly formed analytics center of excellence for the Corning Life Science division. Our mission is to add value to the business by driving key metrics, analyses, and insights for all areas of the business globally including, but not limited to, sales, marketing, manufacturing, customer service and operations. 


A successful candidate will have a track record of designing and developing reliable data ingestion pipelines from multiple process and operational data stores using both on-premise and cloud-based technologies. These pipelines will require data validation and data profiling automation using version control to ensure ongoing resiliency and maintainability of the inbound data flows supporting both business intelligence as well as advanced analytics projects. Embedded within the business intelligence and data science team, this role will be a key partner in advancing how the division capitalizes on its data.


Day to Day Responsibilities:

  • Design, test, deploy and maintain production big-data ingestion pipelines using established frameworks, patterns of practics, agile software development and continuous delivery and/or continuous deployment (CI/CD) practices, collaborating closely with the advanced analytics platform team
  • Work with cross-organizational data source teams to define data ingestion requirements for structured, unstructured and semi-structured data, pilot their implementation, and ensure user acceptance
  • Define and implement automated validation and profiling capabilities needed to ensure reliable data delivery, using agile software development and CI/CD practices
  • Work with data source teams, domain experts, analysts and data scientists to define and develop data cleansing and data enrichment processes to ensure the final data sets are usable without additional processing effort
  • Actively participate in code reviews and technical information sharing with your team members and the broader software engineering community at Corning
  • Develop and implement data governance processes to support a robust and well documented data lake environment
  • Stay up to date with industry standards and technological advancements that will improve the quality, productivity and performance of your work
  • Provide support in a DevOps environment to monitor tokens, jobs and overall system performance


Travel Requirements:

  • Negligible


Hours of work/work schedule/flex-time:

  • Standard business hours; Monday – Friday.


Required Education:

  • Bachelor's degree in Computer Science, Engineering, Math, Finance, or related discipline


Required Years and Area of Experience:

  • 5+ years of demonstrated production programming proficiency in at least one modern JVM language such as Java, as well as an interpreted declarative programming language such as Python
  • 3+ years of experience developing batch, micro-batch and streaming ingestion pipelines using high-level Apache Spark APIs (pySpark, SparkR, and SparkSQL)
  • 3+ years of production experience using SQL and DDL
  • 2+ years DevOps experience with AWS platform services, including AWS S3 & EC2, Data Migration Services (DMS), RDS, EMR, RedShift, Lambda, DynamoDB, CloudWatch, CloudTrail


Required Skills:

  • Strong, hands-on technical familiarity with Apache Spark architecture, S3, parquet and Delta Lake architecture, technologies and tools
  • Expert level proficiency with both traditional relational and polyglot persistence technologies
  • Experience with agile software development & continuous integration + continuous deployment methodologies along with supporting tools such as Git (Gitlab), Jira, Terraform, New Relic
  • Strong, hands-on familiarity with notebook environments including Jupyter
  • Proven success in communicating with users, other technical teams, and senior management to collect requirements, describe data modeling decisions and data engineering strategy
  • Excellent organizational skills including prioritization of multiple concurrent projects while still delivering timely and accurate results


Desired Experience / Qualifications / Skills:

  • Prior full-stack app development experience (front-end, back-end, microservices)
  • Familiarity with Oracle, Microsoft SQL Server, SSIS, SSRS data technologies
  • Established enterprise ETL and integration tools including Informatica, Mulesoft
  • Experience with data sources and integration solutions commonly used in manufacturing such as Pi Integrator, and Maximo
  • Familiarity with reporting and analysis tools such as PowerBI, Tableau, or SAS JMP


Soft Skills:

  • People Skills:
    • Collaboration and Influencing
    • Communication Skills
    • Drive for Results


  • Business Skills:
    • Ability to see beyond the numbers
    • Business Acumen
    • Creativity
    • Customer Focus
    • Perspective
    • Strategic Agility
  • Functional/Technical
    • Experience in Python or similar languages is preferred
    • Experience with Microsoft PowerBI
    • Proficient with data discovery / statistical analysis tools
    • Proficient with MS and Oracle SQL


We prohibit discrimination on the basis of race, color, gender, age, religion, national origin, sexual orientation, gender identity or expression, disability, veteran status or any other legally protected status.


We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.

Nearest Major Market: Boston