Senior Data Engineer @ Amgen

Job Information

Job Description:

  • Determinewhich tools and architectures are available for pipeline development based on the data source format and current technologies
  • Develop data flow pipelines to extract, transform, and load data from various data sources in various forms, including custom ETL pipelines that enable model and product development
  • Build data products and service processes which perform data transformation, metadata extraction, workload management and error processing management to ensure high quality data
  • Collaborate with Data Scientists to normalize and clean data inorderto produce robust and high-quality data assets
  • Support Data Science team with model design by anticipating and engineering features needed to drive model development and insight generation
  • Provide clear documentation for delivered solutions and processes, integrating documentation
  • Ensure that data engineering applications are aligned with the overall architectural and development guidelines
  • Identify and implement internal process improvements for data management (automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability)
  • Partner with business stakeholders and other technical teammembers to identify and acquire data sources that are most relevant to business needs and goals
  • Demonstrate deep technical and domain knowledge inorderto propose potential solutions
  • Mentor junior data engineers on bestpractices in the industry and in the Amgen data landscape
Basic Qualifications:

Doctorate degree

Master’s degree and 3 yearsofInformationSystemsexperience
Bachelor’s degree and 5 yearsofInformationSystemsexperience
Associate degree and 10 yearsofInformationSystemsexperience


High school diploma / GED and 12 yearsofInformationSystemsexperience
Preferred Qualifications:
  • Handsonexperience writing SQL using any RDBMS (Redshift, Postgres, MySQL, Teradata, Oracle, etc.).
  • Proficient in one of the coding languages (Python, Java, Scala)
  • Experience with Spark, Hive, Kafka, Kinesis, Spark Streaming, and Airflow.
  • Handsonexperience using Databricks/Jupyter or similar notebook environment.
Preferred Qualifications:
  • Experienceworkingwith teams ofdatascientists, software engineers and business experts to drive insights
  • Strongunderstanding of relevant data standards and industry trends
  • Ability to understand new business requirements and prioritize them for delivery
  • Experienceworkingin biopharma/life sciences industry
  • Experience with Schema Design & Dimensional data modeling.
  • Experience with AWS Services such as EC2, S3, Redshift/Spectrum, Glue, Athena, RDS, Lambda, and API gateway.
  • Experience with software DevOps CI/CD tools, such Git, Jenkins, Linux, and Shell Script
  • Experienceworkingin an agile environment (i.e. user stories, iterative development, etc.)
  • Experienceworkingwithtest-driven development and software test automation
  • ExperienceworkinginaProductTeamenvironment

Experience Level: Senior
Work From: Remote from Region

Company Information

View all jobs of Company: Click here

Pin It on Pinterest