探花合集

Lead Data Engineer - Computational Discovery

Highly Competitive
  1. Contract
  2. Manufacturing
  3. United States
Wilmington, USA
Posting date: 13 Jan 2026
68147

Lead Data Engineer - Computational Discovery - Contract - Wilmington DE

Ready to engineer the future of data in pharma? Join us as a Lead Data Engineer and turn complex challenges into groundbreaking solutions.

探花合集 is seeking a Lead Data Engineer to play a pivotal role in designing, implementing, and maintaining scalable data pipelines and structures.

Primary Responsibilities:

This position focuses on integrating complex scientific datasets into modern cloud architectures, supporting translational research initiatives. You will collaborate with cross-functional teams to ensure robust, scalable, and FAIR-compliant solutions.

Skills & Requirements:

  • Proficiency in Python, including libraries such as Pandas, PySpark, Dask, and SQLAlchemy.
  • Advanced knowledge of SQL and workflow orchestration tools like Airflow, Dagster, or Prefect.
  • Experience with modern cloud architectures (e.g., Azure Fabric, Databricks, Snowflake).
  • Strong understanding of data modeling, ETL processes, and schema design for complex datasets.
  • Expertise in API development for data access.
  • Familiarity with FAIR principles and metadata standards for scientific data.
  • Excellent communication and collaboration skills to bridge IT and scientific teams.
  • Preferred: Knowledge of clinical data standards (e.g., SDTM, ADaM, CDISC) and biomarker data formats (e.g., NGS, flow cytometry, proteomics).

The Lead Data Engineer's responsibilities will be:

  • Act as a hands-on technical lead, defining architecture and coding scalable ETL pipelines and data structures.
  • Oversee the ingestion of complex datasets (e.g., genomics, proteomics, imaging, lab data) into cloud-based data lakes.
  • Lead data engineering projects, designing integration solutions for diverse scientific data sources.
  • Develop automated procedures to normalize unformatted external vendor data into a structured Common Data Model (CDM).
  • Collaborate with research and IT teams to align infrastructure with scientific needs.
  • Architect and implement scalable ETL processes, APIs, and visualization tools for data access.
  • Engage stakeholders to gather requirements and incorporate feedback into designs.
  • Lead user acceptance testing (UAT) to ensure high-quality deliverables.
  • Promote FAIR principles and interoperability across translational and clinical programs.

If you are having difficulty in applying or if you have any questions, please contact Anderson Maldonado at a.maldonado@proclinical.com

If you are interested in applying to this exciting opportunity, then please click 'Apply' or to speak to one of our specialists please request a call back at the top of this page.

探花合集 is a leading life sciences recruiter focused on finding exceptional people and matching them with the finest positions across the globe. 探花合集 is acting as an Employment Agency in relation to this vacancy.

By submitting this application, you confirm that you've read and understood our privacy policy, which informs you how we process and safeguard your data - /privacy-policy

close