A fast-growing healthtech startup developing solutions to help healthcare organizations safely implement generative AI. Their platform ensures compliance, enhances model reliability, and provides ongoing monitoring to support responsible AI adoption.
THE ROLE
As a Data Engineer, you will design and optimize scalable ETL pipelines, standardize data processing workflows, and integrate healthcare data across various platforms. You'll play a key role in automating data ingestion, enhancing system reliability, and ensuring high-quality datasets for AI applications. This position requires problem-solving ability, adaptability, and a strong interest in the intersection of healthcare and AI.
ROLE RESPONSIBILITIES
Develop and maintain ETL pipelines that extract data from FHIR endpoints.
Build and enhance data ingestion processes to streamline healthcare system integration.
Automate data workflows for seamless interaction with Epic and other EHR platforms.
Establish repeatable data transformation patterns to ensure consistency across systems.
Deploy Infrastructure-as-Code (IaC) solutions for efficient scaling.
Work closely with AI/ML teams to provide clean, structured data for model development.
SKILLS & EXPERIENCE
4+ years of experience working with healthcare datasets.
Strong programming skills in Python.
Hands-on experience with FHIR & HL7v2 data standards.
Cloud expertise in Azure (preferred), GCP, or AWS.
Experience working with Snowflake & Databricks is a plus.
Proven ability to scale ETL pipelines in live production environments.
Familiarity with Infrastructure-as-Code (IaC) and CI/CD pipelines.
Exposure to AI/ML data engineering is a plus, particularly in model training workflows.
BENEFITS
$165,000 - $185,000 base salary + equity.
Flexible remote or hybrid work model.
Opportunity to shape the future of AI governance in healthcare.
If you're a data engineer passionate about AI in healthcare, we'd love to connect!