Our client is tackling some of the most critical challenges in healthcare AI. Generative AI (GenAI) solutions offer significant potential but remain difficult to evaluate and control. From biased outputs to inaccurate information and protected health information (PHI) leakage, healthcare systems face growing risks.
Current approaches-such as slow electronic health record (EHR) vendor rollouts, outright bans on commercial AI tools, or limited point solutions-fail to address core safety and compliance issues. With increasing regulatory mandates like the 2023 Executive Order and oversight from CMS and the Office of Civil Rights (OCR), health systems need robust solutions to ensure AI safety and accountability.
About the Role
As a Senior Data Engineer, you will play a pivotal role in building scalable ETL pipelines, integrating healthcare data systems, and ensuring data standardization. You will develop infrastructure that automates workflows and supports real-time data ingestion from FHIR endpoints. While deep AI expertise is not required, familiarity with AI/ML in data engineering contexts is a plus.
This is a hands-on, generalist data engineering role ideal for someone who thrives in fast-paced, ambiguous environments.
Key Responsibilities
Design, build, and scale ETL pipelines to source data from FHIR endpoints.
Develop standardized data transformation patterns for repeatability across multiple healthcare systems.
Automate data workflows and integrate with health systems like Epic.
Implement Infrastructure-as-Code (IaC) for scalable deployments.
Collaborate with AI/ML teams to support data engineering needs for model training.
Ensure data accuracy, security, and compliance with healthcare regulations.
Qualifications
4+ years of experience in data engineering with healthcare data.
Proficient in Python for building ETL pipelines.
Familiarity with healthcare data standards like FHIR and HL7v2.
Hands-on experience with cloud platforms (Azure preferred; GCP or AWS also acceptable).
Experience with Snowflake and Databricks is a plus.
Proven track record of scaling ETL pipelines in production environments.
Experience with Infrastructure-as-Code (IaC) and CI/CD pipelines.
AI/ML exposure is a bonus.
Team Structure & Culture
Small but growing team, consisting of a full-time data integration specialist and a contractor.
Fast-paced startup environment - candidates should be adaptable and comfortable with ambiguity.
Passion for healthtech is essential - candidates must articulate why they want to contribute to this space.
Work Environment & Benefits
Free breakfast, lunch, and dinner.
Access to Equinox and onsite personal training sessions.
Onsite car washing services.
Silicon Valley coworking space with proximity to major tech companies.
Why Join?
Work on AI-driven healthcare solutions with real-world impact.
Gain broad technical exposure across data engineering, infrastructure, and AI/ML.
Thrive in a fast-paced, high-growth startup environment.