Pioneer novel software systems for the rapidly growing field of data-centric AI. Our tools enable data scientists/engineers (across all industries) to effectively diagnose/fix issues in their datasets thus improving the quality of their business's core asset.
Determine how to best leverage new Generative AI advances/infrastructure for better tools that automatically find & fix issues in datasets. The north star of our company is to use AI to increase the value of Data.
Work with text, structured/tabular, image/video datasets from companies across diverse industries, and setup on large-scale modeling infrastructure at a dynamic startup (0 to 1).
What we're looking for
Strong software engineering skills and experience productionizing code/models in cloud environments while maintaining the necessary data/API infrastructure. You should be comfortable with large software systems.
Experience working in MLOps: processing data, deploying and monitoring models, setting up the necessary infrastructure for Data/AI projects.
Responsibilities
Spin up cloud infrastructure for ML projects spanning text, image, and structured/tabular datasets.
Work on large-scale Data/AI projects with Cleanlab enterprise customers, from inception to production.
Work as individual contributor tech lead on a ML team of around 5.
Innovate on new algorithmic techniques to improve a dataset, leveraging the newest Foundation model advances.
Qualifications
This is a senior role! Candidates must have at least 5+ years work experience in ML. Schoolwork or general data science work does not count here, you should have tackled hard MLOps challenges at companies dealing with massive datasets, reliable model serving, cloud infrastructure, etc.
Python (pandas, scikit-learn, numpy, Jupyter)
PyTorch/PyTorch Lightning, Hugging Face
Relational databases
AWS
Docker
Git
CI/Testing, e.g. Jenkins
Bonus:
PhD in Machine Learning
Strong research publications or open-source contributions
Sagemaker, MLflow, Ray
ELT and data cleaning tools
Cleanlab or other data-centric AI tools
LangChain and LLMOps stack
Learn more about the role and benefits here: https://cleanlab.ai/careers/ml-engineer/