A fast-growing healthtech startup at the intersection of machine learning and scientific research is looking for a
Data Engineer
to help scale our data infrastructure.
What You’ll Be Working On
- Develop and maintain scalable data pipelines to process scientific literature
- Work closely with ML and backend engineers to transform unstructured text into structured datasets
- Optimize performance and reliability of pipelines to handle large-scale data ingestion (targeting millions of entries)
- Support the integration of processed data into machine learning models
- Collaborate with scientists and product teams to ensure data accuracy and usability
What We’re Looking For
Must-Haves:
- Solid Python development skills
- Experience building and maintaining data pipelines
- Familiarity with working in fast-paced, startup-like environments
- Strong understanding of data processing and performance optimisation
Nice-to-Haves:
- Exposure to scientific or research data
- Experience with machine learning workflows or pipelines
- Interest in health, biology, or scientific domains