Principal Data Engineer
(3+ years exp)Altis Labs
Job Type
Full TimeVisa Sponsorship
Not AvailableRemote Work Policy
Onsite or remoteHires remotely
Relocation
AllowedSkills
The Role
About Altis Labs
Altis Labs is the computational imaging company advancing precision medicine with AI.
Legacy imaging interpretation methods have confined researchers to slow, risky, and expensive drug development requiring more than $2 billion and 10 years to get a new cancer treatment to patients in need.
We believe that medical imaging is the richest data modality to generate clinical insight. Scientists use our AI-powered software platform, Nota, to accelerate clinical research by more accurately measuring the effect of novel treatments. Trained on over 222 million images with associated clinical information, our deep learning models hosted on Nota predict clinically meaningful outcomes.
Our multi-disciplinary team of machine learning scientists, engineers, clinicians, biostatisticians, and business operators is on a mission to help get the most effective treatment to patients sooner.
Founded in 2019, Altis is a venture-backed AI company headquartered in Toronto.
We are actively growing our team in Canada and the US across functional areas. We are open to candidates who prefer to work remotely, in our Toronto offices, or a hybrid version of the two.
About the Position
Altis is seeking an experienced Senior Data Engineer who will play a crucial role on our team. He/she should bring multiple years of industry experience building scalable, production-grade data services products that power machine learning (ML) applications. The successful candidate will be responsible for building data infrastructure and services that operationalizes petabytes of medical imaging data and associated tabular clinical data. The data service will enable our team to query data, train computer vision models, and deploy models in a scalable way. He/she ideally has experience working with clinical and oncology data and will work closely with our software engineering team, ML scientists, and external stakeholders.
Responsibilities & Expectations:
Build and maintain a scalable data service to enable AI model training and deployment
Collaborate closely with our expert team of ML scientists, software engineers, and clinicians
Present your work to team members, clients, and at conferences
Qualifications:
- 3+ years of building scalable, production-grade imaging data services products that power machine learning (ML) applications.
- 3+ years of working with large electronic medical records (EMR), DICOM, or other clinical data
- Demonstrated experience with ETL processes, data modeling, and data warehousing
- Knowledge of SQL, RDBMS, and NoSQL databases, including design, implementation, and optimization
- Strong programing skills in languages including Python and C++
- Experience with cloud based frameworks including AWS, GCP or Azure and architecting cloud solutions
- Experience with Git, or equivalent version control system
- Experience with distributed computing
- Experience with test-driven development and common software test frameworks
- Excellent written and verbal communication skills in English
Nice to have:
- Experience in the medical domain including DICOM and other imaging modalities and experience working in the field of oncology
- Experience with Kubernetes, Kubeflow or other associated MLOps tools
- Experience building deep learning models on 3D data, such as in autonomous driving
- Experience using statistical tools such as R
Benefits
- Competitive pay and generous equity participation
- Coverage for medical, vision, and dental insurance
- 4 weeks of vacation per year
- Flexible work organization and access to remote work