Case Studies Stanford Medicine Uses Snorkel to Revolutionize Medical Imaging Data Labeling
Edit This Case Study Record

Stanford Medicine Uses Snorkel to Revolutionize Medical Imaging Data Labeling

Analytics & Modeling - Machine Learning
Analytics & Modeling - Predictive Analytics
Healthcare & Hospitals
Life Sciences
Product Research & Development
Quality Assurance
Automated Disease Diagnosis
Clinical Image Analysis
Remote Patient Monitoring
Data Science Services
System Integration
Labeling training data for triaging models in medical imaging is a time-consuming process, often requiring person-months to person-years of radiologist time. This manual labeling is not only labor-intensive but also prone to human error, which can affect the accuracy and reliability of the models. The challenge was to find a more efficient and accurate method to label large datasets of medical images, which are crucial for developing and training machine learning models for disease diagnosis and patient monitoring.
Read More
Stanford Medicine is a leading academic medical center that integrates research, medical education, and healthcare. Known for its cutting-edge research and innovative approaches to medical challenges, Stanford Medicine collaborates with various institutions to advance the field of medicine. The institution is committed to improving patient care through the development and application of new technologies. In this case, Stanford Medicine partnered with Snorkel to enhance their data labeling processes for medical imaging, aiming to improve the efficiency and accuracy of their machine learning models used in disease diagnosis and patient monitoring.
Read More
Stanford Medicine deployed a cross-modal Snorkel pipeline to automate the labeling of medical imaging datasets. This innovative approach allowed them to replace the traditional manual labeling process, which took person-months to person-years, with a more efficient method that completed the task in just a few hours. The Snorkel pipeline was able to match or even exceed the performance of manually gathered labels, ensuring high accuracy and reliability. This solution not only saved significant time and resources but also improved the overall quality of the labeled data, which is essential for training effective machine learning models.
Read More
The deployment of the Snorkel pipeline significantly reduced the time required for labeling medical imaging datasets, replacing 8 person-months of manual labeling with just a few hours of automated processing.
The solution is currently being tested for deployment in Stanford and Department of Veteran Affairs (VA) hospital systems, indicating its potential for broader application and impact in the healthcare sector.
The automated labeling process ensured high accuracy and reliability, matching or exceeding the performance of manually gathered labels, which is crucial for developing effective machine learning models for disease diagnosis and patient monitoring.
8 Person-months of labeling replaced
94% ROC AUC Performance
50K+ Images labeled in minutes
Download PDF Version
test test