Scale AI Case Studies Yuka's Rapid Product Database Expansion with Scale Rapid
Edit This Case Study Record
Scale AI Logo

Yuka's Rapid Product Database Expansion with Scale Rapid

Scale AI
Analytics & Modeling - Machine Learning
Application Infrastructure & Middleware - Event-Driven Application
Food & Beverage
Healthcare & Hospitals
Onsite Human Safety Management
Tamper Detection
System Integration
Yuka, a mobile application that provides health impact information for food products and cosmetics, faced a significant challenge in managing its rapidly growing database. The database, which already contained over 4 million products, was expanding at a rate of approximately 1,200 new products daily. Yuka's small team was unable to manually review each new product added to the platform, a process that often required multiple transcription tasks. The application initially used OCR to scan product images for nutritional information and ingredients, but this process was not always accurate. OCR struggled with images featuring inconsistent lighting, obstructions, or irregular text surfaces. As a result, about 60% of the images submitted to Yuka needed to be outsourced to a human annotator. This was a daunting task for Yuka's small team, especially considering their goal to provide a product's health score within 2-3 hours of its addition to the database.
Read More
Yuka is a mobile application that allows users to scan foods and cosmetics to get the product’s associated health impact. The application uses OCR and machine learning to extract data from uploaded images of the ingredients list and, in the case of food products, the nutritional table. This information is then used to calculate a health score for a given product. Yuka's database is extensive, containing over 4 million products, and is growing rapidly with approximately 1,200 new products added daily. The company aims to provide the health impact of new products in real-time, or within 2-3 hours of a product being added to the database.
Read More
Yuka turned to Scale Rapid to overcome the challenge of quickly and accurately transcribing product information. When OCR failed to achieve a sufficient detection rate on an image, Yuka sent the image to Scale Rapid for manual transcription by a human annotator. Yuka typically sent hundreds or thousands of these images to Scale Rapid each day. Once the transcribed data was returned, Yuka compared the text against their existing dictionary of known ingredients and nutritional information. If the error rate was low enough, the product was integrated into their database and a health rating was calculated. Scale Rapid's ability to handle massive amounts of data within a short period of time was crucial for Yuka. The service consistently provided accurate transcriptions within 2-3 hours of each request, in multiple languages including English, French, German, Italian, and Spanish.
Read More
With the help of Scale Rapid, Yuka has been able to sustainably manage its rapidly growing database while maintaining a high-quality user experience. The transcriptions received from Scale Rapid are easily incorporated into Yuka's application. When new products are added, Yuka automatically sends the necessary data requests to Scale Rapid using the Scale API. Once the transcriptions are returned, they are verified by the application and immediately added to the database. This efficient pipeline allows Yuka to provide users with product health scores in just hours, ensuring the database can continue to grow quickly and sustainably.
Yuka's database is growing rapidly with approximately 1,200 new products added daily.
Scale Rapid consistently provides Yuka with transcriptions within 2-3 hours of each request.
About 60% of the images submitted to Yuka, which could be as many as 500 to 1000 images daily, are outsourced to a human annotator via Scale Rapid.
Download PDF Version
test test