Datameer Case Studies Using Big Data Analytics to Create Better Outcomes for Cancer Patients
Edit This Case Study Record
Datameer Logo

Using Big Data Analytics to Create Better Outcomes for Cancer Patients

Datameer
Analytics & Modeling - Big Data Analytics
Infrastructure as a Service (IaaS) - Cloud Computing
Healthcare & Hospitals
Product Research & Development
Clinical Image Analysis
Predictive Maintenance
Data Science Services
Cancer diagnosis is complicated due to the uniqueness of each case, and treatment outcomes vary greatly from patient to patient. DKFZ, the largest biomedical research Institute in Germany, is working to understand the mechanisms of cancer, identify risk factors, and find new ways to prevent people from getting cancer. A key focus of DKFZ’s medical researchers is genomic data research. However, due to the massive volumes of genomic data involved in this research, DKFZ faced huge challenges on the data and analytics front. Their analytic systems were overwhelmed by many petabytes of data, and analyzing an entire patient data set took weeks and even months to complete. These huge bottlenecks greatly slowed research and frustrated staff.
Read More
DKFZ is the largest biomedical research Institute in Germany. They are working to understand the mechanisms of cancer, identify risk factors, and find new ways to prevent people from getting cancer. A key focus of DKFZ’s medical researchers is genomic data research. By analyzing human genomes, they can identify DNA problems which are the root cause of cancer in an individual — information that can be used to personalize cancer treatment — as well as understand the genetic evolution of disease and monitor patient responses to different treatments to better understand their efficacy.
Read More
DKFZ collaborated with Fujitsu to deploy the Fujitsu Prime Flex Integrated System for Hadoop, a powerful and scalable server cluster that uses Datameer’s analytic platform for parallel processing power and incredibly fast, actionable analytics. With Datameer, researchers can analyze the complete, raw genomic data sets of multiple patients in parallel, along with patient data records, detail selection data, and reference genome data. Analysis on the complete, unreduced data set can be performed by block of data and by patient. Using this new parallelization technique the analysis on the complete data set can be completed between five and twenty minutes. By comparisons, the analysis on the greatly reduced data set used to take 24-48 hours to complete.
Read More
DKFZ can now analyze 10 TB of raw data per day – the equivalent of 140 billion records looking at 900,000 thousand positions in each genome.
They can analyze complete data sets in minutes, eliminating the need to reduce data and risk missing out on key insights.
Vastly faster processing enables the DKFZ to more quickly identify specific, optimal cancer therapies for each patient, as well as further their overall research on correlations between cancer and genetics.
Analysis on the complete data set can be completed between five and twenty minutes, compared to the 24-48 hours it used to take.
DKFZ can now analyze 10 TB of raw data per day – the equivalent of 140 billion records looking at 900,000 thousand positions in each genome.
Download PDF Version
test test