ClickHouse
Case Studies
ClickHouse: Powering Darwinium's Security and Fraud Analytics
Overview
ClickHouse: Powering Darwinium's Security and Fraud AnalyticsClickHouse |
Application Infrastructure & Middleware - Database Management & Storage Infrastructure as a Service (IaaS) - Cloud Databases | |
National Security & Defense Oil & Gas | |
Logistics & Transportation Product Research & Development | |
Cybersecurity Fraud Detection | |
Cloud Planning, Design & Implementation Services System Integration | |
Operational Impact
ClickHouse's mutable engine simplified the data ingestion pipeline complexity, and its choice of table engine led to simpler data pipelines. It also supported complex data types, which was a fundamental requirement for Darwinium to build upon for interactive analytics. ClickHouse's recent addition of JSON type support further met the analytics requirements of Darwinium. ClickHouse also supported the concept of tumbling data retention windows, where “hot” data could be initially placed on a fast access medium like a local SSD, with the ability to subsequently move the data to a relatively slower but cheaper storage system like S3. ClickHouse was also cloud-native, and could be run from a low-end laptop, an on-premise cluster of nodes, or on any of the myriad hardware configurations possible even on a single cloud provider like AWS. This made ClickHouse a cloud native offering for Darwinium use cases. | |
Quantitative Benefit
ClickHouse could easily handle a few thousand writes per second, with multiple data pipeline writers writing to a single ClickHouse server at any given instant. | |
ClickHouse server side metrics showed a consistent and very low (<5%) user space and system space core usage while the write operations alone were being executed. | |
ClickHouse can utilise all of the cores available on a single node that it is running on to process a query. | |