Druid (open-source data store)

Druid

Original author(s)	Eric Tschetter, Fangjin Yang
Developer(s)	The Druid community

Stable release	0.9.2 / 1 December 2016 (2016-12-01)
Repository	github.com/druid-io/druid/
Development status	Active
Written in	Java
Operating system	Cross-platform
Type	distributed, real-time, column-oriented data store
License	Apache License 2.0
Website	druid.io

Druid is a column-oriented, open-source, distributed data store written in Java. Druid is designed to quickly ingest massive quantities of event data, and provide low-latency queries on top of the data. The name Druid comes from the shapeshifting Druid class in many role-playing games, to reflect the fact that the architecture of the system can shift to solve different types of data problems.

Druid is commonly used in business intelligence/OLAP applications to analyze high volumes of real-time and historical data. Druid is used in production by technology companies such as Alibaba,Airbnb,Cisco,eBay,Netflix,Paypal, and Yahoo.

Druid was started in 2011 to power the analytics product of a company named Metamarkets. The project was open-sourced under the GPL license in October 2012, and moved to an Apache License in February 2015.

Over time, a number of organizations and companies have integrated Druid into their backend technology, and committers have been added from numerous different organizations.

In October 2015, the commercial company Imply launched to provide enterprise level support and professional services for Druid.

Fully deployed, Druid runs as a cluster of specialized processes (called nodes in Druid) to support a fault-tolerant architecture where data is stored redundantly, and there is no single point of failure. The cluster includes external dependencies for coordination (Apache ZooKeeper), metadata storage (e.g. MySQL, PostgreSQL, or Derby), and a deep storage facility (e.g. HDFS, or Amazon S3) for permanent data backup.

...
Wikipedia