*** Welcome to piglix ***

Apache Storm

Apache Storm
Apache Storm's Logo
Distributed and fault-tolerant realtime computation
Developer(s) Backtype, Twitter
Stable release
1.0.2 / 10 August 2016 (2016-08-10)
Development status Active
Written in Clojure & Java
Operating system Cross-platform
Type Distributed stream processing
License Apache License 2.0
Website storm.apache.org

Apache Storm is a distributed stream processing computation framework written predominantly in the Clojure programming language. Originally created by Nathan Marz and team at BackType, the project was open sourced after being acquired by Twitter. It uses custom created "spouts" and "bolts" to define information sources and manipulations to allow batch, distributed processing of streaming data. The initial release was on 17 September 2011.

A Storm application is designed as a "topology" in the shape of a directed acyclic graph (DAG) with spouts and bolts acting as the graph vertices. Edges on the graph are named streams and direct data from one node to another. Together, the topology acts as a data transformation pipeline. At a superficial level the general topology structure is similar to a MapReduce job, with the main difference being that data is processed in real time as opposed to in individual batches. Additionally, Storm topologies run indefinitely until killed, while a MapReduce job DAG must eventually end.

Storm became an Apache Top-Level Project in September 2014 and was previously in incubation since September 2013.

Apache Storm is developed under the Apache License, making it available to most companies to use. Git is used for version control and Atlassian JIRA for issue tracking, under the Apache Incubator program.

Storm is but one of dozens of stream processing engines, for a more complete list see Stream processing. Twitter announced Heron (event processor) on June 2, 2015 which is API compatible with Storm. There are other comparable streaming data engines such as Spark Streaming and Flink


...
Wikipedia

...