Storm

Estimated reading time: 5 minutes

Apache Storm is a free and open source distributed realtime computation system.

GitHub repo: https://github.com/31z4/storm-docker

Library reference

This content is imported from the official Docker Library docs, and is provided by the original uploader. You can view the Docker Store page for this repo at https://store.docker.com/images/storm.

Supported tags and respective Dockerfile links

For detailed information about the published artifacts of each of the above supported tags (image metadata, transfer size, etc), please see the repos/storm directory in the docker-library/repo-info GitHub repo.

For more information about this image and its history, please see the relevant manifest file (library/storm). This image is updated via pull requests to the docker-library/official-images GitHub repo.

What is Apache Storm?

Apache Storm is a distributed computation framework written predominantly in the Clojure programming language. Originally created by Nathan Marz and team at BackType, the project was open sourced after being acquired by Twitter. It uses custom created “spouts” and “bolts” to define information sources and manipulations to allow batch, distributed processing of streaming data. The initial release was on 17 September 2011.

wikipedia.org/wiki/Storm_(event_processor)

logo

How to use this image

Running topologies in local mode

Assuming you have topology.jar in the current directory.

$ docker run -it -v $(pwd)/topology.jar:/topology.jar storm storm jar /topology.jar org.apache.storm.starter.ExclamationTopology

Setting up a minimal Storm cluster

  1. Apache Zookeeper is a must for running a Storm cluster. Start it first. Since the Zookeeper “fails fast” it’s better to always restart it.

    $ docker run -d --restart always --name some-zookeeper zookeeper
    
  2. The Nimbus daemon has to be connected with the Zookeeper. It’s also a “fail fast” system.

    $ docker run -d --restart always --name some-nimbus --link some-zookeeper:zookeeper storm storm nimbus
    
  3. Finally start a single Supervisor node. It will talk to the Nimbus and Zookeeper.

    $ docker run -d --restart always --name supervisor --link some-zookeeper:zookeeper --link some-nimbus:nimbus storm storm supervisor
    
  4. Now you can submit a topology to our cluster.

    $ docker run --link some-nimbus:nimbus -it --rm -v $(pwd)/topology.jar:/topology.jar storm storm jar /topology.jar org.apache.storm.starter.WordCountTopology topology
    
  5. Optionally, you can start the Storm UI.

    $ docker run -d -p 8080:8080 --restart always --name ui --link some-nimbus:nimbus storm storm ui
    

… via docker-compose

Example docker-compose.yml for storm:

version: '2'
services:
    zookeeper:
        image: zookeeper
        container_name: zookeeper
        restart: always

    nimbus:
        image: storm
        container_name: nimbus
        command: storm nimbus
        depends_on:
            - zookeeper
        links:
            - zookeeper
        restart: always
        ports:
            - 6627:6627

    supervisor:
        image: storm
        container_name: supervisor
        command: storm supervisor
        depends_on:
            - nimbus
            - zookeeper
        links:
            - nimbus
            - zookeeper
        restart: always

Run docker-compose up and wait for it to initialize completely. The Nimbus will be available at your host and port 6627.

Configuration

This image uses default configuration of the Apache Storm. There are two main ways to change it.

  1. Using command line arguments.

    $ docker run -d --restart always --name nimbus storm storm nimbus -c storm.zookeeper.servers='["zookeeper"]'
    
  2. Assuming you have storm.yaml in the current directory you can mount it as a volume.

    $ docker run -it -v $(pwd)/storm.yaml:/conf/storm.yaml storm storm nimbus
    

Logging

This image uses default logging configuration. All logs go to the /logs directory by default.

Data persistence

No data are persisted by default. For convenience there are /data and /logs directories in the image owned by storm user. Use them accordingly to persist data and logs using volumes.

$ docker run -it -v /logs -v /data storm storm nimbus

Please be noticed that using paths other than those predefined is likely to cause permission denied errors. It’s because for security reasons the Storm is running under the non-root storm user.

License

View license information for the software contained in this image.

Supported Docker versions

This image is officially supported on Docker version 17.04.0-ce.

Support for older versions (down to 1.6) is provided on a best-effort basis.

Please see the Docker installation documentation for details on how to upgrade your Docker daemon.

User Feedback

Issues

If you have any problems with or questions about this image, please contact us through a GitHub issue. If the issue is related to a CVE, please check for a cve-tracker issue on the official-images repository first.

You can also reach many of the official image maintainers via the #docker-library IRC channel on Freenode.

Contributing

You are invited to contribute new features, fixes, or updates, large or small; we are always thrilled to receive pull requests, and do our best to process them as fast as we can.

Before you start to code, we recommend discussing your plans through a GitHub issue, especially for more ambitious contributions. This gives other contributors a chance to point you in the right direction, give you feedback on your design, and help you find out if someone else is working on the same thing.

Documentation

Documentation for this image is stored in the storm/ directory of the docker-library/docs GitHub repo. Be sure to familiarize yourself with the repository’s README.md file before attempting a pull request.

library, sample, Storm