Skip to content

edrevo/spark-ec2-docker

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 
 
 
 
 

Repository files navigation

SparkEC2 docker

This is a dockerized version of the Spark-EC2 script, which will allow you to run the script in Windows (without Cygwin) or from any other environment as long as you have Docker.

Installation

The Docker image is part of the Docker Hub (it can be found here) so you only need to pull the image from docker:

docker pull edrevo/spark-ec2-docker

Usage

The hard way

The manual way of running the docker is with the following command-line:

docker run -t -i -v /c/Users/<% PATH TO YOUR SSH KEY %>:/id_rsa -e AWS_ACCESS_KEY_ID="<% YOUR AWS ID%>" -e AWS_SECRET_ACCESS_KEY="<% YOUR AWS SECRET %>" edrevo/spark-ec2-docker <% ARGUMENTS FOR SCRIPT %>

The easy way

In order to simplify the execution, you can download the batch script, edit it to add your SSH key path and place it in your PATH.

Once you have done that, you can start using the spark-ec2 script directly:

.\spark-ec2.bat launch -s 4 -t c3.4xlarge -r eu-west-1 -k test "my-test-cluster"

The only difference is you don't need to pass the SSH key path or configure any AWS environment variables, since that was already configured in the batch file.

About

A dockerized version of the Spark-EC2 script

Resources

Stars

Watchers

Forks

Packages

No packages published