open-nomad/website/source/guides/analytical-workloads/spark/spark.html.md

---
layout: "guides"
page_title: "Running Apache Spark on Nomad"
sidebar_current: "guides-analytical-workloads-spark-intro"
description: |-
  Learn how to run Apache Spark on a Nomad cluster.
---

# Running Apache Spark on Nomad

Apache Spark is a popular data processing engine/framework that has been
architected to use third-party schedulers. The Nomad ecosystem includes a
[fork of Apache Spark](https://github.com/hashicorp/nomad-spark) that natively
integrates Nomad as a cluster manager and scheduler for Spark. When running on
Nomad, the Spark executors that run Spark tasks for your application, and
optionally the application driver itself, run as Nomad tasks in a Nomad job.

## Next Steps

The links in the sidebar contain detailed information about specific aspects of
the integration, beginning with [Getting Started](/guides/spark/pre.html).
Apache Spark Integration guide 2017-06-29 00:10:05 +00:00			`---`
			`layout: "guides"`
			`page_title: "Running Apache Spark on Nomad"`
Revised NomadProject Structure - Revised "What is Nomad" copy - Added "Key Features" section with links to task drivers & device plugins with lift-and-shift from README - Added "Who Uses Nomad" section with users, talks, blog posts - Removed Hadoop YARN, Docker Swarm, HTCondor from comparisons - Revamped Guides section - Inserted "Installing Nomad", "Upgrading", "Integrations" as persistent in Guides navbar - Split Installing Nomad into two paths for users (one for Sandbox with "Quickstart", one for Production) - Surfaced "Upgrading" and "Integrations" section from documentation - Changed "Job Lifecycle" section into "Deploying & Managing Applications" - Reworked "Operations" into "Operating Nomad" - Reworked "Security" into "Securing Nomad" - Segmented Namespaces, Resource Quotas, Sentinel into "Governance & Policy" subsection - Reworked "Spark integration" into its own "Analytical Workloads" section 2019-05-08 21:40:38 +00:00			`sidebar_current: "guides-analytical-workloads-spark-intro"`
Apache Spark Integration guide 2017-06-29 00:10:05 +00:00			`description: \|-`
			`Learn how to run Apache Spark on a Nomad cluster.`
			`---`

			`# Running Apache Spark on Nomad`

Revised NomadProject Structure - Revised "What is Nomad" copy - Added "Key Features" section with links to task drivers & device plugins with lift-and-shift from README - Added "Who Uses Nomad" section with users, talks, blog posts - Removed Hadoop YARN, Docker Swarm, HTCondor from comparisons - Revamped Guides section - Inserted "Installing Nomad", "Upgrading", "Integrations" as persistent in Guides navbar - Split Installing Nomad into two paths for users (one for Sandbox with "Quickstart", one for Production) - Surfaced "Upgrading" and "Integrations" section from documentation - Changed "Job Lifecycle" section into "Deploying & Managing Applications" - Reworked "Operations" into "Operating Nomad" - Reworked "Security" into "Securing Nomad" - Segmented Namespaces, Resource Quotas, Sentinel into "Governance & Policy" subsection - Reworked "Spark integration" into its own "Analytical Workloads" section 2019-05-08 21:40:38 +00:00			`Apache Spark is a popular data processing engine/framework that has been`
			`architected to use third-party schedulers. The Nomad ecosystem includes a`
			`[fork of Apache Spark](https://github.com/hashicorp/nomad-spark) that natively`
			`integrates Nomad as a cluster manager and scheduler for Spark. When running on`
			`Nomad, the Spark executors that run Spark tasks for your application, and`
Apache Spark Integration guide 2017-06-29 00:10:05 +00:00			`optionally the application driver itself, run as Nomad tasks in a Nomad job.`

			`## Next Steps`

Revised NomadProject Structure - Revised "What is Nomad" copy - Added "Key Features" section with links to task drivers & device plugins with lift-and-shift from README - Added "Who Uses Nomad" section with users, talks, blog posts - Removed Hadoop YARN, Docker Swarm, HTCondor from comparisons - Revamped Guides section - Inserted "Installing Nomad", "Upgrading", "Integrations" as persistent in Guides navbar - Split Installing Nomad into two paths for users (one for Sandbox with "Quickstart", one for Production) - Surfaced "Upgrading" and "Integrations" section from documentation - Changed "Job Lifecycle" section into "Deploying & Managing Applications" - Reworked "Operations" into "Operating Nomad" - Reworked "Security" into "Securing Nomad" - Segmented Namespaces, Resource Quotas, Sentinel into "Governance & Policy" subsection - Reworked "Spark integration" into its own "Analytical Workloads" section 2019-05-08 21:40:38 +00:00			`The links in the sidebar contain detailed information about specific aspects of`
order guides alphabetically; rewrite getting started page; fix typos 2017-06-30 00:12:23 +00:00			`the integration, beginning with [Getting Started](/guides/spark/pre.html).`