open-nomad/website/source/guides/spark/spark.html.md

1.0 KiB

layout page_title sidebar_current description
guides Running Apache Spark on Nomad guides-spark-spark Learn how to run Apache Spark on a Nomad cluster.

Running Apache Spark on Nomad

Nomad is well-suited for analytical workloads, given its performance characteristics and first-class support for batch scheduling. Apache Spark is a popular data processing engine/framework that has been architected to use third-party schedulers. The Nomad ecosystem includes a fork of Apache Spark that natively integrates Nomad as a cluster manager and scheduler for Spark. When running on Nomad, the Spark executors that run Spark tasks for your application, and optionally the application driver itself, run as Nomad tasks in a Nomad job.

Next Steps

The links in the sidebar contain detailed information about specific aspects of the integration, beginning with Getting Started.