It also supports a rich set of higher-level tools including Spark SQL for SQL and structured data processing, MLlib for machine learning, GraphX for graph processing, and Spark Streaming.
It provides high-level APIs in Java, Scala, Python and R, and an optimized engine that supports general execution graphs.
The third in a series of blogs from Anandraj Jagadeesan, talks us through downloading Apache Spark on Windows 10, using the new Ubuntu environment.Īpache Spark is a fast and general-purpose cluster computing system.