By using this website you agree to our use of cookies. Read’s privacy policy.

The best Machine Learning on Spark

Download Documentation

Combine the fast, scalable machine learning algorithms of H2O with the capabilities of Spark. Drive computation from Scala/R/Python and utilize the H2O Flow UI.


Seamlessly transition between Spark and H2O. Data mining in Spark plus Machine Learning in H2O.


Enjoy MLib support in H2O Flow, run Scala code in Flow and export pipelines as executable java code for easy deployment.


Deep learning, ensembles, GBM, GLM and DRF for accuracy. In memory processing and distributed for speed. R, Python, Flow, Tableau and Excel interfaces.

Get Started

With its elegant APIs, SQL, RDD - along with H2O’s speed, columnar-compression and fully featured machine learning algorithms, Sparkling Water was designed to allow users to get the best of Apache Spark.

Download Sparkling Water

Join the Conversation

Used by over 129,000 data scientists and more than 12,000 organizations around the world

Sign up for the H2O Community Newsletter

Get the latest on products updates, community events and other news.