August 26th, 2014

Introducing H2O Lagrange ( to R

RSS icon RSS Category: Uncategorized
Fallback Featured Image

From my perspective the most important event that happened atuseR! 2014 was that I got to meetthe 0xdata team and now, long story short,here I am introducing the latest version of H2O, labeledLagrange (,to the R and greater data science communities. Beforejoining 0xdata, I was working at a competitor on a rival project and wasrepeatedly asked why my generalized linear model analytic didn’t run as fast asH2O’s GLM. The answer then as it is now is the same – becauseH2O has a cutting edge distributed in-memory parallel computingarchitecture – but I no longer receive an electric shock every time I say so.

For those hearing about H2O for the first time, it is an open-sourcedistributed in-memory data analysis tool designed for extremely large data setsand the H2O Lagrange ( release provides scalable solutionsfor the followinganalysis techniques:

In my first blog post at 0xdata, I wanted to keep it simple and make sure Rusers know how to get the h2o package, which is cross-referenced on theHigh-Performance and Parallel ComputingandMachine and Statistical LearningCRAN Task Views, up and running on theircomputers. To so do, open an R console of your choice and type

# Download, install, and initialize the H2O package
                 repos = c("", getOption("repos")))
localH2O <- h2o.init()
# List and run some demos to see H2O at work
demo(package = "h2o")

After you are done experimenting with the demos in R, you can open up a webbrowser to http://localhost:54321/ to give the H2O web interface aonce over and then hop over to0xdata’s YouTube channel for somein-depth talks.
Over the coming weeks we at 0xdata will continue toblog about how to use H2Othrough R and other interfaces. If there is a particular use case you would liketo see addressed, join ourh2ostream Google Groupsconversation or e-mail us at Until then, happy analyzing.
Related Blogs

Leave a Reply

AI-Driven Predictive Maintenance with H2O Hybrid Cloud

According to a study conducted by Wall Street Journal, unplanned downtime costs industrial manufacturers an

August 2, 2021 - by Parul Pandey
What are we buying today?

Note: this is a guest blog post by Shrinidhi Narasimhan. It’s 2021 and recommendation engines are

July 5, 2021 - by Rohan Rao
The Emergence of Automated Machine Learning in Industry

This post was originally published by K-Tech, Centre of Excellence for Data Science and AI,

June 30, 2021 - by Parul Pandey
What does it take to win a Kaggle competition? Let’s hear it from the winner himself.

In this series of interviews, I present the stories of established Data Scientists and Kaggle

June 14, 2021 - by Parul Pandey
Snowflake on
H2O Integrates with Snowflake Snowpark/Java UDFs: How to better leverage the Snowflake Data Marketplace and deploy In-Database

One of the goals of machine learning is to find unknown predictive features, even hidden

June 9, 2021 - by Eric Gudgion
Getting the best out of’s academic program

“ provides impressively scalable implementations of many of the important machine learning tools in a

May 19, 2021 - by Ana Visneski and Jo-Fai Chow

Start your 14-day free trial today