August 28th, 2013

Hands on Workshop: Hack Data With Math

RSS icon RSS Category: Uncategorized
Fallback Featured Image

Thursday night (August 29) at 7, resident math hacker Spencer A. is leading a hands on workshop on using H2O to analyze real-world data. For those of you who are new to the math side of H2O, we have notes below to help you get prepared.

H2O is a distributed math platform featuring a set of analytical tools that can be accessed through an HTML based UI or through R. It’s built to handle really big data sets by analyzing subsets of big data in parallel. If you would like to read more about our performance, I highly recommend looking at Spence’s blog post on performance: http://0xdata.com/blog/2013/08/rf_on_mnist/
You can access instructions and resources for running H2O from your laptop here: http://docs.0xdata.com/quickstart/quickstart_jar.html, and you can download the .jar file here: http://0xdata.com/h2O/. There are tutorials on the documents page as well, if you are inclined to take a deeper look at some of the algorithms we will be working with.
Specifically, Spence will be walking through the famous MNIST data set and Allstate data similar to that presented in a recent Kaggle competition. If you would like to download those data sets before you get here, you can find all of the information you need at: http://www.meetup.com/H2Omeetup/events/124513412/
For the MNIST data, we will walk through the data to develop a better intuitive understand of the Random Forest (RF) algorithm. Generalized linear modeling (GLM) and it’s parallelization will be discussed through a hands on analysis of the Allstate data. By the end of the workshop you will be prepared to analyze and interpret data of your own.

Leave a Reply

What are we buying today?

Note: this is a guest blog post by Shrinidhi Narasimhan. It’s 2021 and recommendation engines are

July 5, 2021 - by Rohan Rao
The Emergence of Automated Machine Learning in Industry

This post was originally published by K-Tech, Centre of Excellence for Data Science and AI,

June 30, 2021 - by Parul Pandey
What does it take to win a Kaggle competition? Let’s hear it from the winner himself.

In this series of interviews, I present the stories of established Data Scientists and Kaggle

June 14, 2021 - by Parul Pandey
Snowflake on H2O.ai
H2O Integrates with Snowflake Snowpark/Java UDFs: How to better leverage the Snowflake Data Marketplace and deploy In-Database

One of the goals of machine learning is to find unknown predictive features, even hidden

June 9, 2021 - by Eric Gudgion
Getting the best out of H2O.ai’s academic program

“H2O.ai provides impressively scalable implementations of many of the important machine learning tools in a

May 19, 2021 - by Ana Visneski and Jo-Fai Chow
Regístrese para su prueba gratuita y podrá explorar H2O AI Hybrid Cloud

Recientemente, lanzamos nuestra prueba gratuita de 14 días de H2O AI Hybrid Cloud, lo que

May 17, 2021 - by Ana Visneski and Jo-Fai Chow

Start your 14-day free trial today