June 25th, 2018

H2O-3 on FfDL: Bringing deep learning and machine learning closer together

Category: Community, Deep Learning, H2O, Technical
FfDl cloud hardware

This post originally appeared in the IBM Developer blog here.
This post is co-authored by Animesh Singh, Nicholas Png, Tommy Li, and Vinod Iyengar.
Deep learning frameworks like TensorFlow, PyTorch, Caffe, MXNet, and Chainer have reduced the effort and skills needed to train and use deep learning models. But for AI developers and data scientists, it’s still a challenge to set up and use these frameworks in a consistent manner for distributed model training and serving.
The open source Fabric for Deep Learning (FfDL) project provides a consistent way for AI developers and data scientists to use deep learning as a service on Kubernetes and to use Jupyter notebooks to execute distributed deep learning training for models written with these multiple frameworks.
Now, FfDL is announcing a new addition that brings together that deep learning training capability with state-of-the-art machine learning methods.

Augment deep learning with best-of-breed machine learning capabilities

For anyone who wants to try machine learning algorithms with FfDL, we are excited to introduce H2O.ai as the newest member of the FfDL stack. H2O-3 is H2O.ai’s open source platform, an in-memory, distributed, and scalable machine learning and predictive analytics platform that enables you to build machine learning models on big data. H2O-3 offers an expansive library of algorithms, such as Distributed Random Forests, XGBoost, and Stacked Ensembles, as well as AutoML, a powerful tool for users with less experience in data science and machine learning.
After data cleansing, or “munging,” one of the most fundamental parts of training a powerful and predictive model is properly tuning the model. For example, deep neural networks are notoriously difficult for a non-expert to tune properly. This is where AutoML becomes an extremely valuable tool. It provides an intuitive interface that automates the process of training a large number of candidate models and selecting the highest performing model based on the user’s preferred scoring method.
FfDl cloud hardware
In combination with FfDL, H2O-3 makes data science highly accessible to users of all levels of experience. You can simply deploy FfDL to your Kubernetes cluster and submit a training job to FfDL. Behind the scenes, FfDL sets up the H2O-3 environment, runs your training job, and streams the training logs for you to monitor and debug your model. Since FfDL also supports multi-node clusters with H2O-3, you can horizontally scale your H2O-3 training job seamlessly on all your Kubernetes nodes. When model training is complete, you can save your model locally to FfDL or to a cloud object store, where it can be obtained later for serving inference.

Try H2O on FfDL today!

You can find the details on how to train H2O models on FfDL in the open source FfDL readme file and guide. Deploy, use, and extend them with any of the capabilities that you find helpful. We’re waiting for your feedback and pull requests!

About the Author

vinod iyengar
Vinod Iyengar

Vinod comes with over 7 years of Marketing & Data Science experience in multiple startups. He was the founding employee for his previous startup, Activehours, where he helped build the product and bootstrap the user acquisition with growth hacking. He has seen the user base for his companies grow from scratch to millions of customers. He’s built models to score leads, reduce churn, increase conversion, prevent fraud and many more use cases. He brings a strong analytical side and an metrics driven approach to marketing. He is responsible for all of our demand generation and growth efforts. When he is not busy hacking, Vinod loves painting and reading. He is a huge foodie and will eat anything that doesn’t crawl, swim or move.

Leave a Reply

AI/ML Model Scoring – What Good Looks Like in Production

One of the main reasons why we build AI/Machine Learning models is for it to

March 10, 2019 - by Karthik Guruswamy
Fallback Featured Image
Machine Learning with H2O – the Benefits of VMware

This blog was originally posted by Justin Murray of VMware and can be accessed here.   This

March 6, 2019 - by Vinod Iyengar
Fallback Featured Image
How to explain a model with H2O Driverless AI

The ability to explain and trust the outcome of an AI-driven business decision is now

February 26, 2019 - by Vinod Iyengar
Boosting your ROI with AutoML & Automatic Feature Engineering

If your business has started using AI/ML tools or just started to think about it,

February 25, 2019 - by Karthik Guruswamy
Fallback Featured Image
What is Your AI Thinking? Part 3

In the past two posts we’ve learned a little about interpretable machine learning in general. In

February 19, 2019 - by Patrick Hall
8 Tips to Make AI Happen Without Getting Fired

“AI is the fastest growing workload on the planet,” Mike Gualtieri of Forrester Research. Last week,

February 15, 2019 - by Ingrid Burton

Join the AI Revolution

Subscribe, read the documentation, download or contact us.

Subscribe to the Newsletter

Start Your 21-Day Free Trial Today

Get It Now
Desktop img