Search Button
RSS icon Sort by:
1st Place Winner’s Blog – Kaggle 2021 Data Science and Machine Learning Survey
by Jo-fai Chow January 4, 2022 Data Journalism Data Science Kaggle

Kaggle, the largest global community of data scientists, conducted the 5th annual industry-wide survey that presented a truly comprehensive view of the state of data science and machine learning. A total of 25,973 responses were collected from participants from over 60 countries. Kaggle also launched the Data Science Survey Challenge in which the goal was […]

Read More
AI/ML Projects — Don’t get stymied in the last mile
by Saurabh Kumar May 3, 2019 Community Data Journalism Data Science Demos H2O Driverless AI

Data Scientists build AI/ML models from data, and then deploy it to production – in addition to a plethora of tasks around data insights, data cleansing etc., Part of the Data Scientist job description/requirement is making models available for transparency, auditability as well as explainability for both regulators as well as internal business use. While […]

Read More
Building AI/ML models on Lending Club Data, with H2O.ai — Part 2
by Saurabh Kumar April 15, 2019 AutoML Data Journalism Data Science H2O Driverless AI

In Part 1 of this series earlier, we looked at how to download data from Lending Club using Jupyter/Python and create a training and test data set, after dropping some target leakage cols. The data preparation code to create the data sets for classification is available in GitHub at: https://git.io/fjTqb In this blog post, we […]

Read More
Building AI/ML models on Lending Club Data, with H2O.ai — Part 1
by Saurabh Kumar March 28, 2019 Beginners Community Data Journalism Data Science Technical Posts Tutorials

Lending Club publishes its basic loan databases to the public and a full version to its customers — anonymized of course. You can find the download page from this link (screenshot below): The publicly downloadable loan data has various attributes — roughly 150+ columns that have categorical, numeric, text and date fields. It also has a ‘loan_status’ text column […]

Read More
What Business Leaders Need to Know About AI
by Saurabh Kumar January 11, 2019 Beginners Community Data Journalism Data Science

The interest around artificial intelligence (AI) is at an all-time fevered pitch right now, and it’s important to understand why. AI can solve real business problems and address very complex situations. Organizations and business leaders should start with the idea of how AI can help by identifying a business problem or use case that they […]

Read More
Launching the Academic Program … OR … What Made My First Four Weeks at H2O.ai so Special!
Launching the Academic Program … OR … What Made My First Four Weeks at H2O.ai so Special!
by Conrad October 30, 2018 Academic Program AutoML Community Data Journalism Data Science H2O Driverless AI H2O Release H2O World Use Cases

We just launched the H2O.ai Academic Program at our sold-out H2O World London. With nearly 1000 people in attendance, we received the first online sign-up forms submitted by professors and students alike. This program will massively democratize AI in academia, increasing the number of AI-skilled graduates – with both technical and business degrees. A short […]

Read More
How This AI Tool Breathes New Life Into Data Science
How This AI Tool Breathes New Life Into Data Science
by Saurabh Kumar October 16, 2018 Beginners Data Journalism Data Science Deep Learning Driverless Explainable AI GPU H2O Driverless AI Machine Learning NLP Python R Technical

Ask any data scientist in your workplace. Any Data Science Supervised Learning ML/AI project will go through many steps and iterations before it can be put in production. Starting with the question of “Are we solving for a regression or classification problem?” Data Collection & Curation Are there Outliers? What is the Distribution? What do […]

Read More
prediction_comparisons
Using Sentiment Analysis to Measure Election Surprise
by h2oai December 1, 2016 Data Journalism

Sentiment Analysis is a powerful Natural Language Processing technique that can be used to compute and quantify the emotions associated with a body of text. One of the reasons that Sentiment Analysis is so powerful is because its results are easy to interpret and can give you a big-picture metric for your dataset. One recent […]

Read More
code
Creating a Binary Classifier to Sort Trump vs. Clinton Tweets Using NLP
by h2oai October 17, 2016 Community Data Journalism Flow Python

The problem: Can we determine if a tweet came from the Donald Trump Twitter account (@realDonaldTrump) or the Hillary Clinton Twitter account (@HillaryClinton) using text analysis and Natural Language Processing (NLP) alone? The Solution: Yes! We’ll divide this tutorial into three parts, the first on how to gather the necessary data, the second on data […]

Read More
chart
When is the Best Time to Look for Apartments on Craigslist?
by h2oai October 6, 2016 Data Journalism

A while ago I was looking for an apartment in San Francisco. There are a lot of problems with finding housing in San Francisco, mostly stemming from the fierce competition. I was checking Craigslist every single day. It still took me (and my girlfriend) a few months to find a place — and we had […]

Read More
1 2