May 10th, 2013

Big Data Science Practice + Algo Implementation

RSS icon RSS Category: Uncategorized
Fallback Featured Image

In this double header we present a practitioners close view of the science and an engineer's close view of design and implementation of distributed algorithm.

Day in the Life of a Data Scientist – Chris Pouliot
In this session, Netflix
analytical leader Chris Pouliot shares his experience building a large team of data scientists at Netflix and what a typical day in life of a Data Scientist looks like. From extracting and exploring data to posing good questions around them and matching them with the right algorithms, Chris goes through the lifecycle of data science in practice.
Chris built a central, horizontal team for the company that spans across all business verticals. Chris shares insights and stories, covering pitfalls and successes and impact they have at Netflix.
Distributed Generalized Linear Modeling (GLM) –
Tomas Nykodym

In this session, 0xdata engineers, Tomas Nykodym & Cliff Click explain how to build a Distributed GLM (Logistic, Poisson Regression.) Generalized_linear_model is the most popular tool at the hand of a good datascientist. A couple of very powerful mathematical approaches such as Stephen Boyd's ADMM and Generalized Gradients are analyzed along with implementation choices. Live Demo and performance comparisons between the two approaches and on applications on Big Data will be presented.

Leave a Reply

An Introduction to Time Series Modeling:
Time Series Preprocessing and Feature Engineering

Time is the only nonrenewable resource - Sri Ambati, Founder and CEO, H2O.ai. Prediction is very

October 26, 2021 - by Adam Murphy
New Features Now Available with the Latest Release of the H2O AI Hybrid Cloud 21.10

The Makers here at H2O.ai have been busy building new features and enhancing capabilities across

October 18, 2021 - by
Time Series Forecasting Best Practices

Earlier this year, my colleague Vishal Sharma gave a talk about time series forecasting best

October 15, 2021 - by Jo-Fai Chow
Improving NLP Model Performance with Context-Aware Feature Extraction

I would like to share with you a simple yet very effective trick to improve

October 8, 2021 - by Jo-Fai Chow
Feature Transformation with the H2O AI Hybrid Cloud

It is well known throughout the data science community that data preparation, pre-processing, and feature

October 7, 2021 - by Benjamin Cox
Introducing DatatableTon – Python Datatable Tutorials & Exercises

Datatable is a python library for manipulating tabular data. It supports out-of-memory datasets, multi-threaded data

September 20, 2021 - by Rohan Rao

Start your 14-day free trial today