October 31st, 2013

0xdata and Yelp – Machine Learning for Relevance and Serendipity/Distributed Gradient Boosting

RSS icon RSS Category: Uncategorized
Fallback Featured Image

Join us and Yelp for a chat on Machine Learning, and make sure not to miss Sri's lightning talk on Distributed Gradient Boosting!

Main Talk: Machine Learning for Relevance and Serendipity
Speaker: Aria Haghighi (Prismatic)
Abstract: 
Careful use of well-designed machine learning systems can transform products by providing highly personalized user experiences. Unlike hand-tuned or heuristic-based personalization systems, machine learning allows for the use of millions of different potential indicators when making a decision, and is robust to many types of noise. In this talk, I will discuss our deeply-integrated use of machine learning and natural language processing for content discovery at Prismatic. Our real-time personalization engine is designed to give our users not just the content they expect, but also a healthy dose of targeted serendipity, all based on relevance models learned from users’ interactions with the site. We use sophisticated machine learning techniques for topical classification of stories, to determine story similarity, make topic suggestions, rate the value of different social connections, and ultimately to determine the relevance of a particular story for a particular user. I will go into detail describing our personalized relevance model, starting with a description of our problem formulation, then discussing feature design, model design, evaluation metrics, and our experimental setup which allows quick offline prototyping without forcing users into the role of guinea pig. Our model’s combination of social cues, topical classification, publisher information, and analysis of the user’s prior interactions produces highly-relevant and often delightfully serendipitous content for our users to consume.
Lightning Talk: Distributed Gradient Boosting
Speaker: SriSatish Ambati (0xdata)
Abstract: 
Boosting is a simple yet powerful technique for learning algorithms. We present a distributed gradient boosting algorithm that's accessible from R and a simple API for roll-your-own Distributed Machine Learning Algorithm for Big Data.
Tentative Schedule:
6:30-7:00 – socializing
7:00-7:20 – lightning talk
7:20-8:30 – main presentation
8:30-9:00 – socializing
 
Learn more and sign up at http://www.meetup.com/SF-Bayarea-Machine-Learning/events/146775042/?joinFrom=event

Leave a Reply

What are we buying today?

Note: this is a guest blog post by Shrinidhi Narasimhan. It’s 2021 and recommendation engines are

July 5, 2021 - by Rohan Rao
The Emergence of Automated Machine Learning in Industry

This post was originally published by K-Tech, Centre of Excellence for Data Science and AI,

June 30, 2021 - by Parul Pandey
What does it take to win a Kaggle competition? Let’s hear it from the winner himself.

In this series of interviews, I present the stories of established Data Scientists and Kaggle

June 14, 2021 - by Parul Pandey
Snowflake on H2O.ai
H2O Integrates with Snowflake Snowpark/Java UDFs: How to better leverage the Snowflake Data Marketplace and deploy In-Database

One of the goals of machine learning is to find unknown predictive features, even hidden

June 9, 2021 - by Eric Gudgion
Getting the best out of H2O.ai’s academic program

“H2O.ai provides impressively scalable implementations of many of the important machine learning tools in a

May 19, 2021 - by Ana Visneski and Jo-Fai Chow
Regístrese para su prueba gratuita y podrá explorar H2O AI Hybrid Cloud

Recientemente, lanzamos nuestra prueba gratuita de 14 días de H2O AI Hybrid Cloud, lo que

May 17, 2021 - by Ana Visneski and Jo-Fai Chow

Start your 14-day free trial today