Practical Problems and Solutions for Machine Learning

Hilary Mason (Accel Partners)

This presentation will review and discuss common data problems encountered with web-sourced data, such as content cleaning, duplicate detection, clustering, and classification and describe the algorithms that work best as the volume of data increases, along with hacks for getting high-quality results as quickly as possible.

Photo of Hilary Mason

Hilary Mason

Accel Partners