Hilary Mason

Hilary Mason
Data Scientist in Residence, Accel Partners

Sessions

Hilary Mason (Accel Partners)
This presentation will review and discuss common data problems encountered with web-sourced data, such as content cleaning, duplicate detection, clustering, and classification and describe the algorithms that work best as the volume of data increases, along with hacks for getting high-quality results as quickly as possible.