Sofie’s Blog
Poster of the talk, mentioning the title, presenter, venue and showing a large bowl of Linguine Bolognese

Data doesn’t lie — but it can mislead: How to ensure integrity of your Machine Learning applications

A meaningful evaluation framework is crucial to the success of any Machine Learning (ML) application, yet structural biases often creep in unknowingly. When this happens, the results may not be as reliable as they initially appear.

This talk identifies common pitfalls and illustrates them with real-world examples from nearly two decades of experience in the data science field. We explore the hidden story behind the performance metrics, moving beyond a single F-measure or accuracy score to delve into the intricacies of the data set and its domain. We discuss how to identify artificial biases in your data and offer strategies for preventing them through rigorous design of your data collection and annotation processes.

Ultimately, this talk provides a list of practical recommendations for building ML projects on solid foundations. Because amid the current AI boom and hype, we urgently need high-quality datasets, meaningful evaluations, and robust algorithms to ensure we are not just building elaborate sandcastles with GPUs.

→  Venue: PyCon Italia (Bologna)

→  Video: Recording of presentation, in raw stream (8:09:01 – 8:55:25)

→  Slides: Speakerdeck