Datasets through the Lđź‘€king-Glass

About

We are focusing on the data aspects of learning-based methods. Our aim is to build a community of scientists interested in understanding how the data we use affects the algorithms and society as a whole, instead of only optimizing for a performance metric. We draw inspiration from a variety of topics, such as data curation to build datasets, meta-data, shortcuts, fairness, ethics and philosophy in AI.

The best way to get a sense of our topics of interest is to look at our webinar series and the publications discussed during our workshop.

News

Jan 31, 2025 After our in-person workshop, we have written a joint paper In the Picture: Medical Imaging Datasets, Artifacts, and their Living Review. We propose a living review that continuously tracks public datasets and their associated research artifacts across multiple medical imaging applications.
Sep 19, 2024 We had the first edition of our workshop “In the Picture: Medical Imaging Datasets” at Nyborg Strand Hotel (Denmark)! It was focused on challenges within medical imaging datasets that hinder the development of fair and robust AI algorithms. Check the presentations’ slides and videos
Mar 25, 2024 We had the fifth edition of our webinar! This edition was focused on dataset construction and metrics with talk from Hubert Dariusz ZajÄ…c and Natalia-Rozalia Avlona, Dr. Annika Reinke and Alceu Bissoto and Dr. Sandra Avila.
Dec 04, 2023 We had the forth edition of our webinar! This edition was focused on shortcuts and biases with talk from Dr. Enzo Ferrante, Rhys Compton and Lily Zhang.
Sep 18, 2023 We had the third edition of our webinar! This edition was focused on rethinking annotations with talk from Dr. Thijs Kooi and Dr. Andre Pacheco.
Jun 05, 2023 We had the second edition of our webinar! This edition was focused on datasets from medical image challenges with talk from Prof. Amber Simpson, Prof. Esther E. Bron and Prof. Spyridon Bakas.
Feb 27, 2023 We had the first edition of our webinar! This edition was focused on skin lesions datasets with talk from Dr. Roxana Daneshjou, Dr. David Wen and Prof. dr. Colin Fleming.

Selected publications

  1. livingreview.png
    Amelia Jiménez-Sánchez, Natalia-Rozalia Avlona , Sarah Boer , Víctor M. Campello , Aasa Feragen , and 24 more authors
    arXiv preprint arXiv:2501.10727, 2025
  2. dropshortcuts.png
    Ryan Wang , Po-Chih Kuo , Li-Ching Chen , Kenneth Patrick Seastedt , Judy Wawira Gichoya , and 1 more author
    EBioMedicine, 2024
  3. chexmask.png
    Nicolás Gaggion , Candelaria Mosquera , Lucas Mansilla , Julia Mariel Saidman , Martina Aineseder , and 2 more authors
    Scientific Data, 2024
  4. qualityassured.png
    Tim Rädsch , Annika Reinke , Vivienn Weru , Minu D Tizabi , Nicholas Heller , and 3 more authors
    arXiv preprint arXiv:2407.17596, 2024
  5. actionability.png
    Amelia Jiménez-Sánchez, Natalia-Rozalia Avlona , Dovile Juodelyte, Théo Sourget, Caroline Vang-Larsen , and 3 more authors
    In The Thirty-eight Conference on Neural Information Processing Systems Datasets and Benchmarks Track , 2024
  6. source.png
    Dovile Juodelyte, Yucheng Lu, Amelia Jiménez-Sánchez, Sabrina Bottazzi , Enzo Ferrante , and 1 more author
    arXiv preprint arXiv:2403.04484, 2024
  7. citation.png
    Théo Sourget, Ahmet Akkoç , Stinna Winther , Christine Lyngbye Galsgaard , Amelia Jiménez-Sánchez, and 3 more authors
    In Medical Imaging with Deep Learning , 2024
  8. shortcutbias.png
    Imon Banerjee , Kamanasish Bhattacharjee , John L Burns , Hari Trivedi , Saptarshi Purkayastha , and 4 more authors
    Journal of the American College of Radiology, 2023
  9. fastdime.png
    Nina Weng , Paraskevas Pegios , Aasa Feragen , Eike Petersen , and Siavash Bigdeli
    arXiv preprint arXiv:2312.14223, 2023
  10. groundtruthordare.png
    Hubert Dariusz Zając , Natalia Rozalia Avlona , Finn Kensing , Tariq Osman Andersen , and Irina Shklovski
    In Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society , 2023