Seminar on Data Science by Paul Groth (University of Amsterdam)

Paul Groth's seminar: Thinking about the Making of Data

  • Date: 09 MAY 2019  from 14:30 to 16:30

  • Event location: Aula Affreschi, Via Zamboni 34, Piano Terra

  • Type: Seminars

Paul Groth 
Thinking about the Making of Data

Abstract

(This will be an interactive session. Please do me a favor and come prepared with examples of your favorite datasets and recipes of how it was made.)

A central challenge in our modern information environment is how to use, integrate and repurpose data that stem from a multitude of diverse sources. Within data science, ~60-70% of the time is spent gathering, preparing, integrating, and munging data. In science, there is, for instance, the need to know which of the thousands of prior experimental records are reliable, applicable and can be reused for an experiment. In this talk, I discuss the goal of developing intelligent systems that work with people to combine and reuse data.  I give examples from my work on flexible knowledge graph construction and contextualize this in the context of recent work looking at how scientist search for data. 

Bio:
Paul Groth is Professor of Algorithmic Data Science at the University of Amsterdam where he leads the Intelligent Data Engineering Lab (INDElab). His research focuses on intelligent systems for dealing with large amounts of diverse contextualized knowledge with a particular focus on web and science applications. This includes research in data provenance, data integration and knowledge sharing.
Paul led the design of a number of large scale data integration and knowledge graph construction efforts in the biomedical domain. Paul was co-chair of the W3C Provenance Working Group that created a standard for provenance interchange. He has also contributed to the emergence of community initiatives to build a better scholarly ecosystem including altmetrics and the FAIR data principles.