The 7th International Workshop on Dataset PROFIling & Search (PROFILES’20 @ ISWC 2020)
Co-located with The 19th International Semantic Web Conference (ISWC 2020).
Venue: virtual conference.
Date: November 2/3, 2020.
- Workshop paper submissions due: August 17, 2020
- Workshop paper notifications sent: September 11, 2020
- Camera-ready copies due: September 21, 2020
- Publication of workshop proceedings: October 2, 2020
- PROFILES Workshop @ ISWC: November 2/3, 2020 (half-day)
Prof. Felix Neumann from the Hasso Plattner Institute will give a keynote “Data Profiling in the Relational World” at PROFILES 2020!
The Web of Data has seen tremendous growth recently. New forms of structured data have emerged in the form of knowledge graphs, Web markup, such as schema.org, as well as entity-centric data in Web tables. Considering these rich, heterogeneous, and evolving data sources that cover a wide variety of domains, exploitation of Web Data becomes increasingly important in the context of various applications, including dataset search, question answering, and fact verification. These applications require reliable information on dataset characteristics, including general metadata, quality features, statistical data, dynamics, licensing, and provenance. Lack of a thorough understanding of the nature, scope, and characteristics of data from particular sources limits their take-up and reuse, such that applications are often limited and focused on well-known reference datasets.
The goal of the PROFILES’20 workshop is to bring together researchers and practitioners interested in the development of techniques for dataset profiling and deriving quality analytics, as well as performing dataset search and dataset retrieval on the Web while taking dataset profiles into account. We are interested in approaches to analyse, characterise and discover data sources. We aim to discuss technologies addressing data profiling and search – including semantics, information retrieval for Web Data (ranking algorithms and indexing), in particular in the context of decentralised and distributed systems, such as the Web. We want to facilitate a discussion around data search across formats and domain-specific applications.
PROFILES offers a highly interactive forum for researchers and practitioners, bringing together experts in the fields of the Web, Semantic Web, Web Data, Semantic Search, Databases, NLP, IR, and application domains. We envision the workshop as a forum for researchers and practitioners to come together and discuss common challenges and identify synergies for joint initiatives.
The topics of interest of PROFILES’20 include:
- Dataset profile representation (vocabularies, schemas)
- Profiling and assessment of novel forms of entity-centric web data
- Data summarisation
- Data quality analysis for search
- Topic profiling of datasets
- Dataset indexing and profiling approaches
- Novel applications using dataset profiles
- Dataset retrieval
- Search results presentation for datasets
- Semantic dataset search
- Evaluation of dataset search tools and algorithms
- Decentralised and distributed architectures and algorithms in data search
- Fusing, cleaning, ranking and refining dataset search results
- Approaches to personalisation in dataset search
- Scalability and performance of distributed data queries
- Usage of dataset profiles in search
In the PROFILES’20 edition we are also interested in application of dataset profiles in the areas of
- Machine Learning
- Multilingual data
- Historical and contemporary events
We welcome the following types of contributions:
- Short (up to 6 pages) and full (up to 15 pages) research papers
- Poster abstracts and system demonstrations should not exceed 4 pages
All submissions must be written in English and must be formatted according to the Springer LNCS proceedings style. Each submission will be reviewed by at least 3 members of the PC. Papers will be evaluated according to their significance, originality, technical content, style, clarity and relevance to the workshop.
Please submit your contributions electronically in PDF format via the Easychair system (https://easychair.org/conferences/?conf=profiles2020).
The complete set of papers will be published with the CEUR Workshop Proceedings (CEUR-WS.org), listed by the DBLP.
- Elena Demidova (L3S Research Center, Germany)
- Stefan Dietze (GESIS Leibniz Institute for the Social Sciences, Germany)
- John Breslin (National University of Ireland, Galway)
- Simon Gottschalk (L3S Research Center, Germany)
- Maribel Acosta, Karlsruher Institut für Technologie
- Enrico Daga, The Open University
- Sherzod Hakimov, Technische Informationsbibliothek
- Dmitry Mouromtsev, NRU ITMO
- Andreas Nürnberger, Universität Magdeburg
- Bernardo Pereira Nunes, Australian National University
- Anisa Rula, University of Milano–Bicocca
- Nicolas Tempelmeier, L3S Research Center
- Konstantin Todorov, University of Montpellier
- Raquel Trillo-Lado, Universidad de Zaragoza
- Ran Yu, GESIS Leibniz Institute for the Social Sciences