Submissions/So, you have data. Now, what do you do?

From Wikimania

This is an accepted submission for Wikimania 2017.


Submission no. 1012 - TWD
Title
So, you have data. Now, what do you do?
Type of submission
lecture
Author of the submission
Harmonia Amanda
Language of presentation
English OR French, as the organizers want (will make the material in both languages either way)
E-mail address
harmonia.amanda@gmail.com
Username
Harmonia Amanda
Country of origin
France
Affiliation
none
Abstract

English: Many people are coming to Wikidata with one topic of interest, or want to import a specific database. I have worked on very different topics these last few years, with very different datasets to import (from sled dog racing to books editions, and include Broadway musicals…). When the topic is really undercovered on Wikidata, it seems like an impossible task to take on. I want to explain, step by step, how you can go from "there is some data on the web" to "Wikidata has really good coverage about this". I will talk about :

  • how to create a Wikiproject
  • how to adapt an existing ontology to your need or create a new one
  • how to ask for properties
  • how to go from whatever your original dataset is to something Wikidata-compatible; how to break your original dataset in clean parts to import it, in several steps
  • how to find specifics tools and gadgets to do each step
  • how to create maintenance SPARQL queries
  • how to check for consistency
  • how to ask for help
  • how to use your shiny new data so that more people will volunteer to help you

I will do a quick presentation of several tools, which have really helped me, but not as a workshop on how to exactly use these tools but as an example of how to think about dealing with the data. The key objective is in the way we think. It doesn't matter if the tool already exists or not, if it doesn't, someone can help ; what matter is being able to break the always complex problem of importing very different datasets in Wikidata in manageable steps and knowing either how to do or who to ask for help for each step.

French : De nombreuses personnes arrivent sur Wikidata avec un sujet d'intérêt, ou veulent importer une base de donnée précise. J'ai travaillé sur des domaines très différents ces dernières années, et importé des jeux de données très différents également (allant des courses de chiens de traîneau aux éditions de livres en passant par les comédies musicales de Broadway). Quand le sujet est très peu traité sur Wikidata, la tâche semble impossible. Je souhaite expliquer comment passer de "il y a des données sur le Web" à "Wikidata couvre bien ce sujet" étape par étape. Je vais parler de :

  • comment créer un Wikiproject
  • comment adapter une ontologie existante ou en créer une nouvelle
  • comment demander de nouvelles propriétés
  • comment transformer le jeu de données d'origine en quelque chose de compatible avec Wikidata ; comment le diviser en parties importables proprement, par étapes
  • comment trouver les gadgets et outils spécialisés pour chaque étape
  • comment créer des requêtes SPARQL pour la maintenance
  • comment vérifier la cohérence des données
  • comment demander de l'aide
  • comment utiliser toutes ces nouvelles données pour les gens aient envie de vous aider

Je ferai une présentation rapide de plusieurs outils qui m'ont vraiment aidée, mais non pas comme un atelier qui montrerait comment les utiliser précisément mais plutôt comme des exemples sur la manière de penser la façon de gérer les données. L'objectif principal est dans la manière de réfléchir. Que l'outil existe ou pas est en réalité secondaire, il peut être créé ; ce qui importe, c'est de savoir comment réduire le toujours très complexe problème de l'import de jeux de données divers en étapes faisables, soit en sachant le faire soi-même soit en sachant comment demander de l'aide.

What will attendees take away from this session?
attendees should learn new skills and understand better how Wikidata work, to solve more easily complex data import problems
Theme of presentation
Technology, Interface & Infrastructure
For workshops and discussions, what level is the intended audience?
intermediate to advanced; attendees should already know what a database is and how Wikidata very roughly works (understanding all the basic help pages is needed, but no more than that)
Length of session (if other than 25 minutes, specify how long)
25 minutes
Will you attend Wikimania if your submission is not accepted?
if I have the scholarship, yes
Slides or further information (optional)
Special requests
Is this Submission a Draft or Final?

Interested attendees

If you are interested in attending this session, please sign with your username below. This will help reviewers to decide which sessions are of high interest. Sign with a hash and four tildes. (# ~~~~).

  1. Daniel Mietchen (talk) 00:14, 11 April 2017 (UTC)[reply]
  2. Tpt (talk) 12:06, 11 April 2017 (UTC)[reply]
  3. VIGNERON (talk) 18:58, 11 April 2017 (UTC)[reply]
  4. Saguaromelee (talk) 13:08, 12 April 2017 (UTC)[reply]
  5. Amir É. Aharoni (talk) 13:43, 12 April 2017 (UTC)[reply]
  6. Birgit Müller (WMDE) (talk) 18:30, 12 April 2017 (UTC)[reply]
  7. John Andersson (WMSE) (talk) 18:49, 12 April 2017 (UTC)[reply]
  8. Stefanos Stefanos (talk) 02:49, 15 July 2017 (UTC)[reply]
  9. Camelia (talk) 10:41, 26 July 2017 (UTC)[reply]
  10. Beat Estermann (talk) 08:46, 28 July 2017 (UTC)[reply]
  11. Mlemusrojas (talk) 14:37, 1 August 2017 (UTC)[reply]
  12. Peaceray (talk) 13:29, 11 August 2017 (UTC)[reply]
  13. Ouvrard (talk) 21:04, 11 August 2017
  14. NickK (talk) 13:39, 12 August 2017 (UTC)[reply]