Workshop on Open & Citizen Data Science  (WOCD) Session 1

Schedule Top Page

Time and Date: 10:00 - 12:30 on 20th Sep 2016

Room: C - Veilingzaal

Chair: Thomas Maillart

33000 Opening & Participant Introduction
33001 War is the father of all things: Creative Destruction on Wikipedia, and Implications for Citizen Science [abstract]
Abstract: From the courtroom to the marketplace, the parliament building to the newsroom, some of our most important institutions are adversarial. We expect that placing individuals in conflict will help solve complex problems in information-dense environments, and will drive long-term social evolution through the creation of of new questions to answer. Yet we understand very little about the general principles that make conflict creative. This leaves a major gap in our understanding of human social development, and makes it difficult for us to mitigate the negative effects of conflict while retaining its benefits. To help remedy this, I present a new framework for the quantitative study of creativity and conflict. I apply it to a new study on the relationship between conflict and information creation in the online encyclopedia Wikipedia. Fifteen years of high-resolution records allow to track the long and tumultuous process by which Wikipedia articles are written, re-written, torn-apart and reconstructed. We can see not only how the introduction of new information into an article leads to conflict, but how conflict can often precede the creation of unexpected configurations whose long-term persistence suggests adaptive success. This work contradicts simple accounts that oppose conflict and cooperation, and suggests that popular conflict-suppression mechanisms may actually undermine Wikipedia's success. It provides new evidence for the central role of conflict in creative evolution, new methods to measure and quantify it across a wide variety of systems. For those engaged in the design of systems for citizen science It suggest that the correct management ? and even the selective promotion ? of conflict is crucial to the success of citizen science.
Simon Dedeo
33002 Citizen Science Practices for Computational Social Science Research [abstract]
Abstract: Under the name of Citizen Science, many innovative practices in which volunteers partner up with scientists to pose and answer real-world questions are growing rapidly worldwide. Citizen Science can furnish ready-made solutions with citizens playing an active role. However, this framework is still far from being well established as a standard tool for computational social science research. Here, we present our experience in bridging gap between computational social science and the philosophy underlying Citizen Science, which in our case has taken the form of what we call ?pop-up experiments.? These are non-permanent, highly participatory collective experiments which blend features developed by big data methodologies and behavioral experimental protocols with the ideals of Citizen Science. The main issues to take into account whenever planning experiments of this type are classified, discussed and grouped into three categories: infrastructure, public engagement, and the knowledge return for citizens. We explain the solutions we have implemented, providing practical examples grounded in our own experience in an urban context (Barcelona, Spain). Our aim here is that this work will serve as a guideline for groups willing to adopt and expand such in vivo practices and we hope it opens up the debate regarding the possibilities (and also the limitations) that the Citizen Science framework can offer the study of social phenomena.
Oleguer Sagarra, Mario Gutiérrez-Roig, Isabelle Bonhoure, Josep Perelló
33003 The Open Science Company Manifesto [abstract]
Abstract: This manifesto aims at defining fundamental statements every Company running with the Open Science initiative in mind must respect and promote.
William Durand :
33004 A Decentralized Approach to Dissemination, Retrieval, and Archiving of Data [abstract]
Abstract: Making available and archiving scientific results is for the most part still considered the task of classical publishing companies, despite the fact that classical forms of publishing centered around printed narrative articles no longer seem well-suited in the digital age. In particular, there exist currently no efficient, reliable, and agreed-upon methods for publishing scientific datasets, which have become increasingly important for science. Here we propose to design scientific data publishing as a Web-based bottom-up process, without top-down control of central authorities such as publishing companies. Based on a novel combination of existing concepts and technologies, we present a server network to decentrally store and archive data in the form of nanopublications, an RDF-based format to represent scientific data. We show how this approach allows researchers to publish, retrieve, verify, and recombine datasets of nanopublications in a reliable and trustworthy manner, and we argue that this architecture could be used for the Semantic Web in general. Evaluation of the current small network shows that this system is efficient and reliable.
Tobias Kuehn
33005 Un-conference Breakout Session
33006 Presentations from breakout sessions

Workshop on Open & Citizen Data Science  (WOCD) Session 2

Schedule Top Page

Time and Date: 14:15 - 18:00 on 20th Sep 2016

Room: C - Veilingzaal

Chair: Thomas Maillart

33007 Open Infrastructure for Open Science [abstract]
Abstract: Opening the scientific process for creating knowledge needs opening the access to a number of diverse resources like scientific instruments, scientific data, digital services, software tools, knowledge and expertise, all needed in some form to conduct research. These elements can be regarded as infrastructural resources that are essential inputs to the research process. Making these resources open and shareable require the adoption of standards, the right legal frameworks and license, and clear rules for access. There is also the crucial aspect of defining the appropriate governance and management mechanisms that ensure their long term maintenance and availability. This presentation tackles commoning as the social practice suitable to create systems to manage these shared resources and provides examples in the area of open science. It also identifies some of the current challenges with particular focus on the digital infrastructures.
Sergio Andreozzi
33008 Overview of Citizen Science Models, Practices and Impacts [abstract]
Abstract: Ibercivis (www.ibercivis.es) is the national foundation of Citizen Science in Spain that promotes and supports Citizen Science experiments and studies, delivering services to the communities as well as deploying our own tools and experiences. Our main approach to public engagement is to promote the uptake of Citizen Science tools to enrich research by changing current approaches to scientific challenges and by incorporating knowledge from outside the academia. In the last years, we are facing a boom of Citizen Science practices all around the world. Only from Ibercivis we have deployed more than 50 experiments with over 40 different research groups from different areas of knowledge, reaching over 50.000 volunteers. Our set of experiments include volunteer computing (e.g. simulation of nuclear fusion devices using http://boinc.berkeley.edu), volunteer sensing (e.g. odour nuisances reports http://digitalearthlab.jrc.ec.europa.eu/mygeoss/results3.cfm), volunteer thinking (e.g. stem cells images analysis cellspotting.socientize.eu), participatory experiments (e.g. human behavior in dilemma dilema.ibercivis.es), or collective intelligence projects (e.g. collectivemusicexperiment.eu) among others. One of the main drivers of this scenario is the digitally enabled transformation of the interactions between science and society, facing an unprecedent scale of nature and range of collaborators. We find several models of public engagement in science and contributions occur individually as well as collectively, in all the steps of the scientific workflow. There is a need of having a clear picture of the situation in Europe. Addressing this at national scale, we created the Observatory of Citizen Science in Spain (www.ciencia-ciudadana.es) to monitor the growth of such initiatives, catalogue them and analyze different impacts. Ibercivis was promoted from the BIFI Institute of the University of Zaragoza and includes research institutions and science funders, namely MINECO, CSIC, CIEMAT, Gobierno de Arag?n and Fundaci?n Zaragoza Ciudad del Conocimiento. We are part of the steering committee of the European Citizen Science Association, linking with responsible research and innovation. Since 2012 we coordinate the European project Socientize (www.socientize.eu) which delivered the White Paper on Citizen Science for Europe, referenced as a flagship document for the Citizen Science policy making. In this presentation we will present a significant set of these projects and we will present their outcomes from scientific, educational, political and technological perspectives.
Fermin Serrano, Jesus Clemente, Mari Carmen Ibañez, Eduardo Lostal, Francisco Sanz
33009 Herding Cats in Gentoo Linux [abstract]
Abstract: Between 2001 and 2005, I had the privilege and honour of leading and guiding Gentoo Linux's development teams. That was a period of high growth for Gentoo -- our userbase went from less than a thousand users (about ~750 in the #gentoo IRC channel), to over 1.5 million by 2003. This massive mushrooming of the userbase made its effects felt throughout the project. Due to the nature of the technology and the ethos of the project, Gentoo rapidly started to become the Linux distribution for every kind of situation. Several projects launched within Gentoo -- many of them around supporting different hardware, including: PPC, Sparc, IA64, Arm, and others. Communities formed to help people speaking specific languages (Spanish, French, and Polish were amongst the pioneer communities). Each group had developers within the project, and users interacting with those developers. Interactions (user-user, dev-dev, user-dev) happened in many fora: IRC, email lists, Gentoo Forums, as well as a number of third party online destinations for Linux geeks. Gentoo-fever was all around. We got made fun of: in forums comments everywhere, satirical websites popped up, and of course, our users were die-hard defenders of The Gentoo Way. The challenge of guiding this growth, and considering the perspectives of (at our peak) 250 developers, fell onto the leadership team. As part of that team, I underwent my own growth -- as a developer, a colleague, a friend, and a human being. I would like to tell you my story.
Seemant Kullen
33010 Enhancing Online Community Building & Long-Term Production with Co-Located Events [abstract]
Abstract: Nearly all online communities organize co-located meetings, such as conferences, un-conferences, and hackathons. These events are short, fast-paced, yet they are intended to enable social interactions and fast-circulation of informal knowledge between attendants. There is however a dearth of knowledge on the contribution of co-located events to community enhancement and long-term online production. Here, I study a community of astrophysicists involved in open and reproducible data science. Over the span of data collected (4 years), five co-located meetings were organized. Each meeting triggered contrasted immediate effects regarding collaboration, but all of them had significant long-term enhancing effects on community building and online knowledge production. These results illustrate how punctual co-located meetings change the way contributors engage with their community once they have resumed their routine work online.
Thomas Maillart
33011 Using the Blockchain for Reproducible, Transparent and Trustworthy Science [abstract]
Abstract: Scientific studies are often not reproducible and trustworthy due to false or exaggerated research findings. This current issue creates a bias in which successful studies are often only published through the use of dishonest scientific practices. To approach this issue, a research project has been set up to investigate how the reproducibility and trustworthiness of scientific studies can be improved with the use of the blockchain technology. The blockchain technology has shown adequate results in terms of progressing and logging transactions for the Bitcoin protocol and will be examined within this project for the purpose of logging and tracking preannounced studies. This is the thesis design for the master project ?Using the Blockchain for Reproducible and Trustworthy Science?. First will be explained what the problem statement and context is and how the project is related to other work. Secondly, the research question and the subquestions will be defined, followed by a research methodology that will be used during the research phase. The last section introduces a planning, i.e. a roadmap, that is developed for the project.
Ilias Elmzouri
33012 Un-conference Breakout Session
33013 Presentations from breakout sessions
33014 Wrap-up