Category Archives: OpenSym 2013

A Graphical User Interface for SILK Data Link Discovery Framework

This presentation is part of the WikiSym + OpenSym 2013 program.

Rina Singh; Jan Hidders; Feng Xia; Jialiang Kang

In the field of linked data, interlinking previously unlinked datasets that are available on the linked open cloud is still a big challenge. Silk is one of the tools that allow one to do interlinking between data items within different linked data sources. The main goal of our work is to simplify the process of specifying linking conditions using Silk. Specifying the correct  conditions in the Silk-LSL is a complex task where a helpful interface can make a large difference. In this work, we propose Silk Magic as a useful tool that is capable of guiding the users through the process of specifying linking conditions during the creation of the Silk LSL program and hence simplifies the task of writing Silk-LSL programs. The tool for example allows to display the conditions as an interactive tree, and offers suggestions for class selection conditions and step expressions in path expressions.

A PDF file will be made available on August 5, 2013, through the WikiSym + OpenSym 2013 conference proceedings.

A Preliminary Study on the Effects of Barnstars on Wikipedia Editing

This presentation is part of the WikiSym + OpenSym 2013 program.

Kwan Hui Lim, Amitava Datta and Michael Wise

This paper presents a preliminary study into the awarding of barnstars among Wikipedia editors to better understand their motivations in contributing to Wikipedia articles. We performed a comprehensive web-crawl on the entire Wikipedia and retrieved 21,299 barnstars that were awarded among 14,074 editors. In particular, we found that editors do not reciprocate with (awarding) a barnstar upon receiving one. Also, editors were more active in editing articles before awarding or receiving barnstars.

A PDF file will be made available on August 5, 2013, through the WikiSym + OpenSym 2013 conference proceedings.

Interest Classification of Twitter Users using Wikipedia

This presentation is part of the WikiSym + OpenSym 2013 program.

Kwan Hui Lim, Amitava Datta

We present a framework for (automatically) classifying the relative interest of Twitter users using information from Wikipedia. Our proposed framework first uses Wikipedia to automatically classify a user’s celebrity followings into various interest categories, followed by determining the relative interest of the user with a weighting compared to his/her other interests. Our preliminary evaluation on Twitter shows that this framework is able to correctly classify users’ interests and that these users frequently converse about topics that both reflect their (detected) interest and a related real-life event.

A PDF file will be made available on August 5, 2013, through the WikiSym + OpenSym 2013 conference proceedings.

Metadata Aggregation at GovData.de – An Experience Report

This presentation is part of the WikiSym + OpenSym 2013 program.

Florian Marienfeld; Ina Schieferdecker; Evanela Lapi; Nikolay Tcholtchev

A key challenge for open data portals is the aggregation of metadata from various data catalogs (on different administrative level or from different application fields) also knownas metadata harvesting.  This paper describes harvesting at the pilot of the German open government portal Gov-Data.de, which is scheduled to become the data portal forall German public administration levels. At the launch of the pilot portal in February 2013, eleven federal, state and local data catalogs where integrated and produced about 2,000 open data sets. In the meantime, the data sets were increased to over 3,100 data sets mainly due to improved harvesting capabilities of the portal. This paper discusses the central meta data schema and experiences with the different harvesting techniques that are in use at Gov-Data.de: CKAN-Harvest, CKAN-API, CSW-Harvest and JSON-Dump.

A PDF file will be made available on August 5, 2013, through the WikiSym + OpenSym 2013 conference proceedings.

Video Co-creation in Collaborative Online Communities

This presentation is part of the WikiSym + OpenSym 2013 program.

Andrew Lih

This tutorial session addresses issues of multimedia and video co-creation in a wiki environment, using Wikipedia and Wikimedia Commons as an example.

Did you know that while English Wikipedia has over 4 million articles, only 0.1% of them have video? In 2013, the Wiki Makes Video project was started by Andrew Lih to encourage more video content creation. It was designed to further the work of earlier projects such as Lights Camera Wiki and Kaltura’s open source collaborative video editor work that had been started before 2010, but has stalled in recent years.

We demonstrate the challenges and best practices for video creation within a collaborative community, and how this project has used the idea of “”video patterns”” as an analog to “”programming patterns”” that originally spurred the development of wikis in the 1990s.

Topics include:
– Measuring the quantity and quality of video in Wikipedia’s articles
– How better video can be generated by contributors through teaching visual literacy using video patterns
– Overview of open technical standards being employed in Wikimedia Commons, and how to use them
– Researching and transcoding content from existing video repositories (Internet Archive, Library of Congress, et al)
– What barriers to participation might be, and how to encourage more visual content

A PDF file will be made available on August 5, 2013, through the WikiSym + OpenSym 2013 conference proceedings.

Seamless Sharing in a Seemingly Divided World – A Glimpse of the Challenges Faced by Creative Commons

This presentation is part of the WikiSym + OpenSym 2013 program.

Poorna Mysoor

A broad based adoption of Creative Commons licenses is bound to face challenges. Some of the challenges arise from the way copyright laws in different jurisdictions are designed. Other challenges arise from either the way Creative Commons licenses are structured or due to the underlying policy considerations of information society as a whole.  This paper discusses these challenges and suggests ways of dealing with them.

A PDF file will be made available on August 5, 2013, through the WikiSym + OpenSym 2013 conference proceedings.

Temporal Analysis of Activity Patterns of Editors in Collaborative Mapping Project of OpenStreetMap

This presentation is part of the WikiSym + OpenSym 2013 program.

Taha Yasseri, Giovanni Quattrone, Afra Mashhadi

In the recent years wikis have become an attractive platform for social studies of the human behaviour. Containing millions records of edits across the globe, collaborative systems such as Wikipedia have allowed researchers to gain a better understanding of editors participation and their activity patterns. However, contributions made to Geo-wikis — wiki-based collaborative mapping projects — differ from systems such as Wikipedia in a fundamental way due to spatial dimension of the content that limits the contributors to a set of those who posses local knowledge about a specic area and therefore cross-platform studies and comparisons are required to build a comprehensive image of online open collaboration phenomena. In this work, we study the temporal behavioural pattern of OpenStreetMap editors, a successful example of geo-wiki, for two European capital cities. We categorise dierent type of temporal patterns and report on the historical trend within a period of 7 years of the project age. We also draw a comparison with the previously observed editing activity patterns of Wikipedia.

A PDF file will be made available on August 5, 2013, through the WikiSym + OpenSym 2013 conference proceedings.

User Evolvable Tools in the Web

This presentation is part of the WikiSym + OpenSym 2013 program.

Jens Lincke; Robert Hirschfeld

Self-supporting development environments like Smalltalk and Emacs can be used to evolve the environment itself from within very direct way. In Web-based software development environments users can collaborate in creating software without having to install the environment locally. Bringing these two together and making Web-based environments self-supportive is challenging, since users have to take care not to break the system, as they are not the only ones using it.

In our previous work, we showed how context-oriented programming (COP) can be used to develop such a system completely from within itself. COP allows for dynamically adapting the base system and scope the changes as needed.
Environments aimed at end-users usually provide a scripting level above the base system. In our system, the Lively Kernel, the core is developed using modules and classes, and users create active content by direct manipulating and scripting objects on top of it. By leveraging the scripting level for the development of tools themselves, we allow users to adapt their tools, without the need to learn and adapt to the concepts of the core level.

In this paper we show how the development tools in Lively are collaboratively evolved. The publishing of objects in a shared repository of parts creates a feeling of extreme open source, as tools can be directly inspected, adapted, and republished while they are being used.

A PDF file will be made available on August 5, 2013, through the WikiSym + OpenSym 2013 conference proceedings.

The Empirical Commit Frequency Distribution of Open Source Projects

This presentation is part of the WikiSym + OpenSym 2013 program.

Carsten Kolassa; Dirk Riehle; Michel Salim

A fundamental unit of work in programming is the code contribution (“”commit””) that a developer makes to the code base of the project in work. An author’s commit frequency describes how often that author commits. Knowing the distribution of all commit frequencies is a fundamental part of understanding software development processes. This paper presents a detailed quantitative analysis of commit frequencies in open-source software development. The analysis is based on a large sample of open source projects, and presents the overall distribution of commit frequencies. We analyze the data to show the differences between authors and projects by project size we also includes a comparison of successful and non successful projects and we derive an activity indicator from these analyses. By measuring a fundamental dimension of programming we help improve software development tools and our understanding of software development. We also validate that some fundamental assumptions about software development.

A PDF file will be made available on August 5, 2013, through the WikiSym + OpenSym 2013 conference proceedings.

Are Memory Institutions Ready for Open Data and Crowdsourcing? Results of a pilot survey from Switzerland

This presentation is part of the WikiSym + OpenSym 2013 program.

Beat Estermann

Since the advent of the World Wide Web, the cultural heritage sector has undergone a series of changes. In a pilot survey among memory institutions (galleries, libraries, archives, museums) in Switzerland we have focused on two recent trends – open data and crowdsourcing – asking to what extent heritage institutions are ready to adopt open data policies and to embrace crowdsourcing strategies. The results suggest that so far, only very few institutions have adopted an open data policy. There are, however, signs that this may soon change: a majority of the surveyed institutions considers open data as important and believes that the opportunities prevail over the risks. Some obstacles, however, still need to be overcome, in particular the institutions’ reservations with regard to “free” licensing and their fear of losing control. With regard to crowdsourcing the data suggest that the adoption process will be slower than for open data. Although approximately 10% of the responding institutions seem already to experiment with crowdsourcing, there is no general breakthrough in sight, as a majority of respondents remain sceptical with regard to the benefits.

A PDF file will be made available on August 5, 2013, through the WikiSym + OpenSym 2013 conference proceedings.