About Semantic CorA

Aus Semantic CorA
Wechseln zu: Navigation, Suche

The project "A Virtual Research Environment for the History of Education based on a Semantic Wiki Technology (Semantic MediaWiki for Collaborative Corpora Analysis: Semantic CorA)" targets the development of a virtual research environment (vre) based on Semantic MediaWiki (SMW) for the collaborative analysis of comprehensive digitised data corpora and an exemplary sustained nesting in the professional community of the history of education. Moreover, the project aims to provide a sharing of the researchers' enrichments and analysis and in the long term, an infrastructural distribution of Semantic CorA to other disciplines.

Owing to its concrete need for collaborative means of analysing pedagogical reference books, the domain of history of education offers a good starting point for exemplarily realizing a virtual research environment. Well-established co-operations exists in the community of researchers, librarians and technicians. Such collaborations have, for instance, led to several digitization projects and an amount of research data for this domain. Semantic CorA permits an integration of digitised documents along with their bibliographic metadata, collaboratively analysis in a quantitative and qualitative sense, and connection of linked data with practical research in the digital humanities. Libraries will be enabled to integrate the products from their digitization projects (primary data) into professional discourse and generate added scientific value by semantically linking digitized ressources with analytic results – as well as enabling integrated archiving.

Semantic CorA links up to concrete research projects in the history of education, aimed at discourse and field analyses of pedagogical reference works. Dictionaries from Scripta Paedagogica Online (SPO) (1774-1942), hosted by the Library of the History for Education at the German Institute for International Educational Research (DIPF), are integrated. SPO indexes references to relevant pedagogical works at the level of articles, rendering them accessible online as image files. The corpus started with 25 lexica and a total amount of nearly 22,000 articles. The researchers extended the corpus to more than 80.

An participative design of the vre is conducted by consulting the researchers and empowering them to take an active part in development. Therenye, the development is adjusted to the research process and an agile computing, a step-by-step open-source publishing, is ralized.


Motivation

Interrelation between communities at the design of Semantic CorA

Semantic CorA focuses on the social sciences and humanities and establishes vres in the research community of historical research in education. Collaborative work should be possible in the maintenance and analysis of research data while special attention is paid to the re-use of research data at the beginning and the end of the research processes. Semantic CorA therefore relies on Semantic MediaWiki, which ensures a certain degree of interoperability of newly data due to RDF export features (and other export formats like csv, json,..).

Our project aims at connecting three different communities which are:

  • researchers in the social sciences and humanities
  • developers
  • digital libraries.

Semantic MediaWiki

Semantic Media Wiki was chosen because it is a lightweight system with a broad community. As its development is open source, the results of our project in form of extensions and forms can easily be reused and adapted by others. As Semantic CorA does not aim at developing a (technically) new vre, the modular system architecture of MediaWiki (respective Semantic MediaWiki) offers a basis which can be adapted to the actual needs. Semantic CorA aims at the management and analysis of large corpora but clearly does not claim to be a large scale solution for the totality of research fields in the humanities as for example Text-Grid does. Therefore the RDF-Support of Semantic MediaWiki with its promising interoperability is a fundamental criteria which ensures the possibility to reuse the new data in other, rdf-based systems. This interoperability on the data level enables the possibility to think in "smaller scales" in the context of vres.

Furthermore wikis are well-known as a tool for collaborative working in the web. Even if some usabilty problems are given, regarding the syntax for example, using wiki-systems aims at using already known software. Even if editorial task in the wiki system are fare less trivial to non techi people as often assumed, a familiarization with wiki systems is given.

As another positive effect in Semantic CorA, we observed that after some time the users were able to construct own queries and templates to gain more information from and interact more flexible with their data. This development is clearly due to the openess of the wiki system where a large flexibility is given, especially compared to most of the out-of-the-box desktop environments which define a clear range of possibilities in the analysis of data. A more qualitative development through observation and collaboration in defining requirements on the system step-by-step was possible.

Funding

This project is funded by the German Research Foundation (DFG) entitled: "Entwicklung einer Virtuellen Forschungsumgebung für die Historische Bildungsforschung mit Semantischer Wiki-Technologie - Semantic MediaWiki for Collaborative Corpora Analysis (INST 367/5-1, INST 5580/1-1)" in the domain of Scientific Library Services and Information Systems (LIS). It is realized in a cooperation between the German Institute for International Educational Research (DIPF), the Karlsruhe Institute of Technology (KIT), the Library for Research on Educational History (BBF), and historical educational researchers mainly of the Georg-August-University Göttingen. We are grateful that Rudi Studer, Denny Vrandecic, Elena Simperl, Cornelia Veja, Klaus-Peter Horn, Anne Hild, Anna Stisser, Benedikt Kämpgen, Martin Wünsch, Sabine Liebmann, Stefan Cramme, and Gwen Schulte actively supported our endeavor.