Aktuelles – Analysen – Austausch zu Software und Services für die persönliche Literatur- und Wissensorganisation

BibSoup – Für die Organisation offener bibliographischer Daten

In diesen Tagen macht ein weiteres Softwareprojekt zur Literaturverwaltung mit einem Beta-Launch auf sich aufmerksam. BibSoup ist eine britische, von JISC geförderte, OpenSource-Webapplikation der Open Knowledge Foundation. Das Projekt ist insofern besonders interessant, da es sich mit Bibserver nicht nur einem – extra dafür entwickelten – offenen und damit nachutzbaren Softwareframefork für das Teilen bibliographischer Daten bedient, sondern die Daten selbst in einem (fast) neuen freien Format (BibJSON – modifiziertes JSON) organisiert werden, welche den Prinzipien zu offenen bibliographischen Daten zugrundeliegen. Auf der „About“-Website heißt es so poetisch wie pragmatisch:

The BibSoup is our metaphor for an ocean of open bibliographic records represented in BibJSON, and freely available in bulk for reuse.

The use of lightweight technology has allowed us to create a radically new approach towards the collection of Open bibliography. Conventional wisdom would suggest that all bibliographic records should be normalised and validated by a central authority. In BibJSON, we take the view that any Open bibliographic record (with its provenance) is potentially valuable, even though there may be duplicates referring to “the same bibliographic object”. The question of determining whether two records relate to “the same object” is difficult and controversial and BibSoup deliberately avoids this. It consists of a number of collections of bibliography (initially in STM areas) united by a common syntax. It is left to humans and machines to develop annotations and equalities between the components of these collections. Thus, for example, various records for “the same paper” may be found in arXiv, DBLP and possibly even Medline.

The BibSoup approach encourages the contribution of Open bibliography without the overhead of de-duplication at contribution time. We expect that, as it grows, services will develop that help users and maintainers to manage the information. De-duplication into a central repository may be one solution (with the presumed platonic identity of STM bibliographic entries), but we also expect that software based on RDF will allow tools to manage alternative representations of bibliographic data, leaving the choice to the user as to what strategy they take. In short, current STM bibliography is a distributed mess. BibSoup takes this as a starting point and, where the political will and financial support is available, offers methods for tidying this up.

So a BibSoup is just a collection of bibliographic records represented in BibJSON and made available to other people. It may be on one instance of BibServer, in a file, or all of these combined; it is just a matter of scope.


Eine Entwicklung, die es sich uneingeschränkt zu beobachten bzw. auszuprobieren lohnt! Im FAQ-Bereich steht bereits allerlei Informationsmaterial, darunter Videos, bereit.


Matti Stöhr

Filed under: Software & Tools, , , , , , ,

Neueste Tweets

Blogbeiträge zu:


%d Bloggern gefällt das: