| Digital Libraries as Phenotypes for Digital Societies | | BIBA | Full-Text | 1 | |
| Gary Marchionini | |||
| The research and development community has been actively creating and deploying digital libraries for more than two decades and many digital libraries have become indispensable tools in the daily life of people around the world. Today's digital libraries include interactive multimedia and powerful tools for searching and sharing content and experience. As such, digital libraries are moving beyond personal intellectual prostheses to become much more participative and reflective of social history. Digital libraries not only acquire, preserve, and make available informational objects, but also invite annotation, interaction, and leverage usage patterns to better serve patron needs. These various kinds of usage patterns serve two purposes: first, they serve as context for finding and understanding content, and second, they themselves become content that digital libraries must manage and preserve. Thus, digital library research has expanded beyond technical and informational challenges to consider new opportunities for recommendations, support of affinity groups, social awareness, and cross-cultural understanding, as well as new challenges related to personal and group identity, privacy and trust, and curating and preserving ephemeral interactions. This trend makes digital libraries cultural institutions that reveal and hopefully preserve the phenotypes of societies as they evolve. This talk will illustrate this theoretical perspective with examples from our experience with the Open Video Digital Library over the past decade and with recent extensions (VidArch Project) that harvest YouTube video as a strategy for preserving cultural context for digital collections. | |||
| Curated Databases | | BIBA | Full-Text | 2 | |
| Peter Buneman | |||
| Most of our research and scholarship now depends on curated databases. A curated database is any kind of structured repository such as a traditional database, an ontology or an XML file, that is created and updated with a great deal of human effort. For example, most reference works (dictionaries, encyclopaedias, gazetteers, etc.) that we used to find on the reference shelves of libraries are now curated databases; and because it is now so easy to publish databases on the web, there has been an explosion in the number of new curated databases used in scientific research. Curated databases are of particular importance to digital librarians because the central component of a digital library -- its catalogue or metadata -- is very likely to be a curated database. The value of curated databases lies in the organisation, the annotation and the quality of the data they contain. Like the paper reference works they have replaced, they usually represent the efforts of a dedicated group of people to produce a definitive description of enterprise or some subject area. | |||
| Leveraging the Legacy of Conventional Libraries for Organizing Digital Libraries | | BIBAK | Full-Text | 3-14 | |
| Arash Joorabchi; Abdulhussain E. Mahdi | |||
| With the significant growth in the number of available electronic documents
on the Internet, intranets, and digital libraries, the need for developing
effective methods and systems to index and organize E-documents is felt more
than ever. In this paper we introduce a new method for automatic text
classification for categorizing E-documents by utilizing classification
metadata of books, journals and other library holdings, that already exists in
online catalogues of libraries. The method is based on identifying all
references cited in a given document and, using the classification metadata of
these references as catalogued in a physical library, devising an appropriate
class for the document itself according to a standard library classification
scheme with the help of a weighting mechanism. We have demonstrated the
application of the proposed method and assessed its performance by developing a
prototype classification system for classifying electronic syllabus documents
archived in the Irish National Syllabus Repository according to the well-known
Dewey Decimal Classification (DDC) scheme. Keywords: Digital library organization; text classification; collective
classification; library classification schemes; bibliography | |||
| Annotation Search: The FAST Way | | BIBA | Full-Text | 15-26 | |
| Nicola Ferro | |||
| This paper discusses how annotations can be exploited to develop information access and retrieval algorithms that take them into account. The paper proposes a general framework for developing such algorithms that specifically deals with the problem of accessing and retrieving topical information from annotations and annotated documents. | |||
| wikiSearch -- From Access to Use | | BIBAK | Full-Text | 27-38 | |
| Elaine G. Toms; Lori McCay-Peet; Tayze Mackenzie | |||
| A digital library (DL) facilitates a search workflow process. Yet many DLs
hide much of the user activity involved in the process from the user. In this
research we developed an interface, wikiSearch, to support that process. This
interface flattened the typical multi-page implementation into a single layer
that provided multiple memory aids. The interface was tested by 96 people who
used the system in a laboratory to resolve multiple tasks. Assessment was
through use, usability testing and closed and open perception questions. In
general participants found that the interface enabled them to stay on track
with their task providing a bird's eye view of the events -- queries entered,
pages viewed, and pertinent pages identified. Keywords: Digital libraries; user interface; information task; search; BookBag;
interactivity tools; search workflow | |||
| Adding Quality-Awareness to Evaluate Migration Web-Services and Remote Emulation for Digital Preservation | | BIBA | Full-Text | 39-50 | |
| Christoph Becker; Hannes Kulovits; Michael Kraxner; Riccardo Gottardi; Andreas Rauber; Randolph Welte | |||
| Digital libraries are increasingly relying on distributed services to
support increasingly complex tasks such as retrieval or preservation. While
there is a growing body of services for migrating digital objects into safer
formats to ensure their long-term accessibility, the quality of these services
is often unknown. Moreover, emulation as the major alternative preservation
strategy is often neglected due to the complex setup procedures that are
necessary for testing emulation. However, thorough evaluation of the complete
set of potential strategies in a quantified and repeatable way is considered of
vital importance for trustworthy decision making in digital preservation
planning.
This paper presents a preservation action monitoring infrastructure that combines provider-side service instrumentation and quality measurement of migration web services with remote access to emulation. Tools are monitored during execution, and both their runtime characteristics and the quality of their results are measured transparently. We present the architecture of the presented framework and discuss results from experiments on migration and emulation services. | |||
| Functional Adaptivity for Digital Library Services in e-Infrastructures: The gCube Approach | | BIBA | Full-Text | 51-62 | |
| Fabio Simeoni; Leonardo Candela; David Lievens; Pasquale Pagano; Manuele Simi | |||
| We consider the problem of e-Infrastructures that wish to reconcile the
generality of their services with the bespoke requirements of diverse user
communities. We motivate the requirement of functional adaptivity in the
context of gCube, a service-based system that integrates Grid and Digital
Library technologies to deploy, operate, and monitor Virtual Research
Environments defined over infrastructural resources.
We argue that adaptivity requires mapping service interfaces onto multiple implementations, truly alternative interpretations of the same functionality. We then analyse two design solutions in which the alternative implementations are, respectively, full-fledged services and local components of a single service. We associate the latter with lower development costs and increased binding flexibility, and outline a strategy to deploy them dynamically as the payload of service plugins. The result is an infrastructure in which services exhibit multiple behaviours, know how to select the most appropriate behaviour, and can seamlessly learn new behaviours. | |||
| Managing the Knowledge Creation Process of Large-Scale Evaluation Campaigns | | BIBA | Full-Text | 63-74 | |
| Marco Dussin; Nicola Ferro | |||
| This paper discusses the evolution of large-scale evaluation campaigns and the corresponding evaluation infrastructures needed to carry them out. We present the next challenges for these initiatives and show how digital library systems can play a relevant role in supporting the research conducted in these fora by acting as virtual research environments. | |||
| Hear It Is: Enhancing Rapid Document Browsing with Sound Cues | | BIBA | Full-Text | 75-86 | |
| Parisa Eslambolchilar; George Buchanan; Fernando Loizides | |||
| Document navigation has become increasingly commonplace as the use of electronic documents has grown. Speed-Dependent Automatic Zooming (SDAZ) is one popular method for providing rapid movement within a digital text. However, there is evidence that details of the document are overlooked as the pace of navigation rises. We produced a document reader software where sound is used to complement the visual cues that a user searches for visually. This software was then evaluated in a user study that provides strong supportive evidence that non-visual cues can improve user performance in visual seeking tasks. | |||
| Creating Visualisations for Digital Document Indexing | | BIBAK | Full-Text | 87-93 | |
| Jennifer Pearson; George Buchanan; Harold W. Thimbleby | |||
| Indexes are a well established method of locating information in printed
literature just as find is a popular technique when searching in digital
documents. However, document reader software has seldom adopted the concept of
an index in a systematic manner. This paper describes an implemented system
that not only facilitates user created digital indexes but also uses colour and
size as key factors in their visual presentation. We report a pilot study that
was conducted to test the validity of each visualisation and analyses the
results of both the quantitative analysis and subjective user reviews. Keywords: Document Triage; Indexing; Information Visualisation | |||
| Document Word Clouds: Visualising Web Documents as Tag Clouds to Aid Users in Relevance Decisions | | BIBA | Full-Text | 94-105 | |
| Thomas Gottron | |||
| Information Retrieval systems spend a great effort on determining the significant terms in a document. When, instead, a user is looking at a document he cannot benefit from such information. He has to read the text to understand which words are important. In this paper we take a look at the idea of enhancing the perception of web documents with visualisation techniques borrowed from the tag clouds of Web 2.0. Highlighting the important words in a document by using a larger font size allows to get a quick impression of the relevant concepts in a text. As this process does not depend on a user query it can also be used for explorative search. A user study showed, that already simple TF-IDF values used as notion of word importance helped the users to decide quicker, whether or not a document is relevant to a topic. | |||
| Exploratory Web Searching with Dynamic Taxonomies and Results Clustering | | BIBA | Full-Text | 106-118 | |
| Panagiotis Papadakos; Stella Kopidaki; Nikos Armenatzoglou; Yannis Tzitzikas | |||
| This paper proposes exploiting both explicit and mined metadata for enriching Web searching with exploration services. On-line results clustering is useful for providing users with overviews of the results and thus allowing them to restrict their focus to the desired parts. On the other hand, the various metadata that are available to a WSE (Web Search Engine), e.g. domain/language/date/filetype, are commonly exploited only through the advanced (form-based) search facilities that some WSEs offer (and users rarely use). We propose an approach that combines both kinds of metadata by adopting the interaction paradigm of dynamic taxonomies and faceted exploration. This combination results to an effective, flexible and efficient exploration experience. | |||
| Developing Query Patterns | | BIBA | Full-Text | 119-124 | |
| Panos Constantopoulos; Vicky Dritsou; Eugénie Foustoucos | |||
| Query patterns enable effective information tools and provide guidance to users interested in posing complex questions about objects. Semantically, query patterns represent important questions, while syntactically they impose the correct formulation of queries. In this paper we address the development of query patterns at successive representation layers so as to expose dominant information requirements on one hand, and structures that can support effective user interaction and efficient implementation of query processing on the other. An empirical study for the domain of cultural heritage reveals an initial set of recurrent questions, which are then reduced to a modestly sized set of query patterns. A set of Datalog rules is developed in order to formally define these patterns which are also expressed as SPARQL queries. | |||
| Matching Multi-lingual Subject Vocabularies | | BIBA | Full-Text | 125-137 | |
| Shenghui Wang; Antoine Isaac; Balthasar A. C. Schopman; Stefan Schlobach; Lourens van der Meij | |||
| Most libraries and other cultural heritage institutions use controlled
knowledge organisation systems, such as thesauri, to describe their
collections. Unfortunately, as most of these institutions use different such
systems, unified access to heterogeneous collections is difficult. Things are
even worse in an international context when concepts have labels in different
languages. In order to overcome the multilingual interoperability problem
between European Libraries, extensive work has been done to manually map
concepts from different knowledge organisation systems, which is a tedious and
expensive process.
Within the TELplus project, we developed and evaluated methods to automatically discover these mappings, using different ontology matching techniques. In experiments on major French, English and German subject heading lists Rameau, LCSH and SWD, we show that we can automatically produce mappings of surprisingly good quality, even when using relatively naive translation and matching methods. | |||
| An Empirical Study of User Navigation during Document Triage | | BIBA | Full-Text | 138-149 | |
| Fernando Loizides; George Buchanan | |||
| Document triage is the moment in the information seeking process when the user first decides the relevance of a document to their information need [17]. This paper reports a study of user behaviour during document triage. The study reveals two main findings: first, that there is a small set of common navigational patterns; second, that certain document features strongly influence users' navigation. | |||
| A Visualization Technique for Quality Control of Massive Digitization Programs | | BIBAK | Full-Text | 150-155 | |
| Rodrigo Andrade de Almeida; Pedro Alessio; Alexandre Topol; Pierre Cubaud | |||
| Massive digitization programs need massive visualization techniques for
quality control. We describe the functional prototype of a 3D interactive
environment enabling a rapid inspection of pages conformity for large batches
of digitized books. Keywords: visualization techniques; massive digitization; image control quality | |||
| Improving OCR Accuracy for Classical Critical Editions | | BIBA | Full-Text | 156-167 | |
| Federico Boschetti; Matteo Romanello; Alison Babeu; David Bamman; Gregory Crane | |||
| This paper describes a work-flow designed to populate a digital library of ancient Greek critical editions with highly accurate OCR scanned text. While the most recently available OCR engines are now able after suitable training to deal with the polytonic Greek fonts used in 19th and 20th century editions, further improvements can also be achieved with postprocessing. In particular, the progressive multiple alignment method applied to different OCR outputs based on the same images is discussed in this paper. | |||
| Using Semantic Technologies in Digital Libraries -- A Roadmap to Quality Evaluation | | BIBAK | Full-Text | 168-179 | |
| Sascha Tönnies; Wolf-Tilo Balke | |||
| In digital libraries semantic techniques are often deployed to reduce the
expensive manual overhead for indexing documents, maintaining metadata, or
caching for future search. However, using such techniques may cause a decrease
in a collection's quality due to their statistical nature. Since data quality
is a major concern in digital libraries, it is important to be able to measure
the (loss of) quality of metadata automatically generated by semantic
techniques. In this paper we present a user study based on a typical semantic
technique used for automatic metadata creation, namely taxonomies of author
keywords and tag clouds. We observed experts assessing typical relations
between keywords and documents over a small corpus in the field of chemistry.
Based on the evaluation of this experiment, we focused on communalities between
the experts' perception and thus draw a first roadmap on how to evaluate
semantic techniques by proposing some preliminary metrics. Keywords: Digital Libraries; Information Quality; Semantic Technologies | |||
| Supporting the Creation of Scholarly Bibliographies by Communities through Online Reputation Based Social Collaboration | | BIBAK | Full-Text | 180-191 | |
| Hamed Alhoori; Omar Alvarez; Richard Furuta; Miguel Muñiz; Eduardo Urbina | |||
| Bibliographic digital libraries play a significant role in conducting
research and, in the past few years, have started to move from closed to more
open social platforms. However, in this, they have faced challenges (e.g., from
Web spam) in maintaining the level of scholarly precision -- the ratio of
relevant citations retrieved by search. This paper describes a hybrid approach
that uses online social collaboration and reputation based social moderation to
reduce the cost and to speed up the construction of scholarly bibliographies
that are comprehensive, have better quality citations and higher precision. We
implemented selected social features for an established digital humanities
project (the Cervantes Project) and compared the results with a number of
closed and open current bibliographies. We found this can help in building
scholarly bibliographies and significantly improve precision outcomes. Keywords: Social collaboration; social moderation; social reputation; scholarly
bibliography; digital libraries; digital humanities | |||
| Chance Encounters in the Digital Library | | BIBAK | Full-Text | 192-202 | |
| Elaine G. Toms; Lori McCay-Peet | |||
| While many digital libraries focus on supporting defined tasks that require
targeted searching, there is potential for enabling serendipitous discovery
that can serve multiple purposes from aiding with the targeted search to
suggesting new approaches, methods and ideas. In this research we embedded a
tool in a novel interface to suggest other pages to examine in order to assess
how that tool might be used while doing focused searching. While only 40% of
the participants used the tool, all assessed its usefulness or perceived
usefulness. Most participants used it as a source of new terms and concepts to
support their current tasks; a few noted the novelty and perceived its
potential value in serving as a stimulant. Keywords: Digital libraries; serendipitous discovery | |||
| Stress-Testing General Purpose Digital Library Software | | BIBA | Full-Text | 203-214 | |
| David Bainbridge; Ian H. Witten; Stefan J. Boddie; John Thompson | |||
| DSpace, Fedora, and Greenstone are three widely used open source digital library systems. In this paper we report on scalability tests performed on these tools by ourselves and others. These range from repositories populated with synthetically produced data to real world deployment with content measured in millions of items. A case study is presented that details how one of the systems performed when used to produce fully-searchable newspaper collections containing in excess of 20 GB of raw text (2 billion words, with 60 million unique terms), 50 GB of metadata, and 570 GB of images. | |||
| The NESTOR Framework: How to Handle Hierarchical Data Structures | | BIBA | Full-Text | 215-226 | |
| Nicola Ferro; Gianmaria Silvello | |||
| In this paper we study the problem of representing, managing and exchanging hierarchically structured data in the context of a Digital Library (DL). We present the NEsted SeTs for Object hieRarchies (NESTOR) framework defining two set data models that we call: the "Nested Set Model (NS-M)" and the "Inverse Nested Set Model (INS-M)" based on the organization of nested sets which enable the representation of hierarchical data structures. We present the mapping between the tree data structure to NS-M and to INS-M. Furthermore, we shall show how these set data models can be used in conjunction with Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) adding new functionalities to the protocol without any change to its basic functioning. At the end we shall present how the couple OAI-PMH and the set data models can be used to represent and exchange archival metadata in a distributed environment. | |||
| eSciDoc Infrastructure: A Fedora-Based e-Research Framework | | BIBA | Full-Text | 227-238 | |
| Matthias Razum; Frank Schwichtenberg; Steffen Wagner; Michael Hoppe | |||
| eSciDoc is the open-source e-Research framework jointly developed by the German Max Planck Society and FIZ Karlsruhe. It consists of a generic set of basic services ("eSciDoc Infrastructure") and various applications built on top of this infrastructure ("eSciDoc Solutions"). This paper focuses on the eSciDoc Infrastructure, highlights the differences to the underlying Fedora repository, and demonstrates its powerful und application-centric programming model. Further on, we discuss challenges for e-Research Infrastructures and how we addressed them with the eSciDoc Infrastructure. | |||
| Collaborative Ownership in Cross-Cultural Educational Digital Library Design | | BIBAK | Full-Text | 239-249 | |
| Pauline Ngimwa; Anne Adams; Josh Underwood | |||
| This paper details research into building a Collaborative Educational
Resource Design model by investigating two contrasting Kenyan / UK design
case-studies and an evaluation of end-users and designers' perceptions of
digital libraries and their usage patterns. The two case-studies compared are;
case study 1 based on formal learning in an African university digital library.
Case study 2 is centered on informal learning in an ongoing rural community
digital library system which has a collaborative design model that is being
designed, developed and reviewed within the UK and Africa. A small scale
in-depth evaluation was done with 21 participants in case-study 1 but related
to and with implications for the second case-study. In-depth user issues of
access, ownership, control and collaboration are detailed and reviewed in
relation to design implications. Adams & Blandford's 'information journey'
framework is used to evaluate high-level design effects on usage patterns.
Digital library design support roles and cultural issues are discussed further. Keywords: Digital library design; Educational digital libraries; African Context of
Use; Cross-cultural usability | |||
| A Hybrid Distributed Architecture for Indexing | | BIBA | Full-Text | 250-260 | |
| Ndapandula Nakashole; Hussein Suleman | |||
| This paper presents a hybrid scavenger grid as an underlying hardware architecture for search services within digital libraries. The hybrid scavenger grid consists of both dedicated servers and dynamic resources in the form of idle workstations to handle medium- to large-scale search engine workloads. The dedicated resources are expected to have reliable and predictable behaviour. The dynamic resources are used opportunistically without any guarantees of availability. Test results confirmed that indexing performance is directly related to the size of the hybrid grid and intranet networking does not play a major role. A system-efficiency and cost-effectiveness comparison of a grid and a multiprocessor machine showed that for workloads of modest to large sizes, the grid architecture delivers better throughput per unit cost than the multiprocessor, at a system efficiency that is comparable to that of the multiprocessor. | |||
| A Concept for Using Combined Multimodal Queries in Digital Music Libraries | | BIBA | Full-Text | 261-272 | |
| David Damm; Frank Kurth; Christian Fremerey; Michael Clausen | |||
| In this paper, we propose a concept for using combined multimodal queries in the context of digital music libraries. Whereas usual mechanisms for content-based music retrieval only consider a single query mode, such as query-by-humming, full-text lyrics-search or query-by-example using short audio snippets, our proposed concept allows to combine those different modalities into one integrated query. Our particular contributions consist of concepts for query formulation, combined content-based retrieval and presentation of a suitably ranked result list. The proposed concepts have been realized within the context of the PROBADO Music Repository and allow for music retrieval based on combining full-text lyrics search and score-based query-by-example search. | |||
| A Compressed Self-indexed Representation of XML Documents | | BIBA | Full-Text | 273-284 | |
| Nieves R. Brisaboa; Ana Cerdeira-Pena; Gonzalo Navarro | |||
| This paper presents a structure we call XML Wavelet Tree (XWT) to represent any XML document in a compressed and self-indexed form. Therefore, any query or procedure that could be performed over the original document can be performed more efficiently over the XWT representation because it is shorter and has some indexing properties. In fact, XWT permits to answer XPath queries more efficiently than using the uncompressed version of the documents. XWT is also competitive when comparing it with inverted indexes over the XML document (if both structures use the same space). | |||
| Superimposed Image Description and Retrieval for Fish Species Identification | | BIBAK | Full-Text | 285-296 | |
| Uma Murthy; Edward A. Fox; Yinlin Chen; Eric Hallerman; Ricardo da Silva Torres; Evandro J. Ramos; Tiago R. C. Falcão | |||
| Fish species identification is critical to the study of fish ecology and
management of fisheries. Traditionally, dichotomous keys are used for fish
identification. The keys consist of questions about the observed specimen.
Answers to these questions lead to more questions till the reader identifies
the specimen. However, such keys are incapable of adapting or changing to meet
different fish identification approaches, and often do not focus upon
distinguishing characteristics favored by many field ecologists and more
user-friendly field guides. This makes learning to identify fish difficult for
Ichthyology students. Students usually supplement the use of the key with other
methods such as making personal notes, drawings, annotated fish images, and
more recently, fish information websites, such as Fishbase. Although these
approaches provide useful additional content, it is dispersed across
heterogeneous sources and can be tedious to access. Also, most of the existing
electronic tools have limited support to manage user created content,
especially that related to parts of images such as markings on drawings and
images and associated notes. We present SuperIDR, a superimposed image
description and retrieval tool, developed to address some of these issues. It
allows users to associate parts of images with text annotations. Later, they
can retrieve images, parts of images, annotations, and image descriptions
through text- and content-based image retrieval. We evaluated SuperIDR in an
undergraduate Ichthyology class as an aid to fish species identification and
found that the use of SuperIDR yielded a higher likelihood of success in
species identification than using traditional methods, including the
dichotomous key, fish web sites, notes, etc. Keywords: superimposed information; image annotation; image retrieval; fish; species
identification; biodiversity; user study | |||
| Significance Is in the Eye of the Stakeholder | | BIBAK | Full-Text | 297-308 | |
| Angela Dappert; Adam Farquhar | |||
| Custodians of digital content take action when the material that they are
responsible for is threatened by, for example, obsolescence or deterioration.
At first glance, ideal preservation actions retain every aspect of the original
objects with the highest level of fidelity. Achieving this goal can, however,
be costly, infeasible, and sometimes even undesirable. As a result, custodians
must focus their attention on preserving the most significant characteristics
of the content, even at the cost of sacrificing less important ones. The
concept of significant characteristics has become prominent within the digital
preservation community to capture this key goal. As is often the case in an
emerging field, however, the term has become over-loaded and remains
ill-defined. In this paper, we unpack the meaning that lies behind the phrase,
analyze the domain, and introduce clear terminology. Keywords: Digital preservation; properties; characteristics; significant properties;
significant characteristics; applicable properties; requirements | |||
| User Engagement in Research Data Curation | | BIBAK | Full-Text | 309-314 | |
| Luis Martinez-Uribe; Stuart Macdonald | |||
| In recent years information systems such as digital repositories, built to
support research practice, have struggled to encourage participation partly due
to inadequate analysis of the requirements of the user communities. This paper
argues that engagement of users in research data curation through an
understanding of their processes, constraints and culture is a key component in
the development of the data repositories that will ultimately serve them. In
order to maximize the effectiveness of such technologies curation activities
need to start early in the research lifecycle and therefore strong links with
researchers are necessary. Moreover, this paper promotes the adoption of a
pragmatic approach with the result that the use of open data as a mechanism to
engage researchers may not be appropriate for all disciplinary research
environments. Keywords: digital curation; research data management; open data; digital repository
services; user engagement | |||
| Just One Bit in a Million: On the Effects of Data Corruption in Files | | BIBAK | Full-Text | 315-326 | |
| Volker Heydegger | |||
| So far little attention has been paid to file format robustness, i.e., a
file formats capability for keeping its information as safe as possible in
spite of data corruption. The paper on hand reports on the first comprehensive
research on this topic. The research work is based on a study on the status quo
of file format robustness for various file formats from the image domain. A
controlled test corpus was built which comprises files with different format
characteristics. The files are the basis for data corruption experiments which
are reported on and discussed. Keywords: digital preservation; file format; file format robustness; data integrity;
data corruption; bit error; error resilience | |||
| Formalising a Model for Digital Rights Clearance | | BIBAK | Full-Text | 327-338 | |
| Claudio Prandoni; Marlis Valentini; Martin Doerr | |||
| Due to the increasing complexity and world-wide distribution of digital
objects, identification and enforcement of digital rights have become too
complex to be carried out manually. It is necessary to take into account the
case-specific applicable laws, the complete creation history of a work and the
existing licenses. However, no formal generic model has been presented so far
integrating these aspects. This paper presents an innovative domain ontology of
the Intellectual Property Rights. It distinguishes four levels of abstraction
or control: (1) the legal framework, (2) the individual rights people hold, (3)
the individual usage agreements right holders and others may issue, and (4) the
particular actions that are restricted by IPR regulations or bring particular
rights into existence. The ontology has the potential to enable wide semantic
interoperability of digital repositories for identifying existing rights on
digital objects and tracing the impact of particular actions on rights and
regulations. Keywords: Intellectual Property Rights; Digital Provenance; Digital Rights;
Information Integration; Interoperability; Ontology; Domain Models; Ontology
Engineering | |||
| Evaluation in Context | | BIBA | Full-Text | 339-351 | |
| Jaap Kamps; Mounia Lalmas; Birger Larsen | |||
| All search happens in a particular context -- such as the particular collection of a digital library, its associated search tasks, and its associated users. Information retrieval researchers usually agree on the importance of context, but they rarely address the issue. In particular, evaluation in the Cranfield tradition requires abstracting away from individual differences between users. This paper investigates if we can bring some of this context into the Cranfield paradigm. Our approach is the following: we will attempt to record the "context" of the humans already in the loop -- the topic authors/assessors -- by designing targeted questionnaires. The questionnaire data becomes part of the evaluation test-suite as valuable data on the context of the search requests. We have experimented with this questionnaire approach during the evaluation campaign of the INitiative for the Evaluation of XML Retrieval (INEX). The results of this case study demonstrate the viability of the questionnaire approach as a means to capture context in evaluation. This can help explain and control some of the user or topic variation in the test collection. Moreover, it allows to break down the set of topics in various meaningful categories, e.g. those that suit a particular task scenario, and zoom in on the relative performance for such a group of topics. | |||
| Comparing Google to Ask-a-Librarian Service for Answering Factual and Topical Questions | | BIBA | Full-Text | 352-363 | |
| Pertti Vakkari; Mari Taneli | |||
| This paper evaluates to which extent Google retrieved correct answers as responses to queries inferred from factual and topical requests in a digital Ask-a-Librarian service. 100 factual and 100 topical questions were picked from a digital reference service run by public libraries. The queries inferred simulated average Web queries. The top 10 retrieval results were observed for the answer. The inspection was stopped when the first correct answer was identified. Google retrieved correct answers to 42% of the topical questions and 29% of factual questions. Results concerning the characteristics of queries and retrieval effectiveness are also presented. Evaluations indicate that public libraries' reference services answer correctly 55% of the questions. Thus, Google is not outperforming Ask-a-Librarian service, although it seems to perform relatively satisfactory in retrieving answers to topical questions. | |||
| How People Read Books Online: Mining and Visualizing Web Logs for Use Information | | BIBAK | Full-Text | 364-369 | |
| Rong Chen; Anne Rose; Benjamin B. Bederson | |||
| This paper explores how people read books online using the International
Children's Digital Library (ICDL). We analyzed usage of the ICDL in an attempt
to understand how people read books from websites. We propose a definition of
reading a book (in contrast to others who visit the website), and report a
number of observations about the use of the library in question. Keywords: Web Log Analysis; Information Visualization; Web Usage Mining; ICDL; Reading
Online | |||
| Usability Evaluation of a Multimedia Archive: B@bele | | BIBAK | Full-Text | 370-376 | |
| Roberta Caccialupi; Licia Calvi; Maria Cassella; Georgia Conte | |||
| In institutional repositories, simple discovery and submission interfaces
help increase documents deposit as scholars have very little time to
self-archive. So far, however, usability evaluation of such interfaces has been
limited. In this paper, we present the usability evaluation of a repository
interface, i.e., the interface of B@bele, the DSpace installation of the
Multimedia Production Centre (CPM) of the University Milano-Bicocca. The
results of this evaluation point out the most important shortcomings of the
present DSpace interface: difficulties with browsing within communities and
collections; problems with the submission interface due to scarcely familiar
terminology (metadata) or terms that are not relevant in the specific academic
context (community); problems in the submission process due to some ambiguous
buttons, to the lack of authority files, and to the lack of clearly marked
compulsory fields. In this way, this study will help improve not only B@bele,
but also all other installations of DSpace currently available. Keywords: User interfaces; institutional repositories; DSpace; usability evaluation | |||
| Digital Libraries, Personalisation, and Network Effects -- Unpicking the Paradoxes | | BIBA | Full-Text | 377 | |
| Joy Palmer; Caroline Williams; Paul Walk; David Kay; Dean Rehberger; Bill Hart-Davidson | |||
| The focus of this panel presentation is on personalisation (including adaptive personalisation) and the constructions of 'digital societies' around digital libraries and collections. Panelists will represent a variety of perspectives -- NEH (USA) JISC (UK) & EU -- ranging from developers of highly specialised academic digital libraries, to directors of national digital libraries that aim to achieve system-wide aims. | |||
| DL Education in the EU and in the US: Where Are We?, Where Are We Going? | | BIBA | Full-Text | 378-379 | |
| Vittore Casarosa; Anna Maria Tammaro; Tatjana Aparac-Jelusic; Stefan Gradman; Tefko Saracevic; Ron Larsen | |||
| The EU i2010 policy framework to build the European Information Society has positioned digital libraries as a critical component for its realization. The i2010 Digital Libraries initiative sets out to make all Europe's cultural resources and scientific records -- books, journals, films, maps, photographs, music, etc. -- accessible to all, and preserve it for future generations. In the US, digital libraries have been indicated as a critical component in the long-term realization of the promise of cyberinfrastructure, as tools that will change how science and humanities research is organized, stored, disseminated, and curated (Blue Ribbon Advisory Panel on Cyberinfrastructure). Since the advent of digital libraries in the early nineties, a consistent focus of DL research and development, beyond Computer Science, has been the application of DLs in educational contexts. Many projects have explored how best to teach WITH digital libraries, but only more recently has research been conducted on the best way to teach ABOUT digital libraries. | |||
| Conceptual Discovery of Educational Resources through Learning Objectives | | BIBAK | Full-Text | 380-383 | |
| Stuart A. Sutton; Diny Golder | |||
| This poster reports on current work with the NSF-funded Achievement
Standards Network (ASN) to support discovery of educational resources in
digital libraries using conceptual graphs of officially promulgated achievement
standards statements. Conceptual graphs or knowledge maps of achievement
standards reveal the macrostructure of the learning domain modeled by those
standards and support higher-level understanding by teachers and students. The
work builds on the conceptual framework of the AAAS knowledge maps by providing
the means to flexibly define and deploy new relationship schemas to fit the
disparate modeling needs of the nearly 740 learning standards documents in the
ASN repository. Using an RDF-based, node-link representation of learning goals
and the relationships among them, the ASN Knowledge Map Service will provide
the framework to correlate educational resources to nodes in conceptual models
in order to augment more conventional mechanisms of discovery and retrieval in
digital libraries. Keywords: digital libraries; content and curriculum standards; knowledge maps;
conceptual discovery | |||
| Organizing Learning Objects for Personalized eLearning Services | | BIBA | Full-Text | 384-387 | |
| Naimdjon Takhirov; Ingeborg Sølvberg; Trond Aalberg | |||
| In this paper we present a way to organize Learning Objects to achieve personalized eLearning. PEDAL-NG is a system that supports personalization based on the user's prior knowledge and the learning style in an existing and operational eLearning environment. The prior knowledge assessment and the learning style questionnaire proved to be simple and useful tools to gather necessary information about the user in order to deliver personalized eLearning experience. | |||
| Gaining Access to Decentralised Library Resources Using Location-Aware Services | | BIBAK | Full-Text | 388-391 | |
| Bjarne Sletten Olsen; Ingeborg T. Sølvberg | |||
| The paper describes a prototype that enables library users to use their
mobile devices to find the physical location of specific services or objects in
a branch of a distributed library. It guides the users to this location using
external map services, location-awareness and navigational tools. The
architecture of the system is briefly described together with the integrated
services. Keywords: Location-aware applications; mobile applications; library resources | |||
| Thematic Digital Libraries at the University of Porto: Metadata Integration over a Repository Infrastructure | | BIBAK | Full-Text | 392-395 | |
| Isabel Barroso; Marta Azevedo; Cristina Ribeiro | |||
| The University of Porto has a well-established set of specialized libraries
serving the research and student population of its 14 schools. Thematic digital
libraries can be valuable for organizing specific collections and for
supporting emergent communities. This work focuses on two case studies, one in
the area of the Fine Arts and the other in the area of Food and Nutrition. For
building both digital libraries we propose to use the existing university
repository infrastructure and to establish a metadata workflow that makes use
of available descriptions in the library catalogues and in the university
information system. We expect that such an approach, which takes into account
the institutional context and resources, can be used in other collections at
our university and inspire similar initiatives elsewhere. Keywords: metadata integration; institutional repositories; DSpace; digital libraries;
description and cataloguing | |||
| Recollection: Integrating Data through Access | | BIBAK | Full-Text | 396-397 | |
| Laura E. Campbell | |||
| This demonstration of the Recollection project of the National Digital
Information Infrastructure and Preservation Program at the Library of Congress
will showcase a prototype platform, tools and environment for sharing and
access to diverse born-digital collections. As the Program has addressed the
development of distributed preservation through a national community of partner
institutions, the challenge of access and interoperability has become more
urgent. The network needs to be able to strategically bring collections under
stewardship and keep an inventory without excessive burden on the collecting
organizations. The data under stewardship is very diverse and follows standards
acceptable within each content domain. These circumstances require an
infrastructure that enables the community of NDIIPP Partners to share their
collections and data on an ongoing basis. This allows NDIIPP to maintain the
benefits of a distributed network of partners and also take advantage of the
collections speaking to one another. Keywords: User Interfaces; Interoperability and Data Integration; Digital Archiving
and Preservation; Collection Development and Management and Policies; Knowledge
Organization Systems; Semantic Web Issues in Digital Libraries | |||
| A Gateway to the Knowledge in Taiwan's Digital Archives | | BIBAK | Full-Text | 398-401 | |
| Tzu-Yen Hsu; Ting-Hua Chen; Chung-Hsi Hung; Keh-Jiann Chen | |||
| Taiwan's digital archives cover a broad range of cultural and natural
assets. More than 2 million objects have been accumulated since the project was
launched in 2002. As the number and diversity of digitised objects have
increased rapidly, it has become increasingly difficult for people to gain a
clear picture of the contents of the archives. To disseminate the abundant and
diverse resources to the public, we are building a knowledge structure that
consists of categorized keywords extracted from objects' metadata, and
developing a function called "Tagging Tool" to facilitate fast and efficient
mining of resources. For example, users who want to read an article enriched
with archive collections can utilize the tool to identify archive specific
keywords in the text automatically and annotate them with references to
relevant resources. As a result, users can save a great deal of time on keyword
searching, and contextualize various entities, such as historical events and
people's names. Keywords: keyword extraction; text annotation; digital library | |||
| Developing a Digital Libraries Master's Programme | | BIBAK | Full-Text | 402-407 | |
| Elena Maceviciute; Thomas D. Wilson; Helena Francke | |||
| The changes in Swedish education following the Bologna requirements resulted
in the first Master's programmes in Library and Information Science. Two of
them target information professionals working with digital resources and
services and seeking to develop and update their knowledge. One programme is
oriented to foreign students from all over the world, another to Swedish
students. A Venn diagram illustrating the relationships among the elements of
LIS was used to develop the curriculum for the international Master's
programmes in Digital Libraries and Information Services. As this programme is
delivered in distance learning mode there was a need to find ways of organizing
the study process and deliver the study materials so that it suited this mode
of education. The poster describes the content and design of the programme, as
well as student reactions. Keywords: Digital libraries; curriculum; Master's programme; Sweden; international
students | |||
| The DiSCmap Project: Overview and First Results | | BIBAK | Full-Text | 408-411 | |
| Duncan Birrell; Milena Dobreva; Gordon Dunsire; Jillian Griffiths; Richard Hartley; Kathleen Menzies | |||
| Traditionally, digitisation of cultural and scientific heritage material for
use by the scholarly community has been led by supply rather than demand. The
DiSCmap project commissioned by JISC in 2008, aimed to study what re-focussing
of digitisation efforts will suit best the users of digitised materials,
especially in the context of the research and teaching in the higher education
institutions in the UK. The paper presents some of its initial outcomes based
on quantitative and qualitative analysis of 945 special collections nominated
for digitisation by intermediary users (librarians, archivist and museum
curators), as well as end users' study involving a combination of online
survey, focus groups and in-depth interviews. The criteria for prioritising
digitisation advanced by intermediaries and end users were analysed and
cross-mapped to a range of existing digitisation frameworks. A user-driven
prioritisation framework which synthesises the findings of the project is
presented. Keywords: selection; appraisal; user-defined criteria; digitization; special
collections | |||
| Securing the iRODS Metadata Catalog for Digital Preservation | | BIBAK | Full-Text | 412-415 | |
| Gonçalo Antunes; José Barateiro | |||
| Digital preservation is the ability to retrieve, access, and use digital
objects through time, while ensuring the authenticity and integrity properties
of these objects. Data grids represent a model of storage systems designed for
data management and sharing, which concept also has been proposed for digital
preservation. However, since data grids are not specifically designed for this
purpose, they present weaknesses that have to be handled. This poster will
present a set of services to address a problem in the metadata catalogue of the
iRODS data grid, strengthening that platform for digital preservation purposes. Keywords: Digital Libraries; Digital Preservation; Data Grids | |||
| JSTOR -- Data for Research | | BIBAK | Full-Text | 416-419 | |
| John Burns; Alan Brenner; Keith Kiser; Michael Krot; Clare Llewellyn; Ronald Snyder | |||
| JSTOR is a not for profit organization dedicated to helping the scholarly
community discover, use and build upon a large range of intellectual content in
a trusted digital archive. JSTOR has created a new tool called "Data for
Research" that allows users to interact with the corpus in new ways. Using DfR
researchers can now explore the content visually, analyze the text and the
references, and download complex datasets for offline analysis. Keywords: JSTOR; text analysis; data; research; users; dataset; corpus | |||
| Improving Information Retrieval Effectiveness in Peer-to-Peer Networks through Query Piggybacking | | BIBA | Full-Text | 420-424 | |
| Emanuele Di Buccio; Ivano Masiero; Massimo Melucci | |||
| This work describes an algorithm which aims at increasing the quantity of relevant documents retrieved from a Peer-To-Peer (P2P) network. The algorithm is based on a statistical model used for ranking documents, peers and ultra-peers, and on a "piggybacking" technique performed when the query is routed across the network. The algorithm "amplifies" the statistical information about the neighborhood stored in each ultra-peer. The preliminary experiments provided encouraging results as the quantity of relevant documents retrieved through the network almost doubles once query piggybacking is exploited. | |||
| The Planets Interoperability Framework | | BIBA | Full-Text | 425-428 | |
| Ross King; Rainer Schmidt; Andrew N. Jackson; Carl Wilson; Fabian Steeg | |||
| We report on the implementation of a software infrastructure for preservation actions, carried out in the context of the European Integrated Project Planets -- the Planets Interoperability Framework (IF). The design of the framework was driven by the requirements of logical preservation in the domain of libraries and archives. The IF is a Java-based software suite built on a number of open source components and Java standards. Specific features of interest are a web service architecture including specified preservation service interfaces for the integration of new and existing preservation tools and a workflow engine for the execution of service-based preservation plans. | |||
| Improving Annotations in Digital Documents | | BIBAK | Full-Text | 429-432 | |
| Jennifer Pearson; George Buchanan; Harold W. Thimbleby | |||
| Annotation plays a major role in a user's reading of a document: from
elementary school students making notes on text books to professors marking up
their latest research papers. A common place for annotations to appear is in
the margin of a document. Surprisingly, there is little systematic knowledge of
how, why and when annotations are written in margins or over the main text.
This paper investigates how margin size impacts the ease with which documents
can be annotated, and user annotation behavior. The research comprises of a two
part investigation: first, a paper study that examines margins and their use in
physical documents; secondly, we evaluate document reader software that
supports an extended margin for annotation in digital documents. Keywords: Annotation; digital documents; document triage | |||
| Workspace Narrative Exploration: Overcoming Interruption-Caused Context Loss in Information Seeking Tasks | | BIBAK | Full-Text | 433-436 | |
| YoungJoo Park; Richard Furuta | |||
| As digital libraries become more prevalent and as the amount of accessible
online information grows, users often must consult diverse information
collections in carrying out tasks. Simultaneously, the impact of frequent
interruptions on task performance gets more severe. To manage the negative
effects of interruptions on work performance, workers often engage in task
management activities to ensure they are better prepared to resume suspended
tasks. However, managing tasks causes additional cognitive burden and incurs a
time cost to users who already are experiencing demands on their attention and
time. We describe a system that allows users to browse their previous workspace
intuitively, and enhances continuity of their tasks by supporting them to
retrieve desired work context more quickly and easily. Keywords: Context browser; narrative; task; contextual cue; discontinuity; user
interfaces | |||
| SyGAR -- A Synthetic Data Generator for Evaluating Name Disambiguation Methods | | BIBA | Full-Text | 437-441 | |
| Anderson A. Ferreira; Marcos André Gonçalves; Jussara M. Almeida; Alberto H. F. Laender; Adriano Veloso | |||
| Name ambiguity in the context of bibliographic citations is one of the hardest problems currently faced by the digital library community. Several methods have been proposed in the literature, but none of them provides the perfect solution for the problem. More importantly, basically all of these methods were tested in limited and restricted scenarios, which raises concerns about their practical applicability. In this work, we deal with these limitations by proposing a synthetic generator of ambiguous authorship records called SyGAR. The generator was validated against a gold standard collection of disambiguated records, and applied to evaluate three disambiguation methods in a relevant scenario. | |||
| Searching in a Book | | BIBAK | Full-Text | 442-446 | |
| Veronica Liesaputra; Ian H. Witten; David Bainbridge | |||
| Information has no value unless it is accessible. With physical books, most
people rely on the table of contents and subject index to find what they want.
But what if they are reading a book in a digital library and have access to a
full-text search tool?.
The paper describes a search interface to Realistic Books, and investigates the influence of document format and search result presentation on information finding. We compare searching in Realistic Books with searching in HTML and PDF files, and with physical books. Keywords: Within-Document search; Electronic book; Flash application | |||
| Searching Archival Finding Aids: Retrieval in Original Order? | | BIBA | Full-Text | 447-450 | |
| Junte Zhang; Jaap Kamps | |||
| Archival principles as Provenance (keeping material from the same creator together) and its corollary Original Order (keeping the order of creation intact) could help improve access to the archival materials. We investigate the importance of relevance ranking and 'Original Order' when searching finding aids in EAD using XML Retrieval. Our experiment shows that relevance ranking is of paramount importance, although Original Order may help the retrieval of the first few results because these tend to cluster within the original order. | |||
| Digital Preservation and Access of Audio Heritage: A Case Study for Phonographic Discs | | BIBAK | Full-Text | 451-454 | |
| Sergio Canazza; Nicola Orio | |||
| We investigate differences among the approaches to the digitization of
phonographic discs, using two novel methods developed by the authors: a system
for synthesizing audio signals from still images of phonographic discs and a
tool for the automatic alignment of audio signals. Results point out that this
combined approach can be used as an effective tool for the preservation of and
access to the audio documents. Keywords: Audio cultural heritage; Audio documents access; Audio alignment | |||
| Data Recovery from Distributed Personal Repositories | | BIBA | Full-Text | 455-458 | |
| Rudolf Mayer; Robert Neumayer; Andreas Rauber | |||
| We present an approach to personal disaster recovery, e.g. after a hard-disk crash, based not on an explicitly ex-ante defined recovery plan with a rigid backup regime, but rather on naturally accumulated and distributed sources of personal data, such as e-mails and their attachments. We aim to restore as much data as possible, and to provide means to organise it in a meaningful folder structure. Employing information retrieval techniques, we semi-automatically establish the context of and relations between the data objects along several different dimensions, to identify relations and groups. Different views at multiple levels of granularity then allow an interactive organisation into folders. | |||
| A Web-Based Demo to Interactive Multimodal Transcription of Historic Text Images | | BIBAK | Full-Text | 459-460 | |
| Verónica Romero; Luis A. Leiva; Vicente Alabau; Alejandro Hector Toselli; Enrique Vidal | |||
| Paleography experts spend many hours transcribing historic documents, and
state-of-the-art handwritten text recognition systems are not suitable for
performing this task automatically. In this paper we present the modifications
on a previously developed interactive framework for transcription of
handwritten text. This system, rather than full automation, aimed at assisting
the user with the recognition-transcription process. Keywords: Handwritten recognition; Interactive framework; Web; HCI | |||
| Geographic Information Retrieval and Digital Libraries | | BIBA | Full-Text | 461-464 | |
| Ray R. Larson | |||
| In this demonstration we will examine the effectiveness of Geographic Information Retrieval (GIR) methods in digital library interfaces. We will show how various types of information may benefit from explicit geographic search, and where text-based place name search may be sufficient. We will also show how implicit geographic search (or geographic browsing) can be used to dynamically generate geographic searches in geographic interfaces like Google Earth. In this demonstration we will show the algorithms used for Geographic search and how these may be combined with text search. In addition we will show results from the GeoCLEF IR evaluation for text-based search. | |||
| Active Preservation | | BIBAK | Full-Text | 465-468 | |
| Robert Sharpe; Adrian Brown | |||
| In order to perform long-term digital preservation it is necessary to be (i)
understand the technology of the material being stored, (ii) be able to decide
whether this technology is obsolete (and, if so, what to do about it) and (iii)
perform verifiable actions to remove the causes of this obsolescence (e.g., via
format migration). This demonstration will show a real-life solution for
dealing with these challenges. It is based off pioneering work performed mainly
in conjunction with the UK National Archives' Seamless Flow programme and the
Planets project and is now deployed in a variety of national libraries and
archives around the world. Keywords: Digital Archiving and Preservation; Characterization; Preservation Planning;
Migration | |||
| Cultural Heritage Digital Libraries on Data Grids | | BIBAK | Full-Text | 469-472 | |
| Antonio Calanducci; Jorge Sevilla; Roberto Barbera; Giuseppe Andronico; Monica Saso; Alessandro De Filippo; Stefania Iannizzotto; Domenico Vicinanza; Francesco De Mattia | |||
| Data Grids offer redundant and huge distributed storage capabilities,
providing an ideal and secure place for the long-term preservation of digitized
literary works and documents of artistic and historical relevance. In this
demo, we are going to show how we deployed some digital repositories of ancient
manuscripts making use of gLibrary, a grid-based system to host and manage
digital libraries. Keywords: Digital Libraries; Grid Computing; Data Grid; Cultural Heritage; Digital
Preservation | |||
| GROBID: Combining Automatic Bibliographic Data Recognition and Term Extraction for Scholarship Publications | | BIBA | Full-Text | 473-474 | |
| Patrice Lopez | |||
| Based on state of the art machine learning techniques, GROBID (GeneRation Of BIbliographic Data) performs reliable bibliographic data extractions from scholar articles combined with multi-level term extractions. These two types of extraction present synergies and correspond to complementary descriptions of an article. This tool is viewed as a component for enhancing the existing and the future large repositories of technical and scientific publications. | |||
| Building Standardized Digital Collections: ResCarta Tools, a Demo | | BIBAK | Full-Text | 475-476 | |
| John Sarnowski; Samuel Kessel | |||
| ResCarta Tools are a suite of open source software applications which can
assist in the creation of standardized digital objects. ResCarta Tools are open
and modular in their design. Modules for creating digital objects store the
metadata in Library of Congress METS/ MODS/MIX XML formats. Collection and
indexing modules create LUCENE indexes for high speed fielded and full text
retrieval of objects. The tools have been used to create digital collections
from a variety of analog and digital sources. Collections can be hosted on the
web using Apache TOMCAT and the ResCarta WEB application, which provides inline
metadata using COINS. Integrating the use of DLESE OAI is done using the
Collection Manager METS XML data. The tools have been used by small public
libraries to host a dozen pamphlets and aerospace manufactures to host tens of
thousands of documents and millions of pages. Keywords: Search; open source; METS; MODS; MIX | |||
| Digital Libraries -- New Landscapes for Lifelong Learning? The "InfoLitGlobal"-Project | | BIBAK | Full-Text | 477-478 | |
| Heike vom Orde | |||
| "InfoLitGlobal" is an international educational digital library which was
created by the Information Literacy Section of the International Federation of
Library Association and Institutions (IFLA) on behalf of UNESCO.
"InfoLitGlobal" was conceived as a best practice and collaboration tool for
information professionals who want to share their IL resources and materials
with an international community and who are interested in learning from
colleagues all over the world. Keywords: educational digital libraries; information literacy; lifelong learning | |||
| REPOX -- A Framework for Metadata Interchange | | BIBA | Full-Text | 479-480 | |
| Diogo Reis; Nuno Freire; Hugo Manguinhas; Gilberto Pedrosa | |||
| This demonstration presents an XML framework for metadata interchange. REPOX has two goals: to be a means for libraries and other cultural institutions to provide OAI-PMH access to their metadata records, independently of their original format, with a tool that is easy to install, use and deploy; and to be used as an aggregator of OAI-PMH Data Sources. The records are stored internally in XML and there is a metadata transformation service that allows for translation to desired formats. This demonstration will show the usage scenarios, technologies and current results. | |||
| A Visualization Tool of Probabilistic Models for Information Access Components | | BIBA | Full-Text | 481-482 | |
| Lorenzo De Stefani; Giorgio Maria Di Nunzio; Giorgio Vezzaro | |||
| Since massive collections of textual documents become more and more available in digital format, the organization and classification of these documents in Digital Library Management System (DLMS) becomes an important issue. Information access components of a DLMS, such as automatic categorization and retrieval components of digital objects, allow users to interact with the system in order to browse, explore, and retrieve resources from collections of objects. The demonstration presents a two-dimensional visualization tool of Naïve Bayes (NB) probabilistic models for Automated Text Categorization (ATC) and Information Retrieval (IR) useful to explore raw data and interpret results. | |||
| User Interface for a Geo-Temporal Search Service Using DIGMAP Components | | BIBAK | Full-Text | 483-486 | |
| Jorge Machado; Gilberto Pedrosa; Nuno Freire; Bruno Martins; Hugo Manguinhas | |||
| This demo presents a user interface for a Geo-Temporal search service built
in the sequence of DIGMAP project. DIGMAP was a co-funded European Union
project on old digitized maps and deals with resources rich in geographic and
temporal information. This search interface followed a mashup approach using
existing DIGMAP components: a metadata repository, a text mining tool, a
Gazetteer, and a service to generate geographic contextual thumbnails. Google
Maps API is used to provide a friendly and interactive user interface. This
demo will present the resulting geo-temporal search engine functionalities,
whose interface uses WEB 2.0 capabilities to provide contextualization in time
and space and text clustering. Keywords: User Interfaces; GeoTemporal Retrieval; Software Architectures; Metadata;
Information Retrieval; Lucene | |||
| Information Environment Metadata Schema Registry | | BIBAK | Full-Text | 487-488 | |
| Emma Tonkin; Alexey Strelnikov | |||
| Several metadata schemas focusing on Dublin Core metadata are now in
operation around the world. In this demonstration, we support participants in
exploring various components that make up the IEMSR metadata schema registry: a
desktop tool enabling creation of simple DC vocabularies and application
profiles and their addition to the registry; a Web client that enables the
registry to be browsed and searched; a series of prototype tools that
demonstrate the use of the machine-to-machine SPARQL endpoint for practical
scenarios such as internationalization and complex application profile creation
using an explicitly stated entity-relationship model. Keywords: Metadata schema registry; application profile; Dublin Core | |||
| Digital Mechanism and Gear Library -- Multimedia Collection of Text, Pictures and Physical Models | | BIBAK | Full-Text | 489-490 | |
| Rike Brecht; Torsten Brix; Ulf Döring; Veit Henkel; Heidi Krömker; Michael Reeßing | |||
| We are presenting a digital engineering library -- the Digital Mechanism and
Gear Library (DMG-Lib). The existing worldwide knowledge in form of books,
drawings, physical models etc. is mostly scattered, difficult to access and
does not comply with today's requirements concerning a rapid information
retrieval. Therefore the development of a digital, internet-based library for
mechanisms and gears is necessary, which makes the worldwide knowledge about
mechanisms and gears available: http://www.dmg-lib.org. The Digital Mechanism
and Gear Library is of particular importance not only for engineers, product
designers and researchers, but also for teachers, students and historians. Keywords: Multimedia; interactive collection; digital engineering library | |||
| Demonstration of User Interfaces for Querying in 3D Architectural Content in PROBADO3D | | BIBA | Full-Text | 491-492 | |
| René Berndt; Ina Blümel; Harald Krottmaier; Raoul Wessel; Tobias Schreck | |||
| The PROBADO project is a research effort to develop Digital Library support for non-textual documents. The main goal is to contribute to all parts of the Digital Library workflow from content acquisition over semi-automatic indexing to search and presentation. PROBADO3D is a part of the PROBADO framework designed to support 3D documents, with a focus on the Architectural domain. This demonstration will present a set of specialized user interfaces that were developed for content-based querying in this document domain. | |||
| Hoppla -- Digital Preservation Support for Small Institutions | | BIBA | Full-Text | 493-494 | |
| Stephan Strodl; Florian Motlik; Andreas Rauber | |||
| Digital information is of crucial value to a range of institutions, from
memory institutions of all sizes, via industry and SME down to private home
computers containing office documents, valuable memories, and family
photographs. While professional memory institutions make dedicated expertise
and resources available to care for their digital assets, SMEs and private
users lack both the expertise as well as the means to perform digital
preservation activities to keep their assets available and usable for the
future.
This demo presents the Hoppla archiving system providing a digital preservation solution specifically for small institutions and small home/office settings. The system combines bit-stream preservation with logical preservation. It hides the technical complexity and outsource required knowledge and expertise in digital preservation. | |||