| Digital Libraries in the Media Industry | | BIB | 1 | |
| Jim Reimer | |||
| Finding Text in Images | | BIBAK | PDF | 3-12 | |
| Victor Wu; R. Manmatha; Edward M. Riseman | |||
| There are many applications in which the automatic detection and recognition
of text embedded in images is useful. These applications include digital
libraries, multimedia systems, and Geographical Information Systems. When
machine generated text is printed against clean backgrounds it can be converted
to a computer readable form (ASCII) using current Optical Character Recognition
(OCR) technology. However, text is often printed against shaded or textured
backgrounds or is embedded in images. Examples include maps, advertisements,
photographs, videos and stock certificates. Current document segmentation and
recognition technologies cannot handle these situations well.
In this paper, a four-step system which automatically detects and extracts text in images is proposed. First, a texture segmentation scheme is used to focus attention on regions where text may occur. Second, strokes are extracted from the segmented text regions. Using reasonable heuristics on text strings such as height similarity, spacing and alignment, the extracted strokes are then processed to form rectangular boxes surrounding the corresponding text strings. To detect text over a wide range of font sizes, the above steps are first applied to a pyramid of images generated from the input image, and then the boxes formed at each resolution level of the pyramid are fused at the image in the original resolution level. Third, text is extracted by cleaning up the background and binarizing the detected text strings. Finally, better text bounding boxes are generated by using the binarized text as strokes. Text is then cleaned and binarized from these new boxes, and can then be passed through a commercial OCR engine for recognition if the text is of an OCR-recognizable font. The system is stable, robust, and works well on images (with or without structured layouts) from a wide variety of sources, including digitized video frames, photographs, newspapers, advertisements, stock certificates, and personal checks. All parameters remain the same for all the experiments. Keywords: Text reading system, Character recognition, Multimedia indexing, Text
detection, Texture segmentation, Filters, Hierarchical processing,
Binarization, Background removal, Connected-components analysis | |||
| Organization and Retrieval in a Pictorial Digital Library | | BIBA | PDF | 13-20 | |
| Yuri Quintana | |||
| This paper describes a knowledge based approach to organizing and retrieving pictures. Methods are described for using the human perceptions of pictures to create a frame knowledge base that represents the semantic content of pictures. A pictorial knowledge base management system is described that uses a conceptual clustering algorithm to learn new conceptual categories of pictures. A comparison is done between machine and human created conceptual categories. The paper shows how a frame knowledge based system can be used for intelligent picture retrieval using learned categories. | |||
| Multimedia Abstractions for a Digital Video Library | | BIBAK | PDF | 21-29 | |
| Michael G. Christel; David B. Winkler; C. Roy Taylor | |||
| Multimedia abstractions form essential components of digital video libraries
because they enable a user to determine a video's distinguishing content
without investing long viewing times or requiring high network-transfer speeds.
This paper presents usage and evaluation data for abstractions implemented the
Informedia Digital Video Library, and discusses implications for video delivery
over the Web. Keywords: Multimedia abstraction, Digital video library, Video abstraction, Video
browsing | |||
| Using Words and Phonetic Strings for Efficient Information Retrieval from Imperfectly Transcribed Spoken Documents | | BIBA | PDF | 30-35 | |
| Michael J. Witbrock; Alexander G. Hauptmann | |||
| Searching for relevant material in documents containing transcription errors presents new challenges for Information Retrieval. This paper examines information retrieval effectiveness on a corpus of spoken broadcast news documents. For documents transcribed using speech recognition, a substantial number of retrieval errors are due to query terms that occur in the spoken document, but are not transcribed because they are not within the speech recognition system's lexicon, even if that lexicon contains twenty thousand words. It has been shown that a phonetic lattice search in conjunction with full word search regains some of the information lost due to out-of-vocabulary words. In this paper an efficient alternative to this search is proposed that does not require a complete search of the phoneme lattices for all documents at run-time. By using fixed length strings of phonemes instead of phonetic lattices, an information retrieval system can search the phoneme space of a spoken document just as efficiently as a normal word document collection. Experimental evidence is presented that this technique permits the system to recapture some of the information lost due to out-of-vocabulary words in the speech recognition transcripts. | |||
| Evaluating Dewey Concepts as a Knowledge Base for Automatic Subject Assignment | | BIBA | PDF | 37-46 | |
| Roger Thompson; Keith Shafer; Diane Vizine-Goetz | |||
| This article presents the results of our exploration of the Dewey Decimal Classification (Dewey) as a concept definition source for the Scorpion project. Particularly, we show that Dewey demonstrates a high degree of class integrity and thus is a good knowledge base for an automatic subject assignment tool. | |||
| Metadata for Digital Libraries: Architecture and Design Rationale | | BIBAK | PDF | 47-56 | |
| Michelle Baldonado; Chen-Chuan K. Chang; Luis Gravano; Andreas Paepcke | |||
| In a distributed, heterogeneous, proxy-based digital library, autonomous
services and collections are accessed indirectly via proxies. To facilitate
metadata compatibility and interoperability in such a digital library, we have
designed a metadata architecture that includes four basic component classes:
attribute model proxies, attribute model translators, metadata facilities for
search proxies, and metadata repositories. Attribute model proxies elevate
both attribute sets and the attributes they define to first-class objects.
They also allow relationships among attributes to be captured. Attribute model
translators map attributes and attribute values from one attribute model to
another (where possible). Metadata facilities for search proxies provide
structured descriptions both of the collections to which the search proxies
provide access and of the search capabilities of the proxies. Finally,
metadata repositories accumulate selected metadata from local instances of the
other three component classes in order to facilitate global metadata queries
and local metadata caching. In this paper, we outline further the roles of
these component classes, discuss our design rationale, and analyze related
work. Keywords: Metadata architecture, Interoperability, Attribute model, Attribute model
translation, Metadata Repository, InfoBus, Proxy architecture, Heterogeneity,
Digital libraries, CORBA | |||
| Hypertext Construction using Statistical and Semantic Similarity | | BIBA | PDF | 57-63 | |
| Dongwook Shin; Sejin Nam; Munseok Kim | |||
| Automatic construction of hypertext has been gaining growing attention
recently in that a number of documents being produced is beginning to be made
in the form of hypertext, which calls for an enormous amount of intellectual
work by experts. In this decade, several studies have been carried out,
employing techniques mainly developed for retrieving relevant documents to user
needs. Among these, most studies underlie the vector space model and
well-known weighting schemes, from which the notion of similarity (statistical
similarity) has been devised and applied for creating hypertexts.
However, in order to create well-organized hypertexts, semantics of the contents should be also investigated, since generating hypertexts involves highly intellectual works -- understanding contents, splitting them into nodes, finding keywords, and making links between entities that are supposed to be related. This paper addresses how to create good hypertexts, combining the notion of statistical and semantic similarity in an appropriate manner. The notion of statistical similarity is based on a weighting scheme by tf x idf and inner vector product, whereas the notion of semantic similarity underlies thesaurus and partial match. We carry out an experiment with several theses and technical reports written in Korean, measuring how the method proposed here creates hypertext well, compared to the result made by human experts. The result shows that the method makes hypertexts closer to those by human experts than that using only statistical method does. | |||
| Shopping Models: A Flexible Architecture for Information Commerce | | BIBA | PDF | 65-74 | |
| Steven P. Ketchpel; Hector Garcia-Molina; Andreas Paepcke | |||
| In a digital library, there are many different interaction models between customers and information providers or merchants. Subscriptions, sessions, pay-per-view, shareware, and pre-paid vouchers are different models that each have different properties. A single merchant may use several of them. Yet if a merchant wants to support multiple models, there is a substantial amount of work to implement each one. In this paper, we formalize the shopping models which represent these different modes of consumer to merchant interaction. In addition to developing the overall architecture, we define the application program interfaces (API) to interact with the models. We show how a small number of primitives can be used to construct a wide range of shopping models that a digital library can support, and provide examples of the shopping models in operation, demonstrating their flexibility. Two models have been implemented as part of the Stanford Digital Library Project, to begin validating re-usability of key architectural components. | |||
| AGS: Introducing Agents as Services Provided by Digital Libraries | | BIBAK | PDF | 75-82 | |
| J. Alfredo Sanchez; John J. Leggett; John L. Schnase | |||
| This paper presents an architecture for digital libraries that introduces
user agents as one of the services available to publishers, librarians and
patrons. User agents are the fundamental component of an emerging style of
human-computer interaction based on the concept of delegation and indirect
management of tasks. In the agent-enabled digital library architecture, termed
"AGS", service providers define classes of agents that describe helpful tasks
for patrons. Patrons, in turn, delegate work by selecting agents from the
available agent classes and assigning specific tasks to be performed. AGS
enables the development of agents that rely on a wide variety of construction
approaches while maintaining a unified view of an active environment. AGS is
intended to serve as a testbed to investigate alternative user interfaces to
digital libraries and, in particular, a host of unexplored issues raised by the
introduction of user agents. Keywords: User agents, Interface agents, Digital library interfaces, Digital library
architectures, Open architectures, AGS, TAGS | |||
| Seed Ontologies: Growing Digital Libraries as Distributed, Intelligent Systems | | BIBA | PDF | 83-91 | |
| Peter Weinstein; Gene Alloway | |||
| Ontologies are more than a particularly elaborate approach to the description and classification of information. They can be used to support the operation and growth of a new kind of digital library, implemented as a distributed, intelligent system. We describe the design and use of ontologies in the University of Michigan Digital Library. These ontologies will model all aspects of the digital library, including content, services, and licenses. We have refined and extended the IFLA hierarchy for the realization of work, and are starting to use ontologies to support reasoning about content search. We have also used the ontologies to classify the capabilities of computational elements of the system (agents), in a dynamic way that sustains functionality as new agents are added to the system. | |||
| Multiple Search Engines in Database Merging | | BIBA | PDF | 93-102 | |
| Ellen M. Voorhees; Richard M. Tong | |||
| A database merging technique is a strategy for combining the results of
multiple independent searches into a single cohesive response. While a variety
of techniques have been developed to address a range of problem
characteristics, our work focuses on environments in which search engines work
in isolation. This paper shows that the behavior of two previously developed
isolated techniques is indeed independent of the particular search engines that
participate in the search. Two very different search engines, SMART and TOPIC,
were each used to retrieve documents from five subcollections. The relative
effectiveness of the merged result compared to the effectiveness of a
corresponding single collection run is comparable for both engines.
The effectiveness of the merged result is improved when both search engines search the same five subcollections but participate in a single merging. The improvement is such that this 10-collection merge is sometimes more effective than the single collection run. This last finding suggests that these methods may be able to improve the effectiveness of World Wide Web searches by merging the output from several engines. | |||
| Evaluating the Cost of Boolean Query Mapping | | BIBA | PDF | 103-112 | |
| Chen-Chuan K. Chang; Hector Garcia-Molina | |||
| Non-uniform query languages make searching over heterogeneous information sources difficult. Our approach is to allow a user to compose Boolean queries in one rich front-end language. For each user query and target source, we transform the user query into a subsuming query that can be supported by the source but that may return extra documents. The results are then processed by a filter query to yield the correct final results. This post-filtering approach may involve significant cost because the documents that the users will not see may have to be retrieved and filtered. There are generally two ways to implement post-filtering: batch post-filtering and incremental post-filtering. In this paper we evaluate the costs of both methods for different search features such as proximity operators. The experimental results show that in many cases incremental post-filtering cost may be acceptable, while the batch post-filtering cost may sometimes be extremely large. | |||
| Copyright and Digital Libraries | | BIB | 113 | |
| Pamela Samuelson | |||
| Citation Linking: Improving Access to Online Journals | | BIBAK | PDF | 115-122 | |
| S. Hitchcock; L. Carr; S. Harris; J. M. N. Hey; W. Hall | |||
| The most innovative online journals are maturing rapidly and distinctive new
features are emerging. Foremost among these features is the hypertext link,
popularised by the World Wide Web and which will form the basis of a new,
highly integrated scholarly literature. Journal integration in this instance
seeks to recognise, extend and exploit relationships at the level of journal
content -- the papers -- while maintaining some of the familiar contexts, in
some cases journal identities, that define the content hierarchy and inform
decision-making by readers. Links are a powerful tool for journal integration,
most immediately in the form of citation linking. The paper reviews examples
of citation linking in practice, and describes a new system, a link service,
which is being developed to support novel and flexible linking mechanisms on
the Web. One application of this link service is the Open Journal project,
which is working with journal publishers to investigate the most effective ways
of applying these powerful link types to enhance online journals. Keywords: Electronic journals, Hypertext, Hypermedia, Citation linking, Link services | |||
| Secure Distribution of Watermarked Images for a Digital Library of Ancient Papers | | BIBA | PDF | 123-130 | |
| Christian Rauber; Joe O. Ruanaidh; Thierry Pun | |||
| The electronic publishing, storage and distribution of documents is growing
increasingly important and will have profound implications for our economy,
culture and society. The multimedia digitalisation of libraries and the
distribution of the contents of museums is revolutionising these organisations
and will make these resources available to a much wider audience than was
previously possible.
The main goal of our MEDIA project (Mobile Electronic Documents with Interacting Agents) is the development of a system for the archival, retrieval, and distribution of electronic documents. For this purpose, a mobile agent platform is used to securely distribute these documents. Information is accessed by a search mechanism that allows the retrieval of text and images according to their content. An important feature of the system is a digital watermarking tool which embeds hidden signatures in images. This provides copyright protection and helps to ensure that the image will not be copied and sold and without proper authorisation. The management of the database of documents and images is accomplished by an extensible object relational database management system. In addition, documents and data can be accessed through the World Wide Web network. | |||
| Annotation: From Paper Books to the Digital Library | | BIBAK | PDF | PDF | 131-140 | |
| Catherine C. Marshall | |||
| Readers annotate paper books as a routine part of their engagement with the
materials; it is a useful practice, manifested through a wide variety of
markings made in service of very different purposes. This paper examines the
practice of annotation in a particular situation: the markings students make in
university-level textbooks. The study focuses on the form and function of
these annotations, and their status within a community of fellow textbook
readers. Using this study as a basis, I discuss issues and implications for
the design of annotation tools for a digital library setting. Keywords: Annotation, Markings, Study, Digital library reading tools, Annotation
systems design | |||
| The Digital Library Integrated Task Environment (DLITE) | | BIBAK | PDF | 142-151 | |
| Steve B. Cousins; Andreas Paepcke; Terry Winograd; Eric A. Bier; Ken Pier | |||
| We describe a case study in the design of a user interface to a digital
library. Our design stems from a vision of a library as a channel to the vast
array of digital information and document services that are becoming available.
Based on published studies of library use and on scenarios, we developed a
metaphor called workcenters, which are customized for users' tasks. Due to our
scenarios and to prior work in the CHI community, we chose a
direct-manipulation realization of the metaphor. Our system, called DLITE, is
designed to make it easy for users to interact with many different services
while focusing on a task. Users have reacted favorably to the interface design
in pilot testing. We conclude by describing our approaches to this problem. Keywords: Digital library, User interface, Direct-manipulation, World-Wide Web,
Holophrasting | |||
| Web-Based Collaborative Library Research | | BIBA | PDF | 152-160 | |
| Scott Robertson; Sherif Jitan; Kathy Reese | |||
| The U S WEST Research & Information Group, the corporate research library, has recently moved many of its resources and services to the company's intranet. Principle among the group's functions is conducting information searches and research analyses for employees. This paper describes a web-based system that employees can use to interact with library researchers. The system also automates tracking of research service usage and indexing and archiving of research requests and actions. Library clients initiate research requests using a personal web page. Each request generates its own web page on which interaction between client and researcher takes place. Researchers and clients can post comments, record actions, use e-mail, and upload and download files through the request web page. When the interaction is over, the client may record an evaluation using the same web page and all actions are saved for administrative purposes. Research interactions are maintained in a searchable archive which can be viewed by all employees. | |||
| Understanding Complex Information Environments: A Social Analysis of Watershed Planning | | BIBA | PDF | 161-168 | |
| Lisa R. Schiff; Nancy A. Van House; Mark H. Butler | |||
| This paper presents an approach to social analysis for the development of digital libraries. If digital libraries are viewed as both social and technological artifacts, then effective design requires that we must understand the social world in which each functions. The theoretical framework of Pierre Bourdieu and the situated action approach are suggested as sound bases for this understanding. Initial findings of our work in the arena of watershed planning, as part of the UC Berkeley Digital Library Project, are reported. | |||
| LDC Online: A Digital Library for Linguistic Research and Development | | BIBA | PDF | 170-174 | |
| Zhibiao Wu; Mark Liberman | |||
| The Linguistic Data Consortium (LDC), an open consortium of universities, companies and government research laboratories, creates, collects and distributes speech and text databases, lexicons, and other resources for research and development purposes. The LDC has published more than 200 CD-ROMs for use by speech recognition engineers, natural language processing researchers, and linguists, at more than 130 member organizations and 300 non-member sites. The volume of LDC data roughly doubles every year. Few organizations have been able to afford to store and index all LDC data, or to develop the software needed for efficient search and retrieval. Therefore, we have established a web-accessible digital library (accessible via http://www.ldc.upenn.edu), containing all LDC data, including data not yet published on CD-ROM, accessible through a variety of sophisticated and efficient search and display algorithms. | |||
| Managing a Digital Library of Legislation | | BIBA | PDF | 175-183 | |
| Timothy Arnold-Moore; Philip Anderson; Ron Sacks-Davis | |||
| We provide an overview of the Themis system, a commercial implementation of a digital library of legislation. Themis uses SGML to store legislation. This allows a single source document to be exported in a number of different formats and presentations. Themis also allows access to different versions of legislation by specifying a point-in-time at which the law is required. We discuss how this is achieved in Themis and how versioning impacts the storage of fragments of documents and management of references within and between documents. | |||
| CONTENT: A Practical, Scalable, High-Performance Multimedia Database | | BIBA | PDF | 185-192 | |
| Lawrence Yapp; Craig Yamashita; Gregory Zick | |||
| This paper describes Content, a practical, scalable, and high-performance text-indexed multimedia database system. The novelty of Content is in its approach of integrating high-volume storage, fast searching and browsing, easy multimedia acquisition, effective updates, scalability, extendibility, and an API based on HTTP. Content is also a low-cost solution for large multimedia databases that is available today. Standard Web-based browsers such as Netscape can query the Content server. The API is flexible so that different and unique Content clients on multiple platforms can be built to access multiple Content servers. The Content architecture permits any multimedia type to be stored. Text descriptions are used as indices for images and videos. Content includes an easy-to-use Windows-based acquisition station for acquiring images and video. Currently, Content is being used in a real library setting and contains more than 25,000 multimedia objects that span two different collections of valuable historical photographs. In terms of performance, Content can access a single image in a database of over one million images in less than a second. | |||
| TINTIN: A System for Retrieval in Text Tables | | BIBA | PDF | 193-200 | |
| Pallavi Pyreddy; W. Bruce Croft | |||
| Tables form an important kind of data element in text retrieval. Often, the gist of an entire news article or other exposition can be concisely captured in tabular form. In this paper, we examine the utility of exploiting information other than the key words in a digital document to provide the users with more flexible and powerful query capabilities. More specifically, we exploit the structural information in a document to identify tables and their component fields and let the users query based on these fields. Our empirical results have demonstrated that heuristic method based table extraction and component tagging can be performed effectively and efficiently. Moreover, our experiments in retrieval using the TINTIN system have strongly indicated that such structural decomposition can facilitate better representation of user's information needs and hence more effective retrieval of tables. | |||
| I Read the News Today, Oh Boy: Reading and Attention in Digital Libraries | | BIBAK | PDF | 202-211 | |
| David M. Levy | |||
| This is a paper at the intersection of two topics now receiving considerable
attention. The question of reading -- of what it is to read and how reading
has changed over time -- has been attracting some interest in recent days, no
doubt due in part to the very visible transformation of technology now under
way. To a lesser but still substantial extent, the topic of human attention is
also the subject of increasing discussion. There is growing awareness of
attention as a highly limited resource, stemming in part from the realization
that an abundance of information, good though it is in many ways, is also a tax
on our attention. This paper examines current assumptions about what it means
(or will mean) to read digital documents and to read "in" digital libraries.
It suggests that current work in digital library design and development is
participating in a general societal trend toward shallower, more fragmented,
and less concentrated reading and, by calling attention to this phenomenon,
offers an opportunity to question this movement. Keywords: Digital library, Reading, Attention, Hypertext | |||
| Content + Connectivity => Community: Digital Resources for a Learning Community | | BIBA | PDF | 212-220 | |
| Gary Marchionini; Victor Nolet; Hunter Williams; Wei Ding; Josephus, Jr. Beale; Anne Rose; Allison Gordon; Ernestine Enomoto; Lynn Harbinson | |||
| Digital libraries offer new opportunities to provide access to diverse resources beyond those held in school buildings and to allow teachers and learners to reach beyond classroom walls to other people to build distributed learning communities. Creating learning communities requires that teachers change their behaviors and the Baltimore Learning Community Project described here is based on the premise that access to resources should be tied to the assessment outcomes that increasingly drive curricula and classroom activity. Based on examination of curriculum guides and discussions with project teachers, an interface for the BLC digital library was prototyped. Three components (explore, construct, and present) of this user interface that allows teachers to find text, video, images, web sites, and instructional modules and create their own modules are described. Although the technological challenges of building learning communities are significant, the greater challenges are mainly social and political. | |||
| Talking in the Library: Implications for the Design of Digital Libraries | | BIBA | PDF | 221-228 | |
| Andy Crabtree; Michael B. Twidale; Jon O'Brien; David M. Nichols | |||
| We describe the use of ethnomethodologically-informed ethnography as a means of informing the requirements elicitation, design, development and evaluation of digital libraries. We present the case for the contribution of such studies to the development of digital library technology to support the practices of information-searching. This is illustrated by a particular study of the help desk at a university library, examining the implications it has for designing appropriate functionality for a digital library. This requires us to address the problems of using ethnographic data in systems design. | |||
| Browsing in Digital Libraries: A Phrase-Based Approach | | BIBA | PDF | 230-236 | |
| Craig G. Nevill-Manning; Ian H. Witten; Gordon W. Paynter | |||
| The thrust of this research is to build systems that let users become
familiar with the content of a digital library by browsing a hierarchical
structure of phrases that are repeated frequently within the collection.
Despite our syntactic approach to phrase identification, the structures that
are obtained in practice frequently correspond to plausible conceptual
hierarchies. This permits large corpora of text to be browsed efficiently,
with access to a particular document requiring a number of steps that varies
with the logarithm of the size of the corpus.
The method shows promise for collections of up to 50 Mbyte, but still poses significant practical problems before it can be adopted on a wider scale. We plan to investigate ways to scale the hierarchical inference by building multiple hierarchies and merging them. On the browsing side, we are developing a disk -- rather than memory-based system that can run efficiently on client-class machines. We believe that in the context of large information bases such as the New Zealand Digital Library, this interface will obviate the "query and hope" approach to browsing, and allow users to develop an intuition that would otherwise be very difficult to acquire. | |||
| Lexical Navigation: Visually Prompted Query Expansion and Refinement | | BIBA | PDF | 237-246 | |
| James W. Cooper; Roy J. Byrd | |||
| We have designed a document search and retrieval system, termed Lexical Navigation, which provides an interface allowing a user to expand or refine a query based on the actual content of the collection. In this work we have designed a client-server system written in Java to allow users to issue queries, have additional terms suggested to them, explore lexical relationships, and view documents based on keywords they contain. Lexical networks containing domain-specific vocabularies and relationships are automatically extracted from the collection and play an important role in this navigation process. The Lexical Navigation methodology constitutes a powerful set of tools for searching large text collections. | |||
| Auto-Adaptive Illustration through Conceptual Evocation | | BIBAK | PDF | 247-254 | |
| Michel Crampes | |||
| When engaged in reading, a reader is permanently building up associations of
ideas either freely, or guided by the evocative power of the text and his
imagination. This paper finds its inspiration in this natural mechanism to
consider a formal model of Conceptual Evocation that could be used for
automatic adaptative illustration or, more generally, dynamic and
auto-adaptative hypernavigation in hypermedia applications. We borrow from
Sowa's Conceptual Graphs a theoretical framework for node conceptual modelling.
In search of more creative mechanisms, we introduce Conceptual State Vectors to
tag the nodes, and a Conceptual Evocative Engine to dynamically create
Conceptual Evocative Links between nodes. Finally a mock-up is presented that
shows the operationality of all the concepts in the context of a TV program
composition. Keywords: Hypertext, Conceptual graphs, Conceptual evocation, Conceptual evocative
links, Conceptual evocative engine, Auto-adaptivity | |||
| Quantitative Analysis and Visualization Regarding Interactive Learning with a Digital Library in Computer Science | | BIB | 256 | |
| Ghaleb Abdulla; Winfield S. Heagy; Edward A. Fox | |||
| GeoScience Self-Organizing Map and Concept Space | | BIB | PDF | 257 | |
| Hsinchun Chen; Terry R. Smith; Tobun Dorbin Ng | |||
| Internet Browsing and Searching: User Evaluations of Category Map and Concept Space Techniques | | BIB | PDF | 257 | |
| Hsinchun Chen; Bruce R. Schatz; Andrea L. Houston; Robin R. Sewell; Tobun Dorbin Ng; Chienting Lin | |||
| The Itsy Bitsy Spider | | BIB | PDF | 258 | |
| Hsinchun Chen; Bruce R. Schatz; Marshall Ramsey; Y-Ming Chung | |||
| Medical Information Retrieval | | BIB | PDF | 258 | |
| Hsinchun Chen; Bruce R. Schatz; Susan M. Hubbard; Tamas E. Doszkocs; Andrea L. Houston; Robin R. Sewell; Kristin M. Tolle; Tobun D. Ng | |||
| LIBClient: A Tool for Legal Research on the Web | | BIB | PDF | 258-259 | |
| Bert J. Dempsey; Robert C. Vreeland | |||
| An Information Retrieval Application for Simulated Annealing | | BIB | PDF | PDF | 259-260 | |
| Bernard J. Jansen | |||
| Cognitive Load Effects and Design of Computer Based Instruction | | BIB | PDF | 260 | |
| Slava Kalyuga | |||
| Packaging Digitized Images for Access from an OPAC | | BIB | PDF | PDF | 260-261 | |
| Virginia Kerr; Claire Dougherty | |||
| Storage Structures in Digital Libraries: JSTOR | | BIB | PDF | 261 | |
| Amy J. Kirchhoff; Mark Ratliff | |||
| Branch Libraries for Multimedia Repositories | | BIB | PDF | PDF | 261-262 | |
| Michael Kozuch; Wayne Wolf; Andrew Wolfe; Don McKay | |||
| Exporting a BRS/Search Structured Database to an Access Database on a Web Accessible Windows NT Server | | BIB | PDF | 262 | |
| Jeffrey Meyer; Charles Cooper; Stephanie Normann | |||
| Building Multi-Discipline Digital Libraries | | BIB | PDF | PDF | 262-263 | |
| Michael L. Nelson; Kurt Maly; Stewart N. T. Shen | |||
| Distinguishing the Picture Captions on the World Wide Web | | BIB | PDF | 263 | |
| Neil C. Rowe; Brian Frew | |||
| On Page Coherence for Dynamic HTML Pages | | BIB | PDF | PDF | 263-264 | |
| Antonio Si; Hong Va Leong | |||
| Neighboring Graphs as Alternative Organizations for Information Retrieval | | BIB | PDF | 264 | |
| Fei Song | |||
| Bibliographic Integration in Digital Document Libraries | | BIB | PDF | PDF | 264-265 | |
| Atsuhiro Takasu | |||
| Connecting Citizens to the National Spatial Data Infrastructure via Local Libraries | | BIB | PDF | 265 | |
| Derek Thompson; Jeffrey Burka; Gary Marchionini | |||
| Multilingual Newspaper Clippings Image Database | | BIB | PDF | 266 | |
| Susanna S. Y. Tsang | |||
| HELIOS | | BIB | 267 | |
| Edward Galloway | |||
| Lexical Navigation System | | BIB | 267 | |
| James Cooper | |||
| USIA Digital Library | | BIB | 267 | |
| Cynthia Borys | |||
| Accessing Statistical Information Via the Internet | | BIB | 267 | |
| Cathryn Dipp | |||
| Digital Libraries in Museums and Galleries | | BIBA | 268 | |
| Joseph A. Busch; Jim Blackaby; Robin Dowden; Joseph Busch; Beth Sandore | |||
| Museums and galleries have been investigating a range of technical
challenges in user interface design and information retrieval that are key
items on the digital libraries research agenda. Presentations by this panel
will include discussion of issues related to:
* access to content-rich photographic archive and related source materials;
* contextualization and re-purposing of research materials;
* evaluation of instructional uses of images;
* relevance judgments based on image properties such as color, shape, and
texture; * authentication and control of intellectual property; * distributed participation in data collection; and * search redirection and knowledge discovery. This panel will illustrate how these challenges are being met today through demonstrations of exemplary applications. | |||
| NCSTRL: Experience with a Global Digital Library | | BIB | 269 | |
| Carl Lagoze | |||
| NDLF: A Shared Architecture for Research Libraries | | BIB | 269 | |
| Issues in Metadata | | BIB | 269 | |
| Stuart Weibel | |||
| Infobus: Experience in Linking Heterogeneous Systems | | BIB | 269 | |
| Thesauri and Metadata | | BIBA | 271 | |
| Joseph A. Busch | |||
| During the past two years consensus has been growing on the Dublin Core, a core set of extensible metadata elements to describe document-like objects. The impetus for reaching this agreement has been to improve the performance of Inter- and intranet searching generally through the use of proprietary text search engines such as AltaVista, Lycos, etc. This workshop will focus on the development and application of thesauri and related tools (such as cross-reference and authority files) as mechanisms for coordinating resource discovery through the searching and processing of metadata attribute data values. The goal of the workshop will be to develop an agenda of issues related to metadata attribute data values including, for example, a proposed mechanism for developing guidelines on the creation and use of thesauri for networked resource discovery. | |||
| Education and Curriculum Development for Multimedia, Hypertext and Information Access: Focus on Digital Libraries and Information Retrieval | | BIBA | 271 | |
| Edward Fox | |||
| This workshop is part of a series of meetings that began in 1995 to develop guidelines for curricula and courses in the broad area of "information"; see about Multimedia, Hypertext and Information Access at http://ei.cs.vt.edu/~fox/MHIA/. Attendees will help draft guidelines (similar to those by SIGGRAPH, SIGCHI) for curricula, courses and training programs in this area. Educators will present syllabi and describe courseware for courses or training programs about digital libraries or information retrieval. Employers will describe knowledge and skills they seek when recruiting in these areas. Researchers will explain testbeds that can be used by learners. Workshop results will be disseminated over WWW and later through ACM publications, and also will be made available through online courseware for undergraduate and graduate students. | |||
| Collaboration in the Digital Library | | BIBA | 272 | |
| Michael Twidale; David Nichols; Jon O'Brien; Bob Sandusky | |||
| This workshop will explore collaboration in digital libraries. It will: assess and compare studies of collaborative activity in information searching; attempt to spot emerging themes across the different studies; and consider recommendations for improving collaboration. | |||