Journal of the American Society for Information Science (JASIS) -- Table of Contents

Contributed by
Richard Hill
American Society for Information Science
Silver Spring, Maryland, USA
rhill@asis.org

VOLUME 50, NUMBER 13
November 1999

CONTENTS

Editorial

  • In This Issue
    Bert R. Boyce

Special Topic Issue: Integrating Multiple Overlapping Metadata Standards
Guest Editor: Zorana Ercegovac

  • Introduction
    Zorana Ercegovac

    Metadata has remained one of the critical components in the context of knowledge representation and data mining in digital libraries as it had traditionally been in the context of the pre-Web libraries. Today in the digital libraries environment in which individual collections of massive heterogenous objects need to be unified and linked in a single resource, we have witnessed both the growth of different metadata and the attempts to reconcile the common attributes in the existing overlapping standards. The ultimate goal is to make it possible to access relevant information seamlessly regardless of its type (e.g., visual and museum objects, historical data, cultural heritage, scientific data), location, and scholarly tradition (e.g., librarians, archivists, scientists).

    This Special Issue of JASIS addresses different applications of metadata standards in geospatial collections, education, historical costume collection, data management, and information retrieval, and explores the future thinking of metadata standards for digital libraries.

  • Metadata Elements for Object Description and Representation: A Case Report from a Digitized Historical Fashion Collection Project
    Marcia Lei Zeng

    Zeng examines the fitness of three existing metadata formats (USMARC, The Dublin Core Element Set, and the Visual Resources Association) to support a collection of historical fashion objects held at the Kent State University Fashion Museum. Zeng adopted and modified the VRA metadata format to catalog the entire digitized historical fashion collection.

  • A Comparison of the Two Traditions of Metadata Development
    Kathleen Burnett, Kwong Bor Ng, and Soyeon Park

    Burnett, Ng and Park discuss two different approaches: the bibliographic approach that has origins in cataloging (the library community), and the data management approach that has roots in computer processing (the computer science profession). The article compares element sets between and among six different metadata (i.e., USMARC, The Dublin Core, TEI, Semantic Header, IAFA Templates and URC) and supports a proposal for an integrated approach to metadata.

  • Use of Metadata Vocabularies in Data Retrieval
    Edwin M. Cortez

    Finally, in the context of Information Retrieval and the Internet, Cortez considers a metadata vocabulary as a negotiator between a set of 39 different databases (disparate by structure, vocabulary, use and purpose) and equally diverse user populations. The proposed metadata vocabulary relates to the domain of food, agriculture, natural resources and rural development; it attempts to normalize semantic and hierarchical distinctions between and among different databases and to act as a front-end unified language to the prototype Database Catalog.

Research

  • The Ecological Approach to Text Visualization
    James A. Wise

    The Spatial Paradigm for Information Retrieval and Exploration, SPIRE, converts digitized text documents into vector space document representations using 280 element vectors whose elements were produced by a neural net trained on the domain of the documents. These are clustered with a similarity measure and projected onto a two-dimensional plane using a modification of multidimensional scaling that uses document-to-centroid distances rather than pairwise document distances. The visualization shows the reoccurrence of a concept as a height on a projection that resembles a terrain map.

  • A Hybrid Method for Abstracting Newspaper Articles
    James Liu, Yan Wu, and Lina Zhou

    Liu, Zhou, and Wu begin their abstract extraction from Chinese text by comparing character pairs with user chosen keywords for exact, partial, or variable character matches. Word frequency of all words is compared to a standard word frequency table, where nouns and verbs of frequency at variance with the standard are extracted . High variance words are used to select sentences until the required length of text is extracted. In a combined method, matching is supplemented by weighted extraction. An additional level uses parts of speech, pronoun referents, and syntactic rules as well as syntactic markers explicit in Chinese text.

    Thirty five users were surveyed and 60% found keyword and percentage extraction to be useful. The extraction of summaries was not well received.

  • Formal Features of Cyberspace: Relationships between Web Page Complexity and Site Traffic
    Erik P. Bucy, Annie Lang, Robert F. Potter, and Maria Elizabeth Grabe

    Using a sample of 5,000 Web sites top ranked by hits using 100hot's InSite Pro service, Bucy, Lang, Potter, and Grabe randomly selected 500 sites, and 496 home pages were coded to reflect domain name, rank, average number of page views over six weeks, and the banner, body, and advertisements were analyzed for features and links. Banners occur on 75% of sites and are most commonly white. One-fifth featured movement. Home pages averaged 2.4 screens in length and 79% used one or more frames. The dominant background color was white. A graphical element occurred on 95% of the pages, with a logo being the most common. Movement was present in about one-third of the pages. Asynchronous elements--links, surveys, contact information--occurred in 98.9% of pages with an average of 27.1 such elements per page. Just 15.9% used real-time interactive elements, like audio or video links, or chat rooms (which were the most common of these). Over half the pages exhibited advertisements of some kind but less than one-third of these had dynamic features.

    For commercial sites, high visitation correlates with high graphics use and less strongly with asynchronous interactive elements. In noncommercial sites, there is a strong correlation between visits and asynchronous interactive elements. Real time interactive elements are rare. Advertising is prominent, but pages are not generally over-designed.

Book Reviews

  • Understanding Information Retrieval Interactions: Theoretical and Practical Implications, by Carol A. Hert
    Sue Myburgh
  • Information Literacy: Essential Skills for the Information Age, by Kathleen L. Spitzer with Michael B. Eisenberg and Carrie A. Lowe
    Cheryl Knott Malone
  • Scholarly Book Reviewing in the Social Sciences and Humanities. The Flow of Ideas Within and Among Disciplines, by Ylva Lindholm-Romantschuk
    Jack Andersen

Click here to return to the D-Lib Magazine clips column.