ALA   American Library Association Search ALA      Contact ALA      Login     
ACRL home contact us search ACRL sitemap home join acrl
50 East Huron Street, Chicago, IL 60611, T. 800-545-2433 ext. 2523, F. 312-280-2520
 
 
About ACRL Issues & Advocacy Events & Conferences Professional Tools Publications
Standards & Guidelines Awards Give to ACRL President's Page
 
 Publications
 ACRLog
 College & Research Libraries News
 College and Research Libraries
 CHOICE
 Academic Library Statistics
 Books/Monographs
 Downloadables
 RBM
 White Papers and Reports
                         



College and Research Libraries
July 2002, Vol. 63, No. 4

Book Review

Hockey, Susan. Electronic Texts in the Humanities: Principles and Practice. Oxford: Oxford University Pr., 2000. 216p. $70, cloth (ISBN 0198711948); $19.95, paper (ISBN: 0198711956).

I recently had a meeting with a freshly minted MLIS who had been hired to bring up a “digital library” for a local consortium of schools. Among the long list of tasks that comprised her agenda was a request to mark up sets of transcribed documents from the eighteenth and nineteenth centuries. I asked her why they wanted them “marked up.” So they could be searched, she replied. Searched for what, I came back. That was not clear to her—nor, I suspect, to those who made the request in the first place. I should have given her my copy of Susan Hockey’s impressive review of humanities computing. There, she would have received a detailed introduction to the history and practice of text encoding. Reading Hockey’s thick descriptions of various encoding practices and projects may have induced her into vertigo, but at least she would never again be naive about Hockey’s topic.

Despite its subtitle, Hockey’s book is more of a conspectus of humanities computing projects since the later 1960s. Herself an accomplished member of the tribe of encoders, Hockey has a sure grip on the field, its achievements, and its limitations. Her book, she asserts, fills a niche: “It explains how computers can help researchers and teachers in the humanities, particularly those who work with text-based material.” It is resolutely NOT, she continues, a book about the Web or the Internet, relative latecomers to the field of the electronic text. Although she is confident that more and more projects will migrate to Web-based formats and that Web tools will improve to facilitate this process, her book focuses more on earlier, stand-alone projects than on recent ones. Thus, readers expecting an extended look at, say, the Rossetti or Blake electronic archives at the University of Virginia will not find it in Hockey. On the other hand, she is superb in sketching out the theoretical and technical prehistory of contemporary examples.

Text encoding is a relatively simple, straightforward process. It involves marking or tagging words and/or phrases according to some descriptive convention such that the words will be recognizable and retrievable by a computer programmed accordingly. That said, nothing else is simple or straightforward about electronic texts. What does one encode, and why? What text, texts, or corpus? Which words? How does one interpret the retrieved series? At every point in the encoding process, decisions are made and interpretations rendered. Far from being reifications of objectivity, Hockey reminds us, encoding projects are really layers of interpretation and judgment. Retrieving words and phrases may seem neat and clean, but on analysis they are anything but that. In fact, studies based on encoded texts have always been, in and of themselves, quite problematic. Encoding participates in Heisenberg’s uncertainty principle: To study the object, you necessarily change the object. The encoded text has been invaded, violated. It has been preinterpreted.

Hockey reviews many examples of this phenomenon as she tours us through the large concordance projects—the Bible, the Bard, dictionaries, et alii—and national corpus projects that have accounted for a substantial chunk of electronic text projects. She also examines major editing projects (e.g. Gabler’s Ulysses) and various attempts at stylometric analyses and attribution studies. Through each of these genres, Hockey proves to be both a sympathetic and a critical guide. She is sympathetic with the aspirations of the projects, the nuances of encoding practices, and the discipline and rigor of the work. She is less happy with the results of these projects, the limitations of available tools, and the fact that many of the best databases are now legacy systems with uncertain futures. Most readers should find her an even-handed and judicious guide who is careful to stress that electronic texts and corpora represent but one approach to a variety of linguistic and stylistic issues and that they should always be used in conjunction with other tools and approaches. Moreover, she is frank about the labor-intensive nature of encoding: It is not an inexpensive textual tool by any means. And this makes it all the more crucial to be able to evaluate the benefits of text-encoding projects in comparison with costs. I cannot resist sharing at this point my wish that the people who imposed EAD on us had as much foresight. But that is another conversation.

As much as I learned from Hockey’s book, I thought it flawed in one respect. Readers coming to it without sufficient background in the subject will be put off and probably discouraged by its long passages on encoding practices. For Hockey, this sort of thing is probably second nature. For many readers, however, it may only confirm their worst suspicions of text encoding as an arcane, nerdish pastime, best avoided by serious scholars who know how to think, read, and write. If the book has a second wind, perhaps Professor Hockey could edit out the more tedious examples of encoding practices from an otherwise admirable and commendable text.—Michael Ryan, University of Pennsylvania.





ACRL is a division of the American Library Association
© 2008 American Library Association. Copyright Statement
Last Revised: May 21, 2007