Archives for the Month of January 2010 on Bibliographic/Metadata Services

Advanced TEI Seminar

Advanced TEI Seminar, UMD, January 20-22

-- Issue: integrating manuscripts with print text.
-- Issue: typed manuscripts, with handwritten additions, notes, etc.: how to mark these features
-- Issue of commentary and notation for textual nuances.
-- Encoding Model for Genetic Editions
-- open annotation collaboration
-- XSLT Reference
-- encoding to the stylesheet is bad practice
-- revision narrative is important and necessary

1.
The Beck Center for Electronic Collections
Alice Hickcox, Emory University
Woodruff Library, Emory University
The Beck Center, a part of Woodruff Library at Emory University, has long had an interest in the Lady Gregory portion of the Gregory Family papers in the Manuscript, Archives, and Rare Book Library (MARBL) in Woodruff Library. There are several items in the collection that are unpublished, and we look forward to working with these.
http://beck.library.emory.edu/
http://womenwriters.library.emory.edu/
xml example

2.
Margaret Sanger Papers
Esther Katz and Cathy Moran Hajo, New York University
The Margaret Sanger Papers is a scholarly editing project located at New York University’s History Department, with staff also working at the Sophia Smith Collection at Smith College. Founded in 1987 to gather, preserve, identify and publish the papers of America’s best-known birth control activist, the project is working with a collection of over 95,000 letters, speeches, diaries, pamphlets, reports and other documents ranging in date from 1911-1966. The project completed a two-series microfilm edition of 101 reels and indexed Sanger materials on a third microfilm collection of 145 reels made by the Library of Congress, storing general metadata on the documents in a database.
Our goal is to develop digital version of our publications that can work with one another.
We also hope to digitally publish thematic selections of documents that provide in-depth text analysis and interpretation or link the documents with analysis or texts from other collections.
Search created on:
title
date
document type (e.g. article, document, interview, review, autograph, typed draft, etc.)
subject index
full text
-- Papers of the War Department
-- Omeka interacts with WordPress
-- Public Writings of Margaret Sanger

3.
Melville Electronic Library
John Bryant, Hofstra University
Typee: Fluid Edition
-- diplomatic transcription (typography to represent what you see on screen), base version (final reading text of the manuscript, sans all the crossouts, additions, and so on) -- in dual up-down frames
-- shows revision sequence. Revision narrative then tells the story of the sequence.
-- how to encode false starts versus

4.
Dickinson Electronic Archive
Julie Enszer, University of Maryland
Dickinson, Cartoonist

5.
Folger Shakespeare Library
Jim Kuhn and Heather Wolfe, Folger Shakespeare Library
These advanced seminars will assist Folger staff in properly evaluating TEI-based project proposals that require access to Folger collection material, and in planning for future transcription of Folger manuscript collections that have been digitized. We are also in the process of expanding upon our in-house online paleography transcription program, in the hopes of making it a publicly accessible site. We anticipate that TEI-encoding will be an important component of this project.
Guidelines here: http://quartos.org/info/encoding.html

Personography:
http://www.tei-c.org/Activities/Workgroups/PERS/

image markup tool: http://tapor.uvic.ca/~mholmes/image_markup/
(very cool open-source program)

-- issue: how to show time sequence in manuscript encoding

6.
American Memory Project
Susan Garfinkel, Jurretta Jordan Heckscher, Library of Congress
We are engaged in a demonstration project to present items from the Library of Congress’s American Memory Historical Collections, http://memory.loc.gov, in alternative formats that expand their potential uses for digital scholarship and education.
Our project’s present focus is the markup of a single digitized source, the manuscript Civil War diaries of Horatio Nelson Taft, volume 1. We are interested in exploring multiple tagging schema for this document that enable users to sort its contents by theme or topic.
Human Relations Area Files
-- Interestingly, they have the same problem I do. These 2 librarians alone are doing the encoding, relying on interns on occasion. Extremely difficult (sometimes frustrating) to devote time to such work in addition to other duties and responsibilities. Staffing is crucial.

EXTRA:
http://beck.library.emory.edu/greatwar/postcards/index.php
x include file
interp data in separate file. each XML file would would point to that interp file.
-- eXist. XML database system. You can use stylesheets with it.
-- django
-- http://www.wwp.brown.edu/encoding/seminars/umd2010/handouts/index.html

Diplomatic and Normalized
-- There are two versions of the transcription, diplomatic and normalized. The diplomatic version attempts, as far as is practicable, to present the manuscript as it actually appears. The normalized transcription is a more readable version, which eschews many of the conventions of the original manuscript. The diplomatic version is to be considered definitive

-- ODD file and schema. Schema defines validity. ODD (one document does it all) is the documentation expressing relationship between schema and XML document, _and_ customizes the schema. ODD file is like a TEI document. It has TEI header, then documentation discussing the schema, then it builds a schema that is related to the TEI schema. shows what elements are being acted upon (e.g. deleted from schema)
-- why use this rather than the full schema? Answer: stacking overflow error. Your system may not be able to handle all the elements.
-- http://sourceforge.net/projects/tei/files/
-- http://www.tei-c.org/release/xml/tei/stylesheet/xsltdoc/
-- http://www.lgpn.ox.ac.uk/online/documents/TEIXML
-- I just figured out how to transform a TEI file to HTML and PDF. It is at first glance easy, but later, extremely challenging.
-- Personography and placeography. Should point to external XML file (Cleveland Encyclopedia should be converted to XML).
-- LC geographical name authority file. You can use this for placeography eventually. http://authorities.loc.gov/

THATCamp Notes

Presentations from The Humanities and Technology Camp, Columbus, OH (January 2010):
1.
Big Brother? In MY Kindle?
Faith Van Horne. The future of digital print rights
print vs. electronic rights
cyberlawcases.com/2009/08/31/the-copy-ownership-cases/

2.
Laura Mandell
Chair, MLA division no Information Technology
-- Open source programming language and environment for people who want to program images, animation, and interactions: http://processing.org/
-- Collation software for scholars: juxtasoftware
-- T generate word clouds: wordle
-- Visualization data analysis tool: many eyes
-- Franco Moretti, The valve: a literary organ.

3.
Cleveland Project: Teaching + Learning Cleveland. Using Omeka
120 exhibits
exhibits unfinished (albeit they're cool)
Euclid Corridor Project. Done by students.

4.
wordpress multi-user: http://mu.wordpress.org/
http://commons.gc.cuny.edu/

5.
annotators workbench at Indiana University

6.
Michigan State University, The Quilt Index cms: cora, developed at Matrix

7.
Mapping DC

8.
Digital Archive of Literacy Narratives, Ohio State University

9.
Preserving Digital Humanities
Melanie Schlosser and Lewis Ulman, OSU
-- Evaluation of Digital Work:
-- Samual Sullivan Cox project
-- METS document to capture all the different parties involved with Cox project. Almost all done by hand, except for some scrpt for repetitive parts. DSpace has tech metadata, but it does not capture the relationships or details among assets. METS does this. They rely both on what DSpace captures and the METS document
-- Open source curation tool (good for project workflow): media manager. Also used for manuscript projects
-- library couldn't host this project
-- DSpace used at OSU for institutional repository.
-- article on institutional repositories, "inkeeper at the roach motel"
-- See Knowledge Bank for archival version of Cox project
-- Ask how XSLT was created for this project. Answer: Lewis Ulman did it, using Seaman's XSLT (formerly of UVa), and by attending his workshop
-- See OSU Digital Humanities
-- Fedora: delivery and preservation simultaneous (vs. DSpace)
-- Article on digital preservation fallacies

10.
DH Projects and Collaboration
Columbus neighborhoods
-- how users interact with media. Helene Blowers from Columbus Metro Library. Action words at top of site. "big concepts, keep it clean." Major contact:
Phone: (614) 479-3029
Email: hblowers@columbuslibrary.org
www.columbusneighborhoods.org
-- users will be able to geotag, upload their stories, media. 2-6 minute video clips will be part of it. They can use flicker, etc., common tools most are familiar with.
-- WOSU 10-member committee, subcommittees.
-- Metadata supplied by users is loose, mostly free-forming. For example, if no date is supplied, so be it. However, users do have to register (no anonymous submissions), and there is a drop-down menu for users to pick the neighborhood.
-- See: city of memory web site

11.
Ohio Historical Society
Angela O'Neal
-- new audiences and partnerships via technology
-- has audience changed as a result of technology?
-- have organizations changed as a result of technology?
how does a museum shift its shape to maintain connections to a virtual audience?
Howard Stevens article on the evolution of the media
-- Pew Internet & American Life Project -- use this for statistics; good for grant applications
-- book: What Consumers Really Want by B.J. Pine and J. Gilmore, Harvard Business Review
-- David Thelen: http://chnm.gmu.edu/survey/afterdave.html. He found that 39% of Americans have hobbies related to the past, for example. Also, U.S. Gov't and history profs are least trusted.
-- collaboration, cms, branding. Collaboration: loaded term. Share link to your site, versus share your material to be part of coherent collection housed elsewhere

December 2009 statistics

Here are our December 2009 cataloging statistics:
December 2009 Statistics