Workshop on
The Digitization of Language Data:
The Need for Standards

Advance Preparation










New: Working Group Responses

We hope the working groups will be able to provide concrete suggestions which we can implement in The LINGUIST List E-MELD Project. To that end, we are asking each group to do some advance preparation, testing our current proposals against their own data. We would be most grateful if you would check the assignment for your group and write the brief reports requested prior to coming to the Workshop.

Working Group Assignments:

 

The following material provides useful background information.

Essential Reading

  • Requirements on the Infrastructure for Open Language Archiving.
    This document was written by Steven Bird and Gary Simons for the conference on Web-Based Language Documentation and Description in Philadelphia, Pennsylvania, in December 2000. This version is currently under revision. But it gives such a good orientation to the infrastructure enterprise that we would like participants to read it now, even in an interim version.

  • Overview of the Dublin Core metadata initiative. A useful introduction for those unfamiliar with Dublin Core.

Other Useful Reading

  • A primer on Unicode, the new font-encoding standard which ensures that you need never again worry about whether the symbols you use stay the same, no matter what machine you see them on.
  • A useful tutorial on XML, the successor to HTML.


Workshop homepage | Workshop Proposal | Contact the Organizers