"Living Documents" BOF

Co-Chairs:  Peter Deutsch (peterd@cc.mcgill.ca)                 March, 1992
            Alan Emtage	  (bajan@cc.mcgill.ca)


          ----------------------------------------


The Living Documents BOF met Tuesday, March <blat> 7pm-10pm. The
preliminary agenda called for discussion on a wide range of topics
related to the creation and implementation of Living Documents but in
practice the majority of the discussion revolved around data
representation issues for network-based information discovery and
delivery systems.

Much of the discussion centred upon the characteristics needed to
implement a practical scheme for Universal Document Identifiers,
contrasting these with a proposal for Unique Document Serial Numbers.
UDIs have been proposed to allow multiple information systems to
communicate location and access information. Initial proposals that
had been circulated by Tim Berners-Lee, Brewster Kahle and others were
discussed and these were compared to the information needed and
currently provided by such systems as Prospero, WWW, WAIS and others.
No firm conclusions were reached, but it was agreed that a mailing
list (nir@cc.mcgill.ca) would be created to pursue this issue with a
goal of producing a document standardizing UDIs for Internet use.
Initially, all attendees of this BOF are to be placed on the list, and
existance of the list is to be announced to the Internet community.

Discussion concerning Unique Document Serial Numbers centred around the
perceived need to identify and compare the _contents_ (in contrast to
the location) of documents in an internet environment. Ideally, we would
have a means for:

		a) Identifying the contents of a document
		   and comparing it with other documents without
		   copying and comparing them directly.

		b) Identifying derivative works and ancestral links
		   between documents.

		c) Identifying documents that contain the same
		   information despite representational changes
		   that do not add or delete information contents.

It was generally accepted that the first of these could probably be met
with relatively straightforward signature schemes, but that the last two
would be difficult or impossible using strictly syntactic means. At
least one archive site administrator (Mark Baushe "mdb@nsd.3com.com")
has subsequently implemented such an MD5-based signature scheme at his
site (ftp.3com.com) for testing purposes. Details on accessing these
signatures will be posted to the nir@cc.mcgill.ca list.

The discussion continued across a range of topics, examining the other
issues to be addressed in implementing Living Documents and
network-based information systems. The following list was drawn up
outlining some of the issues to be addressed in subsequent work:

Univeral Document Identifiers:
	- design, documentation and deployment. Issues involved include
	  the need to encode individual access methods and specific
	  location information within a specified access method. An
	  initial proposal for such a scheme had been circulated by Tim
	  Berners-Lee prior to the meeting. A copy is available by
	  anonymous FTP from info.cern.ch in the subdirectory
	  "/pub/www/doc/udi1.[ps|txt]".

Unique Document Serial Numbers:
	- design, documentation and deployment. Issues involved include
	  identifying specific documents, version control and derivation
	  information.

Cataloguing Information:
	- Librarians already make use of far more cataloguing
	  information than any of the experimental systems currently 
	  in use on the Internet. Work with those directly involved in
	  library science working with extending MARC records, ISBN
	  and ISSN numbers is called for.
	  
Discovery mechanisms:
	- There remains a large open problem in rapidly and
	  efficiently discovering the existance and location
	  of information in a large distributed computing environment.
	  The proposed UDIs and UDSNs may enable such systems to be built
	  but additional wrk is still needezVCd. There are problems both
	  in locating individual service providers and specific pieces
	  of information.

Authentication and Access Control:
	- Security issues were not discussed in depth, but it was agreed
	  that such issues would become more important as large-scale
	  systems are developed and deployed.


Editorial Control:
	- Again a topic touched upon only briefly, it was suggested by
	  one participant that true Living Document systems would have
	  to include some method of imposing editorial control.

Mailing List:

			nir@cc.mcgill.ca
			nir-request@cc.mcgill.ca


Mailing List Archive:
	anonymous FTP to:	archives.cc.mcgill.ca
	subdirectory:		"/pub/mailing-lists/nir-archive"