IRC log of simile on 2004-01-22

Timestamps are in UTC.

16:03:51 [RRSAgent]
RRSAgent has joined #simile
16:04:13 [mickBass]
mickBass has joined #simile
16:04:18 [stefanom]
won't be able to join on the phone, sorry
16:04:24 [marbut]
Agenda
16:04:24 [marbut]
1. Round table update from Pis
16:04:24 [marbut]
2. Logistics for arrival of new hires
16:04:24 [marbut]
3. Review project task list - see below
16:04:24 [marbut]
1. ARTSTOR AND OCW DATASETS
16:04:26 [marbut]
Progress update from Mark:
16:04:27 [marbut]
- Now possible transform OCW data just with XSLT, no need to use Perl. The
16:04:29 [marbut]
transform does not do everything the Perl transform did, but this avoids all
16:04:31 [marbut]
the team installing Perl on their machines.
16:04:35 [marbut]
- Artstor topic, geographic and subject fields are now organized
16:04:37 [marbut]
hierarchically where appropriate.
16:04:39 [marbut]
- New ANT automatic build script in CVS allows all team members to rebuild
16:04:41 [marbut]
datasets, avoids need to upload entire datasets to CVS, team members just
16:04:43 [marbut]
need to retrieve updated XSLT scripts from CVS.
16:04:45 [marbut]
- Added type information to OCW dataset.
16:04:47 [marbut]
To do:
16:04:49 [marbut]
- Adopt a common way of displaying names in Artstor and OCW.
16:04:51 [marbut]
- Need to change the way typing is done in OCW so it is more compatible with
16:04:53 [marbut]
the existing LOM schemas.
16:04:55 [marbut]
2. HAYSTACK / SIMILE
16:04:57 [marbut]
Progress update from Steve
16:04:59 [marbut]
Progress update from Andy on Joseki / Haystack integration
16:05:01 [marbut]
To do:
16:05:05 [marbut]
- Need to update simile.ad file so Haystack can display revised OCW dataset.
16:05:07 [marbut]
- Now hierarchical information has been added to datasets, Haystack needs to
16:05:09 [marbut]
process this in faceted browser.
16:05:11 [marbut]
3. CUSTOM BROWSER
16:05:13 [marbut]
Progress update from Mark:
16:05:15 [marbut]
- Can now display both Artstor and OCW data
16:05:17 [marbut]
- Can display hierarchical facets e.g. geographic, subject and topic
16:05:19 [marbut]
- Uploaded to CVS, Rob has been able to retrieve and run on both Linux and
16:05:21 [marbut]
Windows.
16:05:23 [marbut]
To do:
16:05:25 [marbut]
- Write interface so it is possible to switch between using Lucene for
16:05:27 [marbut]
queries and RDQL.
16:05:29 [marbut]
- Add text search boxes to facets when it is not possible to display all
16:05:33 [marbut]
facet values.
16:05:35 [marbut]
- Add paging to facets when it is not possible to display all facet values.
16:05:37 [marbut]
- Fix Ant / XSLT / MTXSLT / Saxon 7 / ENTITY / DOCTYPE bug.
16:05:39 [marbut]
- Need to fix Ant build script so tasks that call XSLT, unzip or Jena
16:05:41 [marbut]
Schemagen only rebuild when necessary.
16:05:43 [marbut]
- Add Jena persistant model support.
16:05:45 [marbut]
- Demonstrate inferencing between the two datasets.
16:05:47 [marbut]
- Display facet frequency.
16:05:49 [marbut]
- Allow both alphabetic sorting of facets and sorti
16:05:51 [marbut]
roll call: andy, steve, rob, mick on the phone
16:05:53 [marbut]
mackenzie joins
16:05:56 [marbut]
stefano on irc
16:05:57 [marbut]
eric sends his regrets
16:06:18 [Rob]
Hires update
16:06:47 [Rob]
Ryan Lee offer out, likely to start early next week (not definite)
16:07:16 [stefanom]
I'm arriving in boston on Saturday night
16:07:29 [stefanom]
will start operational on monday
16:07:41 [Rob]
Good progress on SIMILE funding from review
16:07:51 [Rob]
No cost extension approved
16:08:40 [Rob]
Mick in Cambridge 28-30 Jan
16:08:52 [stefanom]
what happened to the hackaton?
16:09:25 [Rob]
demo critical milestone
16:09:42 [marbut]
(david karger joins)
16:09:55 [marbut]
(Stefano, we'll address you question in the next agenda item)
16:10:29 [stefanom]
ok
16:10:31 [Rob]
next milestone to achieve demo with scale
16:10:57 [stefanom]
sorry for not being able to call in
16:12:05 [Rob]
decided to stick with OCW & ARTstor despite lack of public accessibility
16:12:26 [stefanom]
+1
16:12:38 [Rob]
OK for hires to start at CRL; office, access OK, will need kit (PC etc)
16:13:02 [marbut_]
marbut_ has joined #simile
16:13:17 [marbut_]
Mackenzie has the thumbnails for artstor
16:13:29 [marbut_]
Mackenzie: But they will not let us make it publically available
16:13:48 [marbut_]
SteveGarland: Isn't there no copyright on thumbnails?
16:14:03 [marbut_]
Mackenzie: The Mellon Foundation, don't have that position, we are not the copyright owners in this case
16:14:16 [marbut_]
I've written a letter to say we will not make them publically available
16:14:34 [marbut_]
so I'm looking for an alternative corpus we can use later without such restrictions
16:15:15 [marbut_]
David: It's been a slow week, alround, I'm working us through the list of haystack items
16:15:24 [marbut_]
that we put together last week
16:15:45 [mickBass]
logistics for new hires:
16:15:58 [mickBass]
Offices and passes no prob for Stefano and Ryan (for Monday)
16:16:11 [mickBass]
Kit (PC) is maybe issue
16:16:39 [stefanom]
no prob for me, my laptop is with me and I don't need anything else
16:16:59 [stefanom]
maybe I'll need a windows machine later on, but I can go virtualPC for now
16:17:41 [marbut_]
mickBass: there are threads on three clients going: based on Haystack, Haystack-web, and the custom client
16:17:43 [Rob]
3 clients.. Haystack rich/Web and Mark B's Web client
16:18:09 [marbut_]
(oops, sorry rob)
16:18:43 [Rob]
need to review each client and associated issue lists with core tech team
16:20:17 [Rob]
when each is at the stage they can view the dataset
16:21:05 [Rob]
next thursday PI call time, can walk through each client and issue list with tech team (many in Cambridge, some remote via netmeeting)
16:21:24 [stefanom]
good for me
16:22:31 [Rob]
Steve G and Rob to sort out logistics for that (PCs etc)
16:22:50 [Rob]
ACTION: Steve G and Rob to sort out logistics for that (PCs etc)
16:23:30 [Rob]
dataset: Mark has enabled all SIMILE people to build dataset from CVS (with Java + Ant installed)
16:24:12 [Rob]
can build locally and XSLTs can be updated to fix probs in the RDF serialisations etc
16:24:35 [Rob]
(This is IPSSources CVS, not Haystack CVS)
16:25:49 [Rob]
Have an ant target that recognises if HAYSTACK_HOME is set, and can then copy data to Haystack, so we can keep separate CVS
16:26:36 [Rob]
ACTION: Mark to chase up IPSSources to get David K IPSSources access
16:27:36 [stefanom]
Nice job mark!
16:29:11 [Rob]
Transforms updated to use subclass relationships for hierarchical controlled vocabs (e.g. China, province)
16:30:05 [Rob]
Type info added to OCW. Looking into getting Haystack to display that.
16:30:24 [Rob]
(Low-level type info, e.g. date, number etc. was previously missing)
16:30:49 [Rob]
Now have common stylesheet to canonicalise names in Artstor + OCW
16:31:15 [Rob]
Need to keep OCW in line with LOM schemas, however
16:31:25 [mickBass]
mickBass has joined #simile
16:31:42 [mickBass]
mick is back after proxy-server difficulty..
16:33:14 [Rob]
where to put artstor thumbnails?
16:33:52 [Rob]
Need to think about long-term infrastructure needs... worry about that after demo
16:34:22 [Rob]
Haystack/SIMILE progress update
16:34:53 [Rob]
Haystack Web browser client: Good progress, should have something usable by next week
16:36:10 [Rob]
ACTION: Steve to leave a Haystack running, so people can try out it remotely
16:38:08 [Rob]
Haystack/Joseki progress update: seems to work OK. Fine-grained queries rather than a big GET, so won't work for large data sets
16:38:37 [Rob]
potential for cache on Haystack side
16:39:42 [Rob]
Need to fix simile.ad file in Haystack, updating OCW data uncovered problems
16:40:01 [Rob]
ACTION: Need to fix simile.ad file in Haystack, updating OCW data uncovered problems
16:40:22 [Rob]
ACTION: Need to find out if/how Haystack deals with sub-classing information
16:41:11 [Rob]
Probably don't need Haystack/Joseki integration for demo
16:43:19 [Rob]
Progress on thin Web client. Now can display OCW + ARTstor simultaneously, as well as hierarchical facets. In CVS, has been run on Linux + Windows, some problems to fix
16:43:32 [Rob]
(problems related to instal//build)
16:44:58 [Rob]
Issues: Need to add interface can use Lucene or RDQL; searchable/paging of facet values (900+ even in small dataset); ant/XSLT probs; support for Jena persistent models; demonstrate inferencing
16:45:14 [Rob]
also UI issues, sorting facets by alpha/frequency etc.
16:46:06 [Rob]
CVS Access progress update: Cleaned up a bit
16:46:28 [Rob]
Invitations re-issued to those who never logged in
16:46:44 [Rob]
ACTION: Invitations must be acted on within 7 days
16:47:37 [Rob]
Identifying corpus subset: Kevin has done some work. uses 'visulaising cultures course': USCDAA segment
16:48:36 [Rob]
Need user testing for Haystack + custom browser
16:48:45 [Rob]
Eric's suggestion of brownsauce+ RDFNavigator
16:53:48 [Rob]
simile.mit.edu: need a machine for infrastructure (issue lists, wiki etc.)... but simile.mit.edu is heavyweight for that, can probably find a lighter-weight machine for that. simile.mit.edu could be used for demo's where heavy-weight processing required
16:54:09 [Rob]
could run Haystack Web client on it, but tricky since you need the Haystack UI up and running, so might not work
16:54:48 [stefanom]
I would like that machine to be linux/freebsd, would make it much easier to administer for me
16:54:50 [Rob]
Send someone to Cannes W3C tech plenary with SIMILE demo?
16:55:17 [Rob]
Steve volunteers :)
16:55:31 [Rob]
(plenary in March) Andy going anyway
16:55:53 [Rob]
will likely demo in first 1 or 2 days, need to get slot soon
16:58:20 [Rob]
RRSAgent, pointer?
16:58:20 [RRSAgent]
See http://www.w3.org/2004/01/22-simile-irc#T16-58-20
16:59:06 [Rob]
RRSAgent, bye
16:59:06 [RRSAgent]
I see 6 open action items:
16:59:06 [RRSAgent]
ACTION: Steve G and Rob to sort out logistics for that (PCs etc) [1]
16:59:06 [RRSAgent]
recorded in http://www.w3.org/2004/01/22-simile-irc#T16-22-50
16:59:06 [RRSAgent]
ACTION: Mark to chase up IPSSources to get David K IPSSources access [2]
16:59:06 [RRSAgent]
recorded in http://www.w3.org/2004/01/22-simile-irc#T16-26-36
16:59:06 [RRSAgent]
ACTION: Steve to leave a Haystack running, so people can try out it remotely [3]
16:59:06 [RRSAgent]
recorded in http://www.w3.org/2004/01/22-simile-irc#T16-36-10
16:59:06 [RRSAgent]
ACTION: Need to fix simile.ad file in Haystack, updating OCW data uncovered problems [4]
16:59:06 [RRSAgent]
recorded in http://www.w3.org/2004/01/22-simile-irc#T16-40-01
16:59:06 [RRSAgent]
ACTION: Need to find out if/how Haystack deals with sub-classing information [5]
16:59:06 [RRSAgent]
recorded in http://www.w3.org/2004/01/22-simile-irc#T16-40-22
16:59:06 [RRSAgent]
ACTION: Invitations must be acted on within 7 days [6]
16:59:06 [RRSAgent]
recorded in http://www.w3.org/2004/01/22-simile-irc#T16-46-44