By marchamilton on Jul 05, 2010
Tom Cramer, Chief Technology Strategist and Associate Director, Digital Library Systems and Services, Stanford University, started off the morning. One of the interesting points Tom made was how Stanford seamlessly pulls data from five digital systems in the process of archiving student thesis papers. Starting with student and professor information from Stanford's Oracle Peoplesoft campus information system, archive metadata is automatically populated and combined with thesis PDFs, a new library catalog data record is automatically created, and finally, PDFs and associated metadata are automatically crawled and published to the world via Google Books.
Next, Oxford's Neil Jefferies took the discussion a bit deeper and talked about the changing nature of intellectual discourse. While Oxford's collection holds over 250 km shelf-miles of paper books, the library is increasingly working to archive more ephemeral university data sources including websites, social media, and linked data. A consistent theme discussed by Neil and many of the other speakers was the increasing focus on providing not only archive and preservation but also access to data.
On formally to the continent, Laurent DuPlouy and Olivier Rouchon from the French National Library presented on the SPAR Project and CINES Collaboration. They were brave enough to show a live demo of their system, including use of a StorageTek SL8500 Modular Library System.
Back to the UK, Brian Hole from The British Library presented on the LIFE3 project which aims to model the long term preservation lifecycle costs of digitized data. Brian's taking suggestions for improvements in LIFE4 and and I suggested he including in his model the Oracle Secure Backup Cloud module which can securely backup databases to Amazon S3 cloud storage.
After a wonderful Spanish lunch the first panel session of the day started with discussions on Community and Tool Set collaborations.
Jan Reichelt, founder and director of Mendeley reference management software used to organize, share, and discover academic research papers. Mendeley tracks over 28 million research papers including information on most read papers and authors.
David Tarrant of EPrints discussed how EPrints software is used to create and manage repositories.
Finally, Bram van der Werf of Open Planets Foundation described the Open Planets suite of tools for managing digital data.
After the panel presentation, we heard from a series of Oracle speakers. The Oracle Enterprise Content Management Suite 11g is broadly applicable to preservation and archive, capable of archiving over 179 million documents a day as shown in a recent benchmark. Of course, many PASIG customers already use the Sun Storage Archive Manager software along with StorageTek modular library systems and there were updates from Oracle speakers on all of products and more.
The final session included short presentations from a number of Oracle software partners in the archive and preservation space. I definitely learned a lot today about what some of the world's leading digital libraries are doing on the preservation and archive front, and hopefully it was a day well spent for all who attended. If you are not already a PASIG member, be sure to signup now, for this growing Oracle community.