Catalgablog

Syndicate content
Library cataloging, classification, metadata, subject access and related topics.
Updated: 2 hours 40 min ago

Comment on RDA

November 18, 2008 - 1:16pm
Form for comments from the US about RDA.

Koha User Group Meeting

November 18, 2008 - 11:09am
On April 16-17th there will be a Koha innovations and sharing group in Plano Texas (suburb of Dallas/Fort Worth). The 2 day workshop would have lab access and presentation space. There would be a charge to cover lunch both days and other expenses. Any leftover money would be given to the KUDOS users group as seed money. Anticipated cost $100.

Algorithms for Clustering Tags

November 18, 2008 - 11:03am
Clustering Tags in Enterprise and Web Folksonomies by Simpson, Edwin will be published and presented at the International Conference on Weblogs & Social Media, Seattle, March 31st, 2008 (HPL-2008-18 )Tags lack organizational structure limiting their utility for navigation. We present two clustering algorithms that improve this by organizing tags automatically. We apply the algorithms to two very different datasets, visualize the results and propose future improvements.

RDA Draft

November 17, 2008 - 5:30pm
The full draft of RDA is now available for comment. Comments needed.

Indexing Tool

November 17, 2008 - 11:30am
Library catalogs these days are mostly ralational databases and related indexes. LuSql is a tool to create an index from a relational databse.LuSql is a simple but powerful tool for building Lucene indexes from relational databases. It is a command-line Java application for the construction of a Lucene index from an arbitrary SQL query of a JDBC-accessible SQL database. It allows a user to control a number of parameters, including the SQL query to use, individual indexing/storage/term-vector nature of fields, analyzer, stop word list, and other tuning parameters. In its default mode it uses threading to take advantage of multiple cores.

LuSql can handle complex queries, allows for additional per record sub-queries, and has a plug-in architecture for arbitrary Lucene document manipulation. Its only dependencies are three Apache Commons libraries, the Lucene core itself, and a JDBC driver.

LuSql has been extensively tested, including a large 6+ million full-text & article metadata document collection, producing an 86GB Lucene index.Lots of the Code4Lib folks are working with Lucene indexes.

Lemon8-XML

November 17, 2008 - 11:17am
Adding semantic mark-up to text is something the cataloger in me always finds good. Microformats, XML, or RDF all make searches more precise. Lemon8-XML is a tool to chamge scholarly papers in MS Word or Open Office formats into XML. Sweet idea.Lemon8-XML is a web-based application designed to make it easier for non-technical editors and authors to convert scholarly papers from typical word-processor editing formats such as MS-Word .DOC and OpenOffice .ODT, into publishing layout formats such as the open, industry-standard NLM Journal Publishing XML format.

To use Lemon8-XML, you don't need to understand XML, all you need is a little time and a general understanding of how scholarly articles are structured. In general, this means a document with:

  1. some information about the article and authors at the top
  2. usually an abstract
  3. several sections, often titled "introduction", "methods", "results", etc.
  4. optional figures or tables, either in-text or as appendices
  5. a list of references or citations in a standardized format (eg. MLA, APA, etc.)
It is from the Public Knowledge Project.

Preliminary Authority Records

November 13, 2008 - 1:48pm
Just what does it take to upgrade a preliminary authority record? I ask because there are some about 25 years old that are still preliminary.

n 83827701
Space Age Astronomy Symposium (1961 : Pasadena, Calif.)

or

n 83827385
Solar Spectrum Symposium (1963 : Rijksuniversiteit te Utrecht)

Preliminary Authority Records

November 13, 2008 - 1:48pm
Just what does it take to upgrade a preliminary authority record? I ask because there are some about 25 years old that are still preliminary.

n 83827701
Space Age Astronomy Symposium (1961 : Pasadena, Calif.)

or

n 83827385
Solar Spectrum Symposium (1963 : Rijksuniversiteit te Utrecht)

Preliminary Authority Records

November 13, 2008 - 1:48pm
Just what does it take to upgrade a preliminary authority record? I ask because there are some about 25 years old that are still preliminary.

n 83827701
Space Age Astronomy Symposium (1961 : Pasadena, Calif.)

or

n 83827385
Solar Spectrum Symposium (1963 : Rijksuniversiteit te Utrecht)

OpenSearch and unAPI Enrichs the Cataloges

November 13, 2008 - 12:23pm
SeeAlso: A Simple Linkserver Protocol by Jakob Voss appears in Ariadne no. 57 (October 2008)In recent years the principle of Service-oriented Architecture (SOA) has grown increasingly important in digital library systems. More and more core functionalities are becoming available in the form of Web-based, standardised services which can be combined dynamically to operate across a broader environment [1]. Standard APIs for searching (SRU [2] [3], OpenSearch [4]), harvesting and syndication (OAI-OMH [5], ATOM [6]), copying (unAPI [7] [8]), publishing, editing (AtomPub [9], Jangle [10], SRU Update [11]), and more basic library operations, either already exist or are being developed.

The creation of the SeeAlso linkserver protocol was occasioned by the need to enrich title views in library catalogues of the German Common Library Network (GBV) with links to additional information. However, instead of integrating those links into title records and tailoring the presentation to our specific OPAC software, we decided to create a general linkserver Web service.

OpenSearch and unAPI Enrichs the Cataloges

November 13, 2008 - 12:23pm
SeeAlso: A Simple Linkserver Protocol by Jakob Voss appears in Ariadne no. 57 (October 2008)In recent years the principle of Service-oriented Architecture (SOA) has grown increasingly important in digital library systems. More and more core functionalities are becoming available in the form of Web-based, standardised services which can be combined dynamically to operate across a broader environment [1]. Standard APIs for searching (SRU [2] [3], OpenSearch [4]), harvesting and syndication (OAI-OMH [5], ATOM [6]), copying (unAPI [7] [8]), publishing, editing (AtomPub [9], Jangle [10], SRU Update [11]), and more basic library operations, either already exist or are being developed.

The creation of the SeeAlso linkserver protocol was occasioned by the need to enrich title views in library catalogues of the German Common Library Network (GBV) with links to additional information. However, instead of integrating those links into title records and tailoring the presentation to our specific OPAC software, we decided to create a general linkserver Web service.

Omeka 0.10

November 12, 2008 - 3:29pm
Omeka 0.10 was released yesterday.Omeka 0.10b incorporates many of the changes you asked for: an unqualified Dublin Core metadata schema and fully extensible element sets to accommodate interoperability with digital repository software and collections management systems; elegant reworkings of our theme API and plugin API to make add-on development more intuitive and more powerful; a new, even more user friendly look for the administrative interface; and a new and improved Exhibit Builder. While the changes are extensive and represent a next-to-last step forward toward a 1.0 release in early 2009, existing users of Omeka should have little trouble switching to 0.10b. New users should have even less trouble getting started. Meanwhile, visitors to Omeka.org will find a new look, a more intuitive information architecture, easily browsable themes and plugins directories, improved documentation and user support, and new ways to get involved in the Omeka community.

Omeka 0.10

November 12, 2008 - 3:29pm
Omeka 0.10 was released yesterday.Omeka 0.10b incorporates many of the changes you asked for: an unqualified Dublin Core metadata schema and fully extensible element sets to accommodate interoperability with digital repository software and collections management systems; elegant reworkings of our theme API and plugin API to make add-on development more intuitive and more powerful; a new, even more user friendly look for the administrative interface; and a new and improved Exhibit Builder. While the changes are extensive and represent a next-to-last step forward toward a 1.0 release in early 2009, existing users of Omeka should have little trouble switching to 0.10b. New users should have even less trouble getting started. Meanwhile, visitors to Omeka.org will find a new look, a more intuitive information architecture, easily browsable themes and plugins directories, improved documentation and user support, and new ways to get involved in the Omeka community.

OPML

November 10, 2008 - 12:12pm
How (and Why) to Create an OPML File by Marshall Kirkpatrick is only new to me. A PR person looks at the Outline Processor Markup Language.There’s a billion other reasons to use OPML - just ask yourself in what circumstances you can imagine sending someone else one link or file that contains a collection of dynamic sources on any topic. I know these are the sorts of questions that keep me up at night.I'm not seeing OPML icons as often as I'd expect. Is this another PICS, a good idea that just never gets adopted?

OPML

November 10, 2008 - 12:12pm
How (and Why) to Create an OPML File by Marshall Kirkpatrick is only new to me. A PR person looks at the Outline Processor Markup Language.There’s a billion other reasons to use OPML - just ask yourself in what circumstances you can imagine sending someone else one link or file that contains a collection of dynamic sources on any topic. I know these are the sorts of questions that keep me up at night.I'm not seeing OPML icons as often as I'd expect. Is this another PICS, a good idea that just never gets adopted?

OPML

November 10, 2008 - 12:12pm
How (and Why) to Create an OPML File by Marshall Kirkpatrick is only new to me. A PR person looks at the Outline Processor Markup Language.There’s a billion other reasons to use OPML - just ask yourself in what circumstances you can imagine sending someone else one link or file that contains a collection of dynamic sources on any topic. I know these are the sorts of questions that keep me up at night.I'm not seeing OPML icons as often as I'd expect. Is this another PICS, a good idea that just never gets adopted?

WorldCat Hackathon

November 6, 2008 - 12:11pm
WorldCat Hackathon is the impetus for some tool development. From OCLC comes this noticeWe added a few more features in this month's xID deployment, hopefully it could be useful in upcoming WorldCat Hackathon.
  • support LCCN query such as: http://xisbn.worldcat.org/webservices/xid/lccn/2004273129?fl=isbn,lccn
  • support deleted OCLCNUM (marc 019 field) http://xisbn.worldcat.org/webservices/xid/oclcnum/47139964?method=getMetadata In this case OCLCNUM 47139964 was merged into 33100112, and we use a flag "presentOclcnum" to mark present OCLC numbers.
  • xISSN project now supports tab-delimited and CSV dissemination http://xissn.worldcat.org/webservices/xid/issn/0036-8075?method=getEditions&format=csv&fl=issn,form,title http://xissn.worldcat.org/webservices/xid/issn/0036-8075?method=getEditions&format=txt&fl=issn,form,title
  • start to support php dissemination format in all XID projects http://xisbn.worldcat.org/webservices/xid/isbn/0596002815?method=getEditions&fl=*&format=php
Matienzo, Mark has announced that Python WorldCat Module v0.1.0 is now available.In preparation for the upcoming WorldCat Hackathon starting this Friday, I've made a few changes to worldcat, my Python module for interacting with OCLC's APIs. Most notably, I've added iterators for SRU and OpenSearch requests, which (like the rest of the module) painfully need documentation.

WorldCat Hackathon

November 6, 2008 - 12:11pm
WorldCat Hackathon is the impetus for some tool development. From OCLC comes this noticeWe added a few more features in this month's xID deployment, hopefully it could be useful in upcoming WorldCat Hackathon.
  • support LCCN query such as: http://xisbn.worldcat.org/webservices/xid/lccn/2004273129?fl=isbn,lccn
  • support deleted OCLCNUM (marc 019 field) http://xisbn.worldcat.org/webservices/xid/oclcnum/47139964?method=getMetadata In this case OCLCNUM 47139964 was merged into 33100112, and we use a flag "presentOclcnum" to mark present OCLC numbers.
  • xISSN project now supports tab-delimited and CSV dissemination http://xissn.worldcat.org/webservices/xid/issn/0036-8075?method=getEditions&format=csv&fl=issn,form,title http://xissn.worldcat.org/webservices/xid/issn/0036-8075?method=getEditions&format=txt&fl=issn,form,title
  • start to support php dissemination format in all XID projects http://xisbn.worldcat.org/webservices/xid/isbn/0596002815?method=getEditions&fl=*&format=php
Matienzo, Mark has announced that Python WorldCat Module v0.1.0 is now available.In preparation for the upcoming WorldCat Hackathon starting this Friday, I've made a few changes to worldcat, my Python module for interacting with OCLC's APIs. Most notably, I've added iterators for SRU and OpenSearch requests, which (like the rest of the module) painfully need documentation.

isbn2marc

November 6, 2008 - 11:01am
William Denton has written a program, isbn2marc, that takes and ISBN and returns a MARC record. It uses Z39.50 and is written in Ruby. Mr. Denton is the person responsible for the FRBR Blog, good stuff.

isbn2marc

November 6, 2008 - 11:01am
William Denton has written a program, isbn2marc, that takes and ISBN and returns a MARC record. It uses Z39.50 and is written in Ruby. Mr. Denton is the person responsible for the FRBR Blog, good stuff.