Statistics: Title Counts

See Best Practices for Decision Pages and Tags for groups
Legend: not started IN PROGRESS STALLED decided

Status

DECIDED

Description

Given our shared understanding that some level of duplication of bibliographic records for the same title exists in our system, how should title counts work?

Decision summary

Use counts of MMS IDs and accept possible overcounting. This is the best approach because duplication issues are widely understood, and other methods of deduplication are too unreliable and inconsistent.

Owning group

AASAP Team sils-aasa-l@listserv.ucop.edu

Approver

 

Consulted

AASAP Team members

Resource Management and Acquisitions experts from UCB

Informed

Leadership Group

Resource Management Group

Decision-making process

Informal consultation among team members, expert consultation, discussion.

Priority

 

Target decision date

Aug 31, 2023

Date decided

Aug 21, 2023

Recommendation

Use MMS ID because it is easy and consistent, and because other options will result in undercounting. We will end up overcounting titles where there are multiple bibliographic records for the same thing, especially for electronic resources. However, there is both 1) widespread understanding of the limitations of this approach and 2) widespread acceptance of its utility.

Impact

Stakeholder group

Impact

Stakeholder group

Impact

UC Libraries

Determinations around what and how we report are for the most part managed/owned by the UC Libraries (i.e., shared ownership).

CDL

CDL analysts, who are responsible for constructing report queries at the Network Zone according to templates agreed upon by the UC Libraries, must exclude items based a variety of parameters…

Background

System-wide, some duplication exists in title-level records. Cataloging rules provide guidance about when separate title-level records are required and when campuses should use the same bibliographic record to represent multiple items. However, in practice, bibliographic records are duplicated with some regularity, especially for electronic materials. For example:

  • CDL OA eBook title duplication.
    From CDL e-resources Acquisitions. Summarized email from cdlacq on Sept 21, 2022:

    • CDL turns on full yearly CZ collections.

    • If publishers add some open access titles to either the yearly CZ frontlist collections or give them to us when we receive marc records for our package, then those titles may end up duplicating titles from the same publisher under a full Open Access collection that we have turned on in the NZ.  

    • We determined that this is ok to leave as they are for now, for the sake of staff time & making collections available to users faster in the NZ, compared to the work it would take to clean them up now or prior to processing a new collection.  This is not exactly a new issue. We may consider internally whether there are ways we can try to mitigate this going forward, in the future when we have more capacity for this.

    • Sample: Ex Libris Discovery - 9780806192116 (exlibrisgroup.com)

  • Vendor-supplied, non-provider-neutral, non-OCLC record duplication. Like those from Cassidy cataloging for resources in Westlaw and LexisNexis.

    • 13,700 bibs in UCI Law Westlaw collection

    • 16,650 MMS IDs for Distinct count MMS ID where the Title + Author Combined and Normalized match

    • Example: separate, non-provide-neutral, non-OCLC bib records for every law review in Lexis & Westlaw

  • Electronic record duplication when campuses use different record sets. For example, UCI Law is using OCLC records for an eBook package without vendor-supplied records, which is time-consuming but gets us the OCLC updates. But another campus is using non-OCLC CZ records. Both of those sets of records have linked NZ bibs.

  • MARCIVE government document records. It’s known that there are duplicative records in the system.

Options Considered

ARL instructions for title counts :

  • Report the total number of titles managed and maintained by the library that are cataloged and made ready for use.

  • Deduplicate titles by counting multiple copies of the same manifestation as one title.

  • Identical content in different formats should not be deduplicated, and each format should be counted as a different title. For example, a serial title available in print, microform and online would be counted as three titles.

  • Count different editions and versions of the same work as separate titles since they denote depth in the collection.

  • Counting the 245 field when the library provides stewardship for those resources may be sufficient.

Alma Options (Count distinct)

Notes

Alma Options (Count distinct)

Notes

Title Normalized, separated by format

This undercounts unique titles because works with different authors and editions are not distinguished.

 

Title Author Combined and Normalized, separated by format

This undercounts unique titles because works in different editions are not distinguished

Concat(Title Author Combined and Normalized, Edition), separated by format

This undercounts unique titles because generically-named titles that are actually different things do not have unique-enough metadata in these fields to distinguish them.

MMS ID

This overcounts unique titles.

This is especially true for electronic resources, where we are using bibliographic records of varying quality from many different sources in order to provide access.

Example from UCI Law

At Irvine, Counts from the Titles Subject Area where:

  • Lifecycle = “In Repository”

  • Resource Type is “Book - Electronic”

  • Library Code (Physical) contains LAW OR Library Code (Electronic) contains LAW

Option

Title Count at UCI Law (rounded)

Option

Title Count at UCI Law (rounded)

Distinct Title Author Combined and Normalized

274,000

Distinct Concat(Title Author Combined and Normalized, Edition)

280,000

Distinct Title (Complete)

301,000

Num of Titles (Active)

364,000

Questions to consider

How much concern about duplication exists in the system? In other words, do we already have widespread acceptance for possible overcounting based on shared and longstanding professional expertise about metadata limitations?
Answer: There is some concern but it tends to be context-specific. Systemwide, there does already exist a shared understanding among campus experts that 1) libraries regularly report record counts as an approximation for title counts, and 2) title count data is inherently imprecise.

Action Log

Action/Point Person

Expected Completion Date

Notes

Status

Action/Point Person

Expected Completion Date

Notes

Status

Gather initial feedback from RM colleagues based on prototype

Aug 17, 2023

Accomplished via email

Complete

Develop proposal for decision-making, with input from a couple of campus experts to help shape future, more inclusive process

Aug 21, 2023

See meeting notes: https://uc-sils.atlassian.net/wiki/spaces/AASAPT/pages/2221670401

Complete

The SILS mission is to transform library services and operations through innovation and collaboration. The future is shared!
Question? Contact AskSILS-L@ucop.edu