• Ready for review
  • Statistics: Title Counts

    See Best Practices for Decision Pages and Tags for groups
    Legend: not started IN PROGRESS STALLED decided

    Status

    DECIDED

    Description

    Given our shared understanding that some level of duplication of bibliographic records for the same title exists in our system, how should title counts work?

    Decision summary

    Use counts of MMS IDs and accept possible overcounting. This is the best approach because duplication issues are widely understood, and other methods of deduplication are too unreliable and inconsistent.

    Owning group

    AASAP Team sils-aasa-l@listserv.ucop.edu

    Approver

     

    Consulted

    AASAP Team members

    Resource Management and Acquisitions experts from UCB

    Informed

    Leadership Group

    Resource Management Group

    Decision-making process

    Informal consultation among team members, expert consultation, discussion.

    Priority

     

    Target decision date

    Aug 31, 2023

    Date decided

    Aug 21, 2023

    Recommendation

    Use MMS ID because it is easy and consistent, and because other options will result in undercounting. We will end up overcounting titles where there are multiple bibliographic records for the same thing, especially for electronic resources. However, there is both 1) widespread understanding of the limitations of this approach and 2) widespread acceptance of its utility.

    Impact

    Stakeholder group

    Impact

    Stakeholder group

    Impact

    UC Libraries

    Determinations around what and how we report are for the most part managed/owned by the UC Libraries (i.e., shared ownership).

    CDL

    CDL analysts, who are responsible for constructing report queries at the Network Zone according to templates agreed upon by the UC Libraries, must exclude items based a variety of parameters…

    Background

    System-wide, some duplication exists in title-level records. Cataloging rules provide guidance about when separate title-level records are required and when campuses should use the same bibliographic record to represent multiple items. However, in practice, bibliographic records are duplicated with some regularity, especially for electronic materials. For example:

    • CDL OA eBook title duplication.
      From CDL e-resources Acquisitions. Summarized email from cdlacq on Sept 21, 2022:

      • CDL turns on full yearly CZ collections.

      • If publishers add some open access titles to either the yearly CZ frontlist collections or give them to us when we receive marc records for our package, then those titles may end up duplicating titles from the same publisher under a full Open Access collection that we have turned on in the NZ.  

      • We determined that this is ok to leave as they are for now, for the sake of staff time & making collections available to users faster in the NZ, compared to the work it would take to clean them up now or prior to processing a new collection.  This is not exactly a new issue. We may consider internally whether there are ways we can try to mitigate this going forward, in the future when we have more capacity for this.

      • Sample: Ex Libris Discovery - 9780806192116 (exlibrisgroup.com)

    • Vendor-supplied, non-provider-neutral, non-OCLC record duplication. Like those from Cassidy cataloging for resources in Westlaw and LexisNexis.

      • 13,700 bibs in UCI Law Westlaw collection

      • 16,650 MMS IDs for Distinct count MMS ID where the Title + Author Combined and Normalized match

      • Example: separate, non-provide-neutral, non-OCLC bib records for every law review in Lexis & Westlaw

    • Electronic record duplication when campuses use different record sets. For example, UCI Law is using OCLC records for an eBook package without vendor-supplied records, which is time-consuming but gets us the OCLC updates. But another campus is using non-OCLC CZ records. Both of those sets of records have linked NZ bibs.

    • MARCIVE government document records. It’s known that there are duplicative records in the system.

    Options Considered

    ARL instructions for title counts :

    • Report the total number of titles managed and maintained by the library that are cataloged and made ready for use.

    • Deduplicate titles by counting multiple copies of the same manifestation as one title.

    • Identical content in different formats should not be deduplicated, and each format should be counted as a different title. For example, a serial title available in print, microform and online would be counted as three titles.

    • Count different editions and versions of the same work as separate titles since they denote depth in the collection.

    • Counting the 245 field when the library provides stewardship for those resources may be sufficient.

    Alma Options (Count distinct)

    Notes

    Alma Options (Count distinct)

    Notes

    Title Normalized, separated by format

    This undercounts unique titles because works with different authors and editions are not distinguished.

     

    Title Author Combined and Normalized, separated by format

    This undercounts unique titles because works in different editions are not distinguished

    Concat(Title Author Combined and Normalized, Edition), separated by format

    This undercounts unique titles because generically-named titles that are actually different things do not have unique-enough metadata in these fields to distinguish them.

    MMS ID

    This overcounts unique titles.

    This is especially true for electronic resources, where we are using bibliographic records of varying quality from many different sources in order to provide access.

    Example from UCI Law

    At Irvine, Counts from the Titles Subject Area where:

    • Lifecycle = “In Repository”

    • Resource Type is “Book - Electronic”

    • Library Code (Physical) contains LAW OR Library Code (Electronic) contains LAW

    Option

    Title Count at UCI Law (rounded)

    Option

    Title Count at UCI Law (rounded)

    Distinct Title Author Combined and Normalized

    274,000

    Distinct Concat(Title Author Combined and Normalized, Edition)

    280,000

    Distinct Title (Complete)

    301,000

    Num of Titles (Active)

    364,000

    Questions to consider

    How much concern about duplication exists in the system? In other words, do we already have widespread acceptance for possible overcounting based on shared and longstanding professional expertise about metadata limitations?
    Answer: There is some concern but it tends to be context-specific. Systemwide, there does already exist a shared understanding among campus experts that 1) libraries regularly report record counts as an approximation for title counts, and 2) title count data is inherently imprecise.

    Action Log

    Action/Point Person

    Expected Completion Date

    Notes

    Status

    Action/Point Person

    Expected Completion Date

    Notes

    Status

    Gather initial feedback from RM colleagues based on prototype

    Aug 17, 2023

    Accomplished via email

    Complete

    Develop proposal for decision-making, with input from a couple of campus experts to help shape future, more inclusive process

    Aug 21, 2023

    See meeting notes: 2023-08-21 10 am AASA-PT Meeting Notes

    Complete

    The SILS mission is to transform library services and operations through innovation and collaboration. The future is shared!
    Question? Contact AskSILS-L@ucop.edu