A library catalog is a database that serves as an inventory of materials owned and locally managed by a library. The catalog is distinct from vendor-provided databases of articles, journals, e-books, and media which a library may subscribe to. In those interfaces, a private company aggregates and indexes the content, and controls the way the resources are described. The library subscribing to the database has limited say in language choices, display, and functionality.
By contrast, a library’s local catalog contains records for the physical materials owned and organized by the library: books, journals, maps, posters, DVDs, CDs, kits, devices, etc. It can also include digitized or born-digital materials hosted on library servers, like PDFs, streaming media files, or scanned materials (1). For these locally-owned and managed materials, the MSU Libraries play a more active role in how resources are described, indexed, and displayed to the public.
The term “metadata” is often used to refer to descriptions of information resources. Metadata includes information which is fairly objective: title of a work, names of creators and contributors, publication date, format, language, edition information. But metadata also includes more subjective elements, such as subject terms and a call number for shelf location. (The call number system is known as classification.) These systems for assigning subject terms and call numbers are attempts to organize knowledge. These systems make it possible to find the book you need among millions in our collection.
Cataloging is the process of:
Historically, maintaining a complete and updated list of an entire local collection was a challenge for every library. This work was complicated by the fact that cataloging practices were not shared or standardized for a long time. If a book was owned by 500 libraries, that meant 500 different librarians had to create a catalog record for it.
In the 1960s, British and American libraries implemented a community standard called the Anglo-American Cataloging Rules, ensuring that a book could be described consistently across institutions. Soon after, technology made it possible for libraries to share catalog records with each other. Only one library needed to create a catalog record. Other libraries could download that and print a catalog card.
Bu the 1990s, the libraries' card catalogs were being turned into databases. Although users can search these databases online, they still contain legacy data -- information compiled 50 or even 100 years ago.
Now, when a resource is acquired -- say, a DVD of the documentary film 13th -- a cataloger checks the WorldCat database of shared catalog records and downloads a matching record to the library’s local database. That record then appears in the library’s online catalog for users to find. In some cases, a record is not available in WorldCat yet. The cataloger creates one, following the community standards used by all WorldCat member libraries. Then the new record is uploaded to WorldCat for other institutions to use.
Why can't you just change the harmful language?
The Library of Congress Subject Headings (also called LC Subject Headings or LCSH) are the main tool we have for describing what each item in the library collection is about. Many problematic LCSH headings have been changed over the years, but many remain. There are both political and technical reasons for this.
Are there alternatives to the Library of Congress Subject Headings?
Yes and no.
The LC Subject Headings cover the full range of knowledge: arts and humanities, social sciences, science and mathematics, and applied fields. The LC Subject Headings are also the most widely-used thesaurus of subject terms in the English-speaking world, which means that nearly all the records we download from WorldCat contain LCSH headings, and the records we upload to WorldCat need to have LCSH headings to be useful to other libraries.
However, the Library of Congress Subject Headings are not the only set of subject headings available; there are many specialized vocabularies. The National Library of Medicine maintains Medical Subject Headings, or MeSH. The Getty Research Institute maintains an Art & Architecture Thesaurus. The Homosaurus International Thesaurus has in-depth coverage of LGBTQ+ vocabulary. We use all of these vocabularies and more to describe our collection. The Library of Congress Subject Headings are notable for their breadth, but specialized vocabularies are often more precise.
What are name authority records?
Imagine that the first book you wrote was in your professional field of chemical engineering. You used Elizabeth G. Rodriguez on the title page. Later you wrote a children's book about science, using a shorter form of your name on the title page: Liz Rodriguez.
The name authority record established for you in the library world typically uses the form of your name in your earliest published work: in this case, Elizabeth G. Rodriguez. But if the author biography in your children's book mentions your earlier book, then librarians know that Liz Rodriguez, children's science book author, is the same person as Elizabeth G. Rodriguez, chemical engineering author. And, we can reflect that in the catalog record. That's the purpose of name authority work: to bring together all the publications by one person, where possible, even if their name changes or different publications use longer or shorter forms of their name.
Creating name authority records is a little like creating Wikipedia entries about people from underrepresented groups, such as scientists who are women and/or people of color. Both efforts are part of the infrastructure that makes it easier to recognize their contributions.