Skip to Main Content

Love Data Week

Find resources related to data at the library for Love Data Week 2024.

CCSU's Murals

"City at the Crossroads of History" was painted in 2014 by CCSU Professor Mike Alewitz. It depicts the history of social activism in New York City and was inspired by the works of Diego Rivera and Hieronymus Bosch. Like many of Alewitz's murals, the painting courted a large amount of controversy and was the subject of censorship.

Alewitz taught in the CCSU Art Department for 17 years, where he ran the mural program. The murals that you can find around the CCSU campus were painted by Alewitz's students. The library is currently in the process of preserving Alewitz's work and that of his students in a digital archive.

Click here to visit the library's Mike Alewitz collection on the Connecticut Digital Archive.

Click here to visit the website for Alewitz's non-profit studio-museum, Red Square.

Click here to learn more about "City at the Crossroads of History".

Jorge Luis Morales-Torres painted "The Roots of the Caribbean" when he was a student at CCSU in 1995. He earned his MSED in Art in 1996, and his rich depiction of Caribbean history and culture has been nationally recognized. "The Roots of the Caribbean" now hangs in the CCSU student center.

Visit the Latin American, Latino, and Caribbean Center's website for more information on the many elements of Caribbean history and culture that this mural depicts.

You can also check out https://rootsofthecaribbeanmural.com/, an interactive website that the library is creating with the LALCC to provide information, historical artifacts, and educational resources related to the mural and the various subjects that it depicts.

What is Metadata?

Metadata is commonly described as "data about data." While easy to remember, this definition is far too vague to be useful. The definitions below provide better explanations in plain English. 

Definition from the National Information Standards Organization (NISO)
"Metadata is structured information that describes, explains, locates, or otherwise makes it easier to retrieve, use, or manage an information resource." 

Definition from Steven Miller, Information and Metadata Lecturer
“Extra baggage associated with any resource that enables a real or potential user to find that resource and to determine value…”

Definition from Karen Coyle, Digital Librarian and Author of Coyle's InFormation
“Metadata is constructed, constructive, and actionable.”

  • Constructed - a man-made artifice, not naturally occurring
  • Constructive - serving a useful purpose, to solve some problem
  • Actionable - can be acted upon, processed by humans and machines

As indicated in the Miller definition above, metadata helps people find resources and determine their value, for whatever need is at hand. This function of metadata is especially critical in digital environments, where humans rely on computer processing for reliable and timely results:

  • Metadata facilitates organization, indexing, discovery, access, analysis, and use of print and online resources.
  • Metadata enables software agents to navigate and "comprehend" web content.
  • Metadata influences search engine results, rankings, and click-through rates.
  • Metadata is even more important for non-textual content that isn't readily processed by machines (e.g. images, multimedia, datasets).
  • Metadata presence and quality (or the lack thereof) can significantly help or hinder time and money expenditures in research activities.

An interesting look at the importance of Metadata and why it really matters can be seen in this slideshow from Rachel Lovinger entitled "Metadata is a Love Note to the Future"

Much of the content on this page comes from Carnegie Mellon University's Metadata Guide.

Metadata Standards

Different communities create, collect, manage, and use different types of information. These communities may have different concerns and approaches to metadata. There is no single standard that can satisfy all the needs across communities.

However, in order for metadata to be understandable to the people and software applications that use it, some sort of consistency is required. Organizations often define and publish metadata standards to meet needs broadly across knowledge domains or within specialized disciplines. Published standards help system designers and end users accomplish their goals effectively.   

Explore the different tabs in this section of the guide to learn more about metadata standards.

Dublin Core is the most common metadata schema for web content.

Named in part for a 1995 metadata conference hosted by the Online Computer Library Center (OCLC) located in Dublin, Ohio, Dublin Core consists of 15 elements that were considered broad and generic enough to describe a wide range of resources. 

Dublin Core Metadata Element Set

Element Description

Title

A name given to the resource, either supplied by the individual assigning metadata or from the object.

Example: "A Pilot's Guide to Aircraft Insurance"

Creator

Entity responsible for making the resource.

Example: "Duncan, P. A."

Subject

The topic of the resource, typically represented using keywords.

Example: "Colonial medicine"

Description

An account of the resource.

Example: "Illustrated guide to airport markings and lighting signals for airports with low visibility conditions."

Publisher

An entity responsible for making the resource available.

Example: "The University of Texas Press"

Contributor

An entity responsible for making contributions to the resource (e.g. editor, transcriber, illustrator).

Example: "Austin Citizen Photograph"

Coverage

The spatial or temporal topic of the resource.

Example: "Austin, TX"

Date

A point or period of time associated with the resource.

Example: "1998-02-16"

Type

The nature or genre of the resource. For a list of possible types, visit the DCMI Type Vocabulary.

Example: "image"

Format

The file format, physical medium, or dimensions of the resource.

Example: "[128] p. : ill. ; 15 cm."

Rights

Information about rights held over the resource.

Example: "This electronic resource is made available by the University of Texas Libraries solely for the purposes of research, teaching and private study."

Source

A related resource from which the described resource is derived.

Example: "ZA 3075 Y69 2007"

Language

Language(s) of the resource.

Example: "Spanish"

Relation

A related resource. For a list of possible relations, visit the Summary Refinement and Scheme Table.

Example: "HasVersion 13th Edition"

Identifier

A unique reference to the resource.

Example: "doi:10.15781/T2251FN91"

MARC, an acronym for MAchine-Readable Cataloging, is a widely used standard among libraries. MARC was developed by the Library of Congress in the 1960s to enable computer production of catalog cards.

The MARC record structure contains ten sets of fields, associated with 3-digit numbers called tags. Some fields are further defined by indicators and subfield codes. 

Components of a MARC Record

Tag

Description

0XX Control information, numbers, and codes
1XX

Main entries related to personal and corporate names

Example: 100 1#  $a Arnosky, Jim.

2XX

Titles, edition, imprint information

Example: 245 10  $a Dinosaurs : $b a visual encyclopedia.

3XX

Physical description

Example: 300 ##  $a 303 p. : $b col. ill. ; $c 29 cm.

4XX Series statements
5XX

Notes

Example: 520 ##  $a Presents an illustrated look at dinosaurs.

6XX

Subject access entries

Example: 650 #1  $a Dinosaurs. 

7XX Added entries other than subjects or series
8XX Series added entries and holdings information
9XX

Fields for locally-defined use

Example: 900 ##  $a 599.74 ARN

Encoded Archival Description (EAD) was developed in the 1990s by the archival community as a way of presenting finding aids in electronic form. It uses Standard Generalized Markup Language (SGML) and XML as the encoding schemes. Maintained by the Society of American Archivists and the Library of Congress, the latest version is EAD3.

Inside the outermost wrapper element, <ead>, the EAD contains two main sections: <control> and <archdesc>. Each of these sections also include required and optional child elements.

Main EAD Elements

Element Description
Encoded Archival Description Contains all the elements in an EAD document, including Control and Archival Description.
Control Contains elements about the finding aid itself, such as title, author, creation date, language, and description rules used.
Archival Description

Contains elements describing the content, context, and extent of the archival collection, rather than the finding aid, such as subjects, format, and box and folder inventory.

Metadata Object Description Schema (MODS) is an XML schema with MARC-like semantics. MODS was developed by the Library of Congress out of the need for something easier to learn than MARC and richer than Dublin Core for describing complex digital objects.

MODS is "friendly" because it uses language-based tags rather than the numeric codes (e.g. 250) traditional to MARC. There are 20 top-level MODS elements, many of which contain subelements for granular desription.

MODS Top-Level Elements

Element Description
Title Information Contains all subelements related to title information.
Name Contains all subelements related to information about the name of a person, organization, or event (conference, meeting, etc.) associated in some way with the resource.
Type of Resource Information about the original item that specifies the characteristics and general type of content of the resource, as chosen from a defined list of terms.
Genre A term that give more specificity for the form, style, or content of an object than the broad terms used in Type of Resource.
Origin Info Contains subelements related to place of origin or publication, publisher/originator, and dates associated with the resource.
Language Contains a subelement to record the language in which the language of the content of a resource is expressed.
Physical Description Contains all subelements relating to physical attributes of the resource.
Abstract A succinct summary of the content of the resource.
Table of Contents A description of the contents of a resource.
Target Audience A description of the intellectual level of the audience for which the resource is intended.
Note General textual information relating to a resource, that is not encoded in other more specific elements.
Subject Contains subelements relating to the primary topic(s) on which a work is focused.
Classification Indication of the subject via a formal system of coding and organizing resources (e.g. call number).
Related Item Contains subelements with information that identifies other resources related to the one being described.
Identifier A unique standard number or code that distinctively identifies a resource.
Location Contains subelements that identify the institution or repository holding the resource, or the electronic location in the form of a URL when available.
Access Condition Information about restrictions imposed on access to a resource.
Part Contains subelements to designate physical parts of a resource in detail.
Extension Provides additional information not covered by MODS (when local elements or elements from other standard schemas are needed).
Record Info Contains subelements relating to information necessary to managing metadata.

VRA Core is a standard commonly employed by cultural heritage organizations to describe images and works of art. VRA Core is hosted by the Visual Resources Association (VRA) and the Library of Congress. 

VRA Core contains 19 elements. There are three primary entities: Work (a built or created object), Collection (an aggregate of such objects), and Image (a visual surrogate of such objects).

VRA Core Elements
Element Description
Work, Collection, or Image A record is described as a Work, a Collection, or an Image.
Agent Individual, group, or corporate body that has contributed to the design, creation, production, etc. of the work.
Cultural Context Name of the culture, people, or country with which the work has been associated. 
Date Date associated with the work.
Description Free-text note about the work that gives additional information not in other categories. 
Inscription All marks added to the work at the time of production (e.g. signatures or stamps).
Location Geographic location or repository whose boundaries include the work.
Material Substance which the work is composed of (e.g oil paint, bronze, or graphite). 
Measurements Dimensions of the work.
Relation Terms describing the relationship between the work and a related work.
Rights Information about the copyright status for the work.
Source Reference to the source of information recorded about the work.
State Edition Identifying number or name assigned to the edition of a work that exists in more than one form.
Style Period Defined style, historical period, school, or movement whose characteristics are represented in the work.
Subject Terms that describe the work.
Technique Production processes, techniques, and methods incorporated in the fabrication of the work.
Textref Unique identifier assigned to the work.
Title Title given to the work.
Work Type Identifies the specific type of Work, Collection, or Image being described in the record.

Controlled Vocabularies

Controlled vocabularies are standardized and organized arrangements of words and phrases and provide a consistent way to describe data. Metadata creators assign terms from vocabularies to improve information retrieval.

Common types of controlled vocabularies include subject headings lists, authority files, and thesauri. Controlled vocabularies can be arranged as alphabetical lists of terms or as taxonomies with a hierarchical structure of broader and narrower terms. Thesauri also include synonyms, related terms, scope and editorial notes, term history, alternate languages, or numerical codes. Ontologies include even more specification, such as descriptions of terms or concepts by their position in the hierarchy and any number of relationships to other terms/concepts.

There are well-established standards to control names of people, geographic names, topics, concepts, resource types or genres, and languages.

Below are examples of controlled vocabularies commonly used in various communities. There are also tools available to help find appropriate vocabularies and ontologies.

General Purpose

Arts & Humanities

Physical & Life Sciences

Social & Behavioural Sciences

Books about Metadata

Report a tech support issue

Elihu Burritt Library
Central Connecticut State University, 1615 Stanley Street, New Britain, CT 06050 - Map

CCSU Home | Central Pipeline | CentralSearch / Catalog | Sign In to CentralSearch | Search Library Website