AMIA-MIG Archive Directory Working Group
TASK FORCE A: Data Elements
Charge
Write the "Data Elements" section of the Moving Image Gateway Archive Directory Report, the working document MIG developers will use to design the Archive Directory database: create a list of Archive Directory Data Elements based on the "Archive Directory Data Elements" document as revised by "Archive Directory Elements draft" (Murtha and Jane's July 26 revision for seven problematic data elements) and the minutes. For each element, write a scope note and indicate 1) mandatory/non-mandatory, 2) repeatable/non-repeatable, and 3) indexed/not indexed. (For indexing, these are recommendations only; indexing is subject to change as the project proceeds.) Leave placeholders for vocabulary lists with scope notes. Resolve data element definition issues raised at the July 25-26 meeting (as described in minutes and breakout session reports/discussion), documenting the rationale for your decisions. As you work, note any issues that you think should be addressed in guidelines for participant (organization) input and guidelines for end users and submit those with your report.
Build on the work done at the July 25-26 Archive Directory Working Group meeting; do not start from scratch. Utilize resources identified below and consult with outside experts as you see fit. If you encounter an issue that you feel needs broader discussion, submit it to the Archive Directory Working Group listserv.
Roster
Linda Tadic, Chair
Snowden Becker
Sally McCallum
Related documents
"Archive Directory
Data Elements" as revised by "Archive Directory Elements draft"
(Murtha and Jane's July 26 revision for seven problematic data elements)
Minutes, Moving Image Gateway Archive Directory Working Group Meeting
Cornucopia "Structure" document (http://www.cornucopia.org.uk/tech.html)
Breakout session rosters ("Who discussed what")
August 28, 2002
Moving Image Gateway : Organization Directory Data Elements
I. INSTITUTIONAL DATA
1. archiveID
Required
Not repeatable
Indexed
Populated by: ??
Supply the unique alphanumeric identifier for the organization contributing information to the Gateway.
*** NOTE TO COMMITTEE: This field could not be defined since it hasn't been decided how the archiveID will be created: automatically assigned by the system, or based on other identifying systems such as the NUC.
Directory Information:
2. Organization Name
Required
Not repeatable
Indexed
Free text
Populated by: manually
Enter the full name of the organization or unit that is contributing to the Gateway; do not abbreviate or use an acronym. Organizations that are part of a larger institution should enter the name of their specific division or unit here, and the name of their parent institution in field 12: Parent Organization, as applicable. (Ex: Organization Name = UCLA Film and Television Archive, Parent Organization = University of California, Los Angeles)
3. Homepage URL
Required where present
Not repeatable
Not indexed
Free text
Populated by: manually
If the organization or unit named in 1: Organization Name has a web page of its own, enter the full URL.
Note: This may be a page on the larger site for a parent organization. If the organization or unit named in this directory entry does not have a web presence apart from the Gateway search page, leave this field empty.
4. Subunit URL
Required where present
Repeatable
Not indexed
Free text
Populated by: manually
If regional offices or other subunits of the organization named in 1: Organization Name have separate web pages, enter the full URL(s).
[Note to programmer: We should probably include a "Subunit name" field to describe instances of the Subunit URL field if this is implemented]
5-5(a). Address
Required
Not Repeatable
Indexed
Free text for field 5; Controlled vocabulary list for "Country" field
only (5a)
Populated by: manually
Enter address information in the appropriate fields for the organization or unit that is contributing to the Gateway. This should be the primary physical location, mailing address, or administrative office location if the organization has multiple physical locations. Do not use abbreviations for state/province names or cities.
[Note to programmer: The Address area should be formatted with separate fields for the following information: Street Address 1 (required), Street Address 2 (not required), City (required), State/Province (required), Country (required, controlled vocabulary list). Also see how this field relates to field 16(a) Geographic Area.]
Controlled vocabulary list for 5(a) Country:
6. Telephone
Required
Repeatable
Not indexed
Free text
Populated by: manually
Enter the primary phone number, including area code, for the organization or unit. Repeat as necessary for day/evening numbers, informational recordings, or other telephone access numbers.
Note: If there is no main public or switchboard number, use the direct-dial number for the individual(s) listed in 26: Public Service Contact(s).
7. Fax
Required where present
Not repeatable
Not indexed
Free text
Populated by: manually
Enter the primary fax number, including area code, for the organization or unit.
Note: If there is no main fax number, use the direct fax number for the individual(s) listed in 26: Public Service Contact(s).
8. Email address
Required where present
Not repeatable
Not indexed
Free text
Populated by: manually
Enter the primary email address for general public contact with the organization or unit.
Note: If there is no main email address, use the direct email contact for the individual(s) listed in 26: Public Service Contact(s).
9. Contact person (Title)
Required
Not repeatable
Not indexed
Free text
Populated by: manually
Enter the name and/or title of the primary public contact person for the organization or unit. This may be an administrator who can refer inquiries to specific staff members, a reference librarian or archivist, or other staff member; whoever is listed here should be the person prepared to receive the broadest range of inquiries from users of the Gateway. It is acceptable to enter only a title and not a staff member's name in this area.
10. Year of foundation
Required
Not repeatable
Not indexed
Numeric
Populated by: manually
Enter the four-digit year the organization or unit was founded. Use the date that best reflects the age of the collection-i.e., the year in which the organization began collecting materials, rather than the year it was opened to the public, if there is a difference. Use the date most pertinent to the unit, rather than the parent organization, if there is a difference.
11. Other name(s)
Not required
Repeatable
Indexed
Free text
Populated by: manually
Enter any other names by which the organization or unit is or has been known. The Gateway participant may wish to include here names of collections that have been taken over/acquired by the listed organization, former names for the organization or unit, or commonly used acronyms, nicknames, or abbreviations.
12. Parent organization
Not required
Not repeatable
Indexed
Free text
Populated by: manually
Enter the full name of the parent organization of the unit that is contributing to the Gateway. Participants should name the parent organization in this field even if it is included in some form in the unit name entered in Field 2 above.
Ex: University of California, Los Angeles for UCLA Film and Television Archive.
13. Logo
Not required
Not repeatable
Not indexed
Image file
Populated by: manually
Attach an image file with the organization or unit's logo as you would like it to appear on the Gateway site and in associated web pages. This file should have minimum dimensions of ?? x ?? pixels, minimum resolution of ?? ppi, and be in one of the following formats: TIFF, GIF, JPEG, BMP, PSD, or ??
[Need information from programmers
about how this will work, minimum resolution, acceptable file formats, etc.
My assumption is that this field would work much like an email "attach"
button-the user would be able to browse their network or local drive and select
the file to include with their data submission.]
14. Organization Type(s)
Required
Repeatable
Indexed
Controlled vocabulary list
Populated by: manually
Define the primary function of the organization or unit that is contributing to the Gateway. Choose as many terms as apply.
Note: Do not use the general terms "Archives" or "Library" if those do not define the primary function of your organization. For example, a library in a corporation should select "Corporation," but not "Library." However, a collection based in a university library would select the terms "Educational institution" and "Library."
Controlled vocabulary list:
15. Organization Services
Required
Repeatable
Indexed
Controlled vocabulary list
Populated by: manually
Describe services the organization provides to the public. Choose as many terms as apply.
Note: Do not select services that are provided as a function of organizational activity to internal staff only.
Controlled vocabulary list:
16-16(a) Organization Location
*** NOTE TO COMMITTEE: This field's definition depends on how the Directory Address information is defined. Breakout groups tended to prefer populating this field with data from the Directory Address (field 5-5a), which will require having that data in discrete fields. The Geographic Area field (16a) would then be a lookup list.
Required
Not repeatable
Indexed
Controlled vocabulary list for geographic areas
Populated by: system (16. city, state/province, country) and manually (16a.
geographic area)
Define the location of the organization. City, State/Province, and Country information will be populated from the Directory Address information. Select the best term to describe the organization's general geographic area.
[Note to programmer: If the Directory Address information is in discrete drop-down fields for city, state/province, and country, populate the Organization Location with that data.]
Controlled vocabulary list for 16(a) Geographic Area:
II. COLLECTIONS DATA
17. Collection Strengths (Forms)
Required
Repeatable
Indexed
Controlled vocabulary list
Populated by: manually
Describe the classes of materials included in the collections of the contributing organization, e.g., News, Animation, Sports. Choose as many terms as apply.
Note: This field is not intended to describe genres (e.g., Screwball comedy), subjects (e.g. Medicine, History, Nature), or physical formats (e.g., film, video, audio).
Controlled vocabulary list:
18. Collection Strengths (Subjects)
Required
Repeatable
Indexed
Controlled vocabulary list
Populated by: manually
Describe the main topics represented in the collections of the contributing
organization (e.g., Medicine, History, Nature). Choose no more than four terms.
Note: Subject strengths can be described in more detail in the Programming, Collections, and Research Support Activities field.
Controlled vocabulary list:
19. Collection Size
Required
Repeatable
Not indexed
Controlled vocabulary list
Populated by: manually
Choose unit(s) of measurement from the lookup list and add numeric value(s).
Use broad estimates; the purpose of the field is to provide the user with a
sense of the size of the collections. Choose as many terms as apply.
Controlled vocabulary list:
20. Formats Collected
Required
Repeatable
Indexed
Controlled vocabulary list
Populated by: manually
Describe the general audiovisual formats represented in the organization's collections. Choose as many terms as apply.
Controlled vocabulary list:
· Film (includes
audio recordings on film, e.g., mag tracks, optical tracks)
· Videotape (analog and/or digital)
· Audio recordings (all audio not on film or digital file)
· Optical discs (CD, DVD, laser)
· Digital files (sound or picture; include DLT)
20a. Formats Collected: Archival Portal (Film Base and Uncommon Gauges)
Not required
Repeatable
Indexed
Controlled vocabulary list
Populated by: manually
This field will appear only on the archival portal. Describe whether the organization's film collection contains nitrate and/or safety base film, and whether it has any uncommon gauges (e.g., 9.5mm, 28mm, 8mm, Super-8mm). Choose as many terms as apply.
Controlled vocabulary list:
21. Programming, Collections, and Research Support Activities
Not required
Not repeatable
Indexed (keyword)
Free text
Populated by: manually
Expanding on the controlled vocabulary terms in the Services, Collection Strengths,
Audiences Served, Conditions for Use/Access Restrictions, and Formats fields,
this is a free-text description of the programming, collecting (including rate
of growth) and research support activities of the contributing organization.
Related collections such as stills, posters, scripts, and other documentation
can be mentioned here, as well as detailed information on specific formats and
subjects collected. This field will be displayed on the organization's dynamic
Web page for introducing the organization or unit to potential users.
22. Published Data About the Collections
Not required
Not repeatable
Not indexed
Free text
Populated by: manually
List bibliographic citations for publications on the collections.
23. Other Internet Links
Not required
Not repeatable
Not indexed
Free text
Populated by: manually
Indicate the URLs for sites on the World Wide Web run by third parties that
provide information about the collections. Sites can include footage licensing
sites such as footage.net. URLs will be hot-linked to the sites.
III. ACCESS DATA
24. Search page URL
Mandatory where present
Not repeatable
Indexed
Free text
Populated by: manually
If the organization or unit has a web site that offers public access to search the collection, enter the complete URL for the search page.
[Note to programmer: Useful to index this data as present/absent, in order to allow users to restrict searches to repositories that offer any searching through their own sites already?]
25. Hours and days of service
Mandatory
Not repeatable
Not indexed
Controlled vocabulary list for hours of operation on individual days
Populated by: manually
Enter the normal business hours of the organization or unit. Use normal office hours if the organization is not open to the public; use open hours if public access times differ from the administrative schedule.
[Note to programmer: Set up with seven sets of fields for M/T/W/Th/F/S/Sun, as below?]
Open Close
Sunday Closed
Monday 8:30 am 5:00 pm etc.
26. Public service contact(s) [Title(s)]
Mandatory where present
Repeatable
Not indexed
Free text
Populated by: manually
Enter the name and/or title of the public service contact person/s for the organization or unit. This may be the reference librarian, archivist, head of sales and licensing, and/or other staff who deal with the public in various functions. It is acceptable to enter only a title and not a staff member's name in this area.
27. Audiences served
Mandatory
Repeatable
Indexed
Controlled vocabulary list
Populated by: manually
Describe audiences served by the organization or unit. Check all that apply.
Controlled vocabulary list:
28. Conditions for use/access restrictions
Mandatory
Repeatable
Indexed
Controlled vocabulary list
Populated by: manually
List restrictions to access and conditions for use of the collection. Check all that apply.
Controlled vocabulary list:
29. Viewing facilities
Mandatory
Repeatable
Indexed
Controlled vocabulary list
Populated by: manually
List all available viewing facilities and equipment. Check all that apply; if viewing facilities are not available, please check "No viewing facilities."
Controlled vocabulary list:
IV. CATALOGING AND PRESERVATION ACTIVITIES DATA
30. Preservation activities
Mandatory
Repeatable
Indexed
Controlled vocabulary list
Populated by: manually
List the preservation activities undertaken by the organization or unit. Check all that apply; if no preservation work is done by or with this collection, please check "No preservation activities."
Controlled vocabulary list:
31. Preservation contact(s) [Title(s)]
Mandatory where present
Repeatable
Not indexed
Free text
Populated by: manually
Enter the name and/or title of the public contact person/s for the organization or unit's preservation activities. It is acceptable to enter only a title and not a staff member's name in this area.
32. Cataloging activities (controlled vocabularies)
Mandatory
Repeatable
Indexed
Controlled vocabulary list
Populated by: manually
Describe cataloging processes and standards used by this organization or unit. Check all that apply currently or were used in records that are still active.
Note: Cataloging processes or standards no longer in use can be described in more detail below, in Cataloging activities (free text).
Controlled vocabulary list:
33. Cataloging activities (free text)
Mandatory
Not repeatable
Not indexed
Free text
Populated by: manually
Briefly describe general cataloging procedures, policies, and standards in use, now or formerly. Where known, summarize information like percentage of holdings cataloged and level of detail used to describe portions of the collection. Please elaborate on information from above regarding content standards, subject heading lists or classification schemas in use, form of catalog, access to catalog information, significant changes in cataloging procedure or policy, and any other pertinent details.
34. Cataloging contact(s) [Title(s)]
Mandatory where present
Repeatable
Not indexed
Free text
Populated by: manually
Enter the name and/or title of the contact person/s for the organization or unit's cataloging activities. It is acceptable to enter only a title and not a staff member's name in this area.
V. DATABASE ADMINISTRATION DATA
35. Database Contact(s)
Required
Repeatable
Not Indexed
Free text
Populated by: manually
Give the title of the position at the organization that is responsible for the
administration of the AMIA-MIG database, to schedule dataloads, evaluate data
maps, etc.
36. OAI Participation Flag
Required
Not repeatable
Indexed
Yes/No flag
Populated by: manually
The Yes/No flag indicates whether the AMIA-MIG should make records from the
organization available for harvesting for OAI initiatives.
37. Z39.50 Database Flag
Required
Not repeatable
Indexed
Yes/No flag
Populated by: manually
The Yes/No flag indicates whether the organization's browser-based database
is Z39.50 compatible.
38. Date of Last Database Update
Required
Not repeatable
Indexed
Populated by: system
This field indicates the currency of the record displayed in the search result. The system automatically populates this field as a timestamp.
39. Data History
Required
Not repeatable
Not indexed
Populated by: system
Timestamp history about the creation and updating of the organization's Directory entry itself is stored here. The system can automatically populate this field based on user login.
40. Sources
Not required
Not repeatable
Not indexed
Free text
Populated by: manually
Describe the sources consulted in creating the Directory record.
41. Portal IDs
Not required
Repeatable
Not indexed (linked instead)
Controlled vocabulary list
Populated by: manually
The Portal ID field will link organizations' collections to specific Gateway portals. An organization might consider associating their collections to a particular portal if their services and/or collections would be of special interest to groups of users. Examples of portals include Archival, Education, etc.