Convert An Excel Spreadsheet Into MARC Data With MarcEdit: Tutorial
Convert An Excel Spreadsheet Into MARC Data With MarcEdit: Tutorial
Contents
1 Initial Steps 2
1.1 Edit the E-bib File . . . . . . . . . . . . . . . . . . . . . . . . . 2
1.2 Change Some Column Headers . . . . . . . . . . . . . . . . . 4
Abstract
This tutorial explains how to use MarcEdit’s Delimited Text Translator
feature to convert a book vendor’s Excel file with metadata into MARC
records. This capability can be helpful when there are not MARC
1
records available from “typical” bibliographic sources (such as OCLC
WorldCat, the Library of Congress, etc.).
Although this tutorial uses a sample spreadsheet from Tsai Fong
Books, you should be able to apply the procedure outlined here to an
Excel file from any other vendor and tailor the data mapping (as well
as the post-conversion bibliographic editing) to suit your needs.
The screenshots in this tutorial show version 7.1.180 of MarcEdit,
which is the most current version at the time of writing.
1 Initial Steps
• Tsai Fong Books (a vendor specializing in Asian-language materials)
provides free “e-bib” files which contain bibliographic metadata in
Excel spreadsheet form.
• Before editing your e-bib file, make a copy of it, and save the original
in a safe place.
2
• Notice that there are two Author columns: one has a Romanized
version of the author’s name (using pinyin1 Romanization); the other
column presents the author’s name in Chinese characters.
• There are two Title columns: one has the title Romanized in pinyin;
the other column has the title in Chinese characters.
• There are also two Publisher columns: one in pinyin; the other col-
umn with Chinese characters.
1
Pinyin is a system to transcribe the sounds of Mandarin Chinese using the Western
(Roman) alphabet. Pinyin was developed in the 1950s in Mainland China and is now the
official Romanization system of China, Singapore, the US Library of Congress, and the
American Library Association.
3
1.2 Change Some Column Headers
Change the column headers listed below. This will make it easier when you
are doing the “data mapping” step (which comes later in the conversion
process).
4
Tip:
For greater simplicity and efficiency in creating these brief biblio-
graphic records, it is recommended that the vernacular CJKa script
metadata columns be used (for Author, Title and Publisher), rather
than the Romanized pinyin columns.
Some previous library studies have shown that end-users are not
well served with pinyin-based searching. Although pinyin is useful
for non-native students who are learning Chinese, it is a crutch—
like training wheels on a bicycle—which will eventually have to be
discarded. Native Chinese speakers / readers are unlikely to be familiar
with pinyin, so they will prefer searching with CJK characters.
a
Initialism for “Chinese, Japanese and Korean.” This term is commonly used in the
library world to refer to materials written in the Chinese, Japanese, and/or Korean
languages.
5
• A new window will appear:
• From here, navigate to the location where you saved your e-bib file:
• Open / select your source file. (It should be in Microsoft Excel format).
6
• Select the location for your output file (which will be saved in the
program’s ‘mnemonic’ .mrk format):
• A pop-up window will appear. From the LDR drop-down menu, select
the Book option:
7
• Click on the OK button to close the pop-up window.
8
2.2 Map the Excel Metadata into MARC
Tip:
These steps are the most important in the entire conversion process
because they involve mapping the vendor’s metadata from Excel into
MARC 21 format. Please work carefully as you proceed.
9
• To begin, select Field 1 (020) and map it to the 020 tag. Remember
to add $a (‘subfield a’) to the MARC tag.
The first and second indicator values are correctly set as \ \ (‘blank
blank’), so they do not need to be changed:
• After mapping each desired field, click on the Add Argument button.
• Once the button has been pressed, you will see the Argument added to
the summary box below:
10
Value 1 Value 2 Value 3
Select: Map To: Indicators:
Field 4 100$a 1\
(i.e., first indicator is ‘1’, second indicator is ‘blank’)
Field 8 245$a 10
Field 11 264$b \\
Field 12 264$c \\
Field 15 650$a \4
Field 16 546$a \\
Field 17 035$a \\
Field 23 520$a \\
• When you complete this part of the procedure, your screen should look
something like this:
• Do not worry that some of the MARC tags are out of numerical order;
just make sure that the Sort Fields option is checked / activated:
11
• Next, right-click and choose Join Items:
• After being joined, you will notice that the 264 fields now have a
preceding asterisk:
Tip:
I have not tried using the Save Template option, so I do not
know how it will behave when used for future data mapping.
12
• MarcEdit will display a pop-up confirmation window that your file has
been saved in the program’s internal .mrk file format:
13
2.4.1 Delete “Placeholder” Record At Head of File
• The e-bib file’s metadata was successfully mapped and converted to
MARC format.
However, you must delete the empty “placeholder” record at the begin-
ning of the file (circled above, in red ink).
14
• To make this change globally, go to the Tools menu > Edit Indicator
Data (or press the F8 function key).
• Click on the Replace button; afterwards, you will see a pop-up confir-
mation window:
• Go to the Tools menu > Edit Subfield Data (or press function key
F9).
15
Tip:
Pay special attention to the Subfield: box, and enter the data
exactly as shown: [ˆb]
The notation [ˆb] alerts MarcEdit that you wish to insert $a
before $b in data field 264.
(If you fail to add the special notation, MarcEdit will place the
new $a after $c — at the end of the data field).
• Next, click on the Replace Text button in the upper right-hand corner.
• MarcEdit will ask if you wish to continue:
16
• Click on the Yes button to complete the operation.
• The 264 $a will be added, and correctly placed at the beginning of the
field:
2.4.4 Add 336 / 337 / 338 Fields (for Books) via the RDA Helper
Another easy enhancement is to add the “triple” of MARC fields (336/7/8)
identifying content [336], media format [337] and carrier or physical format
[338] of the cataloged items. (The following global example assumes that all
the items in the e-bib file are printed books).
• Go to the Tools menu > MARC Processing Tools > RDA Helper.
• Check the two boxes shown below, then click the OK button:
17
• The selected tags will be added to the records:
• When you are satisfied with your file, proceed to the next step. You’re
almost done!
18
• Give your enhanced / improved file an appropriate name, to distinguish
it from your original file.
19