MARC ( machine-readable cataloging ) is a standard set of digital formats for the machine-readable description of items catalogued by libraries, such as books, DVDs, and digital resources. Computerized library catalogs and library management software need to structure their catalog records as per an industry-wide standard, which is MARC, so that bibliographic information can be shared freely between computers. The structure of bibliographic records almost universally follows the MARC standard. Other standards work in conjunction with MARC, for example, Anglo-American Cataloguing Rules (AACR)/ Resource Description and Access (RDA) provide guidelines on formulating bibliographic data into the MARC record structure, while the International Standard Bibliographic Description (ISBD) provides guidelines for displaying MARC records in a standard, human-readable form.
31-537: WSV can mean: The ISO 639 code for Wotapuri-Katarqalai language Winter service vehicle , a snow-clearing vehicle Washington Square Village , an apartment complex in New York City Windows Server Virtualization , virtual machine software from Microsoft Any of a number of German football clubs: SV Wehen SV Wilhelmshaven Wuppertaler SV Borussia Topics referred to by
62-463: A web service , often following the SRU or OAI-PMH standards. MARC encodes information about a bibliographic item, not information about the content of that item; this means it is a metadata transmission standard, not a content standard. The actual content that a cataloger places in each MARC field is usually governed and defined by standards outside of MARC, except for a handful of fixed fields defined by
93-527: A code in Set 1, have a B/T code, and are classified as macrolanguages in Set 3. One representative of these four elements is "Persian" fa / per / fas . These differences are due to the following factors. In ISO 639 Set 2 , two distinct codes were assigned to 22 individual languages, namely a bibliographic and a terminology code (B/T codes). B codes were included for historical reasons because previous widely used bibliographic systems used language codes based on
124-629: A key element of locale data. The codes also find use in various applications, such as Misplaced Pages URLs for its different language editions. The early form of ISO's language coding system was manifested by ISO/R 639:1967 titled Symbols for Languages, Countries and Authorities , which aimed chiefly to regulate vocabularies signifying languages, countries, and standardization agencies of ISO member bodies. Its "language symbols" consisted of one- or two-letter variable-length identifiers in capitalized Latin alphabets, e.g. E or En for English; S , Sp , or Es for Spanish; and In for Indonesian. It
155-578: A larger system. Set 2 defines four special codes mis , mul , und , zxx , a reserved range qaa-qtz (20 × 26 = 520 codes) and has 20 double entries (the B/T codes), plus 2 entries with deprecated B-codes. This sums up to 520 + 22 + 4 = 546 codes that cannot be used in Set 3 to represent languages or in Set 5 to represent language families or groups. The remainder is 17,576 – 546 = 17,030. There are somewhere around six to seven thousand languages on Earth today. So those 17,030 codes are adequate to assign
186-484: A part 6 was published but withdrawn. It was first approved in 1967 as a single-part ISO Recommendation , ISO/R 639 , superseded in 2002 by part 1 of the new series, ISO 639-1 , followed by additional parts. All existing parts of the series were consolidated into a single standard in 2023, largely based on the text of ISO 639-4. The language codes defined in the several sections of ISO 639 are used for bibliographic purposes and, in computing and internet environments, as
217-620: A set of characters at the beginning of each record that provide a directory for locating the fields and subfields within the record. In 2002, the Library of Congress developed the MARCXML schema as an alternative record structure, allowing MARC records to be represented in XML ; the fields remain the same, but those fields are expressed in the record in XML markup . Libraries typically expose their records as MARCXML via
248-489: A unique code to each language, although some languages may end up with arbitrary codes that sound nothing like the traditional name(s) of that language. "Alpha-4" codes (for codes composed of 4 letters of the ISO basic Latin alphabet ) were proposed to be used in ISO 639-6 , which has been withdrawn. The upper limit for the number of languages and dialects that can be represented is 26 = 456,976. MARC standards Working with
279-505: A way that no code means one thing in one set and something else in another. However, not all languages are in all sets, and there is a variety of different ways that specific languages and other elements are treated in the different sets. This depends, for example, whether a language is listed in Sets 1 or 2, whether it has separate B/T codes in Set 2, or is classified as a macrolanguage in Set 3, and so forth. These various treatments are detailed in
310-608: Is based on ISO 2022 and allows the use of Hebrew, Cyrillic, Arabic, Greek, and East Asian scripts. MARC 21 in UTF-8 format allows all the languages supported by Unicode. MARCXML is an XML schema based on the common MARC 21 standards. MARCXML was developed by the Library of Congress and adopted by it and others as a means of facilitating the sharing of, and networked access to, bibliographic information. Being easy to parse by various systems allows it to be used as an aggregation format, as it
341-512: Is different from Wikidata All article disambiguation pages All disambiguation pages ISO 639 ISO 639 is a standard by the International Organization for Standardization (ISO) concerned with representation of languages and language groups . It currently consists of four sets (1-3, 5) of code, named after each part which formerly described respective set (part 4 was guidelines without its own coding system);
SECTION 10
#1733085615821372-440: Is further divided into subfield "a" for the place of publication, "b" for the name of the publisher, and "c" for the date of publication. MARC records are typically stored and transmitted as binary files, usually with several MARC records concatenated together into a single file. MARC uses the ISO 2709 standard to define the structure of each record. This includes a marker to indicate where each record begins and ends, as well as
403-592: Is in software packages such as MetaLib , though that package merges it into a wider DTD specification. The MARCXML primary design goals included: The future of the MARC formats is a matter of some debate among libraries. On the one hand, the storage formats are quite complex and are based on outdated technology. On the other, there is no alternative bibliographic format with an equivalent degree of granularity. The billions of MARC records in tens of thousands of individual libraries (including over 50,000,000 records belonging to
434-466: The Library of Congress , American computer scientist Henriette Avram developed MARC between 1965 and 1968, making it possible to create records that could be read by computers and shared between libraries. By 1971, MARC formats had become the US national standard for dissemination of bibliographic data . Two years later, they became the international standard . There are several versions of MARC in use around
465-669: The OCLC consortium alone) create inertia. The Library of Congress has launched the Bibliographic Framework Initiative (BIBFRAME), which aims at providing a replacement for MARC that provides greater granularity and easier re-use of the data expressed in multiple catalogs. Beginning in 2013, OCLC Research exposed data detailing how various MARC elements have been used by libraries in the 400 million MARC records (as of early 2018) contained in WorldCat. The MARC formats are managed by
496-482: The English name for the language. In contrast, the Set 1 codes were based on the native name for the language, and there was also a strong desire to have Set 2 codes (T codes) for these languages which were similar to the corresponding 2-character code in Set 1. Individual languages in Set 2 always have a code in Set 3 (only the Set 2 terminology code is reused there) but may or may not have a code in Set 1, as illustrated by
527-471: The ISO 639-2 namespace that cover individual languages and groups were established as ISO 639-3 and ISO 639-5, respectively. There was also an attempt to code more precise language variants using four-letter identifiers as ISO 639-6, which was later withdrawn and to be reorganized under another framework, ISO 21636 . Relatively constant updates in parts of ISO 639 had been handled by each own authority in charge until
558-414: The MARC standards themselves. Resource Description and Access , for example, defines how the physical characteristics of books and other items should be expressed. The Library of Congress Subject Headings (LCSH) are a list of authorized subject terms used to describe the main subject content of the work. Other cataloging rules and classification schedules can also be used. MARC 21 was designed to redefine
589-511: The United States, and Canada . MARC 21 is a result of the combination of the United States and Canadian MARC formats (USMARC and CAN/MARC). MARC 21 is based on the NISO / ANSI standard Z39.2 , which allows users of different software products to communicate with each other and to exchange data. MARC 21 allows the use of two character sets , either MARC-8 or Unicode encoded as UTF-8 . MARC-8
620-461: The current ISO 639 Set 1. Since then, the standard has been adopted as a fundamental technology of the rapidly expanding computer industry ( RFC 1766 ), leading to development of more expressive three-letter framework, published as ISO 639-2:1998, largely based on MARC codes for languages. The original two-letter system was redefined as ISO 639-1 in 2001. Seeking for more extensive support of languages for widening applications, separate supersets of
651-414: The following chart. In each group of rows (one for each scope of Set 3), the last four columns contain codes for a representative language that exemplifies a specific type of relation between the sets of ISO 639, the second column provides an explanation of the relationship, and the first column indicates the number of elements that have that type of relationship. For example, there are four elements that have
SECTION 20
#1733085615821682-479: The following examples: Some codes (62) in Set 3 are macrolanguages. These are groups containing multiple individual languages that have a good mutual understanding and are commonly mixed or confused. Some macrolanguages developed a default standard form on one of their individual languages (e.g. Mandarin is implied by default for the Chinese macrolanguage, other individual languages may be still distinguished if needed but
713-403: The format for bibliographic records . The MARC standards define three aspects of a MARC record: the field designations within each record, the structure of the record, and the actual content of the record itself. Each field in a MARC record provides particular information about the item the record is describing, such as the author, title, publisher, date, language, media type, etc. Since it
744-487: The original MARC record format for the 21st century and to make it more accessible to the international community. MARC 21 has formats for the following five types of data: Bibliographic Format, Authority Format, Holdings Format, Community Format, and Classification Data Format. Currently MARC 21 has been implemented successfully by The British Library , the European Institutions and the major library institutions in
775-624: The publication of ISO 639:2023, which harmonized and reunified the body text of former standards and brought about organizational change with a joint maintenance agency supervising all sets and issuing newsletters [1] . Each set of the standard is maintained by a maintenance agency, which adds codes and changes the status of codes when needed. ISO 639-6 was withdrawn in 2014, and not included in ISO 639:2023. Scopes: Types (for individual languages): Individual languages and macrolanguages with two distinct three-letter codes in Set 2: The different sets of ISO 639 are designed to work together, in such
806-402: The same term [REDACTED] This disambiguation page lists articles associated with the title WSV . If an internal link led you here, you may wish to change the link to point directly to the intended article. Retrieved from " https://en.wikipedia.org/w/index.php?title=WSV&oldid=697050652 " Category : Disambiguation pages Hidden categories: Short description
837-555: The specific code cmn for Mandarin is rarely used). Collective codes in Set 2 have a code in Set 5: e.g. aus in Sets 2 and 5, which stands for Australian languages . Sets 2 and 3 also have a reserved range and four special codes: Two-letter (formerly "Alpha-2") identifiers (for codes composed of 2 letters of the ISO basic Latin alphabet ) are used in Set 1 . When codes for a wider range of languages were desired, more than 2 letter combinations could cover (a maximum of 26 = 676), Set 2
868-634: The world, the most predominant being MARC 21 , created in 1999 as a result of the harmonization of U.S. and Canadian MARC formats, and UNIMARC. UNIMARC is maintained by the Permanent UNIMARC Committee of the International Federation of Library Associations and Institutions (IFLA), and is widely used in some parts of Europe. The MARC 21 family of standards now includes formats for authority records , holdings records, classification schedules , and community information, in addition to
899-408: Was also allowed to use (the pre-1993 version of) UDC numeral auxiliaries to indicate languages. After decoupling the country code into ISO 3166 in 1974, the first edition of the standard ISO 639:1988 Code for the representation of names of languages was published with a framework of uniformly two-letter identifiers in lowercase Latin alphabets, mostly identical in format and vocabulary to that of
930-438: Was developed using three-letter codes. (However, the latter was formally published first. ) Three-letter (formerly "Alpha-3") identifiers (for codes composed of 3 letters of the ISO basic Latin alphabet ) are used in Set 2 , Set 3 , and Set 5 . The number of languages and language groups that can be so represented is 26 = 17,576. The common use of three-letter codes by three sets of ISO 639 requires some coordination within
961-425: Was first developed at a time when computing power was low, and space precious, MARC uses a simple three-digit numeric code (from 001-999) to identify each field in the record. MARC defines field 100 as the primary author of a work, field 245 as the title and field 260 as the publisher, for example. Fields above 008 are further divided into subfields using a single letter or number designation. The 260, for example,