Schematron is a rule-based validation language for making assertions about the presence or absence of patterns in XML trees. It is a structural schema language expressed in XML using a small number of elements and XPath languages. In many implementations, the Schematron XML is processed into XSLT code for deployment anywhere that XSLT can be used.
20-403: ODD is an initialism, which might refer to: ODD (Text Encoding Initiative) ("One Document Does it all"), an abstracted literate-programming format for describing XML schemas Oodnadatta Airport (IATA: ODD), South Australia Operational design domain (ODD), for automated systems Operational due diligence Oppositional defiant disorder ,
40-454: A customisation using a project-specific subset of the tags and attributes defined by the Guidelines. The TEI defines a sophisticated customization mechanism known as ODD for this purpose. In addition to documenting and describing each TEI tag, an ODD specification specifies its content model and other usage constraints, which may be expressed using schematron . TEI Lite is an example of such
60-453: A customization. It defines an XML-based file format for exchanging texts. It is a manageable selection from the extensive set of elements available in the full TEI Guidelines. As an XML-based format, TEI cannot directly deal with overlapping markup and non-hierarchical structures. A variety of options to represent this sort of data is suggested by the guidelines. The text of the TEI guidelines
80-435: A mental disorder characterized by anger-guided, hostile behavior ODD , a 2007 play by Hal Corley about a teenager with oppositional defiant disorder Optical disc drive See also [ edit ] Odd (disambiguation) Topics referred to by the same term [REDACTED] This disambiguation page lists articles associated with the title ODD . If an internal link led you here, you may wish to change
100-604: Is a literate programming language for XML schemas . In literate-programming style, ODD documents combine human-readable documentation and machine-readable models using the Documentation Elements module of the Text Encoding Initiative. Tools generate localised and internationalised HTML , ePub , or PDF human-readable output and DTDs , W3C XML Schema , Relax NG Compact Syntax, or Relax NG XML Syntax machine-readable output. The Roma web application
120-416: Is a text-centric community of practice in the academic field of digital humanities , operating continuously since the 1980s. The community currently runs a mailing list, meetings and conference series, and maintains the TEI technical standard , a journal , a wiki , a GitHub repository and a toolchain . The TEI Guidelines collectively define a type of XML format, and are the defining output of
140-584: Is built around the ODD format and can use it to generate schemas in DTD , W3C XML Schema , Relax NG Compact Syntax, or Relax NG XML Syntax formats, as used by many XML validation tools and services. ODD is the format used internally by the Text Encoding Initiative for the TEI technical standard . Although ODD files generally describe the difference between a customized XML format and the full TEI model, ODD also can be used to describe XML formats that are entirely separate from
160-404: Is grounded in one or more academic disciplines and examples are given. The standard is split into two parts, a discursive textual description with extended examples and discussion and set of tag-by-tag definitions. Schemata in most of the modern formats ( DTD , RELAX NG and XML Schema (W3C) ) are generated automatically from the tag-by-tag definitions. A number of tools support the production of
180-403: Is rich in examples. There is also a samples page on the TEI wiki, which gives examples of real-world projects that expose their underlying TEI. TEI allows texts to be marked up syntactically at any level of granularity, or mixture of granularities. For example, this paragraph (p) has been marked up into sentences (s) and clauses (cl). TEI has tags for marking up verse. This example (taken from
200-522: The French translation of the TEI Guidelines) shows a sonnet. The choice tag is used to represent sections of text that might be encoded or tagged in more than one possible way. In the following example, based on one in the standard, choice is used twice, once to indicate an original and a corrected number, and once to indicate an original and regularised spelling. One Document Does it all ("ODD")
220-489: The TEI Guidelines are based on a TEI customization documented in a TEI ODD file. Even when users choose one of the off-the-shelf pre-generated schemas to validate against, these have been created from freely available customization files. The format is used by many projects worldwide. Practically all projects are associated with one or more universities. Some well-known projects that encode texts using TEI include: Prior to
SECTION 10
#1732854621317240-522: The TEI. One example of this is the W3C's Internationalization Tag Set which uses the ODD format to generate schemas and document its vocabulary. TEI customizations are specializations of the TEI XML specification for use in particular fields or by specific communities. Customization in the TEI is done through the ODD mechanism mentioned above. In truth since its P5 version, all so-called 'TEI Conformant' uses of
260-401: The community of practice. The format differs from other well-known open formats for text (such as HTML and OpenDocument ) in that it is primarily semantic rather than presentational: the semantics and interpretation of every tag and attribute are specified. There are some 500 different textual components and concepts: word , sentence , character , glyph , person , etc. Each
280-410: The creation of TEI, humanities scholars had no common standards for encoding electronic texts in a manner that would serve their academic goals ( Hockey 1993, p. 41). In 1987, a group of scholars representing fields in humanities, linguistics, and computing convened at Vassar College to put forth a set of guidelines known as the “Poughkeepsie Principles”. These guidelines directed the development of
300-584: The error messages themselves. The current ISO recommendation is Information technology, Document Schema Definition Languages (DSDL) , Part 3: Rule-based validation, Schematron (ISO/IEC 19757-3:2020). Constraints are specified in Schematron using an XPath-based language that can be deployed as XSLT code, making it practical for applications such as the following: Schematron was invented by Rick Jelliffe while at Academia Sinica Computing Centre, Taiwan. He described Schematron as "a feather duster to reach
320-709: The first TEI standard, "P1". Schematron Schematron is capable of expressing constraints in ways that other XML schema languages like XML Schema and DTD cannot. For example, it can require that the content of an element be controlled by one of its siblings. Or it can request or require that the root element, regardless of what element that is, must have specific attributes. Schematron can also specify required relationships between multiple XML files. Constraints and content rules may be associated with "plain-English" (or any language) validation error messages, allowing translation of numeric Schematron error codes into meaningful user error messages. Users of Schematron define all
340-1166: The following namespace: Schematron rules can be created using a standard XML editor or XForms application. The following is a sample schema: This rule checks to make sure that the ContractDate XML element has a date that is before the current date. If this rule fails the validation will fail and an error message which is the body of the assert element will be returned to the user. Schematron schemas are suitable for use in XML Pipelines , thereby allowing workflow process designers to build and maintain rules using XML manipulation tools. The W3C's XProc pipelining language, for example, has native support for Schematron schema processing through its "validate-with-schematron" step. Since Schematron schemas can be transformed into XSLT stylesheets, these can themselves be used in XML Pipelines which support XSLT transformation. An Apache Ant task can be used to convert Schematron rules into XSLT files. There exists also native Schematron implementation, like
360-408: The guidelines and the application of the guidelines to specific projects. A number of special tags are used to circumvent restrictions imposed by the underlying Unicode ; glyph to allow representation of characters that do not qualify for Unicode inclusion and choice to allow overcome the required strict linearity. Most users of the format do not use the complete range of tags, but produce
380-433: The link to point directly to the intended article. Retrieved from " https://en.wikipedia.org/w/index.php?title=ODD&oldid=1159850226 " Categories : Disambiguation pages Initialisms Hidden categories: Short description is different from Wikidata All article disambiguation pages All disambiguation pages ODD (Text Encoding Initiative) The Text Encoding Initiative ( TEI )
400-621: The parts other schema languages cannot reach". The most common versions of Schematron are: Schematron has been standardized by the ISO as Information technology, Document Schema Definition Languages (DSDL) , Part 3: Rule-based validation, Schematron (ISO/IEC 19757-3:2020). This standard is currently not listed on the ISO Publicly Available Specifications list. Paper versions may be purchased from ISO or national standards bodies. Schemas that use ISO/IEC FDIS 19757-3 should use
#316683