Guide: Setting up XML validation

Track

Test bed setup

This guide walks you through the points to consider when setting up XML validation and the steps to bring your validation service online.

What you will achieve

At the end of this guide you will have understood what you need to consider when starting to implement validation services for your XML-based specification. You will also have gone through the steps to bring it online and make it available to your users.

An XML validation service can be created using multiple approaches depending on your needs. You can have an on-premise (or local to your workstation) service through Docker or use the test bed’s resources and, with minimal configuration, bring online a public service that is automatically kept up-to-date.

For the purpose of this guide you will be presented the options to consider and start with a Docker-based instance that could be replaced (or complemented) by a setup through the test bed. Interestingly, the configuration relevant to the validator is the same regardless of the approach you choose to follow.

What you will need

  • About 30 minutes.

  • A text editor.

  • A web browser.

  • Access to the Internet.

  • Docker installed on your machine (only if you want to run the validator as a Docker container).

  • A basic understanding of XML-related technologies. For more information you can check out online resources on XML Schema (XSD), XPath and Schematron.

How to complete this guide

The steps described in this guide are for the most part hands-on, resulting in you creating a fully operational validation service. For these practical steps there are no prerequisites and the content for all files to be created are provided in each step. In addition, if you choose to try your setup as a Docker container you will also be issuing commands on a command line interface (all commands are provided and explained as you proceed).

Steps

You can complete this guide by following the steps described in this section. Not all steps are required, with certain ones being optional or complementary depending on your needs. The following diagram presents an overview of all steps highlighting the ones that apply in all cases (marked as mandatory):

../_images/step_overview4.png

When and why you should skip or consider certain steps depends on your testing needs. Each step’s description covers the options you should consider and the next step(s) to follow depending on your choice.

Step 1: Determine your testing needs

Before proceeding to setup your validator you need to clearly determine your testing needs. A first outline of the approach to follow would be provided by answering the following questions:

  • Will the validator be available to your users as a tool to be used on an ad-hoc basis?

  • Do you plan on measuring the conformance of your community’s members to the XML-based specification?

  • Is the validator expected to be used in a larger conformance testing context (e.g. during testing of a message exchange protocol)?

  • Should the validator be publicly accessible?

  • Should test data and validation reports be treated as confidential?

The first choice to make is on the type of solution that will be used to power your validation service:

  • Standalone validator: A service allowing validation of individual XML instances based on a predefined configuration of validation artefacts, including XML Schema (XSD) (for syntax validation) and Schematron (for business rule validation). The service supports fine-grained customisation and configuration of different validation types (e.g. specification versions) and supported communication channels. Importantly, use of the validator is anonymous and it is fully stateless in that none of the test data or validation reports are maintained once validation completes.

  • Complete test bed: The test bed is used to realise a full conformance testing campaign. It supports the definition of test scenarios as test cases, organised in test suites that are linked to specifications. Access is account-based allowing users to claim conformance to specifications and execute in a self-service manner their defined test cases. All results are recorded to allow detailed reporting, monitoring and eventually certification. Test cases can address XML validation but are not limited to that, allowing validation of any complex exchange of information.

It is important to note that these two approaches are by no means exclusive. It is often the case that a standalone validator is defined as a first step that is subsequently used from within test cases in the test bed. The former solution offers a community tool to facilitate work towards compliance supporting ad-hoc data validation, whereas the latter allows for rigorous conformance testing to take place where proof of conformance is required. This could apply in cases where conformance is a qualification criterion before receiving funding or before being accepted as a partner in a distributed system. Finally, it is interesting to consider that non-trivial XML validation may involve multiple validation artefacts (e.g. Schematron files). In such a case, even if ad-hoc data validation is not needed, defining a separate validator simplifies management of the validation artefacts by consolidating them in a single location, as opposed to bundling them within test suites.

Regardless of the choice of solution, the next point to consider will be the type of access. If public access is important then the obvious choice is to allow access over the Internet. An alternative would be an installation that allows access only through a restricted network, be it an organisation’s internal network or a virtual private network accessible only by your community’s members. Finally, an extreme case would be access limited to individual workstations where each community member would be expected to run the service locally (albeit of course without the expectation to test message exchanges with remote parties).

If access to your validation services over the Internet is preferred or at least acceptable, the simplest case is to opt for using the shared DIGIT test bed resources, both regarding the standalone validator and the test bed itself. If such access is not acceptable or is technically not possible (e.g. access to private resources is needed), the proposed approach would be to go for a Docker-based on-premise installation of all components.

Summarising the options laid out in this section, you will first want to choose:

  • Whether you will be needing a standalone validator, a complete test bed or both.

  • Whether the validator and/or test bed will be accessible over the Internet or not.

Your choices here can help you better navigate the remaining steps of this guide. Specifically:

Step 2: Prepare validation artefacts

As an example case for XML validation we will consider the EU purchase order case first seen in Guide: Creating a test suite. In short, for the purposes of this guide you are considered to be leading an EU cross-border initiative to define a new common specification for the exchange of purchase orders between retailers.

To specify the content of purchase orders your experts have created the following XML Schema:

<xs:schema xmlns:xs="http://www.w3.org/2001/XMLSchema" targetNamespace="http://itb.ec.europa.eu/sample/po.xsd" xmlns="http://itb.ec.europa.eu/sample/po.xsd" elementFormDefault="qualified">

  <xs:element name="purchaseOrder" type="PurchaseOrderType"/>

  <xs:element name="comment" type="xs:string"/>

  <xs:complexType name="PurchaseOrderType">
    <xs:sequence>
      <xs:element name="shipTo" type="Address"/>
      <xs:element name="billTo" type="Address"/>
      <xs:element ref="comment" minOccurs="0"/>
      <xs:element name="items"  type="Items"/>
    </xs:sequence>
    <xs:attribute name="orderDate" type="xs:date"/>
  </xs:complexType>

  <xs:complexType name="Address">
    <xs:sequence>
      <xs:element name="name"   type="xs:string"/>
      <xs:element name="street" type="xs:string"/>
      <xs:element name="city"   type="xs:string"/>
      <xs:element name="zip"    type="xs:decimal"/>
    </xs:sequence>
    <xs:attribute name="country" type="CountryType" use="required"/>
  </xs:complexType>

  <xs:complexType name="Items">
    <xs:sequence>
      <xs:element name="item" minOccurs="0" maxOccurs="unbounded">
        <xs:complexType>
          <xs:sequence>
            <xs:element name="productName" type="xs:string"/>
            <xs:element name="quantity" type="xs:positiveInteger"/>
            <xs:element name="priceEUR"    type="xs:decimal"/>
            <xs:element ref="comment"   minOccurs="0"/>
          </xs:sequence>
          <xs:attribute name="partNum" type="xs:string" use="required"/>
        </xs:complexType>
      </xs:element>
    </xs:sequence>
  </xs:complexType>

  <xs:simpleType name="CountryType">
    <xs:restriction base="xs:string">
      <xs:pattern value="[A-Z]{2}"/>
    </xs:restriction>
  </xs:simpleType>

</xs:schema>

Based on this, a sample purchase order would be as follows:

<?xml version="1.0"?>
<purchaseOrder xmlns="http://itb.ec.europa.eu/sample/po.xsd" orderDate="2018-01-22">
  <shipTo country="BE">
    <name>John Doe</name>
    <street>Europa Avenue 123</street>
    <city>Brussels</city>
    <zip>1000</zip>
  </shipTo>
  <billTo country="BE">
    <name>Jane Doe</name>
    <street>Europa Avenue 210</street>
    <city>Brussels</city>
    <zip>1000</zip>
  </billTo>
  <comment>Send in one package please</comment>
  <items>
    <item partNum="XYZ-123876">
      <productName>Mouse</productName>
      <quantity>20</quantity>
      <priceEUR>15.99</priceEUR>
      <comment>Confirm this is wireless</comment>
    </item>
    <item partNum="ABC-32478">
      <productName>Keyboard</productName>
      <quantity>15</quantity>
      <priceEUR>25.50</priceEUR>
    </item>
  </items>
</purchaseOrder>

A first obvious validation for purchase orders would be against their XML Schema. However, your business requirements also define the concept of a large purchase order which is one that includes more than 10 of each ordered item. This restriction is not reflected in the XML Schema which is considered as a base for all purchase orders but rather in a Schematron rule file that checks this only for orders that are supposed to be “large”. Such a rule file would be as follows:

<?xml version="1.0" encoding="UTF-8"?>
<schema xmlns="http://purl.oclc.org/dsdl/schematron" queryBinding="xslt2">
  <title>Large Purchase Order business rules</title>
  <ns prefix="po" uri="http://itb.ec.europa.eu/sample/po.xsd"/>
  <pattern name="Check order items">
    <rule context="/po:purchaseOrder/po:items/po:item">
      <assert test="number(po:quantity) > 10" flag="fatal" id="PO-01">[PO-01] The quantities of items for large orders must be greater than 10.</assert>
    </rule>
  </pattern>	
</schema>

Given these requirements and validation artefacts we want to support two types of validation (or profiles):

  • basic: For all purchase orders acting as a common base. This is realised by XML Schema validation.

  • large: For large purchase orders. This includes validation against the XML Schema and also against the relevant Schematron rule.

As the first configuration step for the validator we will prepare a folder with the required resources. For this purpose create a root folder named validator with the following subfolders and files:

validator
 |
 +-- resources
      |
      +-- order
           |
           +-- sch
                |
                +-- PurchaseOrder.xsd
           +-- xsd
                |
                +-- LargePurchaseOrder.sch

You will likely note that we are creating several folders of no obvious use. Nonetheless please follow this structure as it will facilitate subsequent steps where we add resources depending on our needs. In terms of meaning of these folders consider the following:

  • validator is the root folder for all files.

  • resources is the root folder for all files that will be considered by the validator.

  • order is the root folder for all files pertinent to purchase order validation. We separate this as the validator could be used to also validate completely different content.

  • sch is the folder containing all Schematron files.

  • xsd is the folder containing all XML Schemas.

Regarding the PurchaseOrder.xsd and LargePurchaseOrder.sch files you can create them from the above content or download them (here: PurchaseOrder.xsd and LargePurchaseOrder.sch). Finally, note that you are free to use any names for the files and folders; the ones used here will however be the ones considered in this guide’s subsequent steps.

Step 3: Prepare validator configuration

After having defined your testing needs and the validation artefacts for your specific case, the next step will be to configure the validator. The validator is defined by a core engine maintained by the test bed team and a layer of configuration, provided by you, that defines its use for a specific scenario. In terms of features the validator supports the following:

  • Validation channels including a SOAP web service API, a web user interface, validation via email and a command-line tool.

  • Configuration of XML Schema and Schematron validation artefacts to drive the validation that can be local or remote.

  • Definition of different validation types as logically-related sets of validation artefacts.

  • Support per validation type allowing user-provided XML Schema and Schematron extensions.

  • Definition of separate validator configurations that are logically split but run as part of a single validator instance. Such configurations are termed “validation domains”.

  • Customisation of all texts presented to users.

Configuration is provided by means of key-value pairs in a property file. This file can be named as you want but needs to end with the .properties extension. In our case we will name this config.properties and place it within the order folder. Recall that the purpose of this folder is to store all resources relevant to purchase order validation. These are the validation artefacts themselves (PurchaseOrder.xsd and LargePurchaseOrder.sch) and the configuration file (config.properties).

Define the content of the config.properties file as follows:

# The different types of validation to support. These values are reflected in other properties.
validator.type = basic, large
# Labels to describe the defined types.
validator.typeLabel.basic = Basic purchase order
validator.typeLabel.large = Large purchase order
# Validation artefacts (XML Schema) to consider for the "basic" type.
validator.schemaFile.basic = xsd/PurchaseOrder.xsd
# Validation artefacts (XML Schema and Schematron) to consider for the "large" type.
validator.schemaFile.large = xsd/PurchaseOrder.xsd
validator.schematronFile.large = sch/LargePurchaseOrder.sch
# The title to display for the validator's user interface.
validator.uploadTitle = Purchase Order Validator

All validator properties share a validator. prefix. The validator.type property is key as it defines one or more types of validation that will be supported (multiple are provided as a comma-separated list of values). The values provided here are important not only because they define the available validation types but also because they drive most other configuration properties. Regarding the validation artefacts themselves, these are provided by means of the validator.schemaFile and validator.schematronFile properties:

  • validator.schemaFile.TYPE defines one or more (comma-separated) file paths (relative to the configuration file) to lookup XML Schema files.

  • validator.schematronFile.TYPE defines one or more (comma-separated) file paths (relative to the configuration file) to lookup Schematron files.

Using these properties you define the validator’s validation artefacts as local files, where in both cases each provided path can be for a file or a folder. If a folder is referenced it will load all contained top-level files (i.e. ignoring subfolders). Note that if your XML Schema or Schematron files import or include other files you need to only point to the “root” or “master” file(s) per case. In case you want your validator to skip XML Schema or Schematron validation you would simply not include any of the relevant configuration properties.

Note

Further validation artefact configuration: You may also define validation artefacts as remote resource references and/or as being user-provided. In addition, you can pre-process configured artefacts before they are used for validation.

The purpose of the remaining properties is to customise the text descriptions presented to users:

  • validator.typeLabel defines a label to present to users on the validator’s user interface for the type in question.

  • validator.uploadTitle defines the title label to present to users on the validator’s user interface.

Once you have created the config.properties file, the validator folder should be as follows:

validator
 |
 +-- resources
      |
      +-- order
           |
           +-- config.properties
           +-- sch
                |
                +-- PurchaseOrder.xsd
           +-- xsd
                |
                +-- LargePurchaseOrder.sch

This limited configuration file assumes numerous default configuration properties. An important example is that by default, the validator will expose a web user interface and a SOAP web service API. This configuration is driven through the validator.channels property that by default is set to form, webservice (for a user form and SOAP web service respectively). All configuration properties provided in config.properties relate to the specific domain in question, notably purchase orders, reflected in the validator’s resources as the order folder. Although rarely needed, you may define additional validation domains each with its own set of validation artefacts and configuration file (see Configuring additional validation domains for details on this). Finally, if you are planning to host your own validator instance you can also define configuration at the level of the complete validator (see Additional configuration options regarding application-level configuration options).

For the complete reference of all available configuration properties and their default values refer to section Validator configuration properties.

Remote validation artefacts

Defining the validator’s artefacts as local files is not the only option. If these are available online you can also reference them remotely by means of the following properties:

  • validator.schemaFile.TYPE.remote.N.url for an XML Schema that is to be loaded remotely (e.g. from a GitHub repository).

  • validator.schematronFile.TYPE.remote.N.url for one or more Schematron files that are to be loaded remotely (e.g. from a GitHub repository).

The N element in the properties’ names is a zero-based positive integer allowing you to define more than one entries to match the number of remote files. Similar to the case of local files, you are expected to only reference the “master” or “root” files assuming that included resources can also be looked up remotely based on the defined locations. Note that loading referenced resources from within remote Schematrons (e.g. via sch:include) is currently not supported.

The example that follows illustrates the loading of one remote XML Schema file and one Schematron file for a validation type named v2 from a remote location:

validator.type = v2
...
validator.schemaFile.v2.remote.0.url = https://my.server.com/my_rules_1.xsd
validator.schematronFile.v2.remote.0.url = https://my.server.com/my_rules_2.sch

You may also combine local and remote Schematron files by defining a validator.schematronFile.TYPE property and one or more validator.schematronFile.TYPE.remote.N.url properties. In all cases, the Schematrons from all sources will be aggregated into a single model for the validation. Such combinations are not possible for XML Schemas where only one schema source is considered.

Note

Remote XML Schema and Schematron caching: Caching is used to avoid constant lookups of remote files. Once loaded, remote files will be automatically refreshed every hour.

User-provided validation artefacts

Apart from defining the XML Schema and Schematron to apply as local and/or remote files, you may also define for a given validation type to allow or not user-provided XSDs and user-provided Schematrons. This is achieved through the following properties:

...
validator.externalSchemaFile.TYPE = optional
validator.externalSchematronFile.TYPE = required

These properties allow three possible values:

  • required: The relevant validation artefact(s) must be provided by the user.

  • optional: Providing the relevant validation artefact(s) is allowed but not mandatory.

  • none (the default value): No such validation artefacts are requested or considered.

Specifying that for a given validation type you allow users to provide XML Schema and Schematron artefacts will result in any such extensions being combined with your predefined artefacts (if present). This could be useful in scenarios where you want to define a common validation base but allow also ad-hoc extensions for e.g. restrictions defined at user-level (e.g. National validation rules to consider in addition to a common set of EU rules). In the case of XML Schemas, if predefined ones are present no user-provided ones are ever considered (i.e. the relevant property is fixed as none).

Note

Generic validator: It is possible to not predefine any XML Schemas or Schematron resulting in a validator that is trully generic, expecting all validation artefacts to be provided by users. Such a generic instance actually exists at https://www.itb.ec.europa.eu/xml/upload.

Validation artefact pre-processing

An advanced configuration option available to you is to enable for a given validation type the pre-processing of the validator’s artefacts, both pre-configured ones (local or remote) as well as those provided by users (if enabled). Pre-processing allows you to run an XSLT transformation on a resource, in order to produce the final file to be used for the validation (either an XML Schema or Schematron). The input for such pre-processing can be any text-based file, allowing you to dynamically generate validation artefacts based on a flexible input.

XSLT pre-processing can be configured for all cases of resources:

  • Local pre-configured files (per file).

  • Remote pre-configured files (per file).

  • User-provided files (per file type).

Pre-processing is enabled by adding to the relevant configuration properties the following postfixes:

  • .preprocessor: The reference to a locally available XSLT file to be used for the transformation.

  • .preprocessor.output: The file extension for the resulting file (by default xsd for XML Schema and sch for Schematron).

Defining the .preprocessor.output postfix can be interesting for Schematron output in case the result is not a raw Schematron (.sch) but rather is itself an XSLT file.

To illustrate use of these properties consider the following sample for a v2 validation type that addresses all cases:

...
# A local XML file to be used as input to generate the XML Schema
validator.schemaFile.v2 = xsds/MyXSD.xml
validator.schemaFile.v2.preprocessor = resources/xsd_template.xslt
# A remote XML file to be used as input to generate the (raw) Schematron
validator.schematronFile.v2.remote.0.url = https://my.server.com/my_rules.xml
validator.schematronFile.v2.remote.0.url.preprocessor = resources/sch_template.xslt
validator.schematronFile.v2.remote.0.url.preprocessor.output = sch
# User-provided XML files to be used to generate (raw) Schematron files
validator.externalSchematronFile.v2 = required
validator.externalSchematronFile.v2.preprocessor = resources/sch_template.xslt
validator.externalSchematronFile.v2.preprocessor.output = sch

The above configuration effectively expects all pre-configured resources and user input to serve as input in generating the actual validation artefacts to use. The XML Schema will be generated using XSLT file resources/sch_template.xslt whereas Schematrons will be generated using the resources/sch_template.xslt file.

Using such pre-processing can allow you to achieve powerful customisations for your XML validator. A good example would be a configuration with a fixed, predefined XML Schema, with Schematron files that are generated on-the-fly based on one or more XML files provided by the user. To complement such customisation, the label used to prompt users for Schematron files would also be adapted via the validator.label.externalSchematronLabel property to reflect the expected XML input files (see Properties related to UI labels).

Supporting multiple languages

Certain configuration properties we have seen up to now define texts that are visible to the validator’s users. Examples of these include the title of the validator’s user interface (validator.uploadTitle) or the labels to present for the available validation types (validator.typeLabel.TYPE), which in the sample configuration are set with English values. Depending on your validator’s audience you may want to switch to a different language or support several languages at the same time. Supporting multiple languages affects:

  • The texts, labels and messages presented on the validator’s user interface.

  • The reports produced after validating content via any of the validator’s interfaces.

The text values used by default by the validator are defined in English (see default values here), with English being the language considered by the validator if no other is selected. If your validator needs to support only a single language, a simple approach is to ensure that the domain-level configuration properties for texts presented to users are defined in the domain configuration file with the values for your selected language. Note that as long as the validator’s target language is an EU official language you need not provide translations for user interface labels and messages as these are defined by the validator itself. You are nonetheless free to redefine these to override the defaults or to define them for a non-supported language.

In case you want your validator to support multiple languages at the same time you need to adapt your configuration to define the supported languages and their specific translations. To do this adapt your domain configuration property file making use of the following properties:

  • validator.locale.available: The list of languages to be supported by the validator, provided as a comma-separated list of language codes (locales). The order these are defined with determines their presentation order in the validator’s user interface.

  • validator.locale.default: The validator’s default language, considered if no specific language has been requested. If multiple languages are supported the default needs to be set to one of these.

  • validator.locale.translations: The path to a folder (absolute or relative to the domain configuration file) that contains the translation property files for the validator’s supported languages.

Each language (locale) is defined as a 2-character lowercase language code (based on ISO 639), followed by an optional 2-character uppercase country code (based on ISO 3166) separated using an underscore character (_). The format is in fact identical to that used to define locales in the Java language. Valid examples include “fr” for French, “fr_FR” for French in France, or “fr_BE” for French in Belgium. Such language codes are the values expected to be used for properties validator.locale.available and validator.locale.default.

Regarding property validator.locale.translations, the value is expected to be a folder containing the translation files for your selected languages. These are defined exactly as you would define a resource bundle in a Java program, specifically:

  • The names of all property files start with the same prefix. Any value can be selected with good examples being “labels” or “translations”.

  • The common prefix is followed by the relevant locale value (language code and country code) separated with an underscore.

  • The files’ extension is set as “.properties”.

Considering the above, good examples of translation property file names would be “labels_de.properties”, “labels_fr.properties” and “labels_fr_FR.properties”. Note that these files are implicitly hierarchical meaning that for related locales you need not redefine all properties. For example you may have your French texts defined in “labels_fr.properties” and only override what you need in “labels_fr_BE.properties”. You can also define an overall default translation file by omitting the locale in its name (labels.properties) which will be used when no locale-specific file exists or if it exists but does not include the given property key. Note additionally that if you define translatable text values in your main domain configuration file these are considered as overall defaults if no specific translations could be found in translation files.

In terms of contents, the translation files are simple property files including key-value pairs. Each such pair defines as its key the property key for the given text, with the value being the translation to use. The properties that can be defined in such files are:

Considering that you typically wouldn’t need to override labels and messages, the texts you would need to translate are the ones relevant to your specification. These are most often the following:

  • The title of the validator’s UI (validator.uploadTitle).

  • The UI’s HTML banner content (validator.bannerHtml).

  • The descriptions for the validation types that you define and their options (validator.typeLabel.TYPE, validator.optionLabel.OPTION, validator.typeOptionLabel.TYPE.OPTION and validator.completeTypeOptionLabel.TYPE.OPTION).

The information up to this point covers the translation of texts, labels and messages but has not yet addressed the validator’s validation artefacts. For the XML validator these artefacts are XML Schemas and Schematron files to validate XML syntax and content respectively. For XML Schemas default translations are already in place for you but you will still need to define language-specific messages for your Schematron files. This is done by defining a diagnostics element within which you define a diagnostic for each translated message. The diagnostic messages define an identifier (attribute id) to allow referencing, and a language attribute (attribute xml:lang) to specify the relevant language. The Schematron assertions (elements assert) are then updated to reference with their diagnostics attributes the IDs of the corresponding diagnostic elements, separated with spaces.

To illustrate how all this comes together let’s revisit our Purchase Order example. In our current, single-language and English-only setup, the configuration files are structured as follows:

validator
 |
 +-- resources
      |
      +-- order
           |
           +-- config.properties
           +-- sch
                |
                +-- PurchaseOrder.xsd
           +-- xsd
                |
                +-- LargePurchaseOrder.sch

The domain configuration file (config.properties) defines itself the user-presented texts (see highlighted lines):

validator.type = basic, large
validator.typeLabel.basic = Basic purchase order
validator.typeLabel.large = Large purchase order
validator.schemaFile.basic = xsd/PurchaseOrder.xsd
validator.schemaFile.large = xsd/PurchaseOrder.xsd
validator.schematronFile.large = sch/LargePurchaseOrder.sch
validator.uploadTitle = Purchase Order Validator

In addition, our Schematron file (LargePurchaseOrder.sch) includes in its assert element the text of the rule’s error message in English:

<?xml version="1.0" encoding="UTF-8"?>
<schema xmlns="http://purl.oclc.org/dsdl/schematron" queryBinding="xslt2">
    <title>Large Purchase Order business rules</title>
    <ns prefix="po" uri="http://itb.ec.europa.eu/sample/po.xsd"/>
    <pattern name="Check order items">
        <rule context="/po:purchaseOrder/po:items/po:item">
            <assert test="number(po:quantity) > 10" flag="fatal" id="PO-01">[PO-01] The quantities of items for large orders must be greater than 10.</assert>
        </rule>
    </pattern>
</schema>

Starting from this point we will make the necessary changes to support alongside English (which remains the default language), German and French translations. The first step is to adapt the config.properties file to remove the contained translations. We could have kept these here for English but as we will be adding specific translation files it is cleaner to move all translations to them. The content of config.properties becomes now as follows:

validator.type = basic, large
validator.schemaFile.basic = xsd/PurchaseOrder.xsd
validator.schemaFile.large = xsd/PurchaseOrder.xsd
validator.schematronFile.large = sch/LargePurchaseOrder.sch
validator.locale.available = en,fr,de
validator.locale.default = en
validator.locale.translations = translations

To define the translations we will introduce a new folder translations (as defined in property validator.locale.translations) that includes the property files per locale:

validator
 |
 +-- resources
      |
      +-- order
           |
           +-- config.properties
           +-- sch
                |
                +-- PurchaseOrder.xsd
           +-- xsd
                |
                +-- LargePurchaseOrder.sch
           +-- translations
                |
                +-- labels_en.properties
                +-- labels_fr.properties
                +-- labels_de.properties

The English translations are provided in labels_en.properties (these are simply moved here from the config.properties file):

validator.typeLabel.basic = Basic purchase order
validator.typeLabel.large = Large purchase order
validator.uploadTitle = Purchase Order Validator

French translations are defined in labels_fr.properties:

validator.typeLabel.basic = Bon de commande de base
validator.typeLabel.large = Bon de commande important
validator.uploadTitle = Validateur de bon de commande

And finally German translations are defined in labels_de.properties:

validator.typeLabel.basic = Grundbestellung
validator.typeLabel.large = Großbestellung
validator.uploadTitle = Bestellbestätigung

Finally, we must not forget to adapt LargePurchaseOrder.sch with the error messages per language:

<?xml version="1.0" encoding="UTF-8"?>
<schema xmlns="http://purl.oclc.org/dsdl/schematron" queryBinding="xslt2">
    <title>Large Purchase Order business rules</title>
    <ns prefix="po" uri="http://itb.ec.europa.eu/sample/po.xsd"/>
    <pattern name="Check order items">
        <rule context="/po:purchaseOrder/po:items/po:item">
            <assert test="number(po:quantity) > 10" flag="fatal" id="PO-01" diagnostics="d01_en d01_fr d01_de"/>
        </rule>
    </pattern>
    <diagnostics>
        <diagnostic id="d01_en" xml:lang="en">[PO-01] The quantities of items for large orders must be greater than 10.</diagnostic>
        <diagnostic id="d01_fr" xml:lang="fr">[PO-01] Les quantités d'articles pour les grosses commandes doivent être supérieures à 10.</diagnostic>
        <diagnostic id="d01_de" xml:lang="de">[PO-01] Die Artikelmengen für Großbestellungen müssen größer als 10 sein.</diagnostic>
    </diagnostics>
</schema>

This completes the validator’s localisation configuration. With this setup in place, the user will be able to select one of the supported languages to change the validator’s user interface and resulting report. Note that localised reports can also now be produced from the validator’s SOAP API and command-line tool.

Step 4: Setup validator as Docker container

Note

When to setup a Docker container: The purpose of setting up your validator as a Docker container is to host it yourself or run it locally on workstations. If you prefer or don’t mind the validator being accessible over the Internet it is simpler to delegate hosting to the test bed team by reusing the test bed’s infrastructure. If this is the case skip this section and go directly to Step 5: Setup validator on test bed. Note however that even if you opt for a validator managed by the test bed, it may still be interesting to create a Docker image for development purposes (e.g. to test new validation artefact versions) or to make available to your users as a complementary service (i.e. use online or download and run locally).

Once the validator’s configuration is ready (configuration file and validation artefacts) you can proceed to create a Docker image.

The configuration for your image is driven by means of a Dockerfile. Create this file in the validator folder with the following contents:

FROM isaitb/xml-validator:latest
COPY resources /validator/resources/
ENV validator.resourceRoot /validator/resources/

This Dockerfile represents the most simple Docker configuration you can provide for the validator. Let’s analyse each line:

FROM isaitb/xml-validator:latest

This tells Docker that your image will be built over the latest version of the test bed’s isaitb/xml-validator image. This represents the validator’s core that expects configuration to drive the validation. It is available on the public Docker Hub and as such can be directly used through any Docker installation with Internet access.

COPY resources /validator/resources/

This copies your resources folder to the image under path /validator/resources/.

ENV validator.resourceRoot /validator/resources/

This instructs the validator that it should consider as the root of all its configuration resources the /validator/resources/ folder (which was just copied into it).

The contents of the validator folder should now be as follows:

validator
 |
 +-- Dockerfile
 +-- resources
      |
      +-- order
           |
           +-- config.properties
           +-- sch
                |
                +-- PurchaseOrder.xsd
           +-- xsd
                |
                +-- LargePurchaseOrder.sch

That’s it. To build the Docker image open a command prompt to the validator folder and issue:

docker build -t po-validator .

This command will create a new local Docker image named po-validator based on the Dockerfile it finds in the current directory. It will proceed to download missing images (e.g. the isaitb/xml-validator:latest image) and eventually print the following output:

Sending build context to Docker daemon  32.77kB
Step 1/3 : FROM isaitb/xml-validator:latest
---> 39ccf8d64a50
Step 2/3 : COPY resources /validator/resources/
---> 66b718872b8e
Step 3/3 : ENV validator.resourceRoot /validator/resources/
---> Running in d80d38531e11
Removing intermediate container d80d38531e11
---> 175eebf4f59c
Successfully built 175eebf4f59c
Successfully tagged po-validator:latest

The new po-validator:latest image can now be pushed to a local Docker registry or to the Docker Hub. In our case we will proceed directly to run this as follows:

docker run -d --name po-validator -p 8080:8080 po-validator:latest

This command will create a new container named po-validator based on the po-validator:latest image you just built. It is set to run in the background (-d) and expose its internal listen port through the Docker machine (-p 8080:8080). Note that by default the listen port of the container (which you can map to any available host port) is 8080.

Your validator is now online and ready to validate XML documents. If you want to try it out immediately skip to Step 6: Use the validator. Otherwise, read on to see additional configuration options for the image.

Running without a custom Docker image

The discussed approach involved building a custom Docker image for your validator. Doing so allows you to run the validator yourself but also potentially push it to a public registry such as the Docker Hub. This would then allow anyone else to pull it and run a self-contained copy of your validator.

If such a use case is not important for you, or if you want to only use Docker for your local artefact development, you could also skip creating a custom image and use the base isaitb/xml-validator image directly. To do so you would need to:

  • Define a named or unnamed Docker volume pointing to your configuration files.

  • Run your container by configuring it with the volume.

Considering the same file structure of the /validator folder you can launch your validator using an unnamed volume as follows:

docker run -d --name po-validator -p 8080:8080 \
           -v /validator/resources:/validator/resources/ \
           -e validator.resourceRoot=/validator/resources/  \
           isaitb/xml-validator

As you see here we create the validator directly from the base image and pass it as a volume our resource root folder. When doing so you need to also make sure that the validator.resourceRoot environment variable is set to the path within the container.

Using this approach to run your validator has the drawback of being unable to share it as-is with other users. The benefit however is one of simplicity given that there is no need to build intermediate images. As such, updating the validator for configuration changes means that you only need to restart your container.

Configuring additional validation domains

Up to this point you have configured validation for purchase orders which defines one or more validation types (basic and large). This configuration can be extended by providing additional types to reflect:

  • Additional profiles with different business rules (e.g. minimal).

  • Specification versions (e.g. basic_v1.0, large_v1.0, basic_v1.1_beta).

  • Other types of content that are linked to purchase orders (e.g. purchase_order_basic_v1.0 and order_receipt_v1.0).

All such extensions would involve defining potentially additional validation artefacts and updating the config.properties file accordingly.

Apart from extending the validation possibilities linked to purchase orders you may want to configure a completely separate validator to address an unrelated specification that would most likely not be aimed to the same user community. To do so you have two options:

  • Repeat the previous steps to define a separate configuration and a separate Docker image. In this case you would be running two separate containers that are fully independent.

  • Reuse your existing validator instance to define a new validation domain. The result will be two validation services that are logically separate but are running as part of a single validator instance.

The rationale behind the second option is simply one of required resources. If you are part of an organisation that needs to support validation for dozens of different XML-based specifications that are unrelated, it would probably be preferable to have a single application to host rather than one per specification.

Note

Sharing artefact files accross domains: Setting the application property validator.restrictResourcesToDomain to false allows to add paths of validation artefacts that are outside of the domain root folder. This enables sharing artefacts between different domains.

In your current single domain setup, the purchase order configuration is reflected through folder order. The name of this folder is also by default assumed to match the name of the domain. A new domain could be named invoice that is linked to XML-based invoices. This is represented by an invoice folder next to order that contains similarly its validation artefacts and domain-level configuration property file. Considering this new domain, the contents of the validator folder would be as follows:

validator
 |
 +-- Dockerfile
 +-- resources
      |
      +-- invoice
        (Further contents skipped)
      +-- order
        (Further contents skipped)

If you were now to rebuild the validator’s Docker image this would setup two logically-separate validation domains (invoice and order).

Note

Validation domains vs types: In almost all scenarios you should be able to address your validation needs by having a single validation domain with multiple validation types. Validation types under the same domain will all be presented as options for users. Splitting in domains would make sense if you don’t want the users of one domain to see the supported validation types of other domains.

Important: Support for such configuration is only possible if you are defining your own validator as a Docker image. If you plan to use the test bed’s shared validator instance (see Step 5: Setup validator on test bed), your configuration needs to be limited to a single domain. Note of course that if you need additional domains you can in this case simply repeat the configuration process multiple times.

Additional configuration options

We have seen up to now that configuring how validation takes place is achieved through domain-level configuration properties provided in the domain configuration file (file config.properties in our example). When setting up the validator as a Docker image you may also make use of application-level configuration properties to adapt the overall validator’s operation. Such configuration properties are provided as environment variables through ENV directives in the Dockerfile.

We already saw this when defining the validator.resourceRoot property that is the only mandatory property for which no default exists. Other such properties that you may choose to override are:

  • validator.domain: A comma-separated list of names that are to be loaded as the validator’s domains. By default the validator scans the provided validator.resourceRoot folder and selects as domains all subfolders that contain a configuration property file (folder order in our case). You may want to configure the list of folder names to consider if you want to ensure that other folders get ignored.

  • validator.domainName.DOMAIN: A mapping for a domain (replacing the DOMAIN placeholder) that defines the name that should be presented to users. This would be useful if the folder name itself (order in our example) is not appropriate (e.g. if the folder was named files).

  • validator.acceptedMimeTypes: The mime types of received XML files to consider as acceptable input for the validator.

  • validator.acceptedSchematronExtensions: The file extensions to consider when looking up Schematron files.

The following example Dockerfile illustrates use of these properties. The values set correspond to the applied defaults so the resulting Docker images from this Dockerfile and the original one (see Step 4: Setup validator as Docker container) are in fact identical:

FROM isaitb/xml-validator:latest
COPY resources /validator/resources/
ENV validator.resourceRoot /validator/resources/
ENV validator.domain order
ENV validator.domainName.order order
ENV validator.acceptedMimeTypes application/xml, text/xml, text/plain
ENV validator.acceptedSchematronExtensions xsl, xslt, sch

See Application-level configuration for the full list of supported application-level properties.

Finally, it may be the case that you need to adapt further configuration properties that relate to how the validator’s application is ran. The validator is built as a Spring Boot application which means that you can override all configuration properties by means of environment variables. This is rarely needed as you can achieve most important configuration through the way you run the Docker container (e.g. defining port mappings). Nonetheless the following adapted Dockerfile shows how you could ensure the validator’s application starts up on another port (9090) and uses a specific context path (/ctx).

FROM isaitb/xml-validator:latest
COPY resources /validator/resources/
ENV validator.resourceRoot /validator/resources/
ENV server.servlet.context-path /ctx
ENV server.port 9090

Note

Custom port: Even if you define the server.port property to a different value other than the default 8080 this remains internal to the Docker container. The port through which you access the validator will be the one you map on your host through the -p flag of the docker run command.

The full list of such application configuration properties, as well as their default values, are listed in the Spring Boot configuration property documentation.

Environment-specific domain configuration

In the previous section you saw how could can configure the validator’s application by means of environment properties. Use of environment properties is also possible for specific validator domains, allowing you to externalise and override their configuration aspects. Typical cases where you may want to do this are:

  • To adapt the configuration for different instances of the same validator.

  • To hide sensitive properties such as internal URLs or passwords.

External configuration properties can be provided through environment variables or system properties (the latter being interesting when using a command line validator). These can contribute to the configuration:

  • Complete properties, by defining a variable or property with the same name as a configuration property.

  • Values, by referring to arbitrary variables or properties within a domain property file. This is done by defining a placeholder ${} and using within it the prefixes env: for environment variables or sys: for system properties (e.g. ${env:MY_VARIABLE}).

As a simple example of this, consider the definition of the title of the validator’s web UI. We want to adapt this title depending on the purpose of the specific validator instance. To begin with we can define the title via the validator.uploadTitle property in the config.properties file as follows:

validator.uploadTitle = Purchase Order Validator

The problem with this approach is that the upload title remains fixed across all instances of the validator. Alternatively we can define the value for the title as an environment variable named VALIDATOR_TITLE. To use this we can adapt config.properties to reference it as follows:

validator.uploadTitle = ${env:VALIDATOR_TITLE}

We can then adapt this for each Docker container we create by defining a specific value for the title:

docker run -e VALIDATOR_TITLE="Purchase Order Validator (Demo)" ...

A further alternative would be to externalise the complete validator.uploadTitle property by removing it from config.properties and specifying it as an environment variable:

docker run -e validator.uploadTitle="Purchase Order Validator (Demo)" ...

Note that such external configuration can also be used as a partial value. If we define an environment variable named VALIDATOR_ENV we could also use it within config.properties as follows:

validator.uploadTitle = Purchase Order Validator (${env:VALIDATOR_ENV})

Finally, you can use environment or system variables to override properties that are already defined in config.properties. This is useful if you use the file’s properties as default values and selectively override certain of them as needed. A property’s value is looked up in sequence as follows:

  1. Look in environment variables.

  2. If not found look in system properties.

  3. If not found look in the domain configuration file.

  4. If not found consider the property’s overall default value (see Domain-level configuration).

Step 5: Setup validator on test bed

Note

When to setup on test bed resources: Setting up your validator on the test bed’s resources removes hosting concerns and allows you to benefit from automatic service reloads for configuration changes. In doing so however you need to keep in mind that the validator will be exposed over the Internet. If this approach is not suitable for you (e.g. you want to expose the validator within a restricted network) you should consider setting up the validator as a Docker container (see Step 4: Setup validator as Docker container) that you can then host as you see fit.

To configure a validator using the test bed’s resources all you need to do is get in touch with the test bed team and provide the validator’s configuration. Specifically:

  1. Send an email to DIGIT-ITB@ec.europa.eu describing your case: This step is needed for two reasons. Firstly you may want to have a further discussion and potentially a demo to better understand the available options. Secondly the test bed’s team would need to ensure that you qualify to use its resources (to e.g. avoid that you are a private company planning to offer commercial validation services).

  2. Share the configuration for the validator: Once contact has been established you need to provide the initial configuration for the validator.

Regarding the second step, the validator’s configuration to be shared is the contents of the validator folder as described in Step 3: Prepare validator configuration. The eventual goal here will be to have the configuration available through an accessible Git repository. This can be done in a number of ways listed below in decreasing order of preference:

  • Create a new Git repository: You can push all resources (the validator folder) to a new Git repository (e.g. on GitHub or the European Commission’s CITNet Bitbucket server). You can of course add any other resources to this repository as you see fit (e.g. a README file). Once done provide the repository’s URL to the test bed team.

  • Provide the resources to the test bed team: You can send the configuration files themselves to the test bed’s team (e.g. make an archive of the validator folder). Ideally you should define the configuration file but if in doubt you can simply describe the resources and the test bed team will prepare the initial configuration for you. When following this approach a new Git repository will be created for you on the European Commission’s CITNet Bitbucket server for which you will be assigned write access (assuming you have a CITNet user account).

  • Update an existing Git repository: If you already have a Git repository to maintain the validation artefacts you can reuse this by adding to it the required configuration file (config.properties in our case). When ready you will need to provide the test bed team with the URL to the repository and the location of the configuration file.

Following the initial configuration, the resulting Git repository will be monitored to detect any changes to the validation artefacts or the configuration file. If such a change is detected, the validation service will be automatically updated within a few minutes.

Note

Using a dedicated Git repository for the validator: Whether you define a new Git repository yourself or the test bed team creates one for you, the result is a repository that is dedicated to the validator. This approach is preferable to reusing an existing Git repository to avoid unwanted changes to the validator. whether or not this is done through GitHub, CITNet’s Bitbucket or another service depends on what best suits your needs.

As part of the initial setup for the validator the test bed team will also configure how it is accessed. The name used will match the name of the folder that contains your configuration file (order in the considered example), but this can differ according to your preferences. If this is the case make sure to inform the test bed team of your preferred naming.

Considering our example, for a name of order, the resulting root URL through which the validator will be accessed is https://www.itb.ec.europa.eu/order. The specific paths will depend on the supported validation channels as described in Step 6: Use the validator.

Step 6: Use the validator

Well done! At this step your validator has been successfully configured and is ready to use. Depending on which approach was followed, this may have been done either:

The validation channels that are supported depend on the configuration you have supplied. This is done through the validator.channels property of your configuration file (config.properties) that defaults to form, webservice. The supported channels are as follows:

  • form: A web user interface allowing a user to upload the XML document to validate.

  • webservice: A SOAP web service API allowing for machine-to-machine integration.

  • email: Validation via email exchange (disabled by default).

The following sub-sections describe how each channel can be used considering the example EU purchase order specification.

Validation via user interface

The validator’s user interface is available at the /upload path. The exact path depends on how this is deployed:

The first page that you see is a simple form to upload the file to validate.

../_images/validator_upload3.png

This form expects the following input:

  • Content to validate: The content that will be validated.

  • Validate as: The type of validation to apply.

The dropdown menu to the right of the Content to validate label selects the input method. For this you have three choices:

  • File: Content provided as a file upload (the default).

  • URI: Content provided as a remote URI reference.

  • Direct input: Content encoded directly in an on-screen editor.

In case the validator is configured to support multiple languages, the form includes an additional dropdown menu to list them in the bottom-right corner. Selecting one of these languages will reload the interface and will record your choice to apply it automatically for future visits.

../_images/validator_upload_languages3.png

Note that all displayed labels can be adapted through the config.properties configuration file (see Properties related to UI labels). The available validation types match the ones defined in the validator.type property, displayed using the validator.typeLabel.TYPE labels. Moreover, the text title could be replaced by a configurable HTML banner, and further complemented with a HTML footer (see Domain-level configuration).

../_images/validator_upload_selected3.png

It is worth noting also that if your configuration defined only a single validation type, the user interface would be simplified by presenting only a single file upload input.

../_images/validator_upload_simple3.png

Finally, in case the validator’s configuration foresees user-provided validation artefacts (see User-provided validation artefacts) the interface is adapted to allow their provision. In the following example, the “large” validation type is set to allow optional external Schematron files. Any number of these can be provided by means of the provided icons.

../_images/validator_upload_external3.png

Once a file has been uploaded and the validation type is selected click the Validate button to trigger the validation. Upon completion you will be presented with the validation results:

../_images/validator_result3.png

This screen includes an overview of the result listing:

  • The validation timestamp (in UTC), the name of the validated file and the applied validation type.

  • The overall result (SUCCESS or FAILURE).

  • The number of errors, warnings and information messages.

This section is followed by the Details panel, where the details of each report item are listed:

  • It’s type (whether this is an error, warning or information message).

  • It’s description.

  • It’s location in the provided input.

Clicking on each item’s details will open a popup that shows within the provided content the specific point that triggered the issue:

../_images/validator_result_errordetail2.png

In terms or reporting, apart from the on-screen display, buttons are available allowing you to download the validation report:

  • View annotated input: Open a popup with the provided content, including annotations for the lines with relevant report items.

  • Download XML report: Download as XML (in GITB TRL syntax - sample here).

  • Download PDF report: Download as PDF (sample here).

Note that the download buttons are initially disabled but are enabled as soon as the respective reports become available.

Finally, to trigger a new validation you may either use the form from the top of the result screen or click on the form’s title that will take you back to the previous page.

Validation via minimal user interface

If you are exposing a web user interface (see Validation via user interface) for your validator you also have the option of enabling an alternative minimal interface that could be used as an embedded component in another web page (e.g. via an iframe). This is enabled through the validator.supportMinimalUserInterface property in your domain configuration (file config.properties).

...
validator.supportMinimalUserInterface = true

The result of this is to expose a /uploadm path. The path depends on how this is deployed:

The minimal interface offers largely the same functionality as the complete one but with a more condensed layout and minimal styling. The initial input page you see for the validator is as follows:

../_images/validator_upload_minimal3.png

The most significant difference is the result page which provides only an overview and the relevant download controls:

../_images/validator_result_minimal3.png

The meaning of the provided controls and displayed information for both input and result pages is identical to the complete user interface (see Validation via user interface).

Validation via SOAP web service API

The validator’s SOAP API is available under the /api path. The exact path depends on how this is deployed (path to WSDL provided):

Note that the api and order path elements are intentionally inverted in the Docker case. This inconsistency is due to a technical restriction that however doesn’t apply when running on the test bed.

The SOAP API used is the GITB validation service API, meaning that the validator is a GITB-compliant validation service. The importance of this is that apart from using it directly, this SOAP API allows integration of the validator in more complex conformance testing scenarios as a validation step in GITB TDL test cases. This potential is covered further in Step 7: Use the validator in GITB TDL test cases.

The operations supported are as follows:

  • getModuleDefinition: Called to return information on how to call the service (i.e. what inputs are expected).

  • validate: Called to trigger validation for provided content.

You can download this SOAP UI project that includes sample calls of these operations (make sure to change the service URL to match your setup).

Regarding the getModuleDefinition operation, a request of:

<soapenv:Envelope xmlns:soapenv="http://schemas.xmlsoap.org/soap/envelope/" xmlns:v1="http://www.gitb.com/vs/v1/">
   <soapenv:Header/>
   <soapenv:Body>
      <v1:GetModuleDefinitionRequest/>
   </soapenv:Body>
</soapenv:Envelope>

Will produce a response as follows:

<soap:Envelope xmlns:soap="http://schemas.xmlsoap.org/soap/envelope/">
   <soap:Body>
      <ns4:GetModuleDefinitionResponse xmlns:ns2="http://www.gitb.com/core/v1/" xmlns:ns3="http://www.gitb.com/tr/v1/" xmlns:ns4="http://www.gitb.com/vs/v1/">
         <module operation="V" id="ValidatorService">
            <ns2:metadata>
               <ns2:name>ValidatorService</ns2:name>
               <ns2:version>1.0.0</ns2:version>
            </ns2:metadata>
            <ns2:inputs>
               <ns2:param type="string" name="type" use="R" kind="SIMPLE"/>
               <ns2:param type="object" name="xml" use="R" kind="SIMPLE"/>
               <ns2:param type="string" name="embeddingMethod" use="O" kind="SIMPLE"/>
               <ns2:param type="list[map]" name="externalSchema" use="O" kind="SIMPLE"/>
               <ns2:param type="list[map]" name="externalSchematron" use="O" kind="SIMPLE"/>               
            </ns2:inputs>
         </module>
      </ns4:GetModuleDefinitionResponse>
   </soap:Body>
</soap:Envelope>

This response can be customised through configuration properties in config.properties to provide descriptions specific to your setup. For example, extending config.properties with the following:

...
validator.webServiceId = PurchaseOrderValidator
validator.webServiceDescription.xml = The purchase order XML file to validate
validator.webServiceDescription.type = The type of validation to perform ('basic' or 'large')

Will produce a response as follows:

<soap:Envelope xmlns:soap="http://schemas.xmlsoap.org/soap/envelope/">
    <soap:Body>
        <ns4:GetModuleDefinitionResponse xmlns:ns2="http://www.gitb.com/core/v1/" xmlns:ns3="http://www.gitb.com/tr/v1/" xmlns:ns4="http://www.gitb.com/vs/v1/">
            <module operation="V" id="InvoiceValidationService">
                <ns2:metadata>
                    <ns2:name>PurchaseOrderValidator</ns2:name>
                    <ns2:version>1.0.0</ns2:version>
                </ns2:metadata>
                <ns2:inputs>
                    <ns2:param type="object" name="xml" use="R" kind="SIMPLE" description="The purchase order XML file to validate"/>
                    <ns2:param type="string" name="type" use="R" kind="SIMPLE" description = "The type of validation to perform ('basic' or 'large')"/>
                    <ns2:param type="string" name="embeddingMethod" use="O" kind="SIMPLE"/>
                    <ns2:param type="list[map]" name="externalSchema" use="O" kind="SIMPLE"/>
                    <ns2:param type="list[map]" name="externalSchematron" use="O" kind="SIMPLE"/>
                </ns2:inputs>
            </module>
        </ns4:GetModuleDefinitionResponse>
    </soap:Body>
</soap:Envelope>

Running the validation itself is done through the validate operation. This expects the following inputs:

Input

Description

Required?

Type

Default value

xml

The XML content to validate.

Yes

A string that is interpreted based on the embeddingMethod input.

embeddingMethod

The way in which to interpret the xml value.

Yes, but should be skipped in favour of the embeddingMethod attribute - see below.

One of BASE64 (for content embedded as a BASE64 encoded string), URI (for a reference to remote content) or STRING (for content embedded as-is).

type

The type of validation to perform.

Yes, unless a single validation type is defined.

String

The single configured validation type (if defined).

locationAsPath

Whether the report items linked to Schematron validation should be an XPath expression (if true) or the relevant line and column number (if false).

No

boolean

false

addInputToReport

Whether the validated input should be included in the produced response as the report’s context.

No

boolean

true

externalSchema

A list of user-provided XML Schemas to be considered with any predefined ones. These are accepted only if explicitly allowed in the configuration for the validation type in question.

No

A list of map entries (see below for content).

externalSchematron

A list of user-provided Schematron extensions to be considered with any predefined ones. These are accepted only if explicitly allowed in the configuration for the validation type in question.

No

A list of map entries (see below for content).

locale

Locale (language code) to use for reporting of results. If the provided locale is not supported by the validator the default locale will be used instead (e.g. “fr”, “fr_FR”). See Supporting multiple languages for details.

No

String

Note

Configuration for increased throughput: If you expect to be validating large XML files and/or with high frequency it would be advised to call the validator setting locationAsPath to true as determining the position of each finding is costly. In addition, you can set addInputToReport to false to avoid including the validated content (which could be large) in the resulting report’s context.

Regarding the externalSchema and externalSchematron inputs, each item is treated as a map with three named properties:

Input

Description

Required?

Type

content

The content to consider.

Yes

A string that is interpreted based on the embeddingMethod input.

embeddingMethod

The way in which to interpret the content input value.

No

One of BASE64 (for content embedded as a BASE64 encoded string), URI (for a reference to remote content) or STRING (the default - for content embedded as-is).

type

The specific type of artifact.

No

A string with value xsd (as default) for a XML Schema, or sch (default) or xsl for Schematron.

As an alternative to the embeddingMethod inputs, the GITB SOAP API foresees also the embeddingMethod attribute that is defined on each input element. The values it supports are:

Value

Description

STRING

The value is interpreted as-is as an embedded text.

BASE64

The value is interpreted as an embedded BASE64 string that will need to be decoded before processing.

URI

The value is interpreted as a remote URI reference that will be looked up before processing.

Note

embeddingMethod values: The two approaches to provide the embeddingMethod value (as an input or an attribute) is due to a known issue in the GITB software where not all embedding methods can be leveraged within test cases (see Step 7: Use the validator in GITB TDL test cases).

The sample SOAP UI project includes sample requests per case. As an example, validating via URI would be done using the following envelope:

<soapenv:Envelope xmlns:soapenv="http://schemas.xmlsoap.org/soap/envelope/" xmlns:v1="http://www.gitb.com/vs/v1/" xmlns:v11="http://www.gitb.com/core/v1/">
   <soapenv:Header/>
   <soapenv:Body>
      <v1:ValidateRequest>
         <sessionId>?</sessionId>
         <input name="xml" embeddingMethod="URI">
            <v11:value>https://www.itb.ec.europa.eu/files/samples/po-sample-invalid.xml</v11:value>
         </input>
         <input name="type" embeddingMethod="STRING">
            <v11:value>large</v11:value>
         </input>
<!-- 
   Sample illustrating use of a user-provided Schematron

	      <input name="externalSchematron">
        	  <v11:item>
	           <v11:item name="content" embeddingMethod="URI">
	        	    <v11:value>http://a.server/rules.xml</v11:value>
	        	  </v11:item>
	        	  <v11:item name="type" embeddingMethod="STRING">
	        	    <v11:value>sch</v11:value>
	        	  </v11:item>
        	  </v11:item>
         </input>
-->        
      </v1:ValidateRequest>
   </soapenv:Body>
</soapenv:Envelope>

With the resulting report provided as follows:

<soap:Envelope xmlns:soap="http://schemas.xmlsoap.org/soap/envelope/">
   <soap:Body>
      <ns4:ValidationResponse xmlns:ns2="http://www.gitb.com/core/v1/" xmlns:ns3="http://www.gitb.com/tr/v1/" xmlns:ns4="http://www.gitb.com/vs/v1/">
         <report>
            <ns3:date>2019-06-05T15:08:21.537Z</ns3:date>
            <ns3:result>FAILURE</ns3:result>
            <ns3:counters>
               <ns3:nrOfAssertions>0</ns3:nrOfAssertions>
               <ns3:nrOfErrors>1</ns3:nrOfErrors>
               <ns3:nrOfWarnings>0</ns3:nrOfWarnings>
            </ns3:counters>
            <ns3:context>
               <ns2:item name="xml" embeddingMethod="STRING">
                  <ns2:value><![CDATA[<?xml version="1.0"?>
<purchaseOrder xmlns="http://itb.ec.europa.eu/sample/po.xsd" orderDate="2018-01-22">
  <shipTo country="BE">
    <name>John Doe</name>
    <street>Europa Avenue 123</street>
    <city>Brussels</city>
    <zip>1000</zip>
  </shipTo>
  <billTo country="BE">
    <name>Jane Doe</name>
    <street>Europa Avenue 210</street>
    <city>Brussels</city>
    <zip>1000</zip>
  </billTo>
  <comment>Send in one package please</comment>
  <items>
    <item partNum="XYZ-123876">
      <productName>Mouse</productName>
      <quantity>5</quantity>
      <priceEUR>15.99</priceEUR>
      <comment>Confirm this is wireless</comment>
    </item>
    <item partNum="ABC-32478">
      <productName>Keyboard</productName>
      <quantity>15</quantity>
      <priceEUR>25.50</priceEUR>
    </item>
  </items>
</purchaseOrder>]]></ns2:value>
               </ns2:item>
            </ns3:context>
            <ns3:reports>
               <ns3:error xsi:type="ns3:BAR" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
                  <ns3:description>[PO-01] The quantities of items for large orders must be greater than 10.</ns3:description>
                  <ns3:location>xml:17:0</ns3:location>
                  <ns3:test>number(po:quantity) > 10</ns3:test>
               </ns3:error>
            </ns3:reports>
         </report>
      </ns4:ValidationResponse>
   </soap:Body>
</soap:Envelope>

The returned report uses the GITB TRL syntax and is the same as the XML report you can download from the user interface (see Validation via user interface). It includes:

  • The validation timestamp (in UTC).

  • The overall result (SUCCESS or FAILURE).

  • The count of errors, warnings and information messages.

  • The context for the validation (i.e. the XML that was validated).

  • The list of report items displaying per item its description, location in the validated content and performed test.

Validation via email

Validation via email takes place by configuring a mailbox for the validator (at the level of a validation domain) and validating any documents sent to it. The validator processes the XML attachments contained in an incoming message that are validated for a configured validation type based on how they are named. Specifically, the validation type is added as a prefix to the attached file name followed by a .. For example, an attachment named basic.myFile.xml will be validated against the basic validation type.

Note

Validation type names when email validation is enabled: The target validation type is determined by the part of the attachment’s name up to the first .. This means that an attachment named v1.2.myFile.xml will attempt validation for a v1 validation type rather than a v1.2 type. To avoid such issues, ensure your validation type identifiers don’t include . separators if you plan to enable validation via email (e.g. use v1_2 instead of v1.2).

Once validation is completed for all attachments, the results are returned as a reply addressed to the received email’s sender. This email includes for each validated document the XML and PDF version of the validation report as attachments. These are complemented by a summary message in the email’s body such as:

Validation report for [basic.myFile.xml]:
- Date: 14/06/2019 12:43:20
- Result: FAILURE
- Errors: 5
- Warnings: 2
- Messages: 0
- Detailed report in: XML [basic.myFile.report.xml] and PDF [basic.myFile.report.pdf]

Note

Enabling email validation: Validation via email is by default disabled. To enable it you will need to ensure availability of a mail server and mailbox for which you will need to configure its inbound (SMTP) and outbound (IMAP/POP) parameters. See Properties related to email for the properties to configure.

Validation via command-line tool

Note

Command-line tool availability: Command-line tools are supported only for validators hosted on test bed resources and if generation of such a tool has been requested (see Step 5: Setup validator on test bed).

When a command line tool is set up for your validator it is available as an executable JAR file, packaged alongside a README file in a ZIP archive. This ZIP archive is downloadable from a URL of the form https://www.itb.ec.europa.eu/xml-offline/DOMAIN/validator.zip, where DOMAIN is the name of the validator’s domain. Considering our purchase order example, the command-line validator would be available at https://www.itb.ec.europa.eu/xml-offline/order/validator.zip.

To use it you need to:

  1. Ensure you have Java running on your workstation (minimum version 11).

  2. Download and extract the validator’s ZIP archive.

  3. Open a command prompt and change to the directory in which you extracted the JAR file.

  4. View the validator’s help message by issuing java -jar validator.jar

> java -jar validator.jar

Expected usage: java -jar validator.jar -input FILE_OR_URI_1 ... [-input FILE_OR_URI_N] [-noreports] [-xsd SCHEMA_FILE_OR_URI] [-sch SCHEMATRON_FILE_OR_URI_1] ... [-sch SCHEMATRON_FILE_OR_URI_N]  [-locale LOCALE]
Where:
    - FILE_OR_URI_X is the full file path or URI to the content to validate.
    - SCHEMA_FILE_OR_URI is the full file path or URI to a schema for the validation.
    - SCHEMATRON_FILE_OR_URI_X is the full file path or URI to a schematron file for the validation.
    - LOCALE is the language code to consider for reporting of results. If the provided locale is not supported by the validator the default locale will be used instead (e.g. 'fr', 'fr_FR').

The summary of each validation will be printed and the detailed reports produced in the current directory (as "report.X.xml" and "report.X.pdf").

Running the validator will produce a summary output on the command console as well as the detailed validation report(s) (unless flag -noreports has been specified). To resolve potential problems during execution, an output log is also generated with a detailed log trace.

> java -jar validator.jar -input sample.xml -xsd PurchaseOrder.xsd -sch LargePurchaseOrder.sch

Validating 1 of 1 ... Done.

Validation report summary [sample.xml]:
- Date: 2020-12-07T17:53:03.396+01:00
- Result: FAILURE
- Errors: 1
- Warnings: 0
- Messages: 0
- Detailed reports in [D:\tmp\report.0.xml] and [D:\tmp\report.0.pdf]

Note

Offline validator use and remote files: Depending on the validator’s configuration, one or more of its XSD or Schematron files may be defined as URIs (see Step 3: Prepare validator configuration). In addition, you may also provide the content to validate as a reference to a remote file. In these cases the workstation running the validator would need access to the remote resources. Any proxy settings applicable for the workstation will automatically be used for the connections.

Step 7: Use the validator in GITB TDL test cases

As a next step over the standalone XML validator you may consider using it from within GITB TDL test cases running in the test bed. You would typically do this for the following reasons:

  • You want to control access to the validation service based on user accounts.

  • You prefer to record all data linked to validations (for e.g. subsequent inspection).

  • You want to build complete conformance testing scenarios that are either focused on the validator or that use it as part of validation steps.

As described in Validation via SOAP web service API, the standalone XML validator offers by default a SOAP API for machine-to-machine integration that realises the GITB validation service specification. In short this means that the service can be easily included in any GITB TDL test case as the handler of a verify step. This is done by supplying as the handler value the full URL to the service’s WSDL, as illustrated in the following example that requests the user to upload the file to validate:

<?xml version="1.0" encoding="UTF-8"?>
<testcase id="testCase1_upload" xmlns="http://www.gitb.com/tdl/v1/" xmlns:gitb="http://www.gitb.com/core/v1/">
    <metadata>
        <gitb:name>testCase1_upload</gitb:name>
        <gitb:version>1.0</gitb:version>
        <gitb:description>Test case that allows the developer of an EU retailer system to upload a purchase order for validation.</gitb:description>
    </metadata>
    <variables>
        <var name="purchaseOrderToValidate" type="binary"/>
    </variables>
    <actors>
        <gitb:actor id="Retailer" name="Retailer" role="SUT"/>
    </actors>
    <steps>
        <interact desc="Upload content">
            <request desc="Purchase order to validate:">$purchaseOrderToValidate</request>
        </interact>
        <verify handler="https://www.itb.ec.europa.eu/order/api/validation?wsdl" desc="Validate purchase order">
            <input name="xml">$purchaseOrderToValidate</input>
            <input name="type">"basic"</input>
        </verify>
    </steps>
</testcase>

Notice in this example how the xml and type inputs are provided as input elements to the verify step. We included the type input because we define two validation types (basic and large) but this could be omitted if only a single validation type is supported. In addition, note that although you can define the service’s WSDL URL directly, a better approach to improve portability is to define this in the test bed’s domain configuration as a domain parameter. Defining a validationService parameter in the domain you could thus redefine the verify step as:

...
<verify handler="$DOMAIN{validationService}" desc="Validate purchase order">
    ...
</verify>
...

Using an external service for XML validation is not the only way to support XML validation from within a test case. The test bed actually offers numerous embedded validators to cover most XML validation needs:

  • The XSDValidator to check syntax against an XSD.

  • The SchematronValidator to validate content against business rules.

  • The XPathValidator to validate content through XPath expressions (resulting in true or false values).

  • The XmlMatchValidator to check an XML document against a given template (potentially skipping certain elements or complete sections).

Given these built-in validators you may wonder why use an external service to begin with. The following are reasons you may choose to do so:

  • To allow data validation also as a separate public, anonymous and stateless service. If you need to define such a service you might as well reuse it within test cases.

  • To simplify management of validation artefacts. Artefacts used in test cases such as XSDs and Schematrons need to either be included in each test suite or referred to remotely through URIs. Given multiple test suites and artefacts it is likely simpler to bundle resources once in a separate validation service where you can have a single update reflected automatically across all relevant test cases.

  • To aggregate reporting. If your needs include validation against multiple independent XSDs and Schematrons, each relevant verify step will produce a separate report. It may be preferable in such a case to always have a single aggregated report produced by the external service regardless of how underlying artefacts are organised.

Summary

Congratulations! You have just setup a validation service for your XML specification. In doing so you considered your needs and defined your service through configuration on the DIGIT test bed or as a Docker container. In addition, you used this service via its different APIs and considered how this could be used as part of complete conformance testing scenarios.

See also

In Step 7: Use the validator in GITB TDL test cases we briefly touched upon using the test bed for complete conformance testing scenarios. If this interests you, several additional guides are available that can provide you with further information:

For the full information on GITB TDL test cases check out the GITB TDL documentation, the reference for all test step constructs as well as a source of numerous complete examples.

In case you need to consider validation of further content types, be aware that the test bed provides similar support for:

If you are planning on operating a validator on your own-premises for production use check the validator production installation guide for the steps to follow and guidelines to consider.

Finally, for more information on Docker and the commands used in this guide, check out the Docker online documentation.

References

This section contains additional references linked to this guide.

Validator configuration properties

The following sections list the configuration properties you can use to customise the operation of your validation service.

Domain-level configuration

The properties in this section are to be provided in the configuration property file (one per configured validation domain) you define as part of your validator configuration. The properties marked as being translatable (in the listed property Type) can also be defined in translation property files if the validator is configured to support multiple languages (see Supporting multiple languages).

Note

Property placeholders: Numerous configuration properties listed below are presented with placeholders. These are marked in uppercase and have the following meaning:

  • TYPE: The validation type to perform.

  • OPTION: An option for a given validation type (e.g. a specific version).

  • FULL_TYPE: For a validation type with options this is equals to TYPE.OPTION, otherwise it is the TYPE value itself.

  • N: A zero-based integer (used as a counter).

Property

Description

Type

Default value

validator.bannerHtml

Configurable HTML banner replacing the text title.

String (translatable)

validator.channels

Comma-separated list of validation channels to have enabled. Possible values are (form, email, webservice).

Comma-separated Strings

form, webservice

validator.completeTypeOptionLabel.TYPE.OPTION

Label to display for the full validation type and option combination (visible in the validator’s result screen).

String (translatable)

validator.defaultPlugins.N.class

Paired with the its corresponding jar property (see above), this defines the fully qualified names of the classes to be loaded as plugin implementations.

Comma-separated Strings

validator.defaultPlugins.N.jar

Configuration for custom plugins that can be used to extend the validation. This is a default plugin definition that applies to all validation types. This property is the relative path pointing to the all-in-one JAR file that contains the plugin implementation(s).

String

validator.externalSchematronFile.FULL_TYPE

Whether or not user-provided Schematron are allowed for the given validation type (added as a postfix). Possible values are (required, optional, none).

String

none

validator.externalSchematronFile.FULL_TYPE.preprocessor

The relative path to a XSLT file that will be used for Schematron pre-processing.

String

validator.externalSchematronFile.FULL_TYPE.preprocessor.output

The file extension for the file resulting from the Schematron pre-processing.

String

sch

validator.externalSchemaFile.FULL_TYPE

Whether or not user-provided XML Schemas are allowed for the given validation type (added as a postfix). Possible values are (required, optional, none).

String

none

validator.externalSchemaFile.FULL_TYPE.preprocessor

The relative path to a XSLT file that will be used for XSD pre-processing.

String

validator.externalSchemaFile.FULL_TYPE.preprocessor.output

The file extension for the file resulting from the XSD pre-processing.

String

xsd

validator.footerHtml

Configurable HTML banner for the footer.

String (translatable)

validator.importProperties

The path of a file declaring additional domain properties. If the application property validator.restrictResourcesToDomain is set to true or omitted, only relative paths of files under the domain root folder are accepted.

String

validator.includeTestDefinition

Whether test expressions (if available) should be included in the resulting reports.

Boolean

true

validator.locale.available

The list of locales (language codes) that will be available for the user to select. This is provided as a comma-separated set of String values, the order of which determines the order that they will be listed in the UI’s language selection control.

Comma-separated Strings

validator.locale.default

The default locale (language code) to consider for the validator. This can be provided alone or with a country variant (e.g. “en”, “en_US”, “en_GB”).

String

en

validator.locale.translations

The path to a folder (absolute or relative to the domain configuration file) that contains the translation property files for the validator’s supported languages.

String

validator.maximumReportsForDetailedOutput

The maximum number of report items for which a PDF validation report will be generated (no report will be preoduced if exceeded).

Integer

5000

validator.maximumReportsForXmlOutput

The maximum number of report items to include in the XML validation report.

Integer

50000

validator.optionLabel.OPTION

Label to display in the web form for an option across all validation types (added as a postfix of validator.optionLabel). Only displayed if there are options defined.

String (translatable)

validator.plugins.FULL_TYPE.N.class

Paired with the its corresponding jar property (see above), this defines the fully qualified names of the classes to be loaded as plugin implementations.

Comma-separated Strings

validator.plugins.FULL_TYPE.N.jar

Configuration for plugins that can be used to extend the validation specific to a given type (FULL_TYPE) and extending any default plugins (see above). This property is the relative path pointing to the all-in-one JAR file that contains the plugin implementation(s).

String

validator.reportsOrdered

Whether the report items are to be ordered (errors first, then warnings, then messages). Otherwise the items will appear based on where they were raised in the validated content.

Boolean

false

validator.schematronFile.FULL_TYPE

Comma-separated list of Schematron files loaded for a given validation type (added as a postfix). These can be files or folders.

Comma-separated Strings

validator.schematronFile.FULL_TYPE.preprocessor

The relative path to a XSLT file that will be used for Schematron pre-processing.

String

validator.schematronFile.FULL_TYPE.preprocessor.output

The file extension for the file resulting from the Schematron pre-processing.

String

sch

validator.schematronFile.FULL_TYPE.remote.N.preprocessor

The relative path to a XSLT file that will be used for Schematron pre-processing.

String

validator.schematronFile.FULL_TYPE.remote.N.preprocessor.output

The file extension for the file resulting from the Schematron pre-processing.

String

sch

validator.schematronFile.FULL_TYPE.remote.N.url

Reference for a remotely loaded Schematron file for a given validation type (added as the FULL_TYPE placeholder). One or more such entries can be defined by incrementing the zero-based N counter.

String

validator.schemaFile.FULL_TYPE

Comma-separated list of XSD files loaded for a given validation type (added as a postfix). These can be a files or folders.

Comma-separated Strings

validator.schemaFile.FULL_TYPE.preprocessor

The relative path to a XSLT file that will be used for XSD pre-processing.

String

validator.schemaFile.FULL_TYPE.preprocessor.output

The file extension for the file resulting from the XSD pre-processing.

String

xsd

validator.schemaFile.FULL_TYPE.remote.N.preprocessor

The relative path to a XSLT file that will be used for XSD pre-processing.

String

validator.schemaFile.FULL_TYPE.remote.N.preprocessor.output

The file extension for the file resulting from the XSD pre-processing.

String

xsd

validator.schemaFile.FULL_TYPE.remote.N.url

Reference for a remotely loaded XML Schema file for a given validation type (added as the FULL_TYPE placeholder). One or more such entries can be defined by incrementing the zero-based N counter.

String

validator.showAbout

Whether or not to show the about panel on the web UI.

Boolean

true

validator.supportMinimalUserInterface

Enable a minimal user interface useful for embedding in other UIs or portals (applies only if the form validation channel is enabled).

Boolean

false

validator.type

Comma-separated list of supported validation types. Values need to be reflected in the other properties’ TYPE and FULL_TYPE placeholders. FULL_TYPE equals TYPE.OPTION (in case type options are defined) or simply TYPE if there are no options (see also validator.typeOptions).

Comma-separated Strings

validator.typeLabel.TYPE

Label to display in the web form for a given validation type (added as a postfix of validator.typeLabel). Only displayed if there are multiple types.

String (translatable)

validator.typeOptions.TYPE

Comma-separated list of options defined for a given validation type (added as a postfix). Values need to be reflects in the other properties’ OPTION and FULL_TYPE placeholders. FULL_TYPE equals TYPE.OPTION (in case type options are defined) or simply TYPE if there are no options (see also validator.type).

Comma-separated Strings

validator.typeOptionLabel.TYPE.OPTION

Label to display for an option for a specific validation type.

String (translatable)

validator.webServiceDescription.addInputToReport

The description of the SOAP web service for element “addInputToReport”.

String

Whether the returned XML validation report should also include the validated input as context information.

validator.webServiceDescription.embeddingMethod

The description of the SOAP web service for element “embeddingMethod”.

String

The embedding method to consider for the ‘xml’ input (‘BASE64’, ‘URL’ or ‘STRING’).

validator.webServiceDescription.externalSchema

The description of the SOAP web service for element “externalSchema”.

String

A list of maps that defines external XSDs to consider in addition to any preconfigured ones.

validator.webServiceDescription.externalSchematron

The description of the SOAP web service for element “externalSchematron”.

String

A list of maps that defines external Schematrons to consider in addition to any preconfigured ones.

validator.webServiceDescription.locale

The description of the SOAP web service for element “locale”.

String

Locale (language code) to use for reporting of results. If the provided locale is not supported by the validator the default locale will be used instead (e.g. “fr”, “fr_FR”).

validator.webServiceDescription.locationAsPath

The description of the SOAP web service for element “locationAsPath”.

String

Whether error locations should be XPath expressions or resolve their line and column locations in the provided input.

validator.webServiceDescription.type

The description of the web service for element “type”. Only displayed if there are multiple types.

String

The type of validation to perform (if multiple types are supported).

validator.webServiceDescription.xml

The description of the web service for element “xml”.

String

The XML content to validate, provided as a string, BASE64 or a URI.

validator.webServiceId

The ID of the web service.

String

ValidatorService

Application-level configuration

These properties govern the validator’s application instance itself. They apply only when you are defining your own validator as a Docker image in which case they are supplied as environment variables (ENV directives in a Dockerfile). Note that apart from these properties any Spring Boot configuration property can also be supplied.

Note

Mandatory property: The only mandatory property that needs to be defined is validator.resourceRoot.

Property

Description

Type

Default value

logging.file.path

Path to a folder that will hold the validator’s log output.

String

/validator/logs

validator.acceptedMimeTypes

Accepted mime-types for input files.

Comma-separated Strings

application/xml, text/xml, text/plain

validator.acceptedSchemaMimeType

Accepted XML Schema files mime type.

Comma-separated Strings

application/xml, text/xml, text/plain

validator.acceptedSchematronExtensions

Accepted Schematron file extensions. All other files found in validator.schematronFile.FULL_TYPE (when folders are defined) are ignored.

Comma-separated Strings

xsl, xslt, sch

validator.acceptedSchematronMimeType

Accepted Schematron files mime type.

Comma-separated Strings

application/xslt+xml, application/xml, text/xml, text/plain

validator.acceptedZipMimeType

Accepted ZIP files mime type.

Comma-separated Strings

application/zip, application/octet-stream, application/x-zip-compressed, multipart/x-zip

validator.cleanupPollingRate

The rate at which the validator.reportFolder folder is polled for forced cleanup (in ms).

Integer

60000

validator.disablePreprocessingCache

Whether to disable caching for pre-processing XSLTs.

Boolean

false

validator.domain

The names of the domain subfolders to consider. By default all folders under validator.resourceRoot will be considered.

Comma-separated Strings

validator.domainName.DOMAIN

The name to display for a given domain folder (the folder name replacing the DOMAIN placeholder). This value will also be used in request paths.

String

The folder name is used.

validator.identifier

The validator’s identifier to be sent for usage statistics reporting.

String

xml

validator.inputFilePrefix

Prefix of input files in the report folder.

String

ITB-

validator.mailPollingRate

The rate at which the configured email addresses (if configured) are polled for received input files (in ms).

Integer

60000

validator.minimumCachedInputFileAge

Time to keep XML input files in milliseconds (600000 = 10 minutes).

Integer

600000

validator.reportFilePrefix

Prefix of report files in the report folder.

String

TAR-

validator.reportFolder

Path to a folder that contains temporary data and reports.

String

/validator/reports

validator.resourceRoot

The root folder under which domain subfolders will be loaded from.

String

validator.restrictResourcesToDomain

Whether local validation artefacts can be loaded from outside the domain root folder. If false, both relative and absolute paths are accepted.

Boolean

true

validator.webhook.ipheader

The HTTP header to use to retrieve the user’s IP address for usage statistics if the validator is behind a proxy. This property is only used to detect the user’s country, when such feature is enabled (see validator.webhook.statisticsEnableCountryDetection).

String

X-Real-IP

validator.webhook.statistics

The URL of the backend service that will collect usage statistics. This property is optional and, if (and only if) present, the validator will report usage statistics.

String

validator.webhook.statisticsCountryDetectionDbFile

The path to the .mmdb file with the geolocation database that is used to resolve the user’s country from an IP address. This database will only be used when the property validator.webhook.statisticsEnableCountryDetection takes the value true.

String

validator.webhook.statisticsEnableCountryDetection

Whether the usage statistics reporting service can detect users’ countries from their IP addresses. If false no information about the user’s country or IP address will be reported.

Boolean

false

validator.webhook.statisticsSecret

The validator client secret to be passed to the backend service for usage statistics reporting.

String

Change history