seed-tei-transformations

XSL Transformations for processing TEI documents

MIT License

Stars
0
Committers
1

SEED TEI Transformations

This project is a collection of XSL stylesheets and packages for transforming TEI-XML documents to HTML and to LaTeX. It offers lower level building blocks for putting together sophisticated transformations, as well as it offers some example transformations put together from these building blocks. The XSLT package system makes the building blocks highly reusable.

  • support right-to-left as well as left-to-right scripts
  • critical apparatus
    • support all encoding methods (parallel segmentation, double
      end-point, location referenced)
    • support an arbitrary number of distinct critical apparatus
    • support a variety of ways of printing out (visualizing)
      • HTML: popups, or endnotes with backlinks, or linenumber-based
        referencing for short texts like poems, standalone apparatus to
        be scrolled with main text in sync
      • LaTeX; reledmac
    • i18n for scholar terms like "omisit"
  • editorial comments
    • support an arbitrary number of separate comment types
    • support the same ways of output as critical apparatus
  • named entities
    • output either as a type of comment or a type of critical apparatus
      or mixed into one of these
  • support TEI's @rendition and @rend
  • provide fully configurable internationalization support for HTML
    output by using i18next.js and for LaTeX output by translation
    files
  • security: written for running in a web service and with the specific
    security requirements of such a runtime environment in mind: Do not
    allow the caller to run arbitrary code by giving access to
    <xsl:evaluate>

See the documentation in the Wiki!

This is part of the SEED, which is an recursive acronym for SEED electronic editions, orif you dislike recursionSCDH electronic editions.

Getting started

Using the packages

For using an XSLT package, the XSLT processor has to be told where to find the package. The most convenient way to pass this information to a Saxon processor is through a Saxon configuration file. Such a file can link packages names and versions to source files and optionally compiled packages. saxon.xml has such a mapping for the packages defined in this directory.

The Saxon configuration file can be loaded from the commandline or in an Oxygen project.

Please note, that the package names cannot be mapped to locations through an XML catalog.

A Saxon configuration file also contains information about the edition (home, enterprise, professional) of the processor.

Commandline

By running the following command, you will get a current Saxon HE (home edition), wrapper scripts and a configuration file for this edition.

./mvnw package

After a successful build there is:

  1. a current Saxon HE and dependencies in target/lib/*
  2. wrapper scripts for running XSLT in target/bin/*
  3. saxon.he.xml, a Saxon configuration for the Home Edition, derived
    from saxon.xsml

Using all the packages is now as simple as running

target/bin/xslt.sh -config:saxon.he.xml -xsl:STYLESHEET -:s:SOURCE PARAMETER=VALUE

For example:

target/bin/xslt.sh -config:saxon.he.xml -xsl:xsl/html/prose.xsl -s:test/samples/Trawr-Gesang2.xml use-libhtml=true {http://scdh.wwu.de/transform/source#}mode=6

The commandline parameters are just passed through to the Saxon processor. Run target/bin/xslt.sh -? for help or have a look at the Saxon documentation.

The wrapper scripts have debugging turned on by default. Use Saxons -o:... option or the shell's 1> and 2> redirection to fork output from stdout and stderr.

Alternatively, you can go the hard way and not let Maven help you and use -lib to point to every package you want to use:

java -jar path-to-saxon.jar -lib:xsl/common/libentry2.xsl -lib:xsl/common/libapp2.xsl -lib:xsl/html/libapp2.xsl -lib:... -xsl:... -s:...

Oxygen

This whole project ist distributed as plugin which can be installed from the following URL:

https://scdh.zivgitlabpages.uni-muenster.de/tei-processing/seed-tei-transformations/descriptor.xml

There are no transformation scenarios in the distribution, because we distribute reusable resources only and not transformations with project-specific parameter values. However, the XSLT resources can be used to declare scenarios based on it. We suggest to define such scenarios either project-wide in the xpr-file or in a framework. In fact, keeping XSLT in a repository like this one and distributing it for different platforms is a key to using the same set everywhere.

Per scenario configuration

The Saxon config file can be declared on the basis of an XSL Transformation scenario. See Oxygen docs.

What to enter into the URL field?

${pluginDirURL(de.wwu.scdh.tei.seed-transformations)}/saxon.ee.xml

Please notice the ee in the file name: There are versions of the config file for all three editions of Saxon in the plugin package. ee is for the Enterprise Edition, which ships with oXygen. The default saxon.xml config file is also present, but it would restrict the transformation engine to the functions of the Home Edition.

Project-wide configuration

A project-wide config is very helpful for developing XSL transformations based on the packages. But please note, that it is not needed for defining, distributing and using scenarios.

  • See Oxygen docs (Use a configuration file ("-config")) how to set this up. This option can be set on a project-basis. The options dialog is accessible from the Options menu, menu item Preferences; then descend into XML, XSLT-Proc, XSLT, Saxon, Saxon-HE/PE/EE.

  • See tei-transform.xpr for an example for a project-wide configuration: The options named saxon.latest.config.file and saxon.latest.use.config.file do the thing.

Stylesheet parameters with fully qualified names

<transformationParameter>
  <field name="paramDescription">
	<paramDescriptor>
	  <field name="localName">
		<String>lb-start</String>
	  </field>
	  <field name="prefix">
		<null/>
	  </field>
	  <field name="namespace">
		<String>http://scdh.wwu.de/hsde/transform/hidefacs#</String>
	  </field>
	</paramDescriptor>
  </field>
  <field name="value">
	<String>''</String>
  </field>
  <field name="hasXPathValue">
	<Boolean>false</Boolean>
  </field>
  <field name="isStatic">
	<Boolean>false</Boolean>
  </field>
</transformationParameter>

XSpec

For testing a stylesheet that uses a package, put a configuration into the XSpec file, see XSpec issue 762. See any of the XSpec files in this repository.

SEED XML Transformer

Running ./mvnw clean package makes a distribution that can be deployed as transformation resources on a SEED XML Transformer RESTful web service. The bundled resources are in target/seed-tei-transformations-VERSION-seed-resources.tar.zip which can be passed into the Kybernetes deployment as a configMap.

The tar ball contains a yaml file that defines all resources available in the REST service. It is also in target/seed-config.yaml after running Maven. Its content is determined by the transformationSet using the ${seed-config-xsl.url} as stylesheet in pom.xml.

Conventions

There are many rules and conventions followed throughout this projects. For details see the contributing notes.

Contributing

See contributing notes!

License

MIT