Meta-Programming in Extensible Documents

One source block to one file The document contains program listings that support the development of ideas. These are usually written in elements, siblings to paragraphs, and for Docbook, of type <programlisting>. The most important attribute, “language”, identifies the programming language. However, there is no attribute in Docbook that tells the tangling program where each piece of code should end up. This is why we introduce our first extension: the “mped:tangle-to” attribute. To tangle a document, an XSLT stylesheet is defined. It reads a Docbook document, and outputs a shell script that writes the correct pieces of code to the correct file names. The key template to do the task is .

Copy a specific programlisting to disk mkdir -p $(dirname "

<< "_MPED_EOF"

_MPED_EOF ]]>

This template starts by creating the directory where the file should go, then fills the file with the source code. XML has a precise behavior when it comes to whitespace preservation, but it’s not always the prettiest when we write it with whitespace output in mind. So, the code output is frequently not indented correctly, and has too many empty lines. To counter this effect, we use a code source formatter, a program that reads source code and indents it correctly. For XSLT, in , we can use xmllint from libxml2. The important thing about the formatter is that it should take its input from standard input and write the formatted code to standard output.

Use xmllint as a formatter for XML languages xmllint --format - ]]> Unfortunately, xmllint does not accept an XML processing instruction after the first line, so you will still need to put no whitespace between the start tag of programlisting and the text for XML listings. If no formatter applies, then we can resort to cat, see .

Use cat as the default formatter for any language cat ]]> We also need to specify how the source code is copied. It is very simple: copies the text verbatim.

Copy the source code as text

]]>

Tangling should never touch anything else. So, text should not be copied to output. This is why we disable text matching by default with .

Ignore text by default when tangling ]]>

Paste other listings in place Literate programming requires the author to be able to discuss bits of code in isolation, and then insert each bit into a larger bit. Mped provides this operation with a new tag, “mped:copy”. It has a “linkend” attribute that resolves to a program listing anywhere in the document. When copying source code, matching this element will insert the linked listing directly here. This is done in . More precisely, it looks if there is a single program listing that is directly under a figure with the given ID. This way, we can refer to listings as the figure they appear in, which makes cross-referencing easier.

Insert literally a listing in another when tangling There are no listing directly within a figure with ID '

There are multiple listings directly within a figure with ID '

]]>

Putting it all together The collection of all these templates gives a full stylesheet, in .

The full stylesheet for tangling

]]> ]]>

Clean program listings We need to replace the <mped:copy> tags within the program listing, with the title of the listing as a comment. See to put the comment in the XML language, and for a default language (do not put a comment).

Insert a call-out comment in XML <!--

--> ]]>

Do not insert a call-out if the language comment syntax is unknown ]]> Finally, all other elements must be copied as-is. This is why the catch-all template is used.

Copy everything else without modification

]]>

Putting it all together The collection of all these templates gives a full stylesheet, in .

The full stylesheet to apply mped markup ]]> ]]> Starting from this file, mped.xml, and the bootstrap tangling stylesheet, tangle-bootstrap.xsl, you would obtain a mped-less docbook file with .

How to convert this document