*Working Draft 10-Jul-1997*

- 5.1 Why two different kinds of tags?
- 5.2 Reasons to Mix Tags
- 5.3 Anticipating Macros for Combined Markup

In what follows, we describe how to mix presentation and content tags for the purpose of creating MathML objects.

The previous sections describe two levels of tags for markup in mathematical documents:

**Presentation
markup** captures mathematical *notation*.
It provides a notational expression with sufficient
structure for rendering in various contexts. For example,
presentation markup will allow an expression to be displayed
sensibly in a window, whether the window is wide or narrow,
or to be enunciated in a sensible way when spoken. It does
this by providing information such as grouping of expression
parts, classification of symbols, etc. Presentation markup
does *not* concern itself with the mathematical
meaning of text, but it *is* concerned with the way
an expression is presented to the eye or ear.

Employing presentation tags alone may limit the ability to render or re-use a MathML object in another context, especially for evaluation by external applications.

**Content markup** conveys mathematical *meaning*.
It provides a mathematical expression with sufficient
structure for the contents to be understood. For
example, with a sufficiently sophisticated renderer and
style-sheet mechanism, content markup might allow a user to
read mathematical documents using familiar notations. With
content markup, an expression has the same mathematical
value independently of where or how it appears. For
example, an expression could be cut from a web browser and
pasted into a mathematical software tool (such as future
versions of Axiom, Maple or Mathematica) with confidence
that correct values will be computed.

Employing content tags alone may limit the ability of the author to precisely control how an expression is rendered, or easily use non-standard notation.

It is important to emphasize that both content and
presentation tags are necessary in order to provide the full
expressive capability one would like in a mathematical
markup language. Often the same mathematical notation is
used to represent several completely different concepts. For
example, the notation *x ^{i}* may be intended
(in polynomial algebra) as the

Thus we see that the same notation may represent several mathematical ideas, and, conversely, that the same mathematical idea often has several notations. Thus, in order to provide authors with the ability to precisely control notational nuances while at the same time encoding meanings in a uniform way for renderers and other applications, both content and presentation markup are needed.

If it is important that the expression appear in exactly a certain way, then presentation markup should be used. If it is important that the expression have a certain meaning, then content markup should be used.

An analogy may be drawn with the non-mathematical aspects of HTML. At one level, writing

is akin to using MathML content tags, while writing<H2>Why two different kinds of tags?</H2>

is akin to using MathML presentation tags.<BR><B>Why two different kinds of tags?</B></BR>

However, this analogy is only partly accurate. In natural language, we encode our ideas by means of many thousands of words, filled with subtle shades of syntactic nuance which cannot be captured by a rigid, formalized system. In a typical prose document, there are relatively few purely notational constructs like paragraphs and headings, and they carry little of the semantic meaning of the document. Most of the meaning of the document is encoded in the words themselves.

In mathematics, by contrast, notational constructs are much more complex and express a great deal of the meaning to a reader. A typical math notation might be at roughly the same level as a "question construct" or a "dependent clause" construct in natural language. Indeed, one might argue that notational schema like fractions in mathematics correspond more closely to words in the amount of meaning they carry, than they do to notational devices like paragraphs; a fraction, like many words, has only a few of well-defined meanings, and it is usually clear from context which is intended. From this point of view, the MathML presentation elements represent a level of notational abstraction at least as great as that of ordinary HTML tags, while MathML content elements represent a level of encoding not possible in natural language, and certainly not present in ordinary HTML.

Of course, neither of these analogies captures the
essence of the interplay between notation and meaning in
mathematics. If we compare mathematics to written natural
language, the richness of the notational system has been
expanded, while the universe of discourse has been narrowed
to the point where they nearly coincide. Regardless of the
philosophical implications of such observations, it is clear
that part of the unique character of mathematics is that is
*possible* to communicate virtually without ambiguity.
Just as clearly, both presentational and content markup have
a role to play in facilitating such communication.

There are several reasons why an author may wish to use a combination of presentation markup and content markup

If an author is primarily presentation-oriented, it will still produce better, more accessible results to use some content markup, e.g.

<EXPR><MI>x</MI><POWER/><MN>2</MN></EXPR>

<MSUP><MI>x</MI><MN>2</MN></MSUP>In this example, it is a slightly shorter to use presentation markup, but more importantly, the meaning of the superscript is unambiguously encoded, which facilitates the proper voice rendering among other things.

If an author is primarily content-oriented, there will be some areas of discourse which do not have content tags pre-defined, so it will be necessary to combine notation with semantics in some circumstances. There may also be situations in which an author needs to supplement the default rendering for content markup with additional presentation markup. For example, to express a statement of linear algebra,

we use the presentation tags <MI>, <MO> and <MSUP> together with the content markup:,rank(u^{T}v) = 1

<E> <EXPR> <SEMANTICS semtype="OpenMath"> <MO fontweight="bold">rank</MO> <ST/> <OMSymbol CD="BasicLinAlg">matrix-rank</OMSymbol> </SEMANTICS> <APPLY/> <EXPR> <SEMANTICS semtype="OpenMath"> <MSUP><MI>u</MI><MI>T</MI></MSUP> <ST/> <OMApply> <OMSymbol CD="BasicLinAlg">matrix-transpose<OMSymbol> <MI>u</MI> </OMApply> </SEMANTICS> <TIMES/> <MI>v</MI> </EXPR> </EXPR> <EQ> <MN>1</MN> </E>

Although there is a MathML vector content element, it is necessary to
markup *u* and *v* as MIs, since the default rendering for
vectors is a parenthesized list. Similarly, we must use the
MO schema to produce the bold **rank** operator.

Including presentation with semantic annotation inside content markup greatly broadens the range of mathematical discourse in which the meaning of expressions can be given. This allows MathML expressions to interact with numeric and symbolic computation systems; theorem proving systems; special purpose scientific programs; interactive textbook applications designed to display and manipulate mathematical content; archiving of semantic mathematical information; and the electronic transmission of mathematical data.

As we see from the example above, introducing new mathematical content
as *combined* presentation-semantics pairs can be verbose.

Certainly one can imagine generating this kind of markup with software tools, but it is at the borderline of what might be deemed tolerable to do by hand.

This is one of the areas of the MathML standard which anticipates most strongly the use of macros. As stated earlier, the main priority for the HTML Math working group will to develop extension protocols for MathML which include the definition of a suitable macro mechanism. With such a mechanism, it would be possible, for example, to define

<rank> X </rank>and

<transpose> X </transpose>to expand, respectively, to

<EXPR> <SEMANTICS semtype="OpenMath"> <MO fontweight="bold">rank</MO> <ST/> <OMSymbol CD="BasicLinAlg">matrix-rank</OMSymbol> </SEMANTICS> <APPLY/> <MI>X</MI> </EXPR>and

<EXPR> <SEMANTICS semtype="OpenMath"> <MSUP>X<MI>T</MI></MSUP> <ST/> <OMApply> <OMSymbol CD="BasicLinAlg">matrix-transpose<OMSymbol> <MI>X</MI> </OMApply> </SEMANTICS> </EXPR>Similarly, we might define a <vect> macro to render vectors as we want. Then, the second example of the previous section becomes:

<E> <rank> <EXPR> <transpose><vect>u</vect></transpose> <TIMES/> <vect>v</vect> </EXPR> </rank> <EQ> <MN>1</MN> </E>From this example we see how the combination of presentation and content markup will become much simpler and effective to write by hand as time goes on.