RIF Framework for Logic Dialects

W3C Editor's Draft 09 June 2008

This version:: http://www.w3.org/2005/rules/wg/draft/ED-rif-fld-20080609/
Latest editor's draft:: http://www.w3.org/2005/rules/wg/draft/rif-fld/
Previous version:: http://www.w3.org/2005/rules/wg/draft/ED-rif-fld-20080518/ (color-coded diff)

Editors:: Harold Boley, National Research Council Canada; Michael Kifer, State University of New York at Stony Brook

Abstract

This document, developed by the Rule Interchange Format (RIF) Working Group, specifies the general framework for logic-based RIF dialects (RIF-FLD). The framework describes mechanisms for specifying the syntax and semantics of logic-based RIF dialects through a number of generic concepts such as signatures, symbol spaces, semantic structures, and so on. The actual dialects are required to specialize this framework to produce their syntaxes and semantics.

Status of this Document

May Be Superseded

This section describes the status of this document at the time of its publication. Other documents may supersede this document. A list of current W3C publications and the latest revision of this technical report can be found in the W3C technical reports index at http://www.w3.org/TR/.

This document is being published as one of a set of 6 documents:

Please Comment By 2008-06-13

The Rule Interchange Format (RIF) Working Group seeks public feedback on these Working Drafts. Please send your comments to public-rif-comments@w3.org (public archive). If possible, please offer specific changes to the text that would address your concern. You may also wish to check the Wiki Version of this document for internal-review comments and changes being drafted which may address your concerns.

No Endorsement

Publication as a Working Draft does not imply endorsement by the W3C Membership. This is a draft document and may be updated, replaced or obsoleted by other documents at any time. It is inappropriate to cite this document as other than work in progress.

Patents

This document was produced by a group operating under the 5 February 2004 W3C Patent Policy. W3C maintains a public list of any patent disclosures made in connection with the deliverables of the group; that page also includes instructions for disclosing a patent. An individual who has actual knowledge of a patent which the individual believes contains Essential Claim(s) must disclose the information in accordance with section 6 of the W3C Patent Policy.

1 Overview of RIF-FLD
2 Syntactic Framework
3 Semantic Framework
4 XML Serialization Framework
- 4.1 XML for the RIF-FLD Language
- 4.2 Translation Between the RIF-FLD Presentation and XML Syntaxes
5 Conformance of RIF Processors with RIF Dialects
6 References
- 6.1 Normative References
- 6.2 Informational References

1 Overview of RIF-FLD

The RIF Framework for Logic-based Dialects (RIF-FLD) is a formalism for specifying all logic-based dialects of RIF, including RIF-BLD. It is a logic in which both syntax and semantics are described through a number of mechanisms that are commonly used for various logic languages, but are rarely brought all together. Amalgamation of several different mechanisms is required because the framework must be broad enough to accommodate several different types of logic languages and because various advanced mechanisms are needed to facilitate translation into a common framework. RIF-FLD gives precise definitions to these mechanisms, but allows certain details to vary. The design of RIF envisages that future standard logic dialects will be based on RIF-FLD. Therefore, any logic dialect being developed to become a stardard should either be a specialization of FLD or justify its deviations from (or extensions to) FLD.

The framework described in this document is very general and captures most of the popular logic-based rule languages found in Databases, Logic Programming, and on the Semantic Web. However, it is anticipated that the needs of future dialects might stimulate further evolution of RIF-FLD. In particular, future extensions might include a logic rendering of actions as found in production and reactive rule languages.

This document is mostly intended for the designers of future RIF dialects. All logic-based RIF dialects are required to be derived from RIF-FLD by specialization, as explained in Sections Syntax of a RIF Dialect as a Specialization of RIF-FLD and Semantics of a RIF Dialect as a Specialization of RIF-FLD. In addition to specialization, to lower the barrier of entry for their intended audiences, a dialect designer may choose to specify the syntax and semantics in a direct, but equivalent, way, which does not require familiarity with RIF-FLD. For instance, the RIF Basic Logic Dialect is specified both by specialization from RIF-FLD and also directly, without relying on the framework. Thus, the reader who is interested in RIF-BLD only can proceed directly to that document.

RIF-FLD has the following main components:

Syntactic framework. This framework defines the mechanisms for specifying the formal presentation syntax of RIF logic dialects by specializing the presentation syntax of the framework. The presentation syntax is used in RIF to define the semantics of the dialects and to illustrate the main ideas with examples. This syntax is not intended to be a concrete syntax for the dialects; it leaves out details such as the delimiters of the various syntactic components, parenthesizing, precedence of operators, and the like. Since RIF is an interchange format, it uses XML as its concrete syntax.
Semantic framework. The semantic framework describes the mechanisms that are used for specifying the models of RIF logic-based dialects.
XML serialization framework. This framework defines the general principles that logic-based dialects are to use in specifying their concrete XML-based syntaxes. For each dialect, its concrete XML syntax is a derivative of the dialect's presentation syntax. It can be seen as a serialization of that syntax.

Syntactic framework. The syntactic framework defines six types of RIF terms:

Constants and variables. These terms are common to most logic languages.
Positional terms. These terms are commonly used in first-order logic. RIF-FLD defines positional terms in a slightly more general way in order to enable dialects with higher-order syntax, such as HiLog [CKW93].
Terms with named arguments. These are like positional terms except that each argument of a term is named and the order of the arguments is immaterial. Terms with named arguments generalize the notion of rows in relational tables, where column headings correspond to argument names.
Frames. A frame term represents an assertion about an object and its properties. These terms correspond to molecules of F-logic [KLW95]. There is syntactic similarity between terms with named arguments and frames, since object properties resemble named arguments. However, the semantics of these terms are different.
Classification. These terms are used to define the subclass and class membership relationships. Like frames, they are also borrowed from F-logic [KLW95].
Equality. These terms are used to equate other terms.
RIF dialects can choose to support all or some of the aforesaid categories of terms. The syntactic framework also defines the following mechanisms for specializing these terms:
Symbol spaces.
Symbol spaces partition the set of all non-logical symbols (symbols used as variables, individual constants, predicates, and functions) and each partition is then given its own semantics. A symbol space has an identifier and a lexical space, which defines the "shape" of the symbols in that symbol space. Some symbol spaces in RIF are used to identify Web entities and their lexical space consists of strings that syntactically look like internationalized resource identifiers [RFC-3987], or IRIs (e.g., rif:iri). Other symbol spaces are used to represent the data types used in RIF (for example, xsd:integer).
Signatures.
Signatures determine which terms and formulas are well-formed. It is a generalization of the notion of a sort in classical first-order logic [Enderton01]. Each nonlogical symbol (and some logical symbols, like =) has an associated signature. A signature defines, in a precise way, the syntactic contexts in which the symbol is allowed to occur.

For instance, the signature associated with a symbol p might allow p to appear in a term of the form f(p), but disallow it to occur in a term like p(a,b). The signature for f, on the other hand, might allow that symbol to appear in f(p) and f(p,q), but disallow f(p,q,r) and f(f). In this way, it is possible to control which symbols are used for predicates and which for functions, where variables can occur, and so on.

Semantic framework. This framework defines the notion of a semantic structure or interpretation (both terms are used in the literature [Enderton01, Mendelson97], but here we will mostly use the first). Semantic structures are used to interpret formulas and to define logical entailment. As with the syntax, this framework includes a number of mechanisms that RIF logic-based dialects can specialize to suit their needs. These mechanisms include:

Truth values. RIF-FLD is designed to accommodate dialects that support reasoning with inconsistent and uncertain information. Most of the logics that are designed to deal with these situations are multi-valued. Consequently, RIF-FLD postulates that there is a set of truth values, TV, which includes the values t (true) and f (false) and possibly others. For example, RIF Basic Logic Dialect is two-valued, but other dialects can have additional truth values.
Data types. Some symbol spaces that are part of the RIF syntactic framework have fixed interpretations. For instance, symbols in the symbol space xsd:string are always interpreted as sequences of unicode characters, and a ≠ b for any pair of distinct symbols. A symbol space whose symbols have a fixed interpretation in any semantic structure is called a data type.
Entailment. This notion is fundamental to logic-based dialects. Given a set of formulas (e.g., facts and rules) G, entailment determines which other formulas necessarily follow from G. Entailment is the main mechanism underlying query answering in databases, logic programming, and the various reasoning tasks in Description Logic.

A set of formulas G logically entails another formula g if for every semantic structure I in some set S, if G is true in I then g is also true in I. Almost all logics define entailment this way. The difference lies in which set S they use. For instance, logics that are based on the classical first-order predicate calculus, such as Description Logic, assume that S is the set of all semantic structures. In contrast, logic programming languages, which use default negation, assume that S contains only the so-called "minimal" Herbrand models of G and, furthermore, only the minimal models of a special kind. See [Shoham87] for a more detailed exposition of this subject.

XML serialization framework. This framework defines the general principles for mapping the presentation syntax of RIF-FLD to the concrete XML interchange format. This includes:

A specification of the XML syntax for RIF-FLD, including the associated XML Schema document.
A specification of a one-to-one mapping from the presentation syntax of RIF-FLD to its XML syntax. This mapping must map any well-formed group formula of RIF-FLD to an XML document that is valid with respect to the aforesaid XML Schema document.

This document is the latest draft of the RIF-FLD specification. Each RIF dialect that is derived from RIF-FLD will be described in its own document. The first of such dialects, RIF Basic Logic Dialect, is described in the document RIF-BLD.

2 Syntactic Framework

The next subsection explains how to derive the presentation syntax of a RIF dialect from the presentation syntax of the RIF framework. The actual syntax of the RIF framework is given in subsequent subsections.

2.1 Syntax of a RIF Dialect as a Specialization of RIF-FLD

The presentation syntax for a RIF dialect can be obtained from the general syntactic framework of RIF by specializing the following parameters, which are defined later in this document:

The alphabet of RIF-FLD can be restricted.

An assignment of signatures to each constant and variable symbol.

Signatures determine which terms in the dialect are well-formed and which are not.

The exact way signatures are assigned depends on the dialect. An assignment can be explicit or implicit (for instance, derived from the context in which each symbol is used).

The choice of the types of terms supported by the dialect.

The RIF logic framework introduces the following types of terms:

constant
variable
positional
with named arguments
equality
frame
class membership
subclass
external

A dialect might support all of these terms or just a subset. For instance, some dialects might not support terms with named arguments or frame terms.

The choice of symbol spaces supported by the dialect.

Symbol spaces determine the syntax of the constant symbols that are allowed in the dialect.

The choice of the formulas supported by the dialect.

RIF-FLD allows formulas of the following kind:
- Atomic
- Conjunction
- Disjunction
- Classical negation
- Default negation (as in logic programming)
- Rule (as in logic programming as opposed to the classical material implication)
- Quantification (universal and existential)
A dialect might support all of these formulas or it might impose various restrictions. For instance, the formulas allowed in the conclusion and/or premises of implications might be restricted (e.g., RIF-BLD essentially allows Horn rules only), certain types of quantification might be prohibited (e.g., RIF-BLD disallows existential quantification in the rule head), classical or default negation (or both) might not be allowed (as in RIF-BLD), etc. A subdialect of RIF-BLD might disallow equality formulas in the conclusions of the rules.

2.2 Alphabet

Definition (Alphabet). The alphabet of the presentation language of RIF-FLD consists of

a countably infinite set of constant symbols Const
a countably infinite set of variable symbols Var (disjoint from Const)
a countably infinite set of argument names ArgNames (disjoint from both Const and Var)
connective symbols And, Or, Naf, Neg, and :-
quantifiers Exists and Forall
the symbols =, #, ##, ->, External, Prefix, and Base
the symbols Document, Group, and Import
auxiliary symbols, such as "(", ")", "[", "]", and "^^".

The set of connective symbols, quantifiers, =, etc., is disjoint from Const and Var. Variables are written as Unicode strings preceded with the symbol "?". The argument names in ArgNames are written as Unicode strings that do not start with a "?". The syntax for constant symbols is given in Section Symbol Spaces.

The symbols =, #, and ## are used in formulas that define equality, class membership, and subclass relationships. The symbol -> is used in terms that have named arguments and in frame terms. The symbol External indicates that an atomic formula or a function term is defined externally (e.g., a builtin) and the symbols Prefix and Base are used in abridged representations of IRIs.

The symbol Document is used to define RIF documents, Import is an import directive, and the symbol Group is used to organize RIF-FLD formulas into collections. ☐

2.3 Symbol Spaces

Throughout this document, we will be using the following abbreviations:

xsd: stands for the XML Schema URI http://www.w3.org/2001/XMLSchema#
rdf: stands for http://www.w3.org/1999/02/22-rdf-syntax-ns#
pred: stands for http://www.w3.org/2007/rif-builtin-predicates#
rif: stands for the URI of RIF, http://www.w3.org/2007/rif#

These and other abbreviations will be used as prefixes in the compact URI notation [CURIE], a notation for succinct representation of IRIs. The precise meaning of this notation in RIF is defined in the document Data Types and Builtins.

The set of all constant symbols in a RIF dialect is partitioned into a number of subsets, called symbol spaces, which are used to represent XML Schema data types, data types defined in other W3C specifications, such as rdf:XMLLiteral, and to distinguish other sets of constants. All constant symbols have a syntax (and sometimes also semantics) imposed by the symbol space to which they belong.

Definition (Symbol space). A symbol space is a named subset of the set of all constants, Const. The semantic aspects of symbol spaces will be described in Section Semantic Framework. Each symbol in Const belongs to exactly one symbol space.

Each symbol space has an associated lexical space and a unique identifier. More precisely,

The lexical space of a symbol space is a non-empty set of Unicode character strings.
The identifier of a symbol space is a sequence of Unicode characters that form an absolute IRI.
Different symbol spaces cannot share the same identifier.

The identifiers for symbol spaces are not themselves constant symbols in RIF. ☐

To simplify the language, we will often use symbol space identifiers to refer to the actual symbol spaces (for instance, we may use "symbol space xsd:string" instead of "symbol space identified by xsd:string").

To refer to a constant in a particular RIF symbol space, we use the following presentation syntax:

     "literal"^^symspace

where literal is called the lexical part of the symbol, and symspace is an identifier of the symbol space. Here literal is a sequence of Unicode characters that must be an element in the lexical space of the symbol space symspace. For instance, "1.2"^^xsd:decimal and "1"^^xsd:decimal are legal symbols because 1.2 and 1 are members of the lexical space of the XML Schema data type xsd:decimal. On the other hand, "a+2"^^xsd:decimal is not a legal symbol, since a+2 is not part of the lexical space of xsd:decimal.

The set of all symbol spaces that partition Const is considered to be part of the logic language of RIF-FLD.

RIF requires that all dialects include the symbol spaces listed in Section Constants and Symbol Spaces of the document Data Types and Built-Ins. These symbol spaces include constants that belong to several important XML Schema data types, certain RDF data types, and constant symbols specific to RIF. The latter include the symbol spaces rif:iri and rif:local, which are used to represent internationalized resource identifiers (IRIs) and constant symbols that are not visible outside of the RIF document in which they occur, respectively. Rule sets that are exchanged through RIF can use additional symbol spaces.

The lexical spaces of the mandatory RIF symbol spaces are described in Section Constants and Symbol Spaces of the document Data Types and Built-Ins.

2.4 Terms

The most basic construct of a logic language is a term. RIF-FLD supports several kinds of terms: constants, variables, the regular positional terms, plus terms with named arguments, equality, classification terms, and frames. The word "term" will be used to refer to any kind of term.

Definition (Term). A term is a statement of one of the following forms:

Constants and variables. If t ∈ Const or t ∈ Var then t is a simple term.

Positional terms. If t and t₁, ..., t_n are terms then t(t₁ ... t_n) is a positional term.
Positional terms in RIF-FLD generalize the regular notion of a term used in first-order logic. For instance, the above definition allows variables everywhere, as in ?X(?Y ?Z(?V "12"^^xsd:integer)), where ?X, ?Y, ?Z, and ?V are variables. Even ?X("abc"^^xsd:string ?W)(?Y ?Z(?V "33"^^xsd:integer)) is a positional term (as in HiLog [CKW93]).
Terms with named arguments. A term with named arguments is of the form t(s₁->v₁ ... s_n->v_n), where t, v₁, ..., v_n are terms, and s₁, ..., s_n are (not necessarily distinct) symbols from the set ArgNames.
The term t here represents a predicate or a function; s₁, ..., s_n represent argument names; and v₁, ..., v_n represent argument values. Terms with named arguments are like regular positional terms except that the arguments are named and their order is immaterial. Note that a term with no arguments, like f(), is both positional and also with named arguments.

For instance, "person"^^xsd:string(name->?Y address->?Z), ?X("123"^^xsd:integer ?W)(arg->?Y arg2->?Z(?V)), and "Closure"^^rif:local(relation->"http://example.com/Flight"^^rif:iri)(from->?X to->?Y) are terms with named arguments. The second of these terms has a positional term ?X(abc,?W), which occurs in the position of a function, and the third term's function is represented by a named arguments term.
Equality terms. An equality term has the form t = s, where t and s are terms.
Classification terms. There are two kinds of classification terms: class membership terms (or just membership terms) and subclass terms.
- t#s is a membership term if t and s are terms.
- t##s is a subclass term if t and s are terms.
Classification terms are used to describe class hierarchies.
Frame terms. t[p₁->v₁ ... p_n->v_n] is a frame term (or simply a frame) if t, p₁, ..., p_n, v₁, ..., v_n, n ≥ 0, are terms.
Frame terms are used to describe properties of objects. As in the case of the terms with named arguments, the order of the properties p_i->v_i in a frame is immaterial.
Externally defined terms. If t is a constant, positional term, a term with named arguments, or a frame term then External(t) is an externally defined term.
Such terms are used for representing builtin functions and predicates as well as "procedurally attached" terms or predicates, which might exist in various rule-based systems, but are not specified by RIF. ☐

The above definition is very general. It makes no distinction between constant symbols that represent individuals, predicates, and function symbols. The same symbol can occur in multiple contexts at the same time. For instance, if p, a, and b are symbols then p(p(a) p(a p c)) is a term. Even variables and general terms are allowed to occur in the position of predicates and function symbols, so p(a)(?v(a c) p) is also a term. Externally defined data sources can also be interfaced with using very general mechanisms: not only predicates are allowed, but also frames. In this way, externally defined objects can be accessed using the more natural frame-based interface. For instance, External("http://example.com/acme"^^rif:iri["http://example.com/acme/president"^^rif:iri(?Year) -> ?Pres]) could be an interface provided to access an externally defined method "http://example.com/mycompany/president"^^rif:iri in an external object "http://example.com/acme"^^rif:iri.

Frame, classification, and other terms can be freely nested, as exemplified by p(?X q#r[p(1,2)->s](d->e f->g)). Some language environments, like FLORA-2 [FL2], OO jDREW [OOjD], NxBRE [NxBRE], and CycL [CycL] support fairly large (partially overlapping) subsets of RIF-FLD terms, but most languages support much smaller subsets. RIF dialects are expected to carve out the appropriate subsets of RIF-FLD terms, and the general form of the RIF logic framework allows a considerable degree of freedom.

Dialects can also restrict the contexts in which the various terms are allowed by using the mechanism of signatures. The RIF-FLD language associates a signature with each symbol (both constant and variable symbols) and uses signatures to define well-formed terms. Each RIF dialect is expected to select appropriate signatures for the symbols in its alphabet, and only the terms that are well-formed according to the selected signatures are allowed in that particular dialect.

2.5 Schemas for Externally Defined Terms

This section introduces the notion of external schemas, which serve as templates for externally defined terms. These schemas determine which externally defined terms are acceptable in a RIF dialect. Externally defined terms include RIF builtins, which are specified in the document Data Types and Builtins, but are more general. They are designed to accommodate the ideas of procedural attachments and querying of external data sources. Because of the need to accommodate many difference possibilities, the RIF logical framework supports a very general notion of an externally defined term. Such a term is not necessarily a function or a predicate -- it can be a frame, a classification term, and so on.

Definition (Schema for external term). An external schema is a statement of the form (?X₁ ... ?X_n; τ) where

τ is a term, as defined earlier, with the exception that it is not permitted to be a variable or an externally defined term.
?X₁ ... ?X_n is a list of all distinct variables that occur in τ

The names of the variables in an external schema are immaterial, but their order is important. For instance, (?X ?Y; ?X[foo->?Y]) and (?V ?W; ?V[foo->?W]) are considered to be the same schema, but (?X ?Y; ?X[foo->?Y]) and (?Y ?X; ?X[foo->?Y]) are viewed as different schemas.

A term t is an instance of an external schema (?X₁ ... ?X_n; τ) iff t can be obtained from τ by a simultaneous substitution ?X₁/s₁ ... ?X_n/s_n of the variables ?X₁ ... ?X_n with terms s₁ ... s_n, respectively. Some of the terms s_i can be variables themselves. For example, ?Z[foo->f(a ?P)] is an instance of (?X ?Y; ?X[foo->?Y]) by the substitution ?X/?Z ?Y/f(a ?P). ☐

Observe that a variable cannot be an instance of an external schema, since τ in the above definition cannot be a variable. It will be seen later that this implies that a term of the form External(?X) is not well-formed in RIF.

The intuition behind the notion of an external schema, such as (?X ?Y; ?X["foo"^^xsd:string->?Y]) or (?V; "pred:isTime"^^rif:iri(?V)), is that ?X["foo"^^xsd:string->?Y] or "pred:isTime"^^rif:iri(?V) are invocation patterns for querying external sources, and instances of those schemas correspond to concrete invocations. Thus, External("http://foo.bar.com"^^rif:iri["foo"^^xsd:string->"123"^^xsd:integer]) and External("pred:isTime"^^rif:iri("22:33:44"^^xsd:time) are examples of invocations of external terms -- one querying an external source and another invoking a builtin.

Definition (Coherent set of external schemas). A set of external schemas is coherent if there can be no term, t, that is an instance of two distinct schemas. ☐

The intuition behind this notion is to ensure that any use of an external term is associated with at most one external schema. This assumption is relied upon in the definition of the semantics of externally defined terms. Note that the coherence condition is easy to verify syntactically and that it implies that schemas like (?X ?Y; ?X[foo->?Y]) and (?Y ?X; ?X[foo->?Y]), which differ only in the order of their variables, cannot be in the same coherent set.

It important to understand that external schemas are not part of the language in RIF, since they do not appear anywhere in RIF statements. Instead, like signatures, which are defined below, they are best thought of as part of the grammar of the language. In particular, they will be used to determine which external terms, i.e., the terms of the form External(t) are well-formed.

2.6 Signatures

In this section we introduce the concept of a signature, which is a key mechanism that allows RIF-FLD to control the context in which the various symbols are allowed to occur. For instance, a symbol f with signature {(term term) => term, (term) => term} can occur in terms like f(a b), f(f(a b) a), f(f(a)), etc., if a and b have signature term. But f is not allowed to appear in the context f(a b a) because there is no =>-expression in the signature of f to support such a context.

The above example provides intuition behind the use of signatures in RIF-FLD. Much of the development, below, is inspired by [CK95]. It should be kept in mind that signatures are not part of the logic language in RIF, since they do not appear anywhere in RIF-FLD formulas. Instead they are part of the grammar: they are used to determine which sequences of tokens are in the language and which are not. The actual way by which signatures are assigned to the symbols of the language may vary from dialect to dialect. In some dialects (for example RIF-BLD), this assignment is derived from the context in which each symbol occurs and no separate language for signatures is used. Other dialects may choose to assign signatures explicitly. In that case, they would require a concrete language for signatures (which would be separate from the language for specifying the logic formulas of the dialect).

Definition (Signature name). Let SigNames be a non-empty, partially-ordered finite or countably infinite set of symbols, called signature names. Since signatures are not part of the logic language, their names do not have to be disjoint from Const, Var, and ArgNames. We require that this set includes at least the following signature names:

atomic -- used to represent the syntactic context where atomic formulas are allowed to appear.
= -- used for representing contexts where equality terms can appear.
# -- a signature name reserved for membership terms.
## -- a signature reserved for subclass terms.
-> -- a signature reserved for frame terms. ☐

Dialects may introduce additional signature names. For instance, RIF-BLD introduces one other signature name, individual. The partial order on SigNames is dialect-specific; it is used in the definition of well-formed terms below.

We use the symbol < to represent the partial order on SigNames. Informally, α < β means that terms with signature α can be used wherever terms with signature β are allowed. We will write α ≤ β if either α = β or α < β.

Definition (Signature). A signature is a statement of the form η{e₁, ..., e_n, ...} where η ∈ SigNames is the name of the signature and {e₁, ..., e_n, ...} is a countable set of arrow expressions. Such a set can thus be infinite, finite, or even empty. In RIF-BLD, signatures can have at most one arrow expression. Other dialects (such as HiLog [CKW93], for example) may require polymorphic symbols and thus allow signatures with more than one arrow expression in them.

An arrow expression is defined as follows:

If κ, κ₁, ..., κ_n ∈ SigNames, n≥0, are signature names then (κ₁ ... κ_n) ⇒ κ is a positional arrow expression.
For instance, () ⇒ term and (term) ⇒ term are positional arrow expressions, if term is a signature name.
If κ, κ₁, ..., κ_n ∈ SigNames, n≥0, are signature names and p₁, ..., p_n ∈ ArgNames are argument names then (p₁->κ₁ ... p_n->κ_n) => κ is an arrow expression with named arguments.
For instance, (arg1->term arg2->term) => term is an arrow signature expression with named arguments. The order of the arguments in arrow expressions with named arguments is immaterial, so any permutation of arguments yields the same expression. ☐

RIF dialects are always associated with sets of coherent signatures, defined next. The overall idea is that a coherent set of signatures must include all the predefined signatures (such as signatures for equality and classification terms) and the signatures included in a coherent set should not conflict with each other. For instance, two different signatures should not have identical names and if one signature is said to extend another then the arrow expressions of the supersignature should be included among the arrow expressions of the subsignature (a kind of an arrow expression "inheritance").

Definition (Coherent signature set). A set Σ of signatures is coherent iff

Σ contains the special signature atomic{ }, which represents the context of atomic formulas.
Σ contains the signature ={e₁, ..., e_n, ...} for the equality symbol.
All arrow expressions e_i here have the form (κ κ) ⇒ γ (the arguments in an equation must be compatible) and at least one of these expressions must have the form (κ κ) ⇒ atomic (i.e., equation terms are also atomic formulas). Dialects may further specialize this signature.
Σ contains the signature #{e₁, ..., e_n...}.
Here all arrow expressions e_i are binary (have two arguments) and at least one has the form (κ γ) ⇒ atomic. Dialects may further specialize this signature.
Σ contains the signature ##{e₁, ..., e_n...}.
Here all arrow expressions e_i have the form (κ κ) ⇒ γ (the arguments must be compatible) and at least one of these arrow expressions has the form (κ κ) ⇒ atomic. Dialects may further specialize this signature.
Σ contains the signature ->{e₁, ..., e_n...}.
Here all arrow expressions e_i are ternary (have three arguments) and at least one of them is of the form (κ₁ κ₂ κ₃) ⇒ atomic. Dialects may further specialize this signature.
Σ has at most one signature for any given signature name.
Whenever Σ contains a pair of signatures, ηA and κB, such that η<κ then B⊆A.
Here ηA denotes a signature with the name η and the associated set of arrow expressions A; similarly κB is a signature named κ with the set of expressions B. The requirement that B⊆A ensures that symbols that have signature η can be used wherever the symbols with signature κ are allowed. ☐

The requirement that coherent sets of signatures must include the signatures for =, #, ->, and so on is just a technicality needed to simplify the definitions. Some of these signatures may go "unused" in a dialect even though, technically speaking, they must be present in the signature set associated with that dialect. If a dialect disallows equality, classification terms, or frames in its syntax then the corresponding signatures will remain unused. Such restrictions can be imposed by specializing RIF-FLD -- see Section Syntax of a RIF Dialect as a Specialization of RIF-FLD.

An incoherent set of signatures would be one that includes signatures mysig{() ⇒ atomic} and mysig{atomic ⇒ atomic} because it has two different signatures with the same name. Likewise, if this set contains mysig₁{() ⇒ atomic} and mysig₂{atomic ⇒ atomic} and mysig₁ < mysig₁ then it is incoherent because the set of arrow expressions of mysig₁ does not contain the set of arrow expressions of mysig₂.

2.7 Presentation Language of a RIF Dialect

The presentation language of a RIF dialect is a set of all well-formed formulas, as defined in the next section. The language is determined by the following parameters (see Syntax of a RIF Dialect as a Specialization of RIF-FLD):

An alphabet.
A set of symbol spaces.
An assignment of signatures from a coherent set of signatures to the symbols in Var and Const:
Each variable symbol is associated with exactly one signature from a coherent set of signatures. A constant symbol can have one or more signatures, and different symbols can be associated with the same signature. (If variables were allowed to have multiple signatures then well-formed terms would not be closed under substitutions. For instance, a term like f(?X,?X) could be well-formed, but f(a,a) could be ill-formed.)
Restrictions on the classes of terms allowed in the language of the dialect.
Restrictions on the classes of formulas allowed in the language of the dialect.
A coherent set of external schemas.

We have already seen how the alphabet and the symbol spaces are used to define RIF terms. The next section shows how signatures and external schemas are used to further specialize this notion to define well-formed RIF-FLD terms.

2.8 Well-formed Terms and Formulas

Since signature names uniquely identify signatures in coherent signature sets, we will often refer to signatures simply by their names. For instance, if one of f's signatures is atomic{ }, we may simply say that symbol f has signature atomic.

Definition (Well-formed term).

A constant or variable symbol with signature η is a well-formed term with signature η.

A positional term t(t₁ ... t_n), 0≤n, is well-formed and has a signature σ iff

t is a well-formed term that has a signature that contains an arrow expression of the form (σ₁ ... σ_n) ⇒ σ; and
Each t_i is a well-formed term whose signature is γ_i, such that γ_i, ≤ σ_i.

As a special case, when n=0 we obtain that t( ) is a well-formed term with signature σ, if t's signature contains the arrow expression () ⇒ σ.

A term with named arguments t(p₁->t₁ ... p_n->t_n), 0≤n, is well-formed and has a signature σ iff

t is a well-formed term that has a signature that contains an arrow expression with named arguments of the form (p₁->σ₁ ... p_n->σ_n) ⇒ σ; and
Each t_i is a well-formed term whose signature is γ_i, such that γ_i ≤ σ_i.

As a special case, when n=0 we obtain that t( ) is a well-formed term with signature σ, if t's signature contains the arrow expression () ⇒ σ.

An equality term of the form t₁=t₂ is well-formed and has a signature κ iff

The signature = has an arrow expression (σ σ) ⇒ κ
t_i and t₂ are well-formed terms with signatures γ₁ and γ₂, respectively, such that γ_i ≤ σ, i=1,2.

A membership term of the form t₁#t₂ is well-formed and has a signature κ iff

The signature # has an arrow expression (σ₁ σ₂) ⇒ κ
t₁ and t₂ are well-formed terms with signatures γ₁ and γ₂, respectively, such that γ_i ≤ σ_i, i=1,2.

A subclass term of the form t₁##t₂ is well-formed and has a signature κ iff

The signature ## has an arrow expression (σ σ) ⇒ κ
t₁ and t₂ are well-formed terms with signatures γ₁ and γ₂, respectively, such that γ_i ≤ σ, i=1,2.

A frame term of the form t[s₁->v₁ ... s_n->v_n] is well-formed and has a signature κ iff

The signature -> has arrow expressions (σ σ₁₁ σ₁₂) ⇒ κ, ..., (σ σ_n1 σ_n2) ⇒ κ (these n expressions need not be distinct).
t, s_j, and v_j are well-formed terms with signatures γ, γ_j1, and γ_j2, respectively, such that γ ≤ σ and γ_ji ≤ σ_ji, where j=1,...,n and i=1,2.

An externally defined term, External(t), is well-formed and has signature κ iff

t is well-formed and has signature κ.

t is an instance of an external schema from a coherent set of external schemas of the language.
Note that, according to the definition of coherent sets of schemas, a term can be an instance of at most one external schema. ☐

Note that, like constant symbols, well-formed terms can have more than one signature. Also note that, according to the above definition, f() and f are distinct terms.

Definition (Well-formed formula). A well-formed term is also a well-formed atomic formula iff one of its signatures is atomic or is ≤ atomic. Note that equality, membership, subclass, and frame terms are atomic formulas, since atomic is one of their signatures.

More general formulas are constructed out of atomic formulas with the help of logical connectives. A well-formed formula is a statement that can have one of the following forms:

Atomic: If φ is a well-formed atomic formula then it is also a well-formed formula.
Conjunction: If φ₁, ..., φ_n, n ≥ 0, are well-formed formulas then so is And(φ₁ ... φ_n).
As a special case, And() is allowed and is treated as a tautology, i.e., a formula that is always true.
Disjunction: If φ₁, ..., φ_n, n ≥ 0, are well-formed formulas then so is Or(φ₁ ... φ_n).
When n=0, we get Or() as a special case; it is treated as a contradiction, i.e., a formula that is always false.
Classical negation: If φ is a well-formed formula then Neg φ is a well-formed formula.
Default negation: If φ is a well-formed formula then Naf φ is a well-formed formula.
Rule implication: If φ and ψ are well-formed formulas then φ :- ψ is a well-formed formula.
Quantification: If φ is a well-formed formula and ?V₁, ..., ?V_n are variables then the following formulas are well-formed:
- Exists ?V₁ ... ?V_n(φ) and
- Forall ?V₁ ... ?V_n(φ)
Group: If ρ₁, ..., ρ_n are well-formed formulas or group formulas (they can be mixed) then Group (ρ₁ ... ρ_n) is a well-formed group formula.
Group formulas are intended to represent sets of formulas. Note that some of the ρ_i's can be group formulas themselves, which means that groups can be nested.
Document: An expression of the form Document(directive₁ ... directive₁ Γ) is a well-formed document formula, if
- Γ is a well-formed group formula that makes the actual logical content of the document.
- directive₁, ..., directive₁ are directives. A directive can be an import directive, a prefix directive, or a base directive.
- - An import directive can have one of these two forms: Import(t) or Import(t p). Here t is an IRI constant and p is a term. The constant t indicates the address of another document to be imported and p is called the profile of import.
    RIF-FLD defines the semantics for the directive Import(t) only. The directive Import(t p) is reserved for RIF dialects, which can use it to import non-RIF logical entities, such as RDF data and OWL ontologies. The profile might specify what kind of entity is being imported and under what semantics (for instance, the various RDF entailment regimes).
  - A prefix directive has the form Prefix(p v), where p is an alphanumeric string that serves as the prefix name and v is a macro-expansion for p -- a string that forms an IRI.
    Prefix directives do not affect the semantics of RIF documents. Instead, they are used as shorthands to allow more concise representation of IRI constants. This mechanism is explained in the document Data Types and Built-ins, Section Constants and Symbol Spaces.
  - A base directive has the form Base(iri), where iri is a unicode string in the form of an IRI.
    Like prefix directives, base directives do not affect the semantics. They are used as syntactic shortcuts for expanding relative IRIs into full IRIs, as described in in Section Constants and Symbol Spaces of the document Data Types and Built-ins.
- All parts of the document formula -- the directives and the group formula -- are optional and can be omitted.

In cases (1) -- (7) above, φ, φ_i, and ψ cannot be group or document formulas. In case (8), ρ₁, ..., ρ_n cannot be document formulas. In other words, groups and documents cannot be nested inside regular formulas and documents cannot be nested inside groups. ☐

Example 1 (Signatures, well-formed terms and formulas).

We illustrate the above definitions with the following examples. In addition to atomic, let there be another signature, term{ }, which is intended here to represent the context of the arguments to positional terms or atomic formulas.

Consider the term p(p(a) p(a b c)). If p has the (polymorphic) signature mysig{(term)⇒term, (term term)⇒term, (term term term)⇒term} and a, b, c each has the signature term{ } then p(p(a) p(a b c)) is a well-formed term with signature term{ }. If instead p had the signature mysig2{(term term)⇒term, (term term term)⇒term} then p(p(a) p(a b c)) would not be a well-formed term since then p(a) would not be well-formed (in this case, p would have no arrow expression which allows p to take just one argument).

For a more complex example, let r have the signature mysig3{(term)⇒atomic, (atomic term)⇒term, (term term term)⇒term}. Then r(r(a) r(a b c)) is well-formed. The interesting twist here is that r(a) is an atomic formula that occurs as an argument to a function symbol. However, this is allowed by the arrow expression (atomic term)⇒ term, which is part of r's signature. If r's signature were mysig4{(term)⇒atomic, (atomic term)⇒atomic, (term term term)⇒term} instead, then r(r(a) r(a b c)) would be not only a well-formed term, but also a well-formed atomic formula.

An even more interesting example arises when the right-hand side of an arrow expression is something other than term or atomic. For instance, let John, Mary, NewYork, and Boston have signatures term{ }; flight and parent have signature h₂{(term term)⇒atomic}; and closure has signature hh₁{(h₂)⇒p₂}, where p₂ is the name of the signature p₂{(term term)⇒atomic}. Then flight(NewYork Boston), closure(flight)(NewYork Boston), parent(John Mary), and closure(parent)(John Mary) would be well-formed formulas. Such formulas are allowed in languages like HiLog [CKW93], which support predicate constructors like closure in the above example. ☐

2.9 Metadata in the Presentation Syntax

RIF-FLD allows every term to be optionally preceded by a metadata block of the form (* id φ *) where id is a rif:iri constant and φ is a RIF formula, which is not a document-formula. Both items inside the metadata block are optional. The id part represents the meta-identifier of the term to which the metadata block is attached and φ is the metadata itself. RIF-FLD does not impose any restrictions on φ apart from what is stated above. In particular, it may include variables, function symbols, rif:local constants, and so on.

Observe that there is certain syntactic ambiguity in the above definition. For instance, in (* id φ *) t[w -> v] the metadata block can be attributed to the term t</> or to the entire frame <tt>t[w -> v]. We do not make an attempt to resolve this ambiguity in the presentation syntax, since, as explained earlier, this syntax is not intended to be concrete. The concrete XML syntax of RIF does not have such ambiguities.

Example 2 (A RIF-FLD document with nested groups and metadata).

We illustrate formulas, including documents and groups, with the following complete example (with apologies to Shakespeare for the imperfect rendering of the intended meaning in logic). For better readability, we use the shortcut notation defined in the document Data Types and Builtins. The example also illustrates attachment of metadata.

 Document(
   Prefix(dc http://http://purl.org/dc/terms/)
   Prefix(ex http://example.org/ontology#)
   Prefix(hamlet http://www.shakespeare-literature.com/Hamlet/)
   
   (* hamlet:assertions hamlet:assertions[dc:title->"Hamlet" dc:creator->"Shakespeare"] *)
   Group(
      Exists ?X (And(?X # ex:RottenThing
                     ex:partof(?X <http://www.denmark.dk>)))
      Forall ?X (Or(hamlet:tobe(?X)  Naf hamlet:tobe(?X)))
      Forall ?X (And(Exists ?B (And(ex:has(?X ?B) ?B # ex:business))
                     Exists ?D (And(ex:has(?X ?D) ?D # ex:desire)))
                   :- ?X # ex:man)
      (* hamlet:facts *)
      Group(
         hamlet:Yorick # ex:poor
         hamlet:Hamlet # ex:prince
      )
   )
 )

Observe that the above set of formulas has a nested subset with its own metadata, hamlet:facts, which contains only a global IRI. ☐

2.10 EBNF Grammar for the Presentation Syntax of RIF-FLD

Up to now we have used mathematical English to specify the syntax of RIF-FLD. We will now specify it using the familiar EBNF notation. The following points about the EBNF notation should be kept in mind:

The syntax of RIF-FLD relies on the signature mechanism and is not context-free, so EBNF does not capture this syntax precisely. As a result, the EBNF grammar defines a strict superset of RIF-FLD (not all formulas that are derivable using the EBNF grammar are well-formed).

The EBNF syntax is not a concrete syntax: it does not address the details of how constants and variables are represented, and it is not sufficiently precise about the delimiters and escape symbols. Instead, white space is informally used as a delimiter, and white space is implied in productions that use Kleene star. For instance, TERM* is to be understood as TERM TERM ... TERM, where each ' ' abstracts from one or more blanks, tabs, newlines, etc. This is done intentionally since RIF's presentation syntax is used as a tool for specifying the semantics and for illustration of the main RIF concepts through examples.

RIF defines a concrete syntax only for exchanging rules, and that syntax is XML-based, obtained as a refinement and serialization of the EBNF syntax via the presentation-syntax-to-XML mapping for RIF-FLD.

In view of the above, the EBNF grammar can be viewed as just an intermediary between the mathematical English and the XML. However, it also gives a succinct overview of the syntax of RIF-FLD and as such can be useful for dialect designers and users alike.

  Document       ::= 'Document' '(' IRIMETA? Directive? Group? ')'
  Directive      ::= Import*
  Import         ::= 'Import' '(' IRI PROFILE? ')'
  Group          ::= 'Group' IRIMETA? '(' (FORMULA | Group)* ')'
  IRIMETA        ::= Frame
  FORMULA        ::= 'And' '(' FORMULA* ')' |
                     'Or' '(' FORMULA* ')' |
                     Implies |
                     'Exists' Var+ '(' FORMULA ')' |
                     'Forall' Var+ '(' FORMULA ')' |
                     'Neg' FORMULA |
                     'Naf' FORMULA |
                     FORM
  Implies        ::= FORMULA ':-' FORMULA
  FORM           ::= Var | ATOMIC | 'External' '(' ATOMIC ')'
  ATOMIC         ::= Const | Atom | Equal | Member | Subclass | Frame
  Atom           ::= UNITERM
  UNITERM        ::= TERM '(' (TERM* | (Name '->' TERM)*) ')'
  Equal          ::= TERM '=' TERM
  Member         ::= TERM '#' TERM
  Subclass       ::= TERM '##' TERM
  Frame          ::= TERM '[' (TERM '->' TERM)* ']'
  TERM           ::= Var | EXPRIC | 'External' '(' EXPRIC ')'
  EXPRIC         ::= Const | Expr | Equal | Member | Subclass | Frame
  Expr           ::= UNITERM
  Const          ::= '"' UNICODESTRING '"^^' SYMSPACE
  IRI            ::= UNICODESTRING
  PROFILE        ::= UNICODESTRING
  Name           ::= UNICODESTRING
  Var            ::= '?' UNICODESTRING
  SYMSPACE       ::= UNICODESTRING

3 Semantic Framework

Recall that the presentation syntax of RIF-BLD allows the use of macros, which are specified via the Prefix and Base directives. The semantics, below, is described using the full syntax, i.e., the description assumes that all macros have already been expanded as explained in Data Types and Builtins, Section Constants and Symbol Spaces.

3.1 Semantics of a RIF Dialect as a Specialization of RIF-FLD

The RIF-FLD semantic framework defines the notions of semantic structures and of models for RIF-FLD formulas. The semantics of a dialect is derived from these notions by specializing the following parameters.

The effect of the syntax.

The syntax of a dialect may limit the kinds of terms that are supported. For instance, if the dialect does not support frames or terms with named arguments then the parts of the semantic structures whose purpose is to interpret the unsupported types of terms become redundant.

Truth values.

The RIF-FLD semantic framework allows formulas to have truth values from an arbitrary partially ordered set of truth values, TV. A concrete dialect must select a concrete partially or totally ordered set of truth values.

Data types.

A data type is a symbol space whose symbols have a fixed interpretation in any semantic structure. RIF-FLD defines a set of core data types that each dialect is required to support, but its semantics does not limit support to just the core types. RIF dialects can introduce additional data types, and each dialect must define the exact set of data types that it supports.

Logical entailment.

Logical entailment in RIF-FLD is defined with respect to an unspecified set of intended models. A RIF dialect must define which models are considered to be intended. For instance, one dialect might specify that all models are intended (which leads to classical first-order entailment), another may consider only the minimal models as intended, while a third one might only use well-founded or stable models [GRS91, GL88].

These notions are defined in the remainder of this document.

3.2 Truth Values

Definition (Set of truth values). Each RIF dialect must define the set of truth values, denoted by TV. This set must have a partial order, called the truth order, denoted <_t. In some dialects, <_t can be a total order. We write a ≤_t b if either a <_t b or a and b are the same element of TV. In addition,

TV must be a complete lattice with respect to <_t, i.e., the least upper bound (lub_t) and the greatest lower bound (glb_t) must exist for any subset of TV.
TV is required to have two distinguished elements, f and t, such that f ≤_t elt and elt ≤_t t for every elt∈TV.
TV has an operator of negation, ~: TV → TV, such that
- ~ is idempotent, i.e., applying ~ twice gives the identity mapping.
- ~t = f (and thus ~f = t). ☐

RIF dialects can have additional truth values. For instance, the semantics of some versions of NAF, such as well-founded negation, requires three truth values: t, f, and u (undefined), where f <_t u <_t t. Handling of contradictions and uncertainty usually requires at least four truth values: t, u, f, and i (inconsistent). In this case, the truth order is partial: f <_t u <_t t and f <_t i <_t t.

3.3 Primitive Data Types

Definition (Primitive data type). A primitive data type (or just a data type, for short) is a symbol space that has

an associated set, called the value space, and
a mapping from the lexical space of the symbol space to the value space, called lexical-to-value-space mapping. ☐

Semantic structures are always defined with respect to a particular set of data types, denoted by DTS. In a concrete dialect, DTS always includes the data types supported by that dialect. All RIF dialects must support the primitive data types that are listed in Section Data Types of the document Data Types and Built-Ins. Their value spaces and the lexical-to-value-space mappings fot these data types are described in the same section.

Although the lexical and the value spaces might sometimes look similar, one should not confuse them. Lexical spaces define the syntax of the constant symbols in the RIF language. Value spaces define the meaning of the constants. The lexical and the value spaces are often not even isomorphic. For example, 1.2^^xsd:decimal and 1.20^^xsd:decimal are two legal -- and distinct -- constants in RIF because 1.2 and 1.20 belong to the lexical space of xsd:decimal. However, these two constants are interpreted by the same element of the value space of the xsd:decimal type. Therefore, 1.2^^xsd:decimal = 1.20^^xsd:decimal is a RIF tautology. Likewise, RIF semantics for data types implies certain inequalities. For instance, abc^^xsd:string ≠ abcd^^xsd:string is a tautology, since the lexical-to-value-space mapping of the xsd:string type maps these two constants into distinct elements in the value space of xsd:string.

3.4 Semantic Structures

The central step in specifying a model-theoretic semantics for a logic-based language is defining the notion of a semantic structure, also known as an interpretation. Semantic structures are used to assign truth values to RIF-FLD formulas.

Definition (Semantic structure). A semantic structure, I, is a tuple of the form <TV, DTS, D, I_C, I_V, I_F, I_frame, I_SF, I_sub, I_isa, I₌, I_external, I_truth>. Here D is a non-empty set of elements called the domain of I. We will continue to use Const to refer to the set of all constant symbols and Var to refer to the set of all variable symbols. TV denotes the set of truth values that the semantic structure uses and DTS is a set of identifiers for primitive data types.

The other components of I are total mappings defined as follows:

I_C maps Const to elements of D.
This mapping interprets constant symbols.
I_V maps Var to elements of D.
This mapping interprets variable symbols.
I_F maps D to functions D* → D (here D* is a set of all sequences of any finite length over the domain D)
This mapping interprets positional terms.
I_SF interprets terms with named arguments. It is a total mapping from D to the set of total functions of the form SetOfFiniteBags(ArgNames × D) → D.
This is analogous to the interpretation of positional terms with two differences:
- Each pair <s,v> ∈ ArgNames × D represents an argument/value pair instead of just a value in the case of a positional term.
- The argument to a term with named arguments is a finite bag of argument/value pairs rather than a finite ordered sequence of simple elements.
- Bags are used here because the order of the argument/value pairs in a term with named arguments is immaterial and the pairs may repeat: p(a->b a->b).
  To see why such repetition can occur, note that argument names may repeat: p(a->b a->c). This can be understood as treating a as a set-valued argument. Identical argument/value pairs can then arise as a result of a substitution. For instance, p(a->?A a->?B) becomes p(a->b a->b) if the variables ?A and ?B are both instantiated with the symbol b.
I_frame is a total mapping from D to total functions of the form SetOfFiniteBags(D × D) → D.
This mapping interprets frame terms. An argument, d ∈ D, to I_frame represents an object and a finite bag {<a1,v1>, ..., <ak,vk>} represents a bag (multiset) of attribute-value pairs for d. We will see shortly how I_frame is used to determine the truth valuation of frame terms.

Bags are employed here because the order of the attribute/value pairs in a frame is immaterial and the pairs may repeat. For instance, o[a->b a->b]. Such repetitions arise naturally when variables are instantiated with constants. For instance, o[?A->?B ?C->?D] becomes o[a->b a->b] if variables ?A and ?C are instantiated with the symbol a and ?B, ?D with b.
I_sub gives meaning to the subclass relationship. It is a total function D × D → D.
The operator ## is required to be transitive, i.e., c1 ## c2 and c2 ## c3 must imply c1 ## c3. This is ensured by a restriction in Section Interpretation of Formulas.
I_isa gives meaning to class membership. It is a total function D × D → D.
The relationships # and ## are required to have the usual property that all members of a subclass are also members of the superclass, i.e., o # cl and cl ## scl must imply o # scl. This is ensured by a restriction in Section Interpretation of Formulas.
I₌ is a total function D × D → D.
It gives meaning to the equality operator.
I_truth is a total mapping D → TV.
It is used to define truth valuation for formulas.
I_external is a mapping from the coherent set of schemas for externally defined functions to total functions D* → D. For each external schema σ = (?X₁ ... ?X_n; τ) in the coherent set of such schemas associated with the language, I_external(σ) is a function of the form Dⁿ → D.
For every external schema, σ, associated with the language, I_external(σ) is assumed to be specified externally in some document (hence the name external schema). In particular, if σ is a schema of a RIF builtin predicate or function, I_external(σ) is specified in the document Data Types and Builtins so that:
- If σ is a schema of a builtin function then I_external(σ) must be the function defined in the aforesaid document.
- If σ is a schema of a builtin predicate then I_truth ο (I_external(σ)) (the composition of I_truth and I_external(σ), a truth-valued function) must be as specified in Data Types and Builtins.

For convenience, we also define the following mapping I :

I(k) = I_C(k), if k is a symbol in Const
I(?v) = I_V(?v), if ?v is a variable in Var
I(f(t₁ ... t_n)) = I_F(I(f))(I(t₁),...,I(t_n))
I(f(s₁->v₁ ... s_n->v_n)) = I_SF(I(f))({<s₁,I(v₁)>,...,<s_n,I(v_n)>})
Here we use {...} to denote a bag of argument/value pairs.
I(o[a₁->v₁ ... a_k->v_k]) = I_frame(I(o))({<I(a₁),I(v₁)>, ..., <I(a_n),I(v_n)>})
Here {...} denotes a bag of attribute/value pairs.
I(c1##c2) = I_sub(I(c1), I(c2))
I(o#c) = I_isa(I(o), I(c))
I(x=y) = I₌(I(x), I(y))
I(External(t)) = I_external(σ)(I(s₁), ..., I(s_n)), if t is an instance of the external schema σ = (?X₁ ... ?X_n; τ) by substitution ?X₁/s₁ ... ?X_n/s_n.
Note that, by definition, External(t) is well formed only if t is an instance of an external schema. Furthermore, by the definition of coherent sets of external schemas, t can be an instance of at most one such schema, so I(External(t)) is well-defined.

The effect of signatures. For every signature, sg, supported by a dialect, there is a subset D_sg ⊆ D, called the domain of the signature. Terms that have a given signature, sg, must be mapped by I to D_sg, and if a term has more than one signature it must be mapped into the intersection of the corresponding signature domains. To ensure this, the following is required:

If sg < sg' then D_sg⊆D_sg'.
If k is a constant that has signature sg then I_C(k) ∈ D_sg.
If ?v is a variable that has signature sg then I_V(?v) ∈ D_sg.
If sg has an arrow expression of the form (s1 ... sn)⇒s then, for every d∈D_sg, I_F(d) must map D_s1× ... ×D_sn to D_s.
If sg has an arrow expression of the form (p1->s1 ... pn->sn)⇒s then, for every d∈D_sg, I_SF(d) must map the set {<p1,D_s1>, ..., <pn,D_sn>} to D_s.
If the signature -> has arrow expressions (sg,s₁,r₁)⇒k, ..., (sg,s_n,r_n)⇒k, then, for every d∈D_sg, I_frame(d) must map {<D_s1,D_r1>, ..., <D_sn,D_rn>} to D_k.
If the signature # has an arrow expression (s r)⇒k then I_isa must map D_s×D_r to D_k.
If the signature ## has an arrow expression (s s)⇒k then I_sub must map D_s×D_s to D_k.
If the signature = has an arrow expression (s s)⇒k then I₌ must map D_s×D_s to D_k.

The effect of data types. The data types in DTS impose the following restrictions. If dt ∈ DTS is a symbol space identifier of a data type, let LS_dt denote the lexical space of dt, VS_dt denote its value space, and L_dt: LS_dt → VS_dt the lexical-to-value-space mapping. Then the following must hold:

VS_dt ⊆ D; and
For each constant lit^^dt such that lit ∈ LS_dt, I_C(lit^^dt) = L_dt(lit).

That is, I_C must map the constants of a data type dt in accordance with L_dt. ☐

RIF-FLD does not impose special requirements on I_C for constants in the symbol spaces that do not correspond to primitive datatypes in DTS. Dialects may have such requirements, however. An example of such a restriction could be a requirement that no constant in a particular symbol space (such as rif:local) can be mapped to VS_dt of a data type dt.

3.5 Metadata and Semantics

Observe that metadata blocks are ignored by all the mappings that constitue RIF-FLD semantic structures, so metadata has no effect on the formal semantics.

Note that although metadata associated with RIF-FLD formulas is ignored by the semantics, it can be extracted by XML tools. Since metadata is represented by frame terms, it can be reasoned with by the rules.

3.6 Interpretation of Non-document Formulas

This section defines how a semantic structure, I, determines the truth value TVal_I(φ) of a RIF-FLD formula, φ, where φ is any formula other than a document formula. Truth valuation of document formulas is defined in the next section.

To this end, we define a mapping, TVal_I, from the set of all non-document formulas to TV. Note that the definition implies that TVal_I(φ) is defined only if the set DTS of the data types of I includes all the data types mentioned in φ.

Definition (Truth valuation). Truth valuation for well-formed formulas in RIF-FLD is determined using the following function, denoted TVal_I:

Constants: TVal_I(k) = I_truth(I(k)), if k ∈ Const.

Variables: TVal_I(?v) = I_truth(I(?v)), if ?v ∈ Var.

Positional atomic formulas: TVal_I(r(t₁ ... t_n)) = I_truth(I(r(t₁ ... t_n))).

Atomic formulas with named arguments: TVal_I(p(s₁->v₁ ... s_k->v_k)) = I_truth(I(p(s₁-> v₁ ... s_k->v_k))).

Equality: TVal_I(x = y) = I_truth(I(x = y)).

To ensure that equality has precisely the expected properties, it is required that

I_truth(I(x = y)) = t if and only if I(x) = I(y) and that I_truth(I(x = y)) = f otherwise.

Subclass: TVal_I(sc ## cl) = I_truth(I(sc ## cl)).

To ensure that the operator ## is transitive, i.e., c1 ## c2 and c2 ## c3 imply c1 ## c3, the following is required: For all c1, c2, c3 ∈ D, glb_t(TVal_I(c1 ## c2), TVal_I(c2 ## c3)) ≤_t TVal_I(c1 ## c3).

Membership: TVal_I(o # cl) = I_truth(I(o # cl)).

To ensure that all members of a subclass are also members of the superclass, i.e., o # cl and cl ## scl implies o # scl, the following is required:

For all o, cl, scl ∈ D, glb_t(TVal_I(o # cl), TVal_I(cl ## scl)) ≤_t TVal_I(o # scl).

Frame: TVal_I(o[a₁->v₁ ... a_k->v_k]) = I_truth(I(o[a₁->v₁ ... a_k->v_k])).

Since the bag of attribute/value pairs represents the conjunction of all the pairs, the following is required:

TVal_I(o[a₁->v₁ ... a_k->v_k]) = glb_t(TVal_I(o[a₁->v₁]), ..., TVal_I(o[a_k->v_k])).

Externally defined atomic formula: TVal_I(External(t)) = I_truth(I_external(σ)(I(s₁), ..., I(s_n))), if t is an atomic formula that is an instance of the external schema σ = (?X₁ ... ?X_n; τ) by substitution ?X₁/s₁ ... ?X_n/s_n.
Note that, by definition, External(t) is well-formed only if t is an instance of an external schema. Furthermore, by the definition of coherent sets of external schemas, t can be an instance of at most one such schema, so I(External(t)) is well-defined.
Conjunction: TVal_I(And(c₁ ... c_n)) = glb_t(TVal_I(c₁), ..., TVal_I(c_n)).
The empty conjunction is treated as a tautology, so TVal_I(And()) = t.
Disjunction: TVal_I(Or(c₁ ... c_n)) = lub_t(TVal_I(c₁), ..., TVal_I(c_n)).
The empty disjunction is treated as a contradiction, so TVal_I(Or()) = f.
Negation: TVal_I(Neg φ) = ~TVal_I(φ) and TVal_I(Naf φ) = ~TVal_I(φ).
The symbol ~ here is the idempotent operator of negation on TV introduced in Section Truth Values. Note that both classical and default negation are interpreted the same way in any concrete semantic structure. The difference between the two kinds of negation comes into play when logical entailment is defined.
Quantification:
- TVal_I(Exists ?v₁ ... ?v_n (φ)) = lub_t(TVal_I*(φ)).
- TVal_I(Forall ?v₁ ... ?v_n (φ)) = glb_t(TVal_I*(φ)).
Here lub_t (respectively, glb_t) is taken over all interpretations I* of the form <TV, DTS, D, I_C, I*_V, I_F, I_frame, I_SF, I_sub, I_isa, I₌, I_external, I_truth>, which are exactly like I, except that the mapping I*_V, is used instead of I_V. I*_V is defined to coincide with I_V on all variables except, possibly, on ?v₁,... ,?v_n.
Rule implication:
- TVal_I(head :- body)=t, if TVal_I(head) ≥_t TVal_I(body).
- TVal_I(head :- body)=f otherwise.
Groups of formulas:
If Γ is a group formula of the form Group(ρ₁ ... ρ_n) then

TVal_I(Γ) = glb_t(TVal_I(ρ₁), ..., TVal_I(ρ_n)).

This means that a group of formulas is treated as a conjunction. ☐

Note that rule implications and equality formulas are always two-valued, even if TV has more than two values.

3.7 Interpretation of Documents

Document formulas are interpreted using semantic multi-structures.

Definition (Semantic multi-structures). A semantic multi-structure is a set {I^Δ₁, ..., I^Δ_n}, n>0, where I^Δ₁, ..., I^Δ_n are semantic structures labeled with document formulas. These structures must be identical in all respects except that the mappings I_C^Δ₁, ..., I_C^Δ_n might differ on the constants in Const that belong to the rif:local symbol space. The above set is allowed to have at most one semantic structure with the same label. ☐

Definition (Imported document). Let Δ be a document formula and Import(t) be one of its import directives, which references another document formula, Δ'. In this case, we say that Δ' is directly imported into Δ.

A document formula Δ' is said to be imported into Δ if it is either directly imported into Δ or it is imported (directly or not) into another formula, which is directly imported into Δ. ☐

With the help of semantic multi-structures we can now explain the semantics of RIF documents.

Definition (Truth valuation of document formulas). Let Δ be a document formula and let Δ₁, ..., Δ_k be all the RIF-FLD document formulas that are imported (directly or indirectly, according to the previous definition) into Δ. Let Γ, Γ₁, ..., Γ_k denote the respective group formulas associated with these documents. If any of these Γ_i is missing (which is a possibility, since every part of a document is optional), assume that it is a tautology, such as a = a, so that every TVal function maps such a Γ_i to the truth value t. Let I = {I^Δ, I^Δ₁, ..., I^Δ_k, ...} be a semantic multi-structure, which contains semantic structures labeled with at least the documents Δ, Δ₁, ..., Δ_k. Then we define:

TVal_I(Δ) = glb_t(TVal_I^Δ(Γ), TVal_I^Δ₁(Γ₁), ..., TVal_{I^Δ_k}(Γ_k)).

Note that this definition considers only those document formulas that are reachable via the one-argument import directives. Two argument import directives are ignored by RIF-FLD. Their semantics is supposed to be defined by other documents, such as RIF RDF and OWL Compatibility. ☐

The above definitions make the intent behind the rif:local constants clear: rif:local constants that occur in different documents can be interpreted differently even if they have the same name. Therefore, each document can choose the names for the rif:local constants freely and without regard to the names of such constants used in the imported documents.

From now on, every formula is assumed to be part of some document. It a formula is not physically part of any document, it will be said to belong to a special query document. This allows us to define TVal_I(φ), where I is a multi-structure, for arbitrary formulas, not just for document formulas: If φ is a formula that is not a document-formula and I is a semantic multi-structure that contains a component I^Δ that corresponds to the document of φ, then TVal_I(φ) is defined as TVal_I^Δ(φ). Otherwise, TVal_I(φ) is undefined.

Definition (Models). A multi-structure I is a model of a formula, φ, written as I|=φ, iff TVal_I(φ) is defined and equals t. ☐

3.8 Intended Semantic Structures

The semantics of a set of formulas, Γ, is the set of its intended semantic multi-structures. RIF-FLD does not specify what these intended multi-structures are, leaving this to RIF dialects. Different logic theories may have different criteria for what is considered an intended semantic multi-structure.

For the classical first-order logic, every model is an intended semantic multi-structure. For RIF-BLD, which is based on Horn rules, intended multi-structures are defined only for sets of rules: an intended semantic multi-structure of a RIF-BLD set Γ is the unique minimal Herbrand model of Γ. For the dialects in which rule bodies may contain literals negated with the negation-as-failure connective Naf, only some of the minimal Herbrand models of a set of rules are intended. Each logic-based dialect of RIF must define the set of intended semantic multi-structures precisely. The two most common such theories are the well-founded models [GRS91] and stable models [GL88].

The following example illustrates the notion of intended semantic structures. Suppose Γ consists of a single rule formula p :- Naf q. If Naf were interpreted as classical negation, not, then this rule would be simply equivalent to Or(p q), and so it would have two kinds of models: those where p is true and those where q is true. In contrast to first-order logic, most rule-based systems do not consider p and q symmetrically. Instead, they view the rule p :- Naf q as a statement that p must be true if it is not possible to establish the truth of q. Since it is, indeed, impossible to establish the truth of q, such theories would derive p even though it does not logically follow from Or(p q). The logic underlying rule-based systems also assumes that only the minimal Herbrand models are intended (minimality here is with respect to the set of true facts). Furthermore, although our example has two minimal Herbrand models -- one where p is true and q is false, and the other where p is false, but q is true, only the first model is considered to be intended.

The above concept of intended models and the corresponding notion of logical entailment with respect to the intended models, defined below, is due to [Shoham87].

3.9 Logical Entailment

We will now define what it means for a set of RIF-FLD formulas to entail another RIF-FLD formula. This notion is typically used for defining queries to knowledge bases and for other tasks, such as testing subsumption of concepts (e.g., in OWL). We assume that each set of formulas has an associated set of intended semantic structures.

Definition (Logical entailment). Let Γ be a RIF-FLD formula and φ another RIF-FLD formula. We say that Γ entails φ, written as Γ |= φ, if and only if, for every intended semantic multi-structure I of Γ for which both TVal_I(Γ) and TVal_I(φ) are defined, it is the case that TVal_I(Γ) ≤_t TVal_I(φ). ☐

This general notion of entailment covers both first-order logic and the non-monotonic logics that underlie many rule-based languages [Shoham87].

Note that one consequence of the multi-document semantics is that local constants specified in one document cannot be queried from another document. In particular, they cannot be returned as query answers. For instance, if one document, Δ', has the fact "http://example.com/ppp"^^rif:iri("abc"^^rif:local) while another document formula, Δ, imports Δ' and has the rule "http://example.com/qqq"^^rif:iri(?X) :- "http://example.com/ppp"^^rif:iri(?X) , then Δ |= "http://example.com/qqq"^^rif:iri("abc"^^rif:local) does not hold. This is because "abc"^^rif:local in Δ' and "abc"^^rif:local in the query on the right-hand side of |= are treated as different constants by semantic multi-structures.

4 XML Serialization Framework

The RIF XML serialization framework defines

a normative mapping from the RIF-FLD presentation syntax to XML, and
a normative XML Schema for that XML syntax.

As explained in the overview section, the design of RIF envisages that the presentation syntaxes of future logic-based RIF dialects will be specializations of the presentation syntax of RIF-FLD. This means that every well-formed formula in the presentation syntax of a standardized RIF dialect must also be well-formed in RIF-FLD. The goal of the XML serialization framework is to provide a similar yardstick for the RIF XML syntax. This amounts to the requirement that any valid XML document for a logic-based RIF dialect must also be a valid XML document for RIF-FLD. In terms of the presentation-to-XML syntax mappings, this means that each mapping for a logic-based RIF dialect must be a restriction of the corresponding mapping for RIF-FLD. For instance, the mapping from the presentation syntax of RIF-BLD to XML is a restriction of the presentation-syntax-to-XML mapping for RIF-FLD. In this way, RIF-FLD provides a framework for extensibility and mutual compatibility between XML syntaxes of RIF dialects.

Editor's Note: This section is incomplete in the present draft: the XML Schema is missing. The next draft will include a full treatment of the XML serialization framework.

4.1 XML for the RIF-FLD Language

Editor's Note: The XML syntax, and especially the element tags, are being discussed by the Working Group. Input is welcome. See Issue-49

The XML serialization for RIF-FLD is alternating or fully striped [ANF01]. A fully striped serialization views XML documents as objects and divides all XML tags into class descriptors, called type tags, and property descriptors, called role tags. We use capitalized names for type tags and lowercase names for role tags. The RIF serialization framework for the syntax of Section EBNF Grammar for the Presentation Syntax of RIF-FLD uses the following XML tags.

- Group     (nested collection of sentences annotated with metadata)
- meta      (meta role, containing metadata, which is represented as a Frame)
- sentence  (sentence role, containing FORMULA or Group)
- Forall    (quantified formula for 'Forall', containing declare and formula roles)
- Exists    (quantified formula for 'Exists', containing declare and formula roles)
- declare   (declare role, containing a Var)
- formula   (formula role, containing a FORMULA)
- Implies   (implication, containing if and then roles)
- if        (antecedent role, containing FORMULA)
- then      (consequent role, containing FORMULA)
- And       (conjunction)
- Or        (disjunction)
- Neg       (strong negation, containing a formula role)
- Naf       (negation as failure, containing a formula role)
- Atom      (atom formula, positional or with named arguments)
- External  (external call, containing a content role)
- content   (content role, containing an Atom, for predicates, or Expr, for functions)
- Member    (member formula)
- Subclass  (subclass formula)
- Frame     (Frame formula)
- object    (Member/Frame role containing a TERM or an object description)
- op        (Atom/Expr role for predicates/functions as operations)
- arg       (argument role)
- upper     (Member/Subclass upper class role)
- lower     (Member/Subclass lower instance/class role)
- slot      (Atom/Expr/Frame slot role, containing a Prop)
- Prop      (Property, prefix version of slot infix '->')
- key       (Prop key role, containing a Const)
- val       (Prop val role, containing a TERM)
- Equal     (prefix version of term equation '=')
- Expr      (expression formula, positional or with named arguments)
- side      (Equal left-hand side and right-hand side role)
- Const     (individual, function, or predicate symbol, with optional 'type' attribute)
- Name      (name of named argument)
- Var       (logic variable)

Example 3 (Serialization of a nested RIF-FLD group annotated with metadata).

This example shows an XML serialization for the formulas in Example 2. For convenience of reference, the original formulas are included at the top. For better readability, we again use the shortcut syntax defined in Data Types and Builtins.

Presentation syntax:

  Document(
   Prefix(dc http://http://purl.org/dc/terms/)
   Prefix(ex http://example.org/ontology#)
   Prefix(hamlet http://www.shakespeare-literature.com/Hamlet/)
     
   Group hamlet:assertions[dc:title -> "Hamlet", dc:creator -> "Shakespeare"]
   (
      Exists ?X (And(?X # ex:RottenThing
                     ex:partof(?X <http://www.denmark.dk>)))
      Forall ?X (Or(hamlet:tobe(?X)  Naf hamlet:tobe(?X)))
      Forall ?X (And(Exists ?B (And(ex:has(?X ?B) ?B # ex:business))
                     Exists ?D (And(ex:has(?X ?D) ?D # ex:desire)))
                   :- ?X # ex:man)
      Group hamlet:facts[ ]
      (
         hamlet:Yorick # ex:poor
         hamlet:Hamlet # ex:prince
      )
   )
  )


XML serialization:

   <Group>
    <meta>
      <Frame>
        <object>
          <Const type="rif:iri">hamlet:assertions</Const>
        </object>
        <slot>
          <Prop>
            <key><Const type="rif:iri">dc:title</Const></key>
            <val><Const type="xsd:string">Hamlet</Const></val>
          </Prop>
        </slot>
        <slot>
          <Prop>
            <key><Const type="rif:iri">dc:creator</Const></key>
            <val><Const type="xsd:string">Shakespeare</Const></val>
          </Prop>
        </slot>
      </Frame>
    </meta>
    <sentence>
     <Exists>
       <declare><Var>X</Var></declare>
       <formula>
         <And>
           <formula>
             <Member>
               <lower><Var>X</Var></lower>
               <upper><Const type="rif:iri">ex:RottenThing</Const></upper>
             </Member>
           </formula>
           <formula>
             <Atom>
               <op><Const type="rif:iri">ex:partof</Const></op>
               <arg><Var>X</Var></arg>
               <arg><Const type="rif:iri">http://www.denmark.dk</Const></arg>
             </Atom>
           </formula>
         </And>
       </formula>
     </Exists>
    </sentence>
    <sentence>
     <Forall>
       <declare><Var>X</Var></declare>
       <formula>
         <Or>
           <formula>
             <Atom>
               <op><Const type="rif:iri">hamlet:tobe</Const></op>
               <arg><Var>X</Var></arg>
             </Atom>
           </formula>
           <formula>
             <Naf>
               <formula>
                 <Atom>
                   <op><Const type="rif:iri">hamlet:tobe</Const></op>
                   <arg><Var>X</Var></arg>
                 </Atom>
               </formula>
             </Naf>
           </formula>
         </Or>
       </formula>
     </Forall>
    </sentence>
    <sentence>
     <Forall>
       <declare><Var>X</Var></declare>
       <formula>
         <Implies>
           <if>
             <Member>
               <lower><Var>X</Var></lower>
               <upper><Const type="rif:iri">ex:man</Const></upper>
             </Member>
           </if>
           <then>
             <And>
               <formula>
                 <Exists>
                   <declare><Var>B</Var></declare>
                   <And>
                     <formula>
                       <Atom>
                         <op><Const type="rif:iri">ex:has</Const></op>
                         <arg><Var>X</Var></arg>
                         <arg><Var>B</Var></arg>
                       </Atom>
                     </formula>
                     <formula>
                       <Member>
                         <lower><Var>B</Var></lower>
                         <upper><Const type="rif:iri">ex:business</Const></upper>
                       </Member>
                     </formula>
                   </And>
                 </Exists>
               </formula>
               <formula>
                 <Exists>
                   <declare><Var>D</Var></declare>
                   <And>
                     <formula>
                       <Atom>
                         <op><Const type="rif:iri">ex:has</Const></op>
                         <arg><Var>X</Var></arg>
                         <arg><Var>D</Var></arg>
                       </Atom>
                     </formula>
                     <formula>
                       <Member>
                         <lower><Var>D</Var></lower>
                         <upper><Const type="rif:iri">ex:desire</Const></upper>
                       </Member>
                     </formula>
                   </And>
                 </Exists>
               </formula>
             </And>
           </then>
         </Implies>
       </formula>
     </Forall>
   </sentence>
   <sentence>
     <Group>
       <meta>
         <Frame>
           <object>
             <Const type="rif:iri">hamlet:facts</Const>
           </object>
         </Frame>
       </meta>
       <sentence>
         <Member>
           <lower><Const type="rif:iri">hamlet:Yorick</Const></lower>
           <upper><Const type="rif:iri">ex:poor</Const></upper>
         </Member>
       </sentence>
       <sentence>
         <Member>
           <lower><Const type="rif:iri">hamlet:Hamlet</Const></lower>
           <upper><Const type="rif:iri">ex:prince</Const></upper>
         </Member>
       </sentence>
     </Group>
    </sentence>
   </Group>

4.2 Translation Between the RIF-FLD Presentation and XML Syntaxes

We now present a translation of the syntax of Section EBNF Grammar for the Presentation Syntax of RIF-FLD to the XML syntax of RIF-FLD.

This translation is specified by the table below. Since the presentation syntax of RIF-FLD is context sensitive, the translation must differentiate between the terms that occur in the position of the individuals from terms that occur as atomic formulas. To this end, in the translation table, the positional and named argument terms that occur in the context of atomic formulas are denoted by the expressions of the form pred(...) and the terms that occur as individuals are denoted by expressions of the form func(...).

Editor's Note: This XML syntax translation table is expected to be made more formal in future versions of this draft.

The prime symbol (for instance, variable') indicates that the translation function defined by the table must be applied recursively (i.e., to variable in our example).

Presentation Syntax	XML Syntax
Group ( clause₁ . . . clause_n )	<Group> <sentence>clause₁'</sentence> . . . <sentence>clause_n'</sentence> </Group>
Group metaframe ( clause₁ . . . clause_n )	<Group> <meta>metaframe'</meta> <sentence>clause₁'</sentence> . . . <sentence>clause_n'</sentence> </Group>
Forall variable₁ . . . variable_n ( body )	<Forall> <declare>variable₁'</declare> . . . <declare>variable_n'</declare> <formula>body'</formula> </Forall>
Exists variable₁ . . . variable_n ( body )	<Exists> <declare>variable₁'</declare> . . . <declare>variable_n'</declare> <formula>body'</formula> </Exists>
conclusion :- condition	<Implies> <if>condition'</if> <then>conclusion'</then> </Implies>
And ( conjunct₁ . . . conjunct_n )	<And> <formula>conjunct₁'</formula> . . . <formula>conjunct_n'</formula> </And>
Or ( disjunct₁ . . . disjunct_n )	<Or> <formula>disjunct₁'</formula> . . . <formula>disjunct_n'</formula> </Or>
Neg form	<Neg> <formula>form'</formula> </Neg>
Naf form	<Naf> <formula>form'</formula> </Naf>
pred ( argument₁ . . . argument_n )	<Atom> <op>pred'</op> <arg>argument₁'</arg> . . . <arg> argument_n'</arg> </Atom>
External ( atomicexpr )	<External> <content>atomicexpr'</content> </External>
func ( argument₁ . . . argument_n )	<Expr> <op>func'</op> <arg>argument₁'</arg> . . . <arg> argument_n'</arg> </Expr>
pred ( unicodestring₁ -> filler₁ . . . unicodestring_n -> filler_n )	<Atom> <op>pred'</op> <slot> <Prop> <key><Name>unicodestring₁</Name></key> <val>filler₁'</val> </Prop> </slot> . . . <slot> <Prop> <key><Name>unicodestring_n</Name></key> <val>filler_n'</val> </Prop> </slot> </Atom>
func ( unicodestring₁ -> filler₁ . . . unicodestring_n -> filler_n )	<Expr> <op>func'</op> <slot> <Prop> <key><Name>unicodestring₁</Name></key> <val>filler₁'</val> </Prop> </slot> . . . <slot> <Prop> <key><Name>unicodestring_n</Name></key> <val>filler_n'</val> </Prop> </slot> </Expr>
inst [ key₁ -> filler₁ . . . key_n -> filler_n ]	<Frame> <object>inst'</object> <slot> <Prop> <key>key₁'</key> <val>filler₁'</val> </Prop> </slot> . . . <slot> <Prop> <key>key_n'</key> <val>filler_n'</val> </Prop> </slot> </Frame>
inst # class [ key₁ -> filler₁ . . . key_n -> filler_n ]	<Frame> <object> <Member> <lower>inst'</lower> <upper>class'</upper> </Member> </object> <slot> <Prop> <key>key₁'</key> <val>filler₁'</val> </Prop> </slot> . . . <slot> <Prop> <key>key_n'</key> <val>filler_n'</val> </Prop> </slot> </Frame>
sub ## super [ key₁ -> filler₁ . . . key_n -> filler_n ]	<Frame> <object> <Subclass> <lower>sub'</lower> <upper>super'</upper> </Subclass> </object> <slot> <Prop> <key>key₁'</key> <val>filler₁'</val> </Prop> </slot> . . . <slot> <Prop> <key>key_n'</key> <val>filler_n'</val> </Prop> </slot> </Frame>
inst # class	<Member> <lower>inst'</lower> <upper>class'</upper> </Member>
sub ## super	<Subclass> <lower>sub'</lower> <upper>super'</upper> </Subclass>
left = right	<Equal> <side>left'</side> <side>right'</side> </Equal>
unicodestring^^space	<Const type="space">unicodestring</Const>
?unicodestring	<Var>unicodestring</Var>

5 Conformance of RIF Processors with RIF Dialects

Let Τ be a set of data types, which includes the data types specified in the RIF-DTB document, and suppose Ε is a set of external predicates and functions, which includes the built-ins listed in the RIF-DTB document. Let D be a RIF dialect (e.g., RIF-BLD). We say that a formula φ is a D_Τ,Ε formula iff

it is a formula in the dialect D,
all the data types used in φ are in Τ, and
all the externally defined functions and predicates used in φ are in Ε.

A RIF processor is a conformant D_Τ,Ε consumer iff it implements a semantics-preserving mapping, μ, from the set of all D_Τ,Ε formulas to the language L of the processor.

Formally, this means that for any pair φ, ψ of D_Τ,Ε formulas for which φ |=_D ψ is defined, φ |=_D ψ iff μ(φ) |=_L μ(ψ). Here |=_D denotes the logical entailment in the RIF dialect D and |=_L is the logical entailment in the language L of the RIF processor. In addition, a D_Τ,Ε compliant consumer must reject any document that contains a non-D_Τ,Ε formula.

A RIF processor is a conformant D_Τ,Ε producer iff it implements a semantics-preserving mapping, μ, from a subset of the language L of the processor to the set of D_Τ,Ε formulas.

Formally this means that for any pair φ, ψ of formulas in L for which φ |=_L ψ is defined, φ |=_L ψ iff μ(φ) |=_D μ(ψ).

6 References

6.1 Normative References

[RDF-CONCEPTS]

[RDF-SEMANTICS]: RDF Semantics, Patrick Hayes, Editor, W3C Recommendation, 10 February 2004, http://www.w3.org/TR/2004/REC-rdf-mt-20040210/. Latest version available at http://www.w3.org/TR/rdf-mt/.

[RDF-SCHEMA]: RDF Vocabulary Description Language 1.0: RDF Schema, Brian McBride, Editor, W3C Recommendation 10 February 2004, http://www.w3.org/TR/rdf-schema/.

[RFC-3066]: RFC 3066 - Tags for the Identification of Languages, H. Alvestrand, IETF, January 2001. This document is at http://www.isi.edu/in-notes/rfc3066.txt.

[RFC-3987]: RFC 3987 - Internationalized Resource Identifiers (IRIs), M. Duerst and M. Suignard, IETF, January 2005. This document is at http://www.ietf.org/rfc/rfc3987.txt.

[XML-SCHEMA2]: XML Schema Part 2: Datatypes, W3C Recommendation, World Wide Web Consortium, 2 May 2001. This version is http://www.w3.org/TR/2001/REC-xmlschema-2-20010502/. The latest version is available at http://www.w3.org/TR/xmlschema-2/.

6.2 Informational References

[ANF01]: Normal Form Conventions for XML Representations of Structured Data, Henry S. Thompson. October 2001.

[KLW95]: Logical foundations of object-oriented and frame-based languages, M. Kifer, G. Lausen, J. Wu. Journal of ACM, July 1995, pp. 741--843.

[CKW93]: HiLog: A Foundation for higher-order logic programming, W. Chen, M. Kifer, D.S. Warren. Journal of Logic Programming, vol. 15, no. 3, February 1993, pp. 187--230.

[CK95]: Sorted HiLog: Sorts in Higher-Order Logic Data Languages, W. Chen, M. Kifer. Sixth Intl. Conference on Database Theory, Prague, Czech Republic, January 1995, Lecture Notes in Computer Science 893, Springer Verlag, pp. 252--265.

[RDFSYN04]

[Shoham87]: Nonmonotonic logics: meaning and utility, Y. Shoham. Proc. 10th International Joint Conference on Artificial Intelligence, Morgan Kaufmann, pp. 388--393, 1987.

[CURIE]: CURIE Syntax 1.0: A syntax for expressing Compact URIs, Mark Birbeck, Shane McCarron. W3C Working Draft 2 April 2008. Available at http://www.w3.org/TR/curie/.

[CycL]: The Syntax of CycL, Web site. Available at http://www.cyc.com/cycdoc/ref/cycl-syntax.html.

[FL2]: FLORA-2: An Object-Oriented Knowledge Base Language, M. Kifer. Web site. Available at http://flora.sourceforge.net.

[OOjD]: Object-Oriented jDREW, Web site. Available at http://www.jdrew.org/oojdrew/.

[NxBRE]: .NET Business Rule Engine, Web site. Available at http://nxbre.wiki.sourceforge.net/.

[GRS91]: The Well-Founded Semantics for General Logic Programs, A. Van Gelder, K.A. Ross, J.S. Schlipf. Journal of ACM, 38:3, pages 620-650, 1991.

[GL88]: The Stable Model Semantics for Logic Programming, M. Gelfond and V. Lifschitz. Logic Programming: Proceedings of the Fifth Conference and Symposium, pages 1070-1080, 1988.

[vEK76]: The semantics of predicate logic as a programming language, M. van Emden and R. Kowalski. Journal of the ACM 23 (1976), 733-742.

[Enderton01]: A Mathematical Introduction to Logic, Second Edition, H. B. Enderton. Academic Press, 2001.

[Mendelson97]: Introduction to Mathematical Logic, Fourth Edition, E. Mendelson. Chapman & Hall, 1997.