A.1.0 Nucleus: Positive Conditions

Notes about the Rationale for the Changes

This document is a version of "A.1_Basis:_Positive_Conditions". It proposes a modified syntax and semantics to account for Web-izing (the use of URIs for constants, predicates, and functions) and primitive data types (integers, floats, time, date, etc.). The main change is that there is no longer a wall between the domains of constants, functions, and predicates. Instead, all these symbols are now drawn from the same domain. Separation between the different kinds of symbols is introduced through the mechanism of sorts. For instance, we can introduce a sort URI for URIs and the sorts integer, float, time, string, etc. We can decide that certain sorts must be disjoint (integers, time) and others are not (eg, URI can be a subsort of string). We can control what sorts can be used for predicates, function symbols, etc. For example, we can decide that only strings can be predicates (which includes URIs, if URI is a subsort of string). Or we can decide that only URI and localSymbol (where localSymbol is some kind of a subsort of string) can be predicates and functions.

The first few sections on the syntax and semantics simply rework the material from "A.1_Basis:_Positive_Conditions" using the single domain for constants, functions, and predicates. The multisorted extension is described in sections Multisorted Extensions of the RIF Core, Formalization of Multisorted Syntax, and Semantics of the Multisorted RIF Core.

Introduction

The basis of the language is formed by what can appear in the bodies of Horn-like rules with equality -- conjunctions and disjunctions of atomic formulas and equations (such rules reduce to pure Horn). We later extend our proposal to include builtins. As mentioned in the introduction, the motivation is that this sublanguage can be shared among the bodies of the rules expressed in the following RIF dialects:

FO (first order)
LP (logic programming)
PR (production rules)
RR (reactive rules)

This sublanguage can also be used to uniformly express:

IC (integrity constraints)
QY (queries)

SYNTAX

Essential BNF for human-readable syntax:

  Var        ::= '?' NAME
  TERM       ::= Var | Expr
  Expr       ::= Con '(' TERM* ')' | Con
  Atom       ::= Expr | TERM '=' TERM
  LITFORM    ::= Atom
  QUANTIF    ::= 'Exists' Var+ '(' CONDIT ')'
  CONJ       ::= 'And' '(' CONDIT* ')'
  DISJ       ::= 'Or' '(' CONDIT* ')'
  CONDIT     ::= LITFORM | QUANTIF | CONJ | DISJ

Here LITFORM stands for Literal Formula and anticipates the introduction of negated atoms later on. QUANTIF stands for Quantified Formula, which |for Horn-like conditions can only be 'Exists' Formulas (Var+ variables should occur free in the scoped CONDIT, so 'Exists' can quantify them; free variables are discussed below). More explicitly than in logic programming, CONJ expresses formula conjunctions, and DISJ expresses disjunctions. Finally, CONDIT combines everything and defines RIF conditions, which can later be extended beyond LITFORM, QUANTIF, CONJ, or DISJ.

We assume that all constants (Con) belong to one logical sort: the sort of elementary entities. Relation names and function symbols are also in Con. This will be subsequently refined to allow multiple sorts for different data types. See "Sorted Extensions of the RIF Core".

At this point we do not commit to any particular vocabulary for the names of variables and for constants. For instance, NAME could be any alphanumeric string and a variety of options could be used for Con. We leave the decision till later time.

In the present version, variables are not sorted and thus can range over all constants. Sorted variables might be introduced in a later version of the language.

Note that there are two uses of variables in the RIF Condition Language: free and quantified. All quantified variables are quantified explicitly, existentially (and also universally, later). We adopt the usual scoping rules for quantification from first-order logic. Variables that are not explicitly quantified are free.

The free variables are needed because we are dealing with conditions that occur in rule bodies only. When a condition occurs in such a rule body, the free variables in the condition are precisely those that also occur in the rule head. Such variables are quantified universally outside of the rule, and the scope of such quantification is the entire rule. For instance, the variable ?X in the rule below is free in the condition that occurs in the rule body, but it is universally quantified outside of the rule.

Condition with a free variable ?X:
                             ... Exists ?Y (condition(..?X..?Y..)) ...

Rule using the condition in its body:
Forall ?X (head(...?X...) :- ... Exists ?Y (condition(..?X..?Y..)) ...)

When conditions are used as queries, their free variables are to be bound to carry the answer bindings back to the caller.

The semantics of conditions is defined in the section SEMANTICS.

Example 1 (A RIF condition in human-readable syntax):

  In this condition, ?Buyer is quantified existentially, while ?Seller
  and ?Author are free:

  And ( Exists ?Buyer (purchase(?Buyer ?Seller book(?Author LeRif) $49))
        ?Seller=?Author )

This syntax is similar in style, and compatible to, the OWL Abstract Syntax http://www.w3.org/TR/owl-semantics/syntax.html.

An XML syntax can be obtained from the above BNF as follows. The non-terminals in all-upercase such as CONDIT become XML entities, which act like macros and will not be visible in instance markups. The other non-terminals and symbols ('=', 'Exists', etc.) become XML elements, which are adapted from RuleML as shown below.

- Con (constant)
- Var (logic variable) 
- Expr   (expression formula)
- Atom   (atomic formula)
- Equal  (prefix version of term equation '=')
- Exists (quantified formula for 'Exists')
- And    (conjunction)
- Or     (disjunction)

Based on the FOL RuleML http://www.w3.org/Submission/FOL-RuleML experience, this could be directly rewritten as a DTD or an XML Schema.

The condition formula in Example 1 can be serialized in XML as shown below.

Example 2 (A RIF condition in XML syntax):

  <And>
    <Exists>
      <Var>Buyer</Var>
      <Atom>
        <Con>purchase</Con>
        <Var>Buyer</Var>
        <Var>Seller</Var>
        <Expr>
          <Con>book</Con>
          <Var>Author</Var>
          <Con>LeRif</Con>
        </Expr>
        <Con>$49</Con>
      </Atom>
    </Exists>
    <Equal>
      <Var>Seller</Var>
      <Var>Author</Var>
    </Equal>
  </And>

SEMANTIC STRUCTURES (a.k.a. INTERPRETATIONS)

The first step in defining a model-theoretic semantics for a logic-based language is to define the notion of a semantic structure, also known as interpretation, and then to define the notion of truth valuation for the formulas in the language.

In case of the first-order semantics, the setting given here is one of the standard common definitions. Although it is not as frequently used as some other well-known definitions of semantic structures, it has the advantage of being easy to generalize to non-first-order cases --- for instance, rule sets with negation as failure (NAF), some of which (e.g., well-founded negation) use three-valued semantic structures, and settings, such as the Web, where information can be uncertain or contradictory. In the latter case, four-valued and other multi-valued semantic structures are used. (See, for example, M. Fitting, Fixpoint Semantics for Logic Programming A Survey, Theoretical Computer Science, 1999.)

A semantic structure is a mappings of the form

I: Set of formulas → TV

where TV is the set of all truth values. Thus, if φ if a formula then I(φ) is its truth value.

The set of truth values TV typically has only two values, t and f. However, some versions of NAF have three, t, u (undefined), and f, and, as we remarked, treatment of contradictions and uncertainty requires at least four: t, u, f, and i (inconsistent).

The set TV is assumed to have a total or partial order, called the truth order; it is denoted <_t. For instance, in the first-order case, f <_t t, and it is a total order. In the well-founded semantics, f <_t u <_t t, and it is again a total order. But in Belnap-style four-valued logics, which are suitable for dealing with uncertain or inconsistent information, the truth order is partial: f <_t u <_t t and f <_t i <_t t.

As a side remark, Belnap-style logics also have another order, called the knowledge order <_k: u <_k t <_k i; and u <_k f <_k i. Under the knowledge order, true and false are incomparable, and facts that are both true and false receive the truth value i, which is the least upper bound of f and t in the knowledge order.

More formally, let us define the following sets:

D - a non-empty set (of domain elements),
Con - the set of syntax elements recognized by the Con / entity production,
Var - the set of syntax elements recognized by the Var / ?name production

An interpretation I consists of four mappings:

I_C from Con to elements of D
I_V from Var to elements of D
I_F from Con to functions from D* into D (D* is a set of all tuples over domain D)
I_R from Con to truth-valued mappings D* -> TV

Using these mappings, we can define a more general mapping, I, as follows:

I(k) = I_C(k) if k is a constant
I(?v) = I_V(?v) if v is a variable
I(f(t1,...,tn)) = I_F(f)(I(t1),...,I(tn))

As explained earlier, an interpretation is supposed to map formulas to truth values. We define this mapping now:

Atomic formulas: I(r(t1,...,tn)) = I_R(r)(I(t1),...,I(tn))
Equality: I(t1=t2) = t iff I(t1) = I(t2) and it is f otherwise.
Conjunction: I(And(c1,...,cn)) = min_t(I(c1),...,I(cn)), where min_t is minimum with respect to the truth order.
Disjunction: I(Or(c1,...,cn)) = max_t(I(c1),...,I(cn)), where max_t is maximum with respect to the truth order.
Quantification: I(Exists v1 ... vn (c)) = max_t(I*(c)), where max_t is taken over all interpretations I* of the form <I_C, I*_V, I_F, I_R>, where I*_V is the same as I_V except possibly on the variables v1,...,vn (i.e., I* agrees with I everywhere except possibly in its interpretation of the mappings of variables v1 ... vn).

Multisorted Extensions of the RIF Core

The classical idea of sorted logic can easily account for the ideas of primitive data types, URIs as identifiers of objects and concepts, and more. Many logic languages (e.g., Prolog, HiLog, F-logic, RDF) allow the same symbol to play multiple roles. For instance, the same symbol foo can be used as a constant, a predicate of several different arities, and as a function symbol of different arities. To account for such languages, we will use a multisorted logic.

In a multisorted RIF core, each constant from Con is associated with one or more sorts. A sort can be primitive, an arrow sort, or a Boolean sort. Arrow sorts are also known as function sorts and Boolean sorts are also known as predicate sorts.

Primitive sorts are drawn from a fixed collection of sorts PS₁, ..., PS_n. These sorts are intended to model primitive data types. For instance, we could have the sorts integer, strings, time, dates, etc. The same constant can be associated with more than one primitive sort, so it is possible that the sort of short integers will be a subsort of the sort of long integers (i.e., every constant that is associated with the sort short will also be associated with the sort long). It is a common practice to distinguish the constants of different primitive sorts syntactically. For instance, constants of the primitive sort integer, would have a different syntax from constants of sort string, and constants of primitive type URI would have yet another syntax.

An arrow sort is a statement of the form s₁ × ... × s_k → s, where s₁, ..., s_k, s are names of primitive sorts (i.e., one of the PS₁, ..., PS_n). A Boolean sort is a statement of the form s₁ × ... × s_k, where, again, s₁, ..., s_k are names of primitive sorts.

Recall that RIF core uses the symbols from Con to denote constants, predicates, and function symbols alike, so the same symbol can occur in multiple contexts. However, it is useful to restrict the contexts in which various symbols are allowed to occur. For instance, Prolog or RDF don't place any such restrictions, but OWL-DL has a unique role for each symbol. This restriction of the context is accomplished by controlling the sorts that are associated with each constant. For instance, if one doesn't want integers to occur as predicate and function symbols then we don't associate any arrow or Boolean sorts with the constants that are associated with primitive sort integer. On the other hand, we do want URIs to denote concepts and other predicates. In that case, we would associate every Boolean sort with every constant that has a primitive sort URI. If we want to also allow local names for concepts and other predicates, then we might introduce a separate primitive sort, localPred, and endow it with every Boolean type.

Formalization of Multisorted Syntax

Formally, the syntax of RIF core needs the following adjustments. We introduce new functions:

PSort: Con → powerset(Primitive_Sorts)
ASort: Con → powerset(Arrow_Sorts)
BSort: Con → powerset(Boolean_Sorts)

Each of these functions associates a (possibly empty) set of sorts (primitive, arrow, or Boolean) with every constant c ∈ Con.

PSort is also defined on variables:

PSort: Var → powerset(Primitive_Sorts)

The intended meaning is that if ?v ∈ Var and PSort(?v) = {s₁, ..., s_k} then ?v can be bound only to function terms that are simultaneously of sorts s₁, ..., s_k (we define what it means for a function term to belong to a primitive sort below). In theory, PSort(?v) can be an empty set. However, such a variable would be useless, since it cannot be bound to anything.

Well-formed function terms. If c ∈ Con is a constant and s ∈ PSort(c) then we say that c (and c(), which we identify with c) is a well-formed function term of sort s. Note that the same constant can be a well-formed term of several different sorts because we allow several primitive sorts to be associated with the same constant. The informal meaning of such a happenstance is that the term belongs to the "intersection" of all the sorts with which it is associated.

By induction, if f(t₁, ..., t_k) is a function term then it is a well-formed function term of sort s if there is an arrow sort s₁, ..., s_k → s ∈ ASort(f) such that t₁, ..., t_k are well-formed function terms of sorts s₁, ..., s_k, respectively.

It is convenient to extend the mapping PSort from constants to function terms as follows:

PSort(t) = { s | t is a well-formed term of sort s }

Well-formed atomic formula. We can now say that an atomic formula p(t₁, ..., t_k) is well-formed if and only if t₁, ..., t_k are well-formed function terms and there is a Boolean sort s₁ × ... × s_k ∈ BSort(p) such that s₁ ∈ PSort(t₁), ..., s_k ∈ PSort(t_k).

The only other modification to the definition of the RIF syntax is that we must require that all atomic formulas that occur in RIF conditions and rules must be well-formed.

Semantics of the Multisorted RIF Core

The semantics of RIF core needs the following adjustments in order to be compatible with the multisorted syntax:

The domain D of an interpretation is now split into several subdomains:
- D = D_s1 ∪ ... ∪ D_sn, where each D_si is the domain of interpretation of the primitive sort s_i.
If c ∈ Con or ?v ∈ Var is a constant or a variable of
- primitive sort s then I_C(c) ∈ D_s and I_V(?v) ∈ D_s.
If f has an arrow type s₁, ..., s_k → s ∈ ASort(f) then
- I_F(f) should be a (possibly polymorphic) function of type D_s1 × ... × D_sk → D_s, i.e., if d₁ ∈ D_s1, ..., d_k ∈ D_sk then I_F(d₁,...,d_k) must be in D_sk (if the arguments are not in D_s1 × ... × D_sk then the result does not need to be in D_s, but I_F(f) might have other types, which restrict its behavior).
The definition of I_P requires no adjustments.