Next:
Limited Expressiveness and
Up:
Essential Characters
Previous:
Essential Characters
Though there is no set of formal principles that define precisely the
structure or content of the ontology we are developing, we can list
several principles that constitute the foundation of our methodology
and that suggest the guidelines we follow. These can also be viewed as
the desiderata for the ontology we are developing.
These principles are more task oriented or situated unlike the
structural principles for ontologies that have been proposed in the
literature (Bouaud, Bachimont, Charlet, and Zweigenbaum, 1995; Zarri,
1995). In fact, we show below that many of those structural principles
had to be violated in our ontology in order to meet the practical
objectives of simplicity in representation and software requirements
and understandability and ease of browsing by non-experts.
The
K ontology is based on the following principles:
-
The ontology is language independent. An ontology is not a transfer
dictionary between a pair of languages. Our ontology is language
independent in two ways:
-
It is not specific to any particular natural language such as English or
Spanish.
-
The concepts in the ontology do not have a one-to-one mapping to word
senses in natural languages. Many concepts may not map to any single
word in a language;
other concepts may map to more than one word in the same language.
-
An ontology must be independently motivated, not dictated by the
lexicon of a particular language. Ontology development is not
subservient to lexicography. The two are sister processes that aid
each other and at the same time constrain each other in significant
ways.
-
An ontology must be well-formed according to a precise, axiomatic
specification (such as the one shown in Appendix B) and internally
consistent in structure, naming, and content across concepts.
-
Consistent and compatible with other knowledge and processing
resources such as the lexicon, the semantic analyzer, the language of
the TMR, and any expert microtheories.
-
An ontology must be rich in content, conceptual structure, and degree
of connectivity between concepts. For machine translation and other
NLP tasks, its structure is necessarily rich; it is not just a
hierarchy of concept names. Each concept is not just a label but has
rich internal structure resulting in a high degree of connectivity
with other concepts in the ontology.
-
Limited expressiveness is a virtue. In particular, we reject full
first-order predicate logic and the ability to make any assertion
within the ontology. Limited expressiveness enables users to browse
and comprehend the ontology.
-
An ontology must be easy to comprehend and simple to search for
concepts. it must be easy to browse, easy to train acquirers,
presentable, and so on. For example, an And-Or tree with disjunctive
inheritance is not suitable for our ontology because it is too hard to
comprehend and use for ontology acquirers and
lexicographers. Computational complexity of the inheritance algorithm
is not the reason for rejecting complex inheritance methods.
-
It must have a high utility for the task it is meant for. It must
ultimately aid language processing in resolving a variety of
ambiguities and making necessary inferences. Ontology development is a
goal driven process: we have a task, well understood ways of using the
product, and an immediate need for the knowledge base we are
acquiring. Ontology is language independent but rather
language-processing dependent in our methodology:
-
must have all the necessary knowledge to sufficiently constrain
language interpretation and generation.
-
must be a richly connected network of concepts to enable a
variety of inferences.
-
variable-depth semantics but nevertheless deep. Deeper semantics than
possible with just word names or word senses.
-
Cost effectiveness: Do not add all the knowledge just because we
know how.
-
Limited scope: This is not an Encyclopedia Britannica. We are
different from CyC.
-
Non-episodic. The knowledge we are acquiring is conceptual, not
episodic. Non-stereotypic and less unbounded. Onomasticon development
is significantly different in scope, methodology, and cost from
ontology development. Our methods for concept acquisition are not
cost-effective for acquiring instances in the onomasticon. Onomasticon
acquisition must be automated to a far higher degree.
-
Not domain specific but does focus on a domain of choice.
-
Technology Aided: The task is made more tractable by the
deployment of latest technologies: faster machines, color graphical
user interfaces, graphical browsers and editors, on-line lexica,
corpora, and other ``ontologies,'' as well as semi-automated interfaces
for customer interactions and database maintenance.
Next:
Limited Expressiveness and
Up:
Essential Characters
Previous:
Essential Characters
Kavi Mahesh
Sun Nov 12 13:56:16 MST 1995