The efficiency of a content realizer can be evaluated more straightforwardly. There are four areas that can be judged: the basic algorithmic complexity, the extent of backtracking, the ability to prune the search space and the processing speed of each node in the tree.
The ``baseline'' I would use for comparison is the number of rules in the tree
after ``look-ahead'' (305). This is because the ``look-ahead'' function uses many
simple techniques
to trim the tree which most
text planners would have available. After initial island processing, then, there
are only 156 rules that have not been eliminated (a 49 percent decrease). The
situation is actually much better than this, though, because of these 156 rules,
94 of them are associated with an island. Because of the optimization techniques
described above, islands require no further processing. Thus only 62 rules
need to actually be considered, a reduction of 80 percent over the baseline.
Note also that the initial processing increases the number of islands 52 percent
over the baseline.
The elimination of backtracking serves to increase the efficiency by making
the solution significantly better than the ``worst case''. The worst case
scenario occurs when the maximum amount of backtracking possible actually
occurs, and the correct solution is the last one tried. This system eliminates
backtracking, first of all, by reducing the number of alternatives that are tried,
as described directly above
. Backtracking
is also eliminated because the system recognizes an impossible plan at the earliest
moment possible, instead of carrying the plan through and only finding out
later that it must be failed. The results of processing on the sample text
follow:
Thus, a decrease of 34 percent (over and above the 80 percent reduction in search space, which has an exponential benefit) was attained. It was interesting to note the relative contributions of the soundness processing versus the island processing components. Contrary to my expectations, the soundness principle almost completely overwhelmed the island principle during the initial processing. Much of this is due to the fact that the soundness principle is applied first, but, nevertheless, all but one of the island-related failures had already been accounted for by the soundness processing. This needs further research to determine if this is a general effect, or simply related to the example text and rule set used.