ILEX plans a single page of text, describing a single entity, at a time. The text potential represents the information we can express, and the interconnectivity of information. When we receive the resquest for an entity description, the planner sets that entity as the global focus of the current page. Opportunistic planning then commences. The facts directly connected to that entity represent opportunities: the system can coherently include these facts in the text. If any of these facts are actually selected, then new opportunities are created in two ways:
An entity-based move from an individual entity to its generic class entity can be made once the appropriate ``isa'' fact has been selected:
This item is an organic jewel.
Organic jewels tend to be ...
: An Entity-Based Move
: A Relation-Based Move
Once we select a new fact in either of the ways described above, the new fact may act as the starting point for new opportunistic expansion. Alternatively, we may decide to backtrack to some earlier point, effecting a focus pop in Grosz and Sidner's [Grosz and Sidner 86] terms.
The selection of which opportunity to explore is determined by a number of heuristic factors. Firstly, facts are weighted according to the chain of relations back to the focus of the page [O'Donnell 97]. This is a way of preventing lengthly digressions from the supposed topic of the text. Secondly, each fact is associated with numbers which represent the opportunity `value' of the fact. The opportunities are of two kinds:
These three values, interest, importance and assimilation are taken together to calculate the opportunity value of each fact, which is used together with the evaluation of the chain of relations to select which textual opportunities to follow. We have no special theory about where interest and importance come from, though the above examples suggest that there may be domain- and user-type-specific rules that can be used to derive some of them.
In summary, content-determination in ILEX is seen as the task of optimising the selection of opportunities that are offered by the topic of the text, subject to not moving too far from that topic. The result of content-determination is a connected subgraph of the text potential (Figure 5). The use of interest and importance in ILEX is analogous to the use of ``salience'' in [McDonald and Conklin 82]. Because the process is seen as a graph traversal problem, there are also similarities with work on generating text from semantic networks [Simmons and Slocum 72,Sibun 92]. In a sense, our work aims to combine the best of both.
Figure 5: Result of Content Determination