Overview

This document describes how to perform batch generation within LOGON. In extension of this it also describes how to produce a generation treebank on the basis of an existing parsing treebank. We first give a step-by-step descripton of how to do this using only menu choices from the [incr tsdb()] podium. Then we show how the same steps can be carried out from the command line, using the generate script provided in the LOGON source tree (i.e. $LOGONROOT/generate).

Using the Menu Options

For the podium approach, there are two main steps; 1) generate and 2) update. In the first step we exhaustively generate all "paraphrases" for the input MRS. In the update step we identify and label the references among these alternative realizations by matching them against the references in the original parse treebank.

(1) Generation

(2) Update

In this step we identify and label the references among the newly generated sets of paraphrases. First we set some switches controlling how the realizations are matched against the references of the original parse treebank:

Then, to perform the labeling step, execute Trees | Update

PS: Some [incr tsdb()] parameters that are relevant for the matching (labeling) of references include the following:

Using the Command Line Script

The procedure described above can also be performed by using the $LOGONROOT/generate script.

  cd $LOGONROOT
  ./generate --binary mrs

The first, skeleton argument to generate should name the skeleton you want to use for the new target profile that will be created. We document the available command-line options below, as well as some related and relevant Lisp variables.

PS: In order to control the maximum number of edges allowed in the chart during generation, look for *tsdb-maximal-number-of-edges* in the generate script (the current default is 100,000)).

LogonProcessing/BatchGeneration (last edited 2011-10-08 21:12:08 by localhost)

(The DELPH-IN infrastructure is hosted at the University of Oslo)