public final class DiskTreebank extends Treebank
DiskTreebank is a Collection of
Trees.
A DiskTreebank object stores merely the information to
get at a corpus of trees that is stored on disk. Access is usually
via apply()'ing a TreeVisitor to each Tree in the Treebank or by using
an iterator() to get an iteration over the Trees.
DEFAULT_TREE_FILE_SUFFIX| Constructor and Description |
|---|
DiskTreebank()
Create a new DiskTreebank.
|
DiskTreebank(int initialCapacity)
Create a new Treebank.
|
DiskTreebank(int initialCapacity,
TreeReaderFactory trf)
Create a new Treebank.
|
DiskTreebank(java.lang.String encoding)
Create a new treebank, set the encoding for file access.
|
DiskTreebank(TreeReaderFactory trf)
Create a new DiskTreebank.
|
DiskTreebank(TreeReaderFactory trf,
java.lang.String encoding)
Create a new DiskTreebank.
|
| Modifier and Type | Method and Description |
|---|---|
void |
apply(TreeVisitor tp)
Applies the TreeVisitor to to all trees in the Treebank.
|
void |
clear()
Empty a
Treebank. |
java.lang.String |
getCurrentFilename()
Returns the absolute path of the file currently being read.
|
java.util.List<java.io.File> |
getCurrentPaths() |
java.util.Iterator<Tree> |
iterator()
Return an Iterator over Trees in the Treebank.
|
void |
loadPath(java.io.File path,
java.io.FileFilter filt)
Load trees from given directory.
|
void |
printFileNames() |
decimate, encoding, loadPath, loadPath, loadPath, loadPath, loadPath, remove, size, textualSummary, textualSummary, toString, transform, treeReaderFactoryadd, addAll, contains, containsAll, isEmpty, removeAll, retainAll, toArray, toArrayclone, equals, finalize, getClass, hashCode, notify, notifyAll, wait, wait, waitpublic DiskTreebank()
LabeledScoredTreeReaderFactory.public DiskTreebank(java.lang.String encoding)
encoding - The charset encoding to use for treebank file decodingpublic DiskTreebank(TreeReaderFactory trf)
trf - the factory class to be called to create a new
TreeReaderpublic DiskTreebank(TreeReaderFactory trf, java.lang.String encoding)
trf - the factory class to be called to create a new
TreeReaderencoding - The charset encoding to use for treebank file decodingpublic DiskTreebank(int initialCapacity)
LabeledScoredTreeReaderFactory.initialCapacity - The initial size of the underlying Collection.
For a DiskTreebank, this parameter is ignored.public DiskTreebank(int initialCapacity,
TreeReaderFactory trf)
initialCapacity - The initial size of the underlying Collection,
For a DiskTreebank, this parameter is ignored.trf - the factory class to be called to create a new
TreeReaderpublic void clear()
Treebank.public void loadPath(java.io.File path,
java.io.FileFilter filt)
public void apply(TreeVisitor tp)
public java.lang.String getCurrentFilename()
public java.util.List<java.io.File> getCurrentPaths()
public void printFileNames()
public java.util.Iterator<Tree> iterator()
apply().