public class TueBaDZLanguagePack extends AbstractTreebankLanguagePack
DEFAULT_ENCODING, DEFAULT_GF_CHAR, generateOriginalDependencies, gfCharacter| Constructor and Description |
|---|
TueBaDZLanguagePack()
Gives a handle to the TreebankLanguagePack
|
TueBaDZLanguagePack(boolean leaveGF)
Make a new language pack with grammatical functions used based on the value of leaveGF
|
TueBaDZLanguagePack(boolean useLimitedGF,
boolean leaveGF,
char gfChar)
Make a new language pack with grammatical functions used based on the value of leaveGF
and marked with the character gfChar.
|
TueBaDZLanguagePack(boolean leaveGF,
char gfChar)
Make a new language pack with grammatical functions used based on the value of leaveGF
and marked with the character gfChar.
|
| Modifier and Type | Method and Description |
|---|---|
java.lang.String |
basicCategory(java.lang.String category)
Returns the basic syntactic category of a String.
|
java.lang.String |
getEncoding()
Return the input Charset encoding for the Treebank.
|
HeadFinder |
headFinder()
The HeadFinder to use for your treebank.
|
boolean |
isLeaveGF() |
boolean |
isLimitedGF() |
char[] |
labelAnnotationIntroducingCharacters()
Return an array of characters at which a String should be
truncated to give the basic syntactic category of a label.
|
static void |
main(java.lang.String[] args)
Prints a few aspects of the TreebankLanguagePack, just for debugging.
|
java.lang.String[] |
punctuationTags()
Returns a String array of punctuation tags for this treebank/language.
|
java.lang.String[] |
punctuationWords()
Returns a String array of punctuation words for this treebank/language.
|
java.lang.String[] |
sentenceFinalPunctuationTags()
Returns a String array of sentence final punctuation tags for this
treebank/language.
|
java.lang.String[] |
sentenceFinalPunctuationWords()
Returns a String array of sentence final punctuation words for
this treebank/language.
|
void |
setLeaveGF(boolean leaveGF) |
void |
setLimitedGF(boolean limitedGF) |
java.lang.String[] |
startSymbols()
Returns a String array of treebank start symbols.
|
java.lang.String |
stripGF(java.lang.String category)
Returns the category for a String with everything following
the gf character (which may be language specific) stripped.
|
java.lang.String |
treebankFileExtension()
Returns the extension of treebank files for this treebank.
|
TreeReaderFactory |
treeReaderFactory()
Returns a TreeReaderFactory suitable for general purpose use
with this language/treebank.
|
HeadFinder |
typedDependencyHeadFinder()
The HeadFinder to use when making typed dependencies.
|
categoryAndFunction, evalBIgnoredPunctuationTagAcceptFilter, evalBIgnoredPunctuationTagRejectFilter, evalBIgnoredPunctuationTags, generateOriginalDependencies, getBasicCategoryFunction, getCategoryAndFunctionFunction, getGfCharacter, getTokenizerFactory, grammaticalStructureFactory, grammaticalStructureFactory, grammaticalStructureFactory, isEvalBIgnoredPunctuationTag, isLabelAnnotationIntroducingCharacter, isPunctuationTag, isPunctuationWord, isSentenceFinalPunctuationTag, isStartSymbol, morphFeatureSpec, punctuationTagAcceptFilter, punctuationTagRejectFilter, punctuationWordAcceptFilter, punctuationWordRejectFilter, sentenceFinalPunctuationTagAcceptFilter, setGenerateOriginalDependencies, setGfCharacter, startSymbol, startSymbolAcceptFilter, supportsGrammaticalStructures, treeTokenizerFactorypublic TueBaDZLanguagePack()
public TueBaDZLanguagePack(boolean leaveGF)
public TueBaDZLanguagePack(boolean leaveGF,
char gfChar)
public TueBaDZLanguagePack(boolean useLimitedGF,
boolean leaveGF,
char gfChar)
public char[] labelAnnotationIntroducingCharacters()
labelAnnotationIntroducingCharacters in interface TreebankLanguagePacklabelAnnotationIntroducingCharacters in class AbstractTreebankLanguagePackpublic java.lang.String[] punctuationTags()
AbstractTreebankLanguagePackpunctuationTags in interface TreebankLanguagePackpunctuationTags in class AbstractTreebankLanguagePackpublic java.lang.String[] punctuationWords()
AbstractTreebankLanguagePackpunctuationWords in interface TreebankLanguagePackpunctuationWords in class AbstractTreebankLanguagePackpublic java.lang.String[] sentenceFinalPunctuationTags()
AbstractTreebankLanguagePacksentenceFinalPunctuationTags in interface TreebankLanguagePacksentenceFinalPunctuationTags in class AbstractTreebankLanguagePackpublic java.lang.String[] startSymbols()
AbstractTreebankLanguagePackstartSymbols in interface TreebankLanguagePackstartSymbols in class AbstractTreebankLanguagePackpublic java.lang.String[] sentenceFinalPunctuationWords()
TreebankLanguagePackpublic java.lang.String treebankFileExtension()
TreebankLanguagePackpublic java.lang.String basicCategory(java.lang.String category)
AbstractTreebankLanguagePacklabelAnnotationIntroducingCharacters().
However, there is also special case stuff to deal with
labelAnnotationIntroducingCharacters in category labels:
(i) if the first char is in this set, it's never truncated
(e.g., '-' or '=' as a token), and (ii) if it starts with
one of this set, a second instance of the same item from this set is
also excluded (to deal with '-LLB-', '-RCB-', etc.).basicCategory in interface TreebankLanguagePackbasicCategory in class AbstractTreebankLanguagePackcategory - The whole String name of the labelpublic java.lang.String stripGF(java.lang.String category)
TreebankLanguagePackstripGF in interface TreebankLanguagePackstripGF in class AbstractTreebankLanguagePackcategory - The String name of the label (may previously have had basic category called on it)public boolean isLeaveGF()
public void setLeaveGF(boolean leaveGF)
public java.lang.String getEncoding()
Charset class.getEncoding in interface TreebankLanguagePackgetEncoding in class AbstractTreebankLanguagePackpublic static void main(java.lang.String[] args)
public boolean isLimitedGF()
public void setLimitedGF(boolean limitedGF)
public TreeReaderFactory treeReaderFactory()
AbstractTreebankLanguagePacktreeReaderFactory in interface TreebankLanguagePacktreeReaderFactory in class AbstractTreebankLanguagePackpublic HeadFinder headFinder()
public HeadFinder typedDependencyHeadFinder()