Lexical Probabilities File

This file can be generated from a tagged corpus using the script src/utilities/train-tagger/bin/TRAIN.sh provided in FreeLing package. See src/utilities/train-tagger/README find out how to use it.

The probabilities file has nine sections: <TagsetFile>, <UnknownTags>, <Theeta>, <Suffixes>, <SingleTagFreq>, <ClassTagFreq>, <FormTagFreq>, <BiassSuffixes>, <LidstoneLambda>. Each section is closed by its corresponding tag </TagsetFile>, </UnknownTags>, </Theeta>, </Suffixes>, </SingleTagFreq>, </ClassTagFreq>, </FormTagFreq>, </BiassSuffixes>, </LidstoneLambda>.

Lluís Padró 2013-09-09