Supported Languages

The current version supports (to different extents, see Table 1.3) Asturian (as), Catalan (ca), English (en), French (fr), Galician (gl), Italian (it), Portuguese (pt), Russian (ru), Slovene (sl), Spanish (es), and Welsh (cy).


Table 1.1: Analysis services available for each language.
  as ca cy en es fr gl it pt ru sl
Tokenization X X X X X X X X X X  
Sentence splitting X X X X X X X X X X  
Number detection   X   X X   X X X X  
Date detection   X   X X   X   X X  
Morphological dictionary X X X X X X X X X X  
Affix rules X X X X X X X X X    
Multiword detection X X X X X X X X X    
Basic named entity detection X X X X X X X X X X  
B-I-O named entity detection   X   X X   X   X    
Named Entity Classification   X   X X       X    
Quantity detection   X   X X   X   X X  
PoS tagging X X X X X X X X X X  
Phonetic encoding       X X            
WN sense annotation   X   X X   X       X
UKB sense disambiguation   X   X X           X
Shallow parsing X X   X X   X   X    
Full/dependency parsing X X   X X   X        
Coreference resolution         X            


FreeLing also includes WordNet-based sense dictionaries for some of the covered languages, as well as some knowledge extracted from WordNet, such as semantic file codes, or hypernymy relationships. See http://wordnet.princeton.edu and http://www.illc.uva.nl/EuroWordNet for details on WordNet and EuroWordNet, respectively.

See the Linguistic Data section on FreeLing webpage to find out more about the size and origin the linguistic resources for these languages.

See file COPYING in the distribution packages to find out the license of each third-party linguistic resource included in FreeLing packages.

Lluís Padró 2013-09-09