FreeLing  3.1
Public Member Functions | Private Member Functions | Private Attributes
freeling::splitter Class Reference

Class splitter implements a sentence splitter, which accumulates lists of words until a sentence is completed, and then returns a list of sentence objects. More...

#include <splitter.h>

Collaboration diagram for freeling::splitter:
Collaboration graph
[legend]

List of all members.

Public Member Functions

 splitter (const std::wstring &)
 Constructor.
void split (const std::list< word > &, bool, std::list< sentence > &ls)
 split sentences with default options
std::list< sentencesplit (const std::list< word > &, bool)

Private Member Functions

bool end_of_sentence (std::list< word >::const_iterator, const std::list< word > &) const
 check for sentence markers

Private Attributes

bool SPLIT_AllowBetweenMarkers
 configuration options
int SPLIT_MaxWords
std::set< std::wstring > starters
 Sentence delimiters.
std::map< std::wstring, boolenders
std::map< std::wstring, intmarkers
 Open-close marker pairs (parenthesis, etc)
bool betweenMrk
int no_split_count
std::list< intmark_type
std::list< std::wstring > mark_form
sentence buffer
 accumulated list of returned sentences

Detailed Description

Class splitter implements a sentence splitter, which accumulates lists of words until a sentence is completed, and then returns a list of sentence objects.


Constructor & Destructor Documentation

freeling::splitter::splitter ( const std::wstring &  )

Member Function Documentation

bool freeling::splitter::end_of_sentence ( std::list< word >::const_iterator  ,
const std::list< word > &   
) const [private]

check for sentence markers

Check whether a word is a sentence end (eg a dot followed by a capitalized word).

void freeling::splitter::split ( const std::list< word > &  ,
bool  ,
std::list< sentence > &  ls 
)

split sentences with default options

std::list<sentence> freeling::splitter::split ( const std::list< word > &  ,
bool   
)

Member Data Documentation

accumulated list of returned sentences

accumulated words of current sentence

std::map<std::wstring,bool> freeling::splitter::enders [private]
std::list<std::wstring> freeling::splitter::mark_form [private]
std::list<int> freeling::splitter::mark_type [private]
std::map<std::wstring,int> freeling::splitter::markers [private]

Open-close marker pairs (parenthesis, etc)

configuration options

std::set<std::wstring> freeling::splitter::starters [private]

Sentence delimiters.


The documentation for this class was generated from the following files: