Concordia
|
#include <token_annotation.hpp>
Public Member Functions | |
TokenAnnotation (const SUFFIX_MARKER_TYPE start, const SUFFIX_MARKER_TYPE end, const int annotationType, const std::string &value) | |
virtual | ~TokenAnnotation () |
int | getType () const |
std::string | getValue () const |
Public Member Functions inherited from Interval | |
Interval (const SUFFIX_MARKER_TYPE start, const SUFFIX_MARKER_TYPE end) | |
virtual | ~Interval () |
bool | intersects (Interval &interval) |
bool | contains (Interval &interval) |
SUFFIX_MARKER_TYPE | getLength () |
SUFFIX_MARKER_TYPE | getStart () const |
SUFFIX_MARKER_TYPE | getEnd () const |
Static Public Attributes | |
static int | NE = 0 |
static int | WORD = 1 |
static int | HTML_TAG = 2 |
static int | STOP_WORD = 3 |
Class representing annotation of char sequence as a token. It is a type of interval that is also storing information about the annoation type and value.
TokenAnnotation::TokenAnnotation | ( | const SUFFIX_MARKER_TYPE | start, |
const SUFFIX_MARKER_TYPE | end, | ||
const int | annotationType, | ||
const std::string & | value | ||
) |
Constructor.
start | start index of the annotation (char-level, 0-based) |
end | end index of the annotation (char-level, 0-based) |
annotationType | annotation type |
value | annotation value |
|
virtual |
Destructor.
|
inline |
Getter for annotation type.
|
inline |
Getter for annotation value.
|
static |
Html tag annotation type
|
static |
Named entity annotation type
|
static |
Stop word annotation type
|
static |
Word annotation type