Treebank Statistics: UD_Romanian-SiMoNERo: POS Tags: PUNCT
There are 28 PUNCT
lemmas (0%), 28 PUNCT
types (0%) and 19614 PUNCT
tokens (13%).
Out of 16 observed tags, the rank of PUNCT
is: 11 in number of lemmas, 12 in number of types and 3 in number of tokens.
The 10 most frequent PUNCT
lemmas: ,, ., ), (, /, -, ;, –, :, %
The 10 most frequent PUNCT
types: ,, ., ), (, /, -, ;, –, :, %
The 10 most frequent ambiguous lemmas:
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of PUNCT
is 1.000000 (the average of all parts of speech is 1.666637).
The 1st highest number of forms (1) was observed with the lemma “%”: %.
The 2nd highest number of forms (1) was observed with the lemma “&”: &.
The 3rd highest number of forms (1) was observed with the lemma “(”: (.
PUNCT
occurs with 1 features: AdpType (596; 3% instances)
PUNCT
occurs with 1 feature-value pairs: AdpType=Prep
PUNCT
occurs with 2 feature combinations.
The most frequent feature combination is _
(19018 tokens).
Examples: ,, ., ), (, -, –, :, %, ], [
Relations
PUNCT
nodes are attached to their parents using 1 different relations: punct (19614; 100% instances)
Parents of PUNCT
nodes belong to 16 different parts of speech: NOUN (7887; 40% instances), VERB (6108; 31% instances), NUM (2722; 14% instances), ADJ (1322; 7% instances), ADV (523; 3% instances), X (337; 2% instances), ADP (332; 2% instances), PRON (146; 1% instances), PROPN (141; 1% instances), CCONJ (48; 0% instances), AUX (31; 0% instances), DET (11; 0% instances), INTJ (2; 0% instances), PART (2; 0% instances), PUNCT (1; 0% instances), SCONJ (1; 0% instances)
19613 (100%) PUNCT
nodes are leaves.
1 (0%) PUNCT
nodes have one child.
The highest child degree of a PUNCT
node is 1.
Children of PUNCT
nodes are attached using 1 different relations: punct (1; 100% instances)
Children of PUNCT
nodes belong to 1 different parts of speech: PUNCT (1; 100% instances)