Treebank Statistics: UD_Old_East_Slavic-Birchbark: POS Tags: PUNCT
There are 35 PUNCT
lemmas (1%), 43 PUNCT
types (0%) and 3973 PUNCT
tokens (15%).
Out of 17 observed tags, the rank of PUNCT
is: 10 in number of lemmas, 15 in number of types and 2 in number of tokens.
The 10 most frequent PUNCT
lemmas: ·, …, :, (…), ⁞, |, ·:·, ·:, [·], ·
The 10 most frequent PUNCT
types: ·, …, :, (…), ⁞, |, ·:·, :], ·:, [·]
The 10 most frequent ambiguous lemmas: … (PUNCT 1219, X 5), ·-· (PUNCT 3, NUM 2), (-) (PUNCT 2, X 2), :-: (PUNCT 2, NUM 1), :-҃: (PUNCT 2, NUM 1), —– (X 11, PUNCT 1)
The 10 most frequent ambiguous types: … (PUNCT 1210, X 10), (-) (PUNCT 2, X 2), :-҃: (PUNCT 2, NUM 1), —– (X 11, PUNCT 1)
- …
- (-)
- :-҃:
- —–
Morphology
The form / lemma ratio of PUNCT
is 1.228571 (the average of all parts of speech is 2.410435).
The 1st highest number of forms (5) was observed with the lemma “:”: :, :), :], [:, ·.
The 2nd highest number of forms (4) was observed with the lemma “·”: (·, [·, ·, ·].
The 3rd highest number of forms (3) was observed with the lemma “…”: (…, …, …).
PUNCT
does not occur with any features.
Relations
PUNCT
nodes are attached to their parents using 1 different relations: punct (3973; 100% instances)
Parents of PUNCT
nodes belong to 15 different parts of speech: NOUN (1193; 30% instances), VERB (1162; 29% instances), PROPN (813; 20% instances), X (258; 6% instances), ADJ (176; 4% instances), PRON (123; 3% instances), NUM (100; 3% instances), DET (78; 2% instances), ADV (27; 1% instances), ADP (17; 0% instances), PART (11; 0% instances), CCONJ (8; 0% instances), SCONJ (5; 0% instances), AUX (1; 0% instances), SYM (1; 0% instances)
3973 (100%) PUNCT
nodes are leaves.
The highest child degree of a PUNCT
node is 0.