Treebank Statistics: UD_Old_East_Slavic-Birchbark: POS Tags: PUNCT
There are 41 PUNCT lemmas (1%), 48 PUNCT types (0%) and 4095 PUNCT tokens (15%).
Out of 17 observed tags, the rank of PUNCT is: 9 in number of lemmas, 15 in number of types and 2 in number of tokens.
The 10 most frequent PUNCT lemmas: ·, …, :, (…), ⁞, ∙, |, ., ·:·, ·:
The 10 most frequent PUNCT types: ·, …, :, (…), ⁞, ∙, |, ., ·:·, :]
The 10 most frequent ambiguous lemmas: … (PUNCT 1268, X 5), ·-· (PUNCT 3, NUM 2), (-) (PUNCT 2, X 2), :-: (PUNCT 2, NUM 1), :-҃: (PUNCT 2, NUM 1), _ (X 26, NUM 3, VERB 3, PROPN 2, PUNCT 2, DET 1), —– (X 11, PUNCT 1)
The 10 most frequent ambiguous types: … (PUNCT 1260, X 10), (-) (PUNCT 2, X 2), :-҃: (PUNCT 2, NUM 1), —– (X 11, PUNCT 1)
- …
- (-)
- :-҃:
- —–
Morphology
The form / lemma ratio of PUNCT is 1.170732 (the average of all parts of speech is 2.421872).
The 1st highest number of forms (5) was observed with the lemma “:”: :, :), :], [:, ·.
The 2nd highest number of forms (4) was observed with the lemma “·”: (·, [·, ·, ·].
The 3rd highest number of forms (3) was observed with the lemma “…”: (…, …, …).
PUNCT does not occur with any features.
Relations
PUNCT nodes are attached to their parents using 1 different relations: punct (4095; 100% instances)
Parents of PUNCT nodes belong to 15 different parts of speech: NOUN (1226; 30% instances), VERB (1195; 29% instances), PROPN (843; 21% instances), X (260; 6% instances), ADJ (181; 4% instances), PRON (131; 3% instances), NUM (104; 3% instances), DET (79; 2% instances), ADV (28; 1% instances), ADP (18; 0% instances), PART (11; 0% instances), CCONJ (8; 0% instances), AUX (5; 0% instances), SCONJ (5; 0% instances), SYM (1; 0% instances)
4095 (100%) PUNCT nodes are leaves.
The highest child degree of a PUNCT node is 0.