Treebank Statistics: UD_Ukrainian-ParlaMint: POS Tags: PUNCT
There are 24 PUNCT lemmas (0%), 23 PUNCT types (0%) and 19308 PUNCT tokens (18%).
Out of 16 observed tags, the rank of PUNCT is: 13 in number of lemmas, 14 in number of types and 2 in number of tokens.
The 10 most frequent PUNCT lemmas: ,, ., “, -, ?, !, :, –, ), ;
The 10 most frequent PUNCT types: ,, ., “, -, ?, !, :, –, ), ;
The 10 most frequent ambiguous lemmas: . (PUNCT 6284, X 1), - (PUNCT 426, X 36), – (PUNCT 196, X 14), _ (X 22, PUNCT 1)
The 10 most frequent ambiguous types: . (PUNCT 6283, X 1)
- .
Morphology
The form / lemma ratio of PUNCT is 0.958333 (the average of all parts of speech is 1.931827).
The 1st highest number of forms (2) was observed with the lemma “-”: ,, -.
The 2nd highest number of forms (2) was observed with the lemma “.”: ,, ..
The 3rd highest number of forms (1) was observed with the lemma “!”: !.
PUNCT occurs with 1 features: Typo (6; 0% instances)
PUNCT occurs with 1 feature-value pairs: Typo=Yes
PUNCT occurs with 2 feature combinations.
The most frequent feature combination is _ (19302 tokens).
Examples: ,, ., “, -, ?, !, :, –, ), ;
Relations
PUNCT nodes are attached to their parents using 1 different relations: punct (19308; 100% instances)
Parents of PUNCT nodes belong to 15 different parts of speech: VERB (9536; 49% instances), NOUN (4901; 25% instances), ADJ (1318; 7% instances), ADV (1280; 7% instances), PROPN (1203; 6% instances), NUM (358; 2% instances), PART (224; 1% instances), PRON (203; 1% instances), DET (55; 0% instances), INTJ (50; 0% instances), AUX (45; 0% instances), X (41; 0% instances), ADP (40; 0% instances), CCONJ (29; 0% instances), SCONJ (25; 0% instances)
19308 (100%) PUNCT nodes are leaves.
The highest child degree of a PUNCT node is 0.