home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Ukrainian-ParlaMint: POS Tags: PUNCT

There are 24 PUNCT lemmas (0%), 23 PUNCT types (0%) and 19308 PUNCT tokens (18%). Out of 16 observed tags, the rank of PUNCT is: 13 in number of lemmas, 14 in number of types and 2 in number of tokens.

The 10 most frequent PUNCT lemmas: ,, ., “, -, ?, !, :, –, ), ;

The 10 most frequent PUNCT types: ,, ., “, -, ?, !, :, –, ), ;

The 10 most frequent ambiguous lemmas: . (PUNCT 6284, X 1), - (PUNCT 426, X 36), (PUNCT 196, X 14), _ (X 22, PUNCT 1)

The 10 most frequent ambiguous types: . (PUNCT 6283, X 1)

Morphology

The form / lemma ratio of PUNCT is 0.958333 (the average of all parts of speech is 1.931827).

The 1st highest number of forms (2) was observed with the lemma “-”: ,, -.

The 2nd highest number of forms (2) was observed with the lemma “.”: ,, ..

The 3rd highest number of forms (1) was observed with the lemma “!”: !.

PUNCT occurs with 1 features: Typo (6; 0% instances)

PUNCT occurs with 1 feature-value pairs: Typo=Yes

PUNCT occurs with 2 feature combinations. The most frequent feature combination is _ (19302 tokens). Examples: ,, ., “, -, ?, !, :, –, ), ;

Relations

PUNCT nodes are attached to their parents using 1 different relations: punct (19308; 100% instances)

Parents of PUNCT nodes belong to 15 different parts of speech: VERB (9536; 49% instances), NOUN (4901; 25% instances), ADJ (1318; 7% instances), ADV (1280; 7% instances), PROPN (1203; 6% instances), NUM (358; 2% instances), PART (224; 1% instances), PRON (203; 1% instances), DET (55; 0% instances), INTJ (50; 0% instances), AUX (45; 0% instances), X (41; 0% instances), ADP (40; 0% instances), CCONJ (29; 0% instances), SCONJ (25; 0% instances)

19308 (100%) PUNCT nodes are leaves.

The highest child degree of a PUNCT node is 0.