home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian: POS Tags: PUNCT

There are 17 PUNCT lemmas (0%), 16 PUNCT types (0%) and 18810 PUNCT tokens (19%). Out of 16 observed tags, the rank of PUNCT is: 14 in number of lemmas, 14 in number of types and 2 in number of tokens.

The 10 most frequent PUNCT lemmas: ,, ., –, ), (, ``, '', -, :, ;

The 10 most frequent PUNCT types: ,, ., –, ), (, ``, '', -, :, ;

The 10 most frequent ambiguous lemmas: (PUNCT 4, SYM 2)

The 10 most frequent ambiguous types: (PUNCT 4, SYM 2)

Morphology

The form / lemma ratio of PUNCT is 0.941176 (the average of all parts of speech is 1.591329).

The 1st highest number of forms (1) was observed with the lemma “!”: !.

The 2nd highest number of forms (1) was observed with the lemma “''”: ''.

The 3rd highest number of forms (1) was observed with the lemma “’”: APOSTROPHE.

PUNCT does not occur with any features.

Relations

PUNCT nodes are attached to their parents using 10 different relations: punct (17700; 94% instances), goeswith (1091; 6% instances), obl (6; 0% instances), parataxis (3; 0% instances), cc (2; 0% instances), conj (2; 0% instances), flat (2; 0% instances), nmod (2; 0% instances), appos (1; 0% instances), nummod (1; 0% instances)

Parents of PUNCT nodes belong to 16 different parts of speech: VERB (7501; 40% instances), NOUN (6176; 33% instances), PROPN (2054; 11% instances), ADV (1240; 7% instances), ADJ (1166; 6% instances), NUM (335; 2% instances), ADP (129; 1% instances), PRON (80; 0% instances), SYM (47; 0% instances), DET (23; 0% instances), AUX (19; 0% instances), CCONJ (16; 0% instances), PUNCT (13; 0% instances), PART (7; 0% instances), SCONJ (2; 0% instances), X (2; 0% instances)

18775 (100%) PUNCT nodes are leaves.

19 (0%) PUNCT nodes have one child.

7 (0%) PUNCT nodes have two children.

9 (0%) PUNCT nodes have three or more children.

The highest child degree of a PUNCT node is 5.

Children of PUNCT nodes are attached using 15 different relations: goeswith (15; 21% instances), case (14; 20% instances), punct (13; 19% instances), nmod (11; 16% instances), amod (4; 6% instances), acl (2; 3% instances), ccomp (2; 3% instances), nsubj (2; 3% instances), acl:relcl (1; 1% instances), advmod (1; 1% instances), conj (1; 1% instances), det (1; 1% instances), mark (1; 1% instances), nummod:gov (1; 1% instances), obl (1; 1% instances)

Children of PUNCT nodes belong to 11 different parts of speech: ADP (19; 27% instances), NOUN (15; 21% instances), PUNCT (13; 19% instances), VERB (6; 9% instances), ADJ (5; 7% instances), ADV (3; 4% instances), PROPN (3; 4% instances), NUM (2; 3% instances), PRON (2; 3% instances), DET (1; 1% instances), SCONJ (1; 1% instances)