home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-GSD: POS Tags: PUNCT

There are 18 PUNCT lemmas (0%), 17 PUNCT types (0%) and 18819 PUNCT tokens (19%). Out of 16 observed tags, the rank of PUNCT is: 14 in number of lemmas, 14 in number of types and 2 in number of tokens.

The 10 most frequent PUNCT lemmas: ,, ., –, ), (, ``, '', -, :, ;

The 10 most frequent PUNCT types: ,, ., –, ), (, ``, '', -, :, ;

The 10 most frequent ambiguous lemmas: / (SYM 37, PUNCT 9), (PUNCT 4, SYM 2)

The 10 most frequent ambiguous types: / (SYM 37, PUNCT 9), (PUNCT 4, SYM 2)

Morphology

The form / lemma ratio of PUNCT is 0.944444 (the average of all parts of speech is 1.592402).

The 1st highest number of forms (1) was observed with the lemma “!”: !.

The 2nd highest number of forms (1) was observed with the lemma “''”: ''.

The 3rd highest number of forms (1) was observed with the lemma “’”: APOSTROPHE.

PUNCT does not occur with any features.

Relations

PUNCT nodes are attached to their parents using 11 different relations: punct (17555; 93% instances), goeswith (1085; 6% instances), case (161; 1% instances), cc (5; 0% instances), obl (4; 0% instances), parataxis (3; 0% instances), flat (2; 0% instances), appos (1; 0% instances), conj (1; 0% instances), nmod (1; 0% instances), nummod (1; 0% instances)

Parents of PUNCT nodes belong to 16 different parts of speech: VERB (7434; 40% instances), NOUN (6147; 33% instances), PROPN (2058; 11% instances), ADV (1243; 7% instances), ADJ (1181; 6% instances), NUM (329; 2% instances), AUX (148; 1% instances), ADP (84; 0% instances), PRON (84; 0% instances), SYM (48; 0% instances), DET (24; 0% instances), CCONJ (15; 0% instances), PUNCT (13; 0% instances), PART (7; 0% instances), SCONJ (2; 0% instances), X (2; 0% instances)

18786 (100%) PUNCT nodes are leaves.

19 (0%) PUNCT nodes have one child.

7 (0%) PUNCT nodes have two children.

7 (0%) PUNCT nodes have three or more children.

The highest child degree of a PUNCT node is 5.

Children of PUNCT nodes are attached using 15 different relations: punct (13; 21% instances), case (12; 20% instances), goeswith (11; 18% instances), nmod (10; 16% instances), acl (2; 3% instances), amod (2; 3% instances), ccomp (2; 3% instances), nsubj (2; 3% instances), acl:relcl (1; 2% instances), advmod (1; 2% instances), conj (1; 2% instances), det (1; 2% instances), mark (1; 2% instances), nummod:gov (1; 2% instances), obl (1; 2% instances)

Children of PUNCT nodes belong to 11 different parts of speech: ADP (15; 25% instances), PUNCT (13; 21% instances), NOUN (12; 20% instances), VERB (6; 10% instances), ADJ (3; 5% instances), ADV (3; 5% instances), PROPN (3; 5% instances), NUM (2; 3% instances), PRON (2; 3% instances), DET (1; 2% instances), SCONJ (1; 2% instances)