home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-Taiga: POS Tags: PUNCT

There are 121 PUNCT lemmas (0%), 121 PUNCT types (0%) and 382774 PUNCT tokens (22%). Out of 17 observed tags, the rank of PUNCT is: 12 in number of lemmas, 14 in number of types and 1 in number of tokens.

The 10 most frequent PUNCT lemmas: ,, ., –, «, », !, :, ), ?, (

The 10 most frequent PUNCT types: ,, ., –, «, », !, :, ), ?, (

The 10 most frequent ambiguous lemmas: : (PUNCT 9970, SYM 5), ) (PUNCT 8386, SYM 85), ( (PUNCT 8214, SYM 4), - (PUNCT 5584, SYM 11, ADJ 1), ; (PUNCT 2348, SYM 1), (PUNCT 504, SYM 18), / (PUNCT 291, SYM 11, ADP 7), * (PUNCT 107, SYM 57, X 9, CCONJ 1), > (PUNCT 81, SYM 13), «/em> (PUNCT 76, SYM 6)

The 10 most frequent ambiguous types: : (PUNCT 9970, SYM 5), ) (PUNCT 8386, SYM 85), ( (PUNCT 8214, SYM 4), - (PUNCT 5583, SYM 11, ADJ 1), ; (PUNCT 2348, SYM 1), (PUNCT 504, SYM 18), / (PUNCT 291, SYM 11, ADP 7), * (PUNCT 107, SYM 57, X 9, CCONJ 1), > (PUNCT 81, SYM 13), «/em> (PUNCT 76, SYM 6)

Morphology

The form / lemma ratio of PUNCT is 1.000000 (the average of all parts of speech is 2.706171).

The 1st highest number of forms (2) was observed with the lemma “-”: -, –.

The 2nd highest number of forms (2) was observed with the lemma ““”: “, ”.

The 3rd highest number of forms (1) was observed with the lemma “!”: !.

PUNCT does not occur with any features.

Relations

PUNCT nodes are attached to their parents using 2 different relations: punct (382730; 100% instances), root (44; 0% instances)

Parents of PUNCT nodes belong to 18 different parts of speech: VERB (194696; 51% instances), NOUN (98069; 26% instances), ADJ (34255; 9% instances), PROPN (15693; 4% instances), ADV (13558; 4% instances), PRON (7531; 2% instances), PART (5061; 1% instances), X (4975; 1% instances), DET (2671; 1% instances), INTJ (2650; 1% instances), NUM (2640; 1% instances), ADP (390; 0% instances), AUX (193; 0% instances), CCONJ (140; 0% instances), SCONJ (88; 0% instances), SYM (63; 0% instances), PUNCT (57; 0% instances), (44; 0% instances)

382744 (100%) PUNCT nodes are leaves.

3 (0%) PUNCT nodes have one child.

27 (0%) PUNCT nodes have two children.

The highest child degree of a PUNCT node is 2.

Children of PUNCT nodes are attached using 1 different relations: punct (57; 100% instances)

Children of PUNCT nodes belong to 1 different parts of speech: PUNCT (57; 100% instances)