home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-PDT: POS Tags: SYM

There are 12 SYM lemmas (0%), 12 SYM types (0%) and 379 SYM tokens (0%). Out of 17 observed tags, the rank of SYM is: 15 in number of lemmas, 16 in number of types and 16 in number of tokens.

The 10 most frequent SYM lemmas: %, =, +, *, x, §, /, >, <, .

The 10 most frequent SYM types: %, =, +, *, x, §, /, >, <, .

The 10 most frequent ambiguous lemmas: * (PUNCT 148, SYM 20), x (NOUN 32, SYM 19), / (PUNCT 118, SYM 6), . (PUNCT 18491, SYM 2), : (PUNCT 1296, SYM 2), - (PUNCT 2364, SYM 1)

The 10 most frequent ambiguous types: * (PUNCT 148, SYM 20), x (NOUN 32, SYM 19), / (PUNCT 118, SYM 6), . (PUNCT 18491, SYM 2), : (PUNCT 1296, SYM 2), - (PUNCT 2364, SYM 1)

Morphology

The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 1.964432).

The 1st highest number of forms (1) was observed with the lemma “%”: %.

The 2nd highest number of forms (1) was observed with the lemma “*”: *.

The 3rd highest number of forms (1) was observed with the lemma “+”: +.

SYM occurs with 1 features: ConjType (42; 11% instances)

SYM occurs with 1 feature-value pairs: ConjType=Oper

SYM occurs with 2 feature combinations. The most frequent feature combination is _ (337 tokens). Examples: %, =, +, *, §, /

Relations

SYM nodes are attached to their parents using 16 different relations: nmod (237; 63% instances), cc (37; 10% instances), root (28; 7% instances), flat (17; 4% instances), advmod (12; 3% instances), parataxis (9; 2% instances), obj (8; 2% instances), conj (7; 2% instances), advcl (5; 1% instances), case (5; 1% instances), nsubj (5; 1% instances), dep (4; 1% instances), appos (2; 1% instances), csubj (1; 0% instances), obl:arg (1; 0% instances), orphan (1; 0% instances)

Parents of SYM nodes belong to 10 different parts of speech: NUM (232; 61% instances), NOUN (61; 16% instances), (28; 7% instances), VERB (25; 7% instances), SYM (11; 3% instances), ADJ (10; 3% instances), PROPN (6; 2% instances), ADV (3; 1% instances), X (2; 1% instances), AUX (1; 0% instances)

192 (51%) SYM nodes are leaves.

98 (26%) SYM nodes have one child.

16 (4%) SYM nodes have two children.

73 (19%) SYM nodes have three or more children.

The highest child degree of a SYM node is 8.

Children of SYM nodes are attached using 21 different relations: punct (94; 22% instances), nmod (85; 20% instances), obj (57; 13% instances), nummod (43; 10% instances), nsubj (39; 9% instances), case (24; 6% instances), obl (24; 6% instances), parataxis (10; 2% instances), conj (8; 2% instances), advmod:emph (7; 2% instances), mark (7; 2% instances), cc (6; 1% instances), cop (6; 1% instances), advcl (5; 1% instances), advmod (5; 1% instances), acl:relcl (3; 1% instances), flat (3; 1% instances), appos (2; 0% instances), aux (2; 0% instances), dep (2; 0% instances), amod (1; 0% instances)

Children of SYM nodes belong to 15 different parts of speech: NOUN (128; 30% instances), NUM (119; 27% instances), PUNCT (94; 22% instances), ADP (24; 6% instances), VERB (15; 3% instances), SYM (11; 3% instances), ADV (8; 2% instances), AUX (8; 2% instances), CCONJ (7; 2% instances), SCONJ (7; 2% instances), ADJ (4; 1% instances), DET (3; 1% instances), PART (3; 1% instances), PRON (1; 0% instances), PROPN (1; 0% instances)