home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-CAC: POS Tags: SYM

There are 1 SYM lemmas (0%), 1 SYM types (0%) and 3570 SYM tokens (1%). Out of 16 observed tags, the rank of SYM is: 16 in number of lemmas, 16 in number of types and 14 in number of tokens.

The 10 most frequent SYM lemmas: &cwildcard;

The 10 most frequent SYM types: *

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: * (SYM 3570, ADP 3)

Morphology

The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 2.185616).

The 1st highest number of forms (1) was observed with the lemma “&cwildcard;”: *.

SYM occurs with 1 features: Abbr (3570; 100% instances)

SYM occurs with 1 feature-value pairs: Abbr=Yes

SYM occurs with 1 feature combinations. The most frequent feature combination is Abbr=Yes (3570 tokens). Examples: *

Relations

SYM nodes are attached to their parents using 20 different relations: nmod (2474; 69% instances), advmod (405; 11% instances), conj (256; 7% instances), nsubj (141; 4% instances), obj (82; 2% instances), root (53; 1% instances), obl:arg (28; 1% instances), cc (25; 1% instances), dep (24; 1% instances), orphan (23; 1% instances), nsubj:pass (19; 1% instances), appos (14; 0% instances), acl:relcl (7; 0% instances), advcl (6; 0% instances), xcomp (4; 0% instances), advmod:emph (3; 0% instances), mark (3; 0% instances), acl (1; 0% instances), ccomp (1; 0% instances), iobj (1; 0% instances)

Parents of SYM nodes belong to 12 different parts of speech: NOUN (1190; 33% instances), NUM (971; 27% instances), PROPN (466; 13% instances), VERB (402; 11% instances), ADJ (224; 6% instances), SYM (161; 5% instances), AUX (73; 2% instances), (53; 1% instances), DET (15; 0% instances), ADV (13; 0% instances), ADP (1; 0% instances), PRON (1; 0% instances)

2099 (59%) SYM nodes are leaves.

877 (25%) SYM nodes have one child.

344 (10%) SYM nodes have two children.

250 (7%) SYM nodes have three or more children.

The highest child degree of a SYM node is 10.

Children of SYM nodes are attached using 30 different relations: case (599; 24% instances), nmod (402; 16% instances), cc (221; 9% instances), punct (212; 8% instances), nummod (211; 8% instances), conj (207; 8% instances), amod (154; 6% instances), nsubj (81; 3% instances), obl (68; 3% instances), advmod (47; 2% instances), cop (47; 2% instances), advmod:emph (43; 2% instances), mark (35; 1% instances), det (30; 1% instances), obj (27; 1% instances), obl:arg (23; 1% instances), acl:relcl (21; 1% instances), orphan (19; 1% instances), dep (17; 1% instances), expl:pv (16; 1% instances), aux (12; 0% instances), xcomp (11; 0% instances), expl:pass (7; 0% instances), advcl (6; 0% instances), nsubj:pass (5; 0% instances), appos (4; 0% instances), acl (3; 0% instances), csubj (3; 0% instances), parataxis (2; 0% instances), ccomp (1; 0% instances)

Children of SYM nodes belong to 15 different parts of speech: ADP (595; 23% instances), NOUN (555; 22% instances), NUM (222; 9% instances), PUNCT (212; 8% instances), CCONJ (200; 8% instances), ADJ (177; 7% instances), SYM (161; 6% instances), ADV (88; 3% instances), AUX (63; 2% instances), VERB (59; 2% instances), DET (50; 2% instances), PRON (46; 2% instances), PART (38; 1% instances), PROPN (37; 1% instances), SCONJ (31; 1% instances)