home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-CAC: POS Tags: SYM

There are 1 SYM lemmas (0%), 1 SYM types (0%) and 3570 SYM tokens (1%). Out of 16 observed tags, the rank of SYM is: 16 in number of lemmas, 16 in number of types and 14 in number of tokens.

The 10 most frequent SYM lemmas: &cwildcard;

The 10 most frequent SYM types: *

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: * (SYM 3570, ADP 3)

Morphology

The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 2.185616).

The 1st highest number of forms (1) was observed with the lemma “&cwildcard;”: *.

SYM occurs with 1 features: Abbr (3570; 100% instances)

SYM occurs with 1 feature-value pairs: Abbr=Yes

SYM occurs with 1 feature combinations. The most frequent feature combination is Abbr=Yes (3570 tokens). Examples: *

Relations

SYM nodes are attached to their parents using 21 different relations: nmod (2474; 69% instances), advmod (367; 10% instances), conj (262; 7% instances), nsubj (141; 4% instances), obj (82; 2% instances), root (80; 2% instances), obl:arg (27; 1% instances), cc (25; 1% instances), dep (24; 1% instances), orphan (24; 1% instances), nsubj:pass (19; 1% instances), appos (14; 0% instances), acl:relcl (9; 0% instances), advcl (6; 0% instances), xcomp (4; 0% instances), advmod:emph (3; 0% instances), mark (3; 0% instances), acl (2; 0% instances), ccomp (2; 0% instances), iobj (1; 0% instances), parataxis (1; 0% instances)

Parents of SYM nodes belong to 12 different parts of speech: NOUN (1202; 34% instances), NUM (971; 27% instances), PROPN (467; 13% instances), VERB (405; 11% instances), ADJ (227; 6% instances), SYM (165; 5% instances), (80; 2% instances), ADV (22; 1% instances), DET (15; 0% instances), AUX (13; 0% instances), PRON (2; 0% instances), ADP (1; 0% instances)

2098 (59%) SYM nodes are leaves.

847 (24%) SYM nodes have one child.

337 (9%) SYM nodes have two children.

288 (8%) SYM nodes have three or more children.

The highest child degree of a SYM node is 10.

Children of SYM nodes are attached using 30 different relations: case (599; 22% instances), nmod (402; 15% instances), punct (251; 9% instances), cc (225; 8% instances), nummod (212; 8% instances), conj (211; 8% instances), amod (154; 6% instances), nsubj (116; 4% instances), cop (86; 3% instances), obl (72; 3% instances), advmod (48; 2% instances), advmod:emph (43; 2% instances), mark (38; 1% instances), det (30; 1% instances), obj (27; 1% instances), obl:arg (23; 1% instances), acl:relcl (21; 1% instances), dep (19; 1% instances), orphan (19; 1% instances), expl:pv (16; 1% instances), aux (12; 0% instances), xcomp (11; 0% instances), expl:pass (7; 0% instances), advcl (6; 0% instances), nsubj:pass (5; 0% instances), appos (4; 0% instances), csubj (4; 0% instances), acl (3; 0% instances), parataxis (3; 0% instances), ccomp (2; 0% instances)

Children of SYM nodes belong to 15 different parts of speech: ADP (595; 22% instances), NOUN (594; 22% instances), PUNCT (251; 9% instances), NUM (224; 8% instances), CCONJ (204; 8% instances), ADJ (178; 7% instances), SYM (165; 6% instances), AUX (99; 4% instances), ADV (89; 3% instances), VERB (63; 2% instances), DET (51; 2% instances), PRON (47; 2% instances), PART (38; 1% instances), PROPN (37; 1% instances), SCONJ (34; 1% instances)