home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latvian-LVTB: POS Tags: SYM

There are 93 SYM lemmas (0%), 93 SYM types (0%) and 557 SYM tokens (0%). Out of 17 observed tags, the rank of SYM is: 8 in number of lemmas, 10 in number of types and 16 in number of tokens.

The 10 most frequent SYM lemmas: %, , a, Re, &, u.c., utt., **, b, c

The 10 most frequent SYM types: %, , a, Re, &, u.c., utt., **, b, c

The 10 most frequent ambiguous lemmas: a (SYM 17, CCONJ 7, X 2), & (SYM 17, X 1), T (SYM 4, NOUN 1), AB.LV (PROPN 1, SYM 1), U (SYM 1, X 1), i (PART 6, CCONJ 2, NUM 1, SYM 1), m (NOUN 3, SYM 1), s (X 2, SYM 1), ā (INTJ 3, SYM 1)

The 10 most frequent ambiguous types: a (SYM 16, CCONJ 2, X 2), Re (SYM 20, INTJ 6), & (SYM 17, X 1), T (SYM 4, NOUN 1), AB.LV (PROPN 1, SYM 1), M (PROPN 1, SYM 1), N (NOUN 1, SYM 1), U (SYM 1, X 1), i (PART 6, SYM 1)

Morphology

The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 2.340184).

The 1st highest number of forms (1) was observed with the lemma “%”: %.

The 2nd highest number of forms (1) was observed with the lemma “&”: &.

The 3rd highest number of forms (1) was observed with the lemma “*”: *.

SYM occurs with 2 features: Abbr (46; 8% instances), Typo (1; 0% instances)

SYM occurs with 2 feature-value pairs: Abbr=Yes, Typo=Yes

SYM occurs with 3 feature combinations. The most frequent feature combination is _ (511 tokens). Examples: %, , a, Re, &, **, b, c, +, L

Relations

SYM nodes are attached to their parents using 19 different relations: flat (68; 12% instances), conj (57; 10% instances), flat:name (55; 10% instances), parataxis (55; 10% instances), nsubj (48; 9% instances), root (48; 9% instances), obl (44; 8% instances), nmod (33; 6% instances), discourse (31; 6% instances), iobj (29; 5% instances), amod (28; 5% instances), dep (27; 5% instances), acl (9; 2% instances), advmod (9; 2% instances), nsubj:pass (6; 1% instances), xcomp (4; 1% instances), advcl (2; 0% instances), cc (2; 0% instances), orphan (2; 0% instances)

Parents of SYM nodes belong to 11 different parts of speech: VERB (168; 30% instances), NOUN (155; 28% instances), SYM (81; 15% instances), (48; 9% instances), PROPN (46; 8% instances), ADJ (22; 4% instances), NUM (20; 4% instances), X (9; 2% instances), ADV (6; 1% instances), PART (1; 0% instances), PRON (1; 0% instances)

229 (41%) SYM nodes are leaves.

71 (13%) SYM nodes have one child.

162 (29%) SYM nodes have two children.

95 (17%) SYM nodes have three or more children.

The highest child degree of a SYM node is 8.

Children of SYM nodes are attached using 21 different relations: nummod (214; 28% instances), punct (196; 26% instances), case (77; 10% instances), nmod (71; 9% instances), flat (51; 7% instances), flat:name (41; 5% instances), cc (23; 3% instances), amod (13; 2% instances), cop (13; 2% instances), nsubj (12; 2% instances), obl (11; 1% instances), conj (9; 1% instances), discourse (8; 1% instances), advmod (4; 1% instances), acl (3; 0% instances), dep (3; 0% instances), orphan (3; 0% instances), det (1; 0% instances), mark (1; 0% instances), parataxis (1; 0% instances), vocative (1; 0% instances)

Children of SYM nodes belong to 16 different parts of speech: NUM (217; 29% instances), PUNCT (196; 26% instances), NOUN (99; 13% instances), SYM (81; 11% instances), ADP (77; 10% instances), CCONJ (23; 3% instances), VERB (16; 2% instances), AUX (13; 2% instances), X (12; 2% instances), PART (7; 1% instances), ADV (4; 1% instances), ADJ (3; 0% instances), PRON (3; 0% instances), PROPN (3; 0% instances), DET (1; 0% instances), SCONJ (1; 0% instances)