home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latvian-LVTB: POS Tags: SYM

There are 67 SYM lemmas (0%), 68 SYM types (0%) and 494 SYM tokens (0%). Out of 17 observed tags, the rank of SYM is: 10 in number of lemmas, 13 in number of types and 16 in number of tokens.

The 10 most frequent SYM lemmas: %, , u.c., Re, &, utt., **, t.i., +, :)

The 10 most frequent SYM types: %, , Re, &, u.c., utt., .u, **, t.i., +

The 10 most frequent ambiguous lemmas: & (SYM 17, X 1), AB.LV (PROPN 1, SYM 1), a (PROPN 16, CCONJ 7, X 2, SYM 1)

The 10 most frequent ambiguous types: Re (SYM 20, INTJ 7), & (SYM 17, X 1), A (PROPN 13, CCONJ 5, SYM 1, X 1), AB.LV (PROPN 1, SYM 1)

Morphology

The form / lemma ratio of SYM is 1.014925 (the average of all parts of speech is 2.341252).

The 1st highest number of forms (2) was observed with the lemma “u.c.”: .u, u.c..

The 2nd highest number of forms (1) was observed with the lemma “%”: %.

The 3rd highest number of forms (1) was observed with the lemma “&”: &.

SYM occurs with 2 features: Abbr (58; 12% instances), Typo (13; 3% instances)

SYM occurs with 2 feature-value pairs: Abbr=Yes, Typo=Yes

SYM occurs with 3 feature combinations. The most frequent feature combination is _ (436 tokens). Examples: %, , Re, &, **, +, :), =, M-3, x

Relations

SYM nodes are attached to their parents using 19 different relations: flat (67; 14% instances), conj (65; 13% instances), obl (50; 10% instances), root (47; 10% instances), nsubj (46; 9% instances), nmod (32; 6% instances), parataxis (32; 6% instances), discourse (30; 6% instances), dep (27; 5% instances), flat:name (27; 5% instances), iobj (27; 5% instances), amod (12; 2% instances), acl (10; 2% instances), advmod (9; 2% instances), xcomp (4; 1% instances), nsubj:pass (3; 1% instances), advcl (2; 0% instances), cc (2; 0% instances), orphan (2; 0% instances)

Parents of SYM nodes belong to 10 different parts of speech: VERB (161; 33% instances), NOUN (132; 27% instances), SYM (72; 15% instances), (47; 10% instances), PROPN (41; 8% instances), ADJ (14; 3% instances), NUM (14; 3% instances), ADV (6; 1% instances), X (6; 1% instances), PART (1; 0% instances)

185 (37%) SYM nodes are leaves.

66 (13%) SYM nodes have one child.

151 (31%) SYM nodes have two children.

92 (19%) SYM nodes have three or more children.

The highest child degree of a SYM node is 8.

Children of SYM nodes are attached using 22 different relations: nummod (224; 31% instances), punct (145; 20% instances), case (81; 11% instances), nmod (69; 10% instances), flat (53; 7% instances), flat:name (27; 4% instances), cc (20; 3% instances), cop (14; 2% instances), amod (13; 2% instances), nsubj (13; 2% instances), goeswith (12; 2% instances), obl (11; 2% instances), conj (8; 1% instances), discourse (8; 1% instances), advmod (4; 1% instances), dep (3; 0% instances), orphan (3; 0% instances), acl (1; 0% instances), det (1; 0% instances), mark (1; 0% instances), parataxis (1; 0% instances), vocative (1; 0% instances)

Children of SYM nodes belong to 16 different parts of speech: NUM (227; 32% instances), PUNCT (145; 20% instances), NOUN (98; 14% instances), ADP (81; 11% instances), SYM (72; 10% instances), CCONJ (20; 3% instances), X (17; 2% instances), VERB (15; 2% instances), AUX (14; 2% instances), PART (7; 1% instances), ADV (4; 1% instances), PROPN (4; 1% instances), ADJ (3; 0% instances), DET (3; 0% instances), PRON (2; 0% instances), SCONJ (1; 0% instances)