home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latvian-LVTB: POS Tags: SYM

There are 130 SYM lemmas (1%), 131 SYM types (0%) and 615 SYM tokens (0%). Out of 17 observed tags, the rank of SYM is: 8 in number of lemmas, 10 in number of types and 16 in number of tokens.

The 10 most frequent SYM lemmas: %, , a, Re, &, nozare.lv, u.c., utt., **, b

The 10 most frequent SYM types: %, , a, Re, &, u.c., **, Nozare.lv, b, utt.

The 10 most frequent ambiguous lemmas: * (SYM 94, PUNCT 1), a (SYM 17, CCONJ 7, X 2), T (SYM 4, NOUN 1), piem. (NOUN 2, SYM 2), 16:00 (NUM 1, SYM 1), AB.LV (PROPN 1, SYM 1), U (SYM 1, X 1), i (PART 6, CCONJ 2, NUM 1, SYM 1), m (NOUN 3, SYM 1), s (X 2, SYM 1)

The 10 most frequent ambiguous types: * (SYM 94, PUNCT 1), a (SYM 16, CCONJ 2, X 2), Re (SYM 20, INTJ 6), T (SYM 4, NOUN 1), piem. (SYM 2, NOUN 1), 16:00 (NUM 1, SYM 1), AB.LV (PROPN 1, SYM 1), M (PROPN 1, SYM 1), N (NOUN 1, SYM 1), U (SYM 1, X 1)

Morphology

The form / lemma ratio of SYM is 1.007692 (the average of all parts of speech is 2.328168).

The 1st highest number of forms (2) was observed with the lemma “Nozare.lv”: NOZARE.LV, Nozare.lv.

The 2nd highest number of forms (2) was observed with the lemma “nozare.lv”: NOZARE.LV, Nozare.lv.

The 3rd highest number of forms (2) was observed with the lemma “utt.”: utt, utt..

SYM occurs with 2 features: Abbr (48; 8% instances), Typo (1; 0% instances)

SYM occurs with 2 feature-value pairs: Abbr=Yes, Typo=Yes

SYM occurs with 3 feature combinations. The most frequent feature combination is _ (567 tokens). Examples: %, , a, Re, &, **, Nozare.lv, b, c, +

Relations

SYM nodes are attached to their parents using 22 different relations: flat (65; 11% instances), parataxis (63; 10% instances), flat:name (55; 9% instances), conj (54; 9% instances), obl (54; 9% instances), nsubj (52; 8% instances), nmod (51; 8% instances), root (48; 8% instances), dep (34; 6% instances), iobj (34; 6% instances), discourse (33; 5% instances), amod (28; 5% instances), acl (10; 2% instances), advmod (9; 1% instances), nsubj:pass (7; 1% instances), xcomp (5; 1% instances), obj (4; 1% instances), advcl (2; 0% instances), cc (2; 0% instances), flat:foreign (2; 0% instances), orphan (2; 0% instances), nummod (1; 0% instances)

Parents of SYM nodes belong to 10 different parts of speech: VERB (202; 33% instances), NOUN (172; 28% instances), SYM (81; 13% instances), PROPN (52; 8% instances), (48; 8% instances), ADJ (22; 4% instances), NUM (20; 3% instances), X (10; 2% instances), ADV (7; 1% instances), PART (1; 0% instances)

233 (38%) SYM nodes are leaves.

99 (16%) SYM nodes have one child.

174 (28%) SYM nodes have two children.

109 (18%) SYM nodes have three or more children.

The highest child degree of a SYM node is 8.

Children of SYM nodes are attached using 22 different relations: punct (253; 30% instances), nummod (213; 25% instances), nmod (98; 11% instances), case (80; 9% instances), flat (51; 6% instances), flat:name (41; 5% instances), cc (20; 2% instances), obl (17; 2% instances), amod (13; 2% instances), cop (13; 2% instances), nsubj (13; 2% instances), conj (10; 1% instances), discourse (10; 1% instances), advmod (6; 1% instances), acl (4; 0% instances), dep (3; 0% instances), orphan (3; 0% instances), appos (2; 0% instances), advcl (1; 0% instances), det (1; 0% instances), mark (1; 0% instances), parataxis (1; 0% instances)

Children of SYM nodes belong to 16 different parts of speech: PUNCT (253; 30% instances), NUM (216; 25% instances), NOUN (132; 15% instances), SYM (81; 9% instances), ADP (79; 9% instances), CCONJ (20; 2% instances), VERB (19; 2% instances), AUX (13; 2% instances), X (12; 1% instances), PART (9; 1% instances), ADV (6; 1% instances), PRON (4; 0% instances), PROPN (4; 0% instances), ADJ (3; 0% instances), SCONJ (2; 0% instances), DET (1; 0% instances)