home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Latvian-LVTB: POS Tags: SYM

There are 174 SYM lemmas (1%), 175 SYM types (0%) and 608 SYM tokens (0%). Out of 17 observed tags, the rank of SYM is: 8 in number of lemmas, 10 in number of types and 16 in number of tokens.

The 10 most frequent SYM lemmas: %, , a, Re, &, nozare.lv, u.c., **, b, utt.

The 10 most frequent SYM types: %, , a, Re, &, u.c., **, b, Nozare.lv, OV

The 10 most frequent ambiguous lemmas: * (SYM 94, PUNCT 1), a (SYM 17, CCONJ 6, X 2, INTJ 1), & (SYM 16, X 1), T (SYM 3, NOUN 1), Facebook (SYM 2, X 1), Firmas.lv (PROPN 2, SYM 2), (PUNCT 10, SYM 2), 14.00 (NUM 1, SYM 1), 16:00 (NUM 1, SYM 1), :) (PUNCT 1, SYM 1)

The 10 most frequent ambiguous types: * (SYM 94, PUNCT 1), a (SYM 16, X 2, CCONJ 1, INTJ 1), Re (SYM 20, INTJ 6), & (SYM 16, X 1), Nozare.lv (SYM 8, PROPN 1), NOZARE.LV (SYM 4, PROPN 1), T (SYM 3, NOUN 1), Facebook (SYM 2, X 1), Firmas.lv (PROPN 2, SYM 2), (PUNCT 10, SYM 2)

Morphology

The form / lemma ratio of SYM is 1.005747 (the average of all parts of speech is 2.233228).

The 1st highest number of forms (2) was observed with the lemma “Nozare.lv”: NOZARE.LV, Nozare.lv.

The 2nd highest number of forms (2) was observed with the lemma “nozare.lv”: NOZARE.LV, Nozare.lv.

The 3rd highest number of forms (2) was observed with the lemma “utt.”: utt, utt..

SYM occurs with 2 features: Abbr (39; 6% instances), Typo (1; 0% instances)

SYM occurs with 2 feature-value pairs: Abbr=Yes, Typo=Yes

SYM occurs with 3 feature combinations. The most frequent feature combination is _ (569 tokens). Examples: %, , a, Re, &, **, b, Nozare.lv, OV, c

Relations

SYM nodes are attached to their parents using 21 different relations: parataxis (65; 11% instances), flat:name (61; 10% instances), flat (59; 10% instances), conj (53; 9% instances), root (53; 9% instances), nsubj (48; 8% instances), nmod (47; 8% instances), amod (46; 8% instances), obl (45; 7% instances), dep (35; 6% instances), discourse (32; 5% instances), iobj (30; 5% instances), acl (9; 1% instances), advmod (7; 1% instances), nsubj:pass (6; 1% instances), obj (4; 1% instances), advcl (2; 0% instances), cc (2; 0% instances), orphan (2; 0% instances), flat:foreign (1; 0% instances), nummod (1; 0% instances)

Parents of SYM nodes belong to 10 different parts of speech: VERB (185; 30% instances), NOUN (174; 29% instances), SYM (101; 17% instances), (53; 9% instances), PROPN (37; 6% instances), NUM (21; 3% instances), ADJ (16; 3% instances), X (13; 2% instances), ADV (7; 1% instances), PART (1; 0% instances)

228 (38%) SYM nodes are leaves.

81 (13%) SYM nodes have one child.

169 (28%) SYM nodes have two children.

130 (21%) SYM nodes have three or more children.

The highest child degree of a SYM node is 8.

Children of SYM nodes are attached using 22 different relations: punct (329; 36% instances), nummod (170; 19% instances), nmod (102; 11% instances), flat:name (69; 8% instances), case (67; 7% instances), flat (52; 6% instances), cc (21; 2% instances), parataxis (19; 2% instances), amod (13; 1% instances), discourse (13; 1% instances), obl (11; 1% instances), conj (9; 1% instances), nsubj (8; 1% instances), dep (7; 1% instances), cop (6; 1% instances), advmod (5; 1% instances), acl (4; 0% instances), appos (3; 0% instances), orphan (2; 0% instances), advcl (1; 0% instances), det (1; 0% instances), mark (1; 0% instances)

Children of SYM nodes belong to 16 different parts of speech: PUNCT (329; 36% instances), NUM (189; 21% instances), NOUN (134; 15% instances), SYM (101; 11% instances), ADP (67; 7% instances), CCONJ (21; 2% instances), X (20; 2% instances), VERB (17; 2% instances), PART (12; 1% instances), AUX (6; 1% instances), ADV (5; 1% instances), ADJ (4; 0% instances), PRON (3; 0% instances), PROPN (3; 0% instances), DET (1; 0% instances), SCONJ (1; 0% instances)