home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Estonian-EWT: POS Tags: SYM

There are 70 SYM lemmas (1%), 71 SYM types (0%) and 358 SYM tokens (0%). Out of 16 observed tags, the rank of SYM is: 10 in number of lemmas, 13 in number of types and 15 in number of tokens.

The 10 most frequent SYM lemmas: @, %, :), :D, ;), :s, :/, =), :(, :lol:

The 10 most frequent SYM types: @, %, :), :D, ;), :s, :/, =), :(, :lol:

The 10 most frequent ambiguous lemmas: @ (SYM 96, PROPN 5, INTJ 1), :) (SYM 25, INTJ 5, PUNCT 1), :D (SYM 22, INTJ 2), * (PUNCT 10, SYM 5), (SYM 5, NOUN 1), (PUNCT 580, SYM 4), + (PUNCT 16, SYM 4), = (PUNCT 5, SYM 3), & (CCONJ 7, SYM 2), -> (SYM 2, PUNCT 1)

The 10 most frequent ambiguous types: @ (SYM 96, PROPN 5, INTJ 1), :) (SYM 25, INTJ 5, PUNCT 1), :D (SYM 22, INTJ 1), * (PUNCT 10, SYM 5), (SYM 5, NOUN 1), (PUNCT 578, SYM 4), + (PUNCT 16, SYM 4), = (PUNCT 5, SYM 3), & (CCONJ 7, SYM 2), -> (SYM 2, PUNCT 1)

Morphology

The form / lemma ratio of SYM is 1.014286 (the average of all parts of speech is 1.732282).

The 1st highest number of forms (2) was observed with the lemma “S3”: S3’med, S3-el.

The 2nd highest number of forms (1) was observed with the lemma “””: .

The 3rd highest number of forms (1) was observed with the lemma “%”: %.

SYM occurs with 6 features: Abbr (14; 4% instances), Case (3; 1% instances), Number (3; 1% instances), Hyph (1; 0% instances), NumForm (1; 0% instances), NumType (1; 0% instances)

SYM occurs with 8 feature-value pairs: Abbr=Yes, Case=Ade, Case=Nom, Hyph=Yes, NumForm=Digit, NumType=Card, Number=Plur, Number=Sing

SYM occurs with 7 feature combinations. The most frequent feature combination is _ (341 tokens). Examples: @, %, :), :D, ;), :s, :/, =), :(, :lol:

Relations

SYM nodes are attached to their parents using 15 different relations: discourse (220; 61% instances), root (23; 6% instances), obl (20; 6% instances), advmod (18; 5% instances), dep (17; 5% instances), parataxis (12; 3% instances), nsubj:cop (11; 3% instances), nmod (8; 2% instances), nsubj (8; 2% instances), obj (7; 2% instances), flat (5; 1% instances), ccomp (3; 1% instances), conj (3; 1% instances), advcl (2; 1% instances), acl:relcl (1; 0% instances)

Parents of SYM nodes belong to 11 different parts of speech: VERB (137; 38% instances), NOUN (93; 26% instances), PROPN (49; 14% instances), ADJ (23; 6% instances), (23; 6% instances), ADV (14; 4% instances), NUM (9; 3% instances), INTJ (3; 1% instances), PRON (3; 1% instances), PUNCT (2; 1% instances), X (2; 1% instances)

271 (76%) SYM nodes are leaves.

47 (13%) SYM nodes have one child.

23 (6%) SYM nodes have two children.

17 (5%) SYM nodes have three or more children.

The highest child degree of a SYM node is 8.

Children of SYM nodes are attached using 16 different relations: nummod (76; 43% instances), punct (23; 13% instances), nmod (19; 11% instances), nsubj:cop (14; 8% instances), cop (12; 7% instances), conj (6; 3% instances), mark (6; 3% instances), advmod (5; 3% instances), case (4; 2% instances), aux (3; 2% instances), det (3; 2% instances), cc (2; 1% instances), obl (2; 1% instances), flat (1; 1% instances), parataxis (1; 1% instances), xcomp (1; 1% instances)

Children of SYM nodes belong to 12 different parts of speech: NUM (76; 43% instances), NOUN (33; 19% instances), PUNCT (23; 13% instances), AUX (15; 8% instances), ADV (6; 3% instances), SCONJ (6; 3% instances), ADP (4; 2% instances), DET (4; 2% instances), PROPN (4; 2% instances), VERB (3; 2% instances), CCONJ (2; 1% instances), PRON (2; 1% instances)