This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home et/pos issue tracker

SYM: symbol

Definition

A symbol is a word-like entity that differs from ordinary words by form, function, or both.
Mail and web addresses are also tagged also SYM.

Examples:
§, %, +, =, 11°, 25°53’
info.euroopaliikumine.ee


Treebank Statistics (UD_Estonian)

There are 62 SYM lemmas (0%), 65 SYM types (0%) and 105 SYM tokens (0%). Out of 15 observed tags, the rank of SYM is: 9 in number of lemmas, 11 in number of types and 14 in number of tokens.

The 10 most frequent SYM lemmas: &, C5, U, AB, %, sulev@ekspress.ee, B, D66, anne@ekspress.ee, x

The 10 most frequent SYM types: &, C5, U, AB, %, sulev@ekspress.ee, &, D66, anne@ekspress.ee, x

The 10 most frequent ambiguous lemmas: D (ADJ 4, SYM 1)

The 10 most frequent ambiguous types:

Morphology

The form / lemma ratio of SYM is 1.048387 (the average of all parts of speech is 1.839644).

The 1st highest number of forms (2) was observed with the lemma “&”: &, &.

The 2nd highest number of forms (2) was observed with the lemma “B”: B, B-.

The 3rd highest number of forms (2) was observed with the lemma “C5”: C5, C5-ga.

SYM occurs with 6 features: Abbr (79; 75% instances), Case (11; 10% instances), Number (11; 10% instances), NumForm (3; 3% instances), NumType (3; 3% instances), Hyph (1; 1% instances)

SYM occurs with 9 feature-value pairs: Abbr=Yes, Case=Com, Case=Gen, Case=Nom, Case=Tra, Hyph=Yes, NumForm=Digit, NumType=Card, Number=Sing

SYM occurs with 9 feature combinations. The most frequent feature combination is Abbr=Yes (74 tokens). Examples: C5, AB, sulev@ekspress.ee, D66, U, anne@ekspress.ee, x, °C, .ee, 11°

Relations

SYM nodes are attached to their parents using 10 different relations: nmod (42; 40% instances), cc (19; 18% instances), root (18; 17% instances), nsubj (10; 10% instances), conj (5; 5% instances), nsubj:cop (3; 3% instances), advmod:quant (2; 2% instances), compound (2; 2% instances), dobj (2; 2% instances), parataxis (2; 2% instances)

Parents of SYM nodes belong to 6 different parts of speech: NOUN (36; 34% instances), PROPN (24; 23% instances), ROOT (18; 17% instances), VERB (15; 14% instances), ADJ (8; 8% instances), NUM (4; 4% instances)

70 (67%) SYM nodes are leaves.

22 (21%) SYM nodes have one child.

5 (5%) SYM nodes have two children.

8 (8%) SYM nodes have three or more children.

The highest child degree of a SYM node is 11.

Children of SYM nodes are attached using 16 different relations: punct (17; 25% instances), nmod (12; 17% instances), nummod (6; 9% instances), cc (5; 7% instances), amod (4; 6% instances), conj (4; 6% instances), acl (3; 4% instances), appos (3; 4% instances), compound (3; 4% instances), nsubj:cop (3; 4% instances), advmod (2; 3% instances), cop (2; 3% instances), det (2; 3% instances), discourse (1; 1% instances), list (1; 1% instances), name (1; 1% instances)

Children of SYM nodes belong to 10 different parts of speech: NOUN (17; 25% instances), PUNCT (17; 25% instances), NUM (9; 13% instances), ADJ (7; 10% instances), PROPN (7; 10% instances), CONJ (5; 7% instances), ADV (2; 3% instances), PRON (2; 3% instances), VERB (2; 3% instances), INTJ (1; 1% instances)


SYM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]