SYM
: symbol
Definition
A symbol is a word-like entity that differs from ordinary words by form, function, or both.
Mail and web addresses are also tagged also SYM.
Examples:
§, %, +, =, 11°, 25°53’
info.euroopaliikumine.ee
Treebank Statistics (UD_Estonian)
There are 62 SYM
lemmas (0%), 65 SYM
types (0%) and 105 SYM
tokens (0%).
Out of 15 observed tags, the rank of SYM
is: 9 in number of lemmas, 11 in number of types and 14 in number of tokens.
The 10 most frequent SYM
lemmas: &, C5, U, AB, %, sulev@ekspress.ee, B, D66, anne@ekspress.ee, x
The 10 most frequent SYM
types: &, C5, U, AB, %, sulev@ekspress.ee, &, D66, anne@ekspress.ee, x
The 10 most frequent ambiguous lemmas: D (ADJ 4, SYM 1)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of SYM
is 1.048387 (the average of all parts of speech is 1.839644).
The 1st highest number of forms (2) was observed with the lemma “&”: &, &.
The 2nd highest number of forms (2) was observed with the lemma “B”: B, B-.
The 3rd highest number of forms (2) was observed with the lemma “C5”: C5, C5-ga.
SYM
occurs with 6 features: Abbr (79; 75% instances), Case (11; 10% instances), Number (11; 10% instances), NumForm (3; 3% instances), NumType (3; 3% instances), Hyph (1; 1% instances)
SYM
occurs with 9 feature-value pairs: Abbr=Yes
, Case=Com
, Case=Gen
, Case=Nom
, Case=Tra
, Hyph=Yes
, NumForm=Digit
, NumType=Card
, Number=Sing
SYM
occurs with 9 feature combinations.
The most frequent feature combination is Abbr=Yes
(74 tokens).
Examples: C5, AB, sulev@ekspress.ee, D66, U, anne@ekspress.ee, x, °C, .ee, 11°
Relations
SYM
nodes are attached to their parents using 10 different relations: nmod (42; 40% instances), cc (19; 18% instances), root (18; 17% instances), nsubj (10; 10% instances), conj (5; 5% instances), nsubj:cop (3; 3% instances), advmod:quant (2; 2% instances), compound (2; 2% instances), dobj (2; 2% instances), parataxis (2; 2% instances)
Parents of SYM
nodes belong to 6 different parts of speech: NOUN (36; 34% instances), PROPN (24; 23% instances), ROOT (18; 17% instances), VERB (15; 14% instances), ADJ (8; 8% instances), NUM (4; 4% instances)
70 (67%) SYM
nodes are leaves.
22 (21%) SYM
nodes have one child.
5 (5%) SYM
nodes have two children.
8 (8%) SYM
nodes have three or more children.
The highest child degree of a SYM
node is 11.
Children of SYM
nodes are attached using 16 different relations: punct (17; 25% instances), nmod (12; 17% instances), nummod (6; 9% instances), cc (5; 7% instances), amod (4; 6% instances), conj (4; 6% instances), acl (3; 4% instances), appos (3; 4% instances), compound (3; 4% instances), nsubj:cop (3; 4% instances), advmod (2; 3% instances), cop (2; 3% instances), det (2; 3% instances), discourse (1; 1% instances), list (1; 1% instances), name (1; 1% instances)
Children of SYM
nodes belong to 10 different parts of speech: NOUN (17; 25% instances), PUNCT (17; 25% instances), NUM (9; 13% instances), ADJ (7; 10% instances), PROPN (7; 10% instances), CONJ (5; 7% instances), ADV (2; 3% instances), PRON (2; 3% instances), VERB (2; 3% instances), INTJ (1; 1% instances)
SYM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]