SYM
: symbol
Definition
A symbol is a word-like entity that differs from ordinary words by form, function, or both.
We recognize as symbols:
- currency symbols: $
- mathematical operators: µg / m3
- ’/’ used a separator: 2001 / 923 / CE
- emoticons and emoji: :-)
- URL’s and emails
The following are not symbols:
- Proper nouns with numbers and special characters: 130XE, DC10, DC-10 are tagged PROPN.
- Acronyms for proper nouns: UN, NATO are tagged as PROPN.
- Abbreviated words: Sig. (signore), kg (chilogrammo), km (chilometro), dott (dottore) are tagged NOUN.
- Characters used as bullets in itemized lists (*, •, ‣) are PUNCT.
Examples
- $, %, §, ©
- +, −, ×, ÷, =, <, >
- :), ♥‿♥, 😝
- john.doe@universal.org, http://universaldependencies.org/
Treebank Statistics (UD_Italian)
There are 10 SYM
lemmas (0%), 10 SYM
types (0%) and 100 SYM
tokens (0%).
Out of 17 observed tags, the rank of SYM
is: 16 in number of lemmas, 16 in number of types and 15 in number of tokens.
The 10 most frequent SYM
lemmas: %, &, +, -, http://europa.eu.int/comm/secretariat@general/sgb/state@aids, http://www.linkiesta.it/Locatelli-lombardia-Nicoli-regione, www.amnesty.it, www.centrodonmilani.org, www.legadelcane.org, x
The 10 most frequent SYM
types: %, &, +, -, http://europa.eu.int/comm/secretariat@general/sgb/state@aids, http://www.linkiesta.it/Locatelli-lombardia-Nicoli-regione, www.amnesty.it, www.centrodonmilani.org, www.legadelcane.org, x
The 10 most frequent ambiguous lemmas: & (SYM 6, PROPN 1), - (PUNCT 771, SYM 2)
The 10 most frequent ambiguous types: & (SYM 6, PROPN 1), - (PUNCT 772, SYM 2), x (ADJ 1, SYM 1)
- &
- -
- x
Morphology
The form / lemma ratio of SYM
is 1.000000 (the average of all parts of speech is 1.488836).
The 1st highest number of forms (1) was observed with the lemma “%”: %.
The 2nd highest number of forms (1) was observed with the lemma “&”: &.
The 3rd highest number of forms (1) was observed with the lemma “+”: +.
SYM
does not occur with any features.
Relations
SYM
nodes are attached to their parents using 10 different relations: it-dep/nmod (63; 63% instances), it-dep/dobj (13; 13% instances), it-dep/name (8; 8% instances), it-dep/nsubj (5; 5% instances), it-dep/conj (3; 3% instances), it-dep/xcomp (3; 3% instances), it-dep/nsubjpass (2; 2% instances), it-dep/mwe (1; 1% instances), it-dep/nummod (1; 1% instances), it-dep/root (1; 1% instances)
Parents of SYM
nodes belong to 9 different parts of speech: VERB (52; 52% instances), NOUN (21; 21% instances), ADJ (11; 11% instances), PROPN (8; 8% instances), ADP (2; 2% instances), NUM (2; 2% instances), SYM (2; 2% instances), ADV (1; 1% instances), ROOT (1; 1% instances)
14 (14%) SYM
nodes are leaves.
0 (0%) SYM
nodes have one child.
8 (8%) SYM
nodes have two children.
78 (78%) SYM
nodes have three or more children.
The highest child degree of a SYM
node is 5.
Children of SYM
nodes are attached using 13 different relations: it-dep/nummod (84; 29% instances), it-dep/det (83; 29% instances), it-dep/case (57; 20% instances), it-dep/nmod (38; 13% instances), it-dep/advmod (8; 3% instances), it-dep/amod (4; 1% instances), it-dep/punct (3; 1% instances), it-dep/cc (2; 1% instances), it-dep/conj (2; 1% instances), it-dep/acl:relcl (1; 0% instances), it-dep/advcl (1; 0% instances), it-dep/cop (1; 0% instances), it-dep/nsubj (1; 0% instances)
Children of SYM
nodes belong to 11 different parts of speech: NUM (84; 29% instances), DET (83; 29% instances), ADP (57; 20% instances), NOUN (35; 12% instances), ADV (8; 3% instances), ADJ (4; 1% instances), PROPN (4; 1% instances), PUNCT (3; 1% instances), VERB (3; 1% instances), CONJ (2; 1% instances), SYM (2; 1% instances)
SYM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]