This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home ca/pos issue tracker

SYM: symbol

This document is a placeholder for the language-specific documentation for SYM.


Treebank Statistics (UD_Catalan)

There are 277 SYM lemmas (1%), 274 SYM types (1%) and 4638 SYM tokens (1%). Out of 17 observed tags, the rank of SYM is: 8 in number of lemmas, 8 in number of types and 14 in number of tokens.

The 10 most frequent SYM lemmas: ’, %, 50/100, 10/100, 30/100, 5/100, 1/100, 2/100, 25/100, 20/100

The 10 most frequent SYM types: ’, %, 50%, 10%, 30%, 5%, 40%, 1%, 2%, 25%

The 10 most frequent ambiguous lemmas: (SYM 3820, PUNCT 31), 10/100 (SYM 16, NUM 1), 15/100 (SYM 6, NUM 1), / (PUNCT 40, SYM 5), 75/100 (SYM 4, NUM 1), 40 (NUM 47, SYM 2, NOUN 1), - (PUNCT 950, SYM 1), 10 (NUM 160, NOUN 9, SYM 1), 34 (NUM 12, SYM 1), 50 (NUM 75, SYM 1)

The 10 most frequent ambiguous types: (SYM 3820, PUNCT 32), / (PUNCT 40, SYM 5), - (PUNCT 950, SYM 1)

Morphology

The form / lemma ratio of SYM is 0.989170 (the average of all parts of speech is 1.413295).

The 1st highest number of forms (2) was observed with the lemma “1.82/100”: 1’82%, 1,82%.

The 2nd highest number of forms (2) was observed with the lemma “3.5/100”: 3’5%, 3,5%.

The 3rd highest number of forms (2) was observed with the lemma “6.94/100”: 6’94%, 6,94%.

SYM occurs with 5 features: NumForm (806; 17% instances), NumType (793; 17% instances), PunctType (5; 0% instances), AdvType (1; 0% instances), Gender (1; 0% instances)

SYM occurs with 5 feature-value pairs: AdvType=Tim, Gender=Masc, NumForm=Digit, NumType=Frac, PunctType=Colo

SYM occurs with 6 feature combinations. The most frequent feature combination is _ (3825 tokens). Examples: ’, 2%, -, 4%, 5%

Relations

SYM nodes are attached to their parents using 14 different relations: nmod (4166; 90% instances), dobj (185; 4% instances), advmod (77; 2% instances), appos (77; 2% instances), nsubj (62; 1% instances), conj (44; 1% instances), root (10; 0% instances), acl (4; 0% instances), cc (4; 0% instances), dep (3; 0% instances), advcl (2; 0% instances), parataxis (2; 0% instances), ccomp (1; 0% instances), iobj (1; 0% instances)

Parents of SYM nodes belong to 17 different parts of speech: VERB (1597; 34% instances), NOUN (1039; 22% instances), PROPN (880; 19% instances), ADJ (390; 8% instances), DET (232; 5% instances), NUM (230; 5% instances), SYM (114; 2% instances), ADV (42; 1% instances), PRON (29; 1% instances), CONJ (27; 1% instances), AUX (15; 0% instances), ADP (11; 0% instances), ROOT (10; 0% instances), X (9; 0% instances), PART (6; 0% instances), PUNCT (4; 0% instances), SCONJ (3; 0% instances)

3942 (85%) SYM nodes are leaves.

133 (3%) SYM nodes have one child.

297 (6%) SYM nodes have two children.

266 (6%) SYM nodes have three or more children.

The highest child degree of a SYM node is 10.

Children of SYM nodes are attached using 20 different relations: nmod (543; 31% instances), det (347; 20% instances), case (336; 19% instances), punct (216; 12% instances), advmod (69; 4% instances), cc (41; 2% instances), conj (40; 2% instances), appos (25; 1% instances), mark (23; 1% instances), cop (22; 1% instances), nsubj (22; 1% instances), amod (18; 1% instances), advcl (15; 1% instances), dobj (15; 1% instances), name (12; 1% instances), acl (10; 1% instances), aux (10; 1% instances), xcomp (5; 0% instances), ccomp (1; 0% instances), compound (1; 0% instances)

Children of SYM nodes belong to 14 different parts of speech: DET (346; 20% instances), ADP (345; 19% instances), NOUN (276; 16% instances), PUNCT (217; 12% instances), PRON (214; 12% instances), SYM (114; 6% instances), ADV (64; 4% instances), PROPN (50; 3% instances), VERB (40; 2% instances), CONJ (38; 2% instances), AUX (28; 2% instances), ADJ (23; 1% instances), SCONJ (13; 1% instances), NUM (3; 0% instances)


SYM in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]