home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-DANTEStocks: POS Tags: SYM

There are 1502 SYM lemmas (16%), 1502 SYM types (13%) and 4438 SYM tokens (5%). Out of 16 observed tags, the rank of SYM is: 4 in number of lemmas, 5 in number of types and 8 in number of tokens.

The 10 most frequent SYM lemmas: %, R$, -, +, http://t.co/kgt1YiTbF7, $, http://t.co/zJRs3Eeyz9, o.O, US$, x

The 10 most frequent SYM types: %, R$, -, +, http://t.co/kgt1YiTbF7, $, http://t.co/zJRs3Eeyz9, o.O, US$, x

The 10 most frequent ambiguous lemmas: - (PUNCT 1787, SYM 329), + (SYM 253, PUNCT 1), x (SYM 18, NOUN 2, ADP 1), =) (SYM 20, PUNCT 1), (PUNCT 258, SYM 15), > (PUNCT 20, SYM 4), => (PUNCT 7, SYM 3), WW (SYM 2, PROPN 1), h (NOUN 3, SYM 2), # (PROPN 2, SYM 1)

The 10 most frequent ambiguous types: - (PUNCT 1787, SYM 329), + (SYM 253, ADV 10, PUNCT 1), x (SYM 18, ADP 1), (PUNCT 258, SYM 15), > (PUNCT 20, SYM 4), => (PUNCT 7, SYM 3), WW (SYM 2, PROPN 1), h (NOUN 2, SYM 2), # (PROPN 2, SYM 1), * (PUNCT 12, SYM 1)

Morphology

The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 1.238049).

The 1st highest number of forms (1) was observed with the lemma “#”: #.

The 2nd highest number of forms (1) was observed with the lemma “$”: $.

The 3rd highest number of forms (1) was observed with the lemma “\(”: <em>\)</em>.

SYM does not occur with any features.

Relations

SYM nodes are attached to their parents using 19 different relations: parataxis (1700; 38% instances), nmod (1000; 23% instances), advmod (587; 13% instances), obl (529; 12% instances), obj (252; 6% instances), discourse (126; 3% instances), conj (105; 2% instances), appos (43; 1% instances), root (31; 1% instances), orphan (18; 0% instances), cc (14; 0% instances), case (10; 0% instances), nsubj (9; 0% instances), advcl (4; 0% instances), flat:name (3; 0% instances), reparandum (3; 0% instances), acl (2; 0% instances), acl:relcl (1; 0% instances), ccomp (1; 0% instances)

Parents of SYM nodes belong to 12 different parts of speech: VERB (1667; 38% instances), NOUN (1245; 28% instances), PROPN (669; 15% instances), NUM (592; 13% instances), SYM (145; 3% instances), ADJ (41; 1% instances), (31; 1% instances), ADV (28; 1% instances), PRON (8; 0% instances), X (7; 0% instances), AUX (4; 0% instances), INTJ (1; 0% instances)

2374 (53%) SYM nodes are leaves.

1004 (23%) SYM nodes have one child.

697 (16%) SYM nodes have two children.

363 (8%) SYM nodes have three or more children.

The highest child degree of a SYM node is 13.

Children of SYM nodes are attached using 21 different relations: nummod (1666; 45% instances), case (779; 21% instances), punct (600; 16% instances), nmod (140; 4% instances), conj (105; 3% instances), det (98; 3% instances), parataxis (96; 3% instances), nsubj (53; 1% instances), cc (42; 1% instances), advmod (35; 1% instances), acl (32; 1% instances), cop (19; 1% instances), appos (10; 0% instances), discourse (10; 0% instances), vocative (8; 0% instances), amod (6; 0% instances), obl (4; 0% instances), mark (3; 0% instances), obj (3; 0% instances), acl:relcl (1; 0% instances), fixed (1; 0% instances)

Children of SYM nodes belong to 16 different parts of speech: NUM (1691; 46% instances), ADP (779; 21% instances), PUNCT (600; 16% instances), SYM (145; 4% instances), PROPN (119; 3% instances), NOUN (110; 3% instances), DET (98; 3% instances), VERB (48; 1% instances), CCONJ (41; 1% instances), ADV (29; 1% instances), AUX (20; 1% instances), X (11; 0% instances), ADJ (10; 0% instances), SCONJ (4; 0% instances), INTJ (3; 0% instances), PRON (3; 0% instances)