Treebank Statistics: UD_Portuguese-DANTEStocks: POS Tags: SYM
There are 1502 SYM lemmas (16%), 1502 SYM types (13%) and 4438 SYM tokens (5%).
Out of 16 observed tags, the rank of SYM is: 4 in number of lemmas, 5 in number of types and 8 in number of tokens.
The 10 most frequent SYM lemmas: %, R$, -, +, http://t.co/kgt1YiTbF7, $, http://t.co/zJRs3Eeyz9, o.O, US$, x
The 10 most frequent SYM types: %, R$, -, +, http://t.co/kgt1YiTbF7, $, http://t.co/zJRs3Eeyz9, o.O, US$, x
The 10 most frequent ambiguous lemmas: - (PUNCT 1787, SYM 329), + (SYM 253, PUNCT 1), x (SYM 18, NOUN 2, ADP 1), =) (SYM 20, PUNCT 1), ’ (PUNCT 258, SYM 15), > (PUNCT 20, SYM 4), => (PUNCT 7, SYM 3), WW (SYM 2, PROPN 1), h (NOUN 3, SYM 2), # (PROPN 2, SYM 1)
The 10 most frequent ambiguous types: - (PUNCT 1787, SYM 329), + (SYM 253, ADV 10, PUNCT 1), x (SYM 18, NOUN 2, ADP 1), ’ (PUNCT 258, SYM 15), > (PUNCT 20, SYM 4), => (PUNCT 7, SYM 3), WW (SYM 2, PROPN 1), h (NOUN 2, SYM 2), # (PROPN 2, SYM 1), * (PUNCT 12, SYM 1)
- -
- +
- SYM 253: Petr4 15,68 + 2,35 % Cerveró falando sem parar … #CPIdaPTbras
- ADV 10: ITUB4 ja negociou + de 300 M ? Ta certo meu sistema aqui ? @ferrisss @dfittarelli @JPedro_Sullivan
- PUNCT 1: > > > Cemig ( CMIG4 ) : Julgamento de o mandado sobre Jaguara é suspenso > > > ( + ) Cemig ( CMIG4 ) : Julgamento de o mandado … http://t.co/M9A5yw7dU0
- x
- SYM 18: Spread entre ITUB4 x BBDC4 abrindo novamente . Oportunidades a vista ?
- NOUN 2: Destaques de BTC ontem foram as Blues #BBAS3 ( alugaram 2 x o que negociou ontem ) , #PETR4 ( tx max 180d ) além de #RENT3 e #KROT3 @ferrisss
- ADP 1: GFSA3 … 227 com um ritmo muito forte em a venda . Se permanecer assim será difícil o papel fechar em o azul hoje . Briga pesada : 227 x 40 .
- ’
- >
- =>
- WW
- h
- #
- *
Morphology
The form / lemma ratio of SYM is 1.000000 (the average of all parts of speech is 1.238049).
The 1st highest number of forms (1) was observed with the lemma “#”: #.
The 2nd highest number of forms (1) was observed with the lemma “$”: $.
The 3rd highest number of forms (1) was observed with the lemma “\(”: <em>\)</em>.
SYM occurs with 1 features: ExtPos (1; 0% instances)
SYM occurs with 1 feature-value pairs: ExtPos=ADV
SYM occurs with 2 feature combinations.
The most frequent feature combination is _ (4437 tokens).
Examples: %, R$, -, +, http://t.co/kgt1YiTbF7, $, http://t.co/zJRs3Eeyz9, o.O, US$, x
Relations
SYM nodes are attached to their parents using 19 different relations: parataxis (1700; 38% instances), nmod (1001; 23% instances), advmod (587; 13% instances), obl (529; 12% instances), obj (251; 6% instances), discourse (126; 3% instances), conj (105; 2% instances), appos (43; 1% instances), root (31; 1% instances), orphan (18; 0% instances), cc (14; 0% instances), case (10; 0% instances), nsubj (9; 0% instances), advcl (4; 0% instances), flat:name (3; 0% instances), reparandum (3; 0% instances), acl (2; 0% instances), acl:relcl (1; 0% instances), ccomp (1; 0% instances)
Parents of SYM nodes belong to 12 different parts of speech: VERB (1666; 38% instances), NOUN (1245; 28% instances), PROPN (670; 15% instances), NUM (592; 13% instances), SYM (145; 3% instances), ADJ (41; 1% instances), (31; 1% instances), ADV (28; 1% instances), PRON (8; 0% instances), X (7; 0% instances), AUX (4; 0% instances), INTJ (1; 0% instances)
2374 (53%) SYM nodes are leaves.
1004 (23%) SYM nodes have one child.
697 (16%) SYM nodes have two children.
363 (8%) SYM nodes have three or more children.
The highest child degree of a SYM node is 13.
Children of SYM nodes are attached using 21 different relations: nummod (1666; 45% instances), case (779; 21% instances), punct (600; 16% instances), nmod (140; 4% instances), conj (105; 3% instances), det (97; 3% instances), parataxis (96; 3% instances), nsubj (53; 1% instances), cc (42; 1% instances), advmod (35; 1% instances), acl (32; 1% instances), cop (19; 1% instances), appos (10; 0% instances), discourse (10; 0% instances), vocative (8; 0% instances), amod (6; 0% instances), obl (5; 0% instances), mark (3; 0% instances), obj (3; 0% instances), acl:relcl (1; 0% instances), fixed (1; 0% instances)
Children of SYM nodes belong to 16 different parts of speech: NUM (1691; 46% instances), ADP (780; 21% instances), PUNCT (600; 16% instances), SYM (145; 4% instances), PROPN (119; 3% instances), NOUN (110; 3% instances), DET (97; 3% instances), VERB (48; 1% instances), CCONJ (41; 1% instances), ADV (29; 1% instances), AUX (20; 1% instances), X (11; 0% instances), ADJ (10; 0% instances), SCONJ (4; 0% instances), INTJ (3; 0% instances), PRON (3; 0% instances)