home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-DANTEStocks: POS Tags: X

There are 344 X lemmas (4%), 363 X types (3%) and 1754 X tokens (2%). Out of 16 observed tags, the rank of X is: 7 in number of lemmas, 7 in number of types and 11 in number of tokens.

The 10 most frequent X lemmas: RT, #vale5, #infomoney, #petr4, $LIGT3, _, #ibov, rsrsr, #bovespa, #BR

The 10 most frequent X types: RT, #vale5, #infomoney, #petr4, $LIGT3, #ibov, rsrsr, #bovespa, #BR, #PETR3

The 10 most frequent ambiguous lemmas: RT (X 314, PROPN 5), #vale5 (X 178, PROPN 63), #petr4 (PROPN 115, X 24), $LIGT3 (X 41, PROPN 5), _ (X 31, PUNCT 13, NOUN 4, ADJ 1, ADV 1), #ibov (X 8, PROPN 5), #bovespa (X 4, PROPN 3), #PETR3 (X 20, PROPN 19), $PETR4 (X 17, PROPN 8), #BBDC4 (PROPN 11, X 10)

The 10 most frequent ambiguous types: RT (X 314, PROPN 5), #vale5 (X 178, PROPN 63), #petr4 (PROPN 115, X 24), $LIGT3 (X 41, PROPN 5), #ibov (X 8, PROPN 5), #bovespa (X 4, PROPN 3), #PETR3 (X 20, PROPN 19), $PETR4 (X 17, PROPN 8), aprov (X 16, NOUN 3), Webcast (X 12, PROPN 4)

Morphology

The form / lemma ratio of X is 1.055233 (the average of all parts of speech is 1.238049).

The 1st highest number of forms (18) was observed with the lemma “_”: 6, 64, BROKER, Bancotario, abertura, bonificação, diretor, feira, final, lenga, market, niquel, onda, petr4, provento, sal, sena, side.

The 2nd highest number of forms (2) was observed with the lemma “#Petrobras”: #Petrobras, #Petrobrás.

The 3rd highest number of forms (2) was observed with the lemma “cai”: cai, cain.

X occurs with 1 features: Foreign (118; 7% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is _ (1636 tokens). Examples: RT, #vale5, #infomoney, #petr4, $LIGT3, #ibov, rsrsr, #bovespa, #BR, #PETR3

Relations

X nodes are attached to their parents using 23 different relations: parataxis (1341; 76% instances), discourse (181; 10% instances), nmod (81; 5% instances), flat:foreign (34; 2% instances), goeswith (26; 1% instances), dep (22; 1% instances), obj (14; 1% instances), obl (13; 1% instances), nsubj (7; 0% instances), root (7; 0% instances), conj (4; 0% instances), flat:name (4; 0% instances), vocative (4; 0% instances), case (3; 0% instances), appos (2; 0% instances), ccomp (2; 0% instances), fixed (2; 0% instances), list (2; 0% instances), advmod (1; 0% instances), amod (1; 0% instances), obl:agent (1; 0% instances), reparandum (1; 0% instances), xcomp (1; 0% instances)

Parents of X nodes belong to 12 different parts of speech: VERB (1181; 67% instances), NOUN (356; 20% instances), PROPN (77; 4% instances), X (48; 3% instances), ADJ (32; 2% instances), ADV (18; 1% instances), PRON (11; 1% instances), SYM (11; 1% instances), NUM (9; 1% instances), (7; 0% instances), AUX (2; 0% instances), INTJ (2; 0% instances)

1144 (65%) X nodes are leaves.

194 (11%) X nodes have one child.

332 (19%) X nodes have two children.

84 (5%) X nodes have three or more children.

The highest child degree of a X node is 10.

Children of X nodes are attached using 24 different relations: punct (593; 50% instances), nmod (370; 31% instances), conj (40; 3% instances), flat:foreign (34; 3% instances), case (31; 3% instances), det (30; 3% instances), appos (27; 2% instances), parataxis (15; 1% instances), vocative (10; 1% instances), nsubj (7; 1% instances), advmod (6; 1% instances), amod (6; 1% instances), flat:name (4; 0% instances), cop (3; 0% instances), discourse (3; 0% instances), nummod (3; 0% instances), obl (3; 0% instances), cc (2; 0% instances), dep (2; 0% instances), fixed (2; 0% instances), list (2; 0% instances), mark (2; 0% instances), acl (1; 0% instances), obj (1; 0% instances)

Children of X nodes belong to 16 different parts of speech: PUNCT (593; 50% instances), PROPN (379; 32% instances), NOUN (60; 5% instances), X (48; 4% instances), ADP (31; 3% instances), DET (30; 3% instances), NUM (21; 2% instances), SYM (7; 1% instances), ADJ (6; 1% instances), ADV (6; 1% instances), PRON (4; 0% instances), VERB (4; 0% instances), AUX (3; 0% instances), CCONJ (2; 0% instances), SCONJ (2; 0% instances), INTJ (1; 0% instances)