home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-GSD: POS Tags: AUX

There are 58 AUX lemmas (1%), 332 AUX types (1%) and 7344 AUX tokens (2%). Out of 16 observed tags, the rank of AUX is: 5 in number of lemmas, 7 in number of types and 12 in number of tokens.

The 10 most frequent AUX lemmas: ser, _, ter, ir, estar, dever, poder, haver, vir, acabar

The 10 most frequent AUX types: é, foi, ser, foram, será, são, vai, pode, deve, era

The 10 most frequent ambiguous lemmas: ser (AUX 2710, VERB 1080, NOUN 7), _ (PROPN 32806, ADP 9506, NUM 8462, PRON 7364, DET 4461, NOUN 3563, AUX 2298, CCONJ 1840, PUNCT 1596, VERB 1247, SYM 1008, PART 746, ADJ 703, X 526, ADV 231, SCONJ 1), ter (VERB 1049, AUX 492), ir (AUX 392, VERB 76), estar (VERB 856, AUX 304), dever (AUX 257, VERB 70, NOUN 6), poder (AUX 193, NOUN 45, VERB 2), haver (VERB 401, AUX 94), vir (VERB 77, AUX 70), acabar (AUX 56, VERB 44)

The 10 most frequent ambiguous types: é (AUX 1231, VERB 529, ADV 1, PART 1, PROPN 1), foi (AUX 1012, VERB 173, PART 1, PROPN 1), ser (AUX 475, VERB 100, NOUN 7, PROPN 2), foram (AUX 321, VERB 56), será (AUX 223, VERB 62), são (AUX 207, VERB 119, ADJ 5), vai (AUX 220, VERB 14), pode (AUX 202, VERB 4), deve (AUX 154, VERB 6), era (AUX 135, VERB 76, NOUN 6)

Morphology

The form / lemma ratio of AUX is 5.724138 (the average of all parts of speech is 3.372737).

The 1st highest number of forms (57) was observed with the lemma “_”: ’s, Começe, Estivemos, Todas, be, candidatos.Foi, começa, começam, começamos, começo, devem, didn’t, doesn’t, e, era, eram, esta, feito, foi, fomos, for, fora, foram, forem, fosse, fossem, fui, ireii, iremos, iria, iriam, irá, is, para, parem, pode, podem, podemos, possa, possam, possamos, posso, segue, seguem, seria, seriam, sfoi, sào, ta, tendo, tá, veio, vir, vira, virem, was, were.

The 2nd highest number of forms (29) was observed with the lemma “ter”: tem, temos, tenha, tenham, tenhamos, tenho, ter, terei, terem, teremos, teria, teriam, termos, terá, terão, teríamos, teve, tido, tinha, tinham, tive, tivemos, tiver, tiveram, tiverem, tivesse, tê, têm, tínhamos.

The 3rd highest number of forms (21) was observed with the lemma “poder”: Pudemos, podem, podendo, poder, poderei, poderem, poderemos, poderia, poderiam, poderá, poderão, poderíamos, podia, podiam, podíamos, pude, puderam, puderem, pudesse, pudessem, pôde.

AUX does not occur with any features.

Relations

AUX nodes are attached to their parents using 10 different relations: aux (2776; 38% instances), aux:pass (2438; 33% instances), cop (2112; 29% instances), root (5; 0% instances), dep (4; 0% instances), ccomp (3; 0% instances), conj (3; 0% instances), acl:relcl (1; 0% instances), det (1; 0% instances), nsubj:pass (1; 0% instances)

Parents of AUX nodes belong to 11 different parts of speech: VERB (5110; 70% instances), NOUN (1796; 24% instances), PRON (267; 4% instances), PROPN (131; 2% instances), NUM (12; 0% instances), PART (11; 0% instances), (5; 0% instances), ADJ (3; 0% instances), AUX (3; 0% instances), SYM (3; 0% instances), X (3; 0% instances)

7324 (100%) AUX nodes are leaves.

9 (0%) AUX nodes have one child.

3 (0%) AUX nodes have two children.

8 (0%) AUX nodes have three or more children.

The highest child degree of a AUX node is 5.

Children of AUX nodes are attached using 13 different relations: punct (12; 26% instances), nsubj (8; 17% instances), conj (5; 11% instances), nmod (4; 9% instances), cc (3; 6% instances), expl:pv (3; 6% instances), mark (3; 6% instances), advmod (2; 4% instances), obj (2; 4% instances), xcomp (2; 4% instances), acl (1; 2% instances), aux (1; 2% instances), dep (1; 2% instances)

Children of AUX nodes belong to 11 different parts of speech: PUNCT (12; 26% instances), NOUN (10; 21% instances), CCONJ (6; 13% instances), ADV (3; 6% instances), AUX (3; 6% instances), PART (3; 6% instances), PRON (3; 6% instances), VERB (3; 6% instances), NUM (2; 4% instances), ADJ (1; 2% instances), PROPN (1; 2% instances)