home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-BR: POS Tags: PART

There are 1 PART lemmas (5%), 74 PART types (0%) and 748 PART tokens (0%). Out of 14 observed tags, the rank of PART is: 9 in number of lemmas, 12 in number of types and 13 in number of tokens.

The 10 most frequent PART lemmas: _

The 10 most frequent PART types: se, ex, vice, pré, auto, claro, latino, pós, recém, ai

The 10 most frequent ambiguous lemmas: _ (NOUN 57316, PUNCT 42033, PROPN 32948, ADP 30871, VERB 29700, DET 26122, ADJ 15107, CCONJ 10984, ADV 9773, NUM 8491, PRON 7392, AUX 5242, PART 748, X 539)

The 10 most frequent ambiguous types: se (PRON 755, PART 392, CCONJ 186, ADP 3, PROPN 1), ex (PART 145, NOUN 1, X 1), vice (PART 45, NOUN 11, ADJ 3), pré (PART 34, ADJ 1), claro (ADJ 28, PART 5, NOUN 2), latino (PART 7, ADJ 3), recém (PART 5, ADV 1), ai (PART 3, ADV 2), (ADV 13, PART 1), bem (ADV 140, NOUN 6, PART 2)

Morphology

The form / lemma ratio of PART is 74.000000 (the average of all parts of speech is 1851.578947).

The 1st highest number of forms (74) was observed with the lemma “_”: ’s, Agora, Avante, Cara, Desculpe, Nè, Ok, Olá, Oxalá, Sucesso, afro, ai, alvi, ante, anti, ar, arqui, atenção, auto, aí, bem, claro, co, contra, cyber, eba, então, ex, extra, foi, franco, germano, greco, grão, hein, hélio, in, infanto, infra, inter, intra, ir, latino, lá, mamilo, micro, multi, on, pan, para, pois, prático, pré, pró, pós, pô, público, recém, rs, s, se, su, sub, supra, tele, to, tá, ultra, utz, vice, viu, á, ão, é.

PART does not occur with any features.

Relations

PART nodes are attached to their parents using 18 different relations: expl:pv (398; 53% instances), nmod (83; 11% instances), nsubj (51; 7% instances), conj (47; 6% instances), dep (38; 5% instances), amod (36; 5% instances), appos (34; 5% instances), obj (27; 4% instances), root (12; 2% instances), advmod (7; 1% instances), nsubj:pass (6; 1% instances), mark (2; 0% instances), parataxis (2; 0% instances), acl:relcl (1; 0% instances), advcl (1; 0% instances), cop (1; 0% instances), flat (1; 0% instances), iobj (1; 0% instances)

Parents of PART nodes belong to 10 different parts of speech: VERB (488; 65% instances), NOUN (139; 19% instances), PROPN (40; 5% instances), ADJ (39; 5% instances), PART (18; 2% instances), (12; 2% instances), PRON (5; 1% instances), ADV (4; 1% instances), AUX (2; 0% instances), NUM (1; 0% instances)

459 (61%) PART nodes are leaves.

18 (2%) PART nodes have one child.

19 (3%) PART nodes have two children.

252 (34%) PART nodes have three or more children.

The highest child degree of a PART node is 9.

Children of PART nodes are attached using 22 different relations: punct (373; 30% instances), flat (256; 21% instances), det (166; 13% instances), nmod (106; 9% instances), appos (98; 8% instances), case (90; 7% instances), conj (35; 3% instances), amod (28; 2% instances), cc (28; 2% instances), acl:relcl (12; 1% instances), acl:part (11; 1% instances), cop (11; 1% instances), det:poss (8; 1% instances), nsubj (8; 1% instances), advmod (5; 0% instances), nummod (3; 0% instances), advcl (1; 0% instances), expl:pv (1; 0% instances), fixed (1; 0% instances), mark (1; 0% instances), parataxis (1; 0% instances), xcomp (1; 0% instances)

Children of PART nodes belong to 12 different parts of speech: PUNCT (373; 30% instances), NOUN (296; 24% instances), PROPN (178; 14% instances), DET (174; 14% instances), ADP (90; 7% instances), VERB (37; 3% instances), ADJ (33; 3% instances), CCONJ (29; 2% instances), PART (18; 1% instances), ADV (6; 0% instances), NUM (5; 0% instances), PRON (5; 0% instances)