home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Portuguese-GSD: POS Tags: DET

There are 6 DET lemmas (0%), 125 DET types (0%) and 47519 DET tokens (15%). Out of 16 observed tags, the rank of DET is: 10 in number of lemmas, 11 in number of types and 3 in number of tokens.

The 10 most frequent DET lemmas: o, _, um, este, seu, a

The 10 most frequent DET types: o, a, os, as, um, uma, sua, seu, seus, cada

The 10 most frequent ambiguous lemmas: _ (PROPN 32806, ADP 9506, NUM 8462, PRON 7364, DET 4461, NOUN 3563, AUX 2298, CCONJ 1840, PUNCT 1596, VERB 1247, SYM 1008, PART 746, ADJ 703, X 526, ADV 231, SCONJ 1), um (DET 3305, NOUN 1), a (ADP 2336, DET 1)

The 10 most frequent ambiguous types: o (DET 16553, PRON 226, ADP 1, PROPN 1, X 1), a (DET 13305, ADP 3852, PRON 64, VERB 5, X 2, CCONJ 1, PROPN 1), os (DET 3843, PRON 36, PROPN 1, X 1), as (DET 2472, PRON 15, ADP 13), um (DET 1701, PRON 175, NUM 120, NOUN 1), uma (DET 1630, NUM 89, PRON 87), sua (DET 513, PRON 2), seu (DET 422, PRON 1), cada (DET 134, PRON 2), outros (DET 116, PRON 40)

Morphology

The form / lemma ratio of DET is 20.833333 (the average of all parts of speech is 3.372737).

The 1st highest number of forms (125) was observed with the lemma “_”: Duas, Imensas, This, Três, Tua, WesleyA, a, aas, algum, alguma, algumas, alguns, ambas, ambos, aquela, aquele, aqueles, as, bastante, cada, casa, certa, certas, certo, certos, cuja, cujas, cujo, cujos, dado, de, demais, determinada, determinadas, determinado, determinados, diferente, diferentes, diversas, diversos, e, el, essa, essas, esse, esses, esta, estas, este, estes, flagrante, la, le, les, los, mais, menos, meu, meus, minha, minhas, muita, muitas, muito, muitos, múltiplos, nenhum, nenhuma, nossa, nossas, nosso, nossos, numerosas, nível, o, oa, onze, os, ourtos, outra, outras, outro, outros, pouca, poucas, pouco, poucos, pouquíssimos, quais, quaisquer, qual, qualquer, quantas, quantos, que, quão, seu, seus, sua, suas, sui, tais, tal, tanta, tantas, tanto, tantos, the, toda, todas, todo, todos, um, uma, umas, uns, varias, varios, vossa, vosso, várias, vários, your, à, às.

The 2nd highest number of forms (4) was observed with the lemma “o”: a, as, o, os.

The 3rd highest number of forms (2) was observed with the lemma “um”: um, uma.

DET occurs with 6 features: Gender (21301; 45% instances), Number (21301; 45% instances), PronType (21299; 45% instances), Definite (21298; 45% instances), Foreign (2; 0% instances), Poss (1; 0% instances)

DET occurs with 9 feature-value pairs: Definite=Def, Foreign=Yes, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Poss=Yes, PronType=Art, PronType=Prs

DET occurs with 8 feature combinations. The most frequent feature combination is _ (26216 tokens). Examples: o, a, os, um, uma, as, sua, seu, seus, cada

Relations

DET nodes are attached to their parents using 13 different relations: det (45825; 96% instances), det:poss (1607; 3% instances), mark (34; 0% instances), dep (15; 0% instances), fixed (15; 0% instances), case (8; 0% instances), conj (5; 0% instances), nmod (2; 0% instances), nsubj (2; 0% instances), obj (2; 0% instances), root (2; 0% instances), advmod (1; 0% instances), cc (1; 0% instances)

Parents of DET nodes belong to 14 different parts of speech: NOUN (36504; 77% instances), PROPN (9351; 20% instances), VERB (377; 1% instances), PRON (352; 1% instances), NUM (323; 1% instances), ADV (277; 1% instances), PART (177; 0% instances), ADJ (60; 0% instances), ADP (51; 0% instances), SYM (19; 0% instances), X (15; 0% instances), DET (9; 0% instances), CCONJ (2; 0% instances), (2; 0% instances)

47467 (100%) DET nodes are leaves.

15 (0%) DET nodes have one child.

34 (0%) DET nodes have two children.

3 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 6.

Children of DET nodes are attached using 9 different relations: fixed (66; 69% instances), conj (7; 7% instances), cc (6; 6% instances), det (4; 4% instances), punct (4; 4% instances), case (3; 3% instances), nmod (3; 3% instances), cop (1; 1% instances), parataxis (1; 1% instances)

Children of DET nodes belong to 11 different parts of speech: NOUN (37; 39% instances), CCONJ (32; 34% instances), DET (9; 9% instances), ADP (5; 5% instances), PUNCT (4; 4% instances), ADJ (3; 3% instances), ADV (1; 1% instances), NUM (1; 1% instances), PRON (1; 1% instances), PROPN (1; 1% instances), VERB (1; 1% instances)