Treebank Statistics: UD_Portuguese-Porttinari: POS Tags: DET
There are 31 DET lemmas (0%), 88 DET types (0%) and 24362 DET tokens (14%).
Out of 16 observed tags, the rank of DET is: 9 in number of lemmas, 10 in number of types and 3 in number of tokens.
The 10 most frequent DET lemmas: o, um, seu, esse, este, outro, todo, meu, algum, mesmo
The 10 most frequent DET types: o, a, os, as, um, uma, sua, seu, esse, essa
The 10 most frequent ambiguous lemmas: o (DET 18936, PRON 762), um (DET 2281, NUM 309, PRON 48), seu (DET 758, PRON 8), esse (DET 531, PRON 60), este (DET 294, PRON 29), outro (DET 239, PRON 83), todo (DET 219, PRON 37, ADJ 4, NOUN 4), meu (DET 150, PRON 4), algum (DET 145, PRON 31), mesmo (DET 125, ADV 99, PRON 16)
The 10 most frequent ambiguous types: o (DET 7194, PRON 471), a (DET 6463, ADP 2025, PRON 114, PROPN 3), os (DET 1859, PRON 77), as (DET 1324, PRON 48), um (DET 1200, NUM 155, PRON 32), uma (DET 973, NUM 116, PRON 8), sua (DET 280, PRON 3), seu (DET 212, PRON 4), esse (DET 189, PRON 13), essa (DET 158, PRON 14)
- o
- a
- DET 6463: Viver a vida real , em vez de ficar com o celular em a mão .
- ADP 2025: Parecer de a Advocacia-Geral enviado a o STF é favorável a Aécio .
- PRON 114: A o mesmo tempo , Hefner adorava a celebridade , a sua e a de os outros .
- PROPN 3: Abriu com “ I’ve Got You Under My Skin “ e teve ótimos momentos , como “ The Lady is a Tramp “ e “ They Can’t Take “ .
- os
- as
- um
- uma
- sua
- seu
- esse
- essa
Morphology
The form / lemma ratio of DET is 2.838710 (the average of all parts of speech is 1.491519).
The 1st highest number of forms (4) was observed with the lemma “algum”: algum, alguma, algumas, alguns.
The 2nd highest number of forms (4) was observed with the lemma “aquele”: aquela, aquelas, aquele, aqueles.
The 3rd highest number of forms (4) was observed with the lemma “cujo”: cuja, cujas, cujo, cujos.
DET occurs with 7 features: PronType (24362; 100% instances), Number (24244; 100% instances), Gender (24062; 99% instances), Definite (21217; 87% instances), Person (1012; 4% instances), Poss (1012; 4% instances), ExtPos (57; 0% instances)
DET occurs with 17 feature-value pairs: Definite=Def, Definite=Ind, ExtPos=ADV, ExtPos=SCONJ, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Person=1, Person=3, Poss=Yes, PronType=Art, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Rel
DET occurs with 40 feature combinations.
The most frequent feature combination is Definite=Def|Gender=Masc|Number=Sing|PronType=Art (8086 tokens).
Examples: o
Relations
DET nodes are attached to their parents using 6 different relations: det (24123; 99% instances), fixed (176; 1% instances), advmod (52; 0% instances), mark (5; 0% instances), conj (4; 0% instances), obj (2; 0% instances)
Parents of DET nodes belong to 12 different parts of speech: NOUN (20342; 83% instances), PROPN (3393; 14% instances), ADP (175; 1% instances), X (157; 1% instances), PRON (121; 0% instances), NUM (101; 0% instances), VERB (32; 0% instances), ADJ (20; 0% instances), SYM (11; 0% instances), ADV (5; 0% instances), DET (4; 0% instances), SCONJ (1; 0% instances)
24296 (100%) DET nodes are leaves.
28 (0%) DET nodes have one child.
35 (0%) DET nodes have two children.
3 (0%) DET nodes have three or more children.
The highest child degree of a DET node is 4.
Children of DET nodes are attached using 4 different relations: fixed (94; 87% instances), punct (6; 6% instances), cc (4; 4% instances), conj (4; 4% instances)
Children of DET nodes belong to 6 different parts of speech: NOUN (56; 52% instances), ADV (33; 31% instances), PUNCT (6; 6% instances), SCONJ (5; 5% instances), CCONJ (4; 4% instances), DET (4; 4% instances)