home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Spanish-GSD: POS Tags: DET

There are 89 DET lemmas (0%), 153 DET types (0%) and 60887 DET tokens (14%). Out of 17 observed tags, the rank of DET is: 8 in number of lemmas, 9 in number of types and 3 in number of tokens.

The 10 most frequent DET lemmas: el, uno, su, este, otro, todo, ese, alguno, varios, mucho

The 10 most frequent DET types: el, la, los, un, las, una, su, sus, este, esta

The 10 most frequent ambiguous lemmas: el (DET 43535, PRON 1), uno (DET 7651, PRON 539, NUM 109, ADJ 2, NOUN 2, PROPN 2, X 1), su (DET 3998, ADJ 1), este (DET 1645, PRON 446, NOUN 45, PROPN 11), otro (DET 699, PRON 208, ADJ 4, NOUN 1, PROPN 1), todo (DET 624, PRON 247, PROPN 5, ADJ 1, NOUN 1), ese (DET 417, PRON 77), alguno (DET 276, PRON 75, ADJ 10, NOUN 8), varios (DET 242, PRON 5, ADJ 4), mucho (ADV 585, DET 220, PRON 148, ADJ 3, PROPN 2, X 1)

The 10 most frequent ambiguous types: el (DET 17640, PRON 1), la (DET 13334, PRON 399), los (DET 4939, PRON 246), un (DET 3886, NUM 53), las (DET 3146, PRON 100), una (DET 3269, PRON 181, NUM 35, NOUN 2, X 1), sus (DET 978, ADJ 1), este (DET 565, PRON 61, NOUN 35, AUX 4, VERB 1), esta (DET 454, AUX 36, PRON 31, VERB 23), otras (DET 217, PRON 27, ADJ 1, PROPN 1)

Morphology

The form / lemma ratio of DET is 1.719101 (the average of all parts of speech is 1.291521).

The 1st highest number of forms (9) was observed with the lemma “este”: esta, estas, este, estos, está, ésta, éstas, éste, éstos.

The 2nd highest number of forms (8) was observed with the lemma “el”: a, al, el, en, l’, la, las, los.

The 3rd highest number of forms (6) was observed with the lemma “alguno”: algun, alguna, algunas, alguno, algunos, algún.

DET occurs with 11 features: PronType (60884; 100% instances), Number (60713; 100% instances), Gender (56083; 92% instances), Definite (51196; 84% instances), Poss (4461; 7% instances), Person (4361; 7% instances), NumType (408; 1% instances), Number[psor] (361; 1% instances), Typo (22; 0% instances), Foreign (17; 0% instances), Degree (3; 0% instances)

DET occurs with 25 feature-value pairs: Definite=Def, Definite=Ind, Degree=Abs, Foreign=Yes, Gender=Fem, Gender=Masc, NumType=Card, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Art, PronType=Dem, PronType=Ind, PronType=Int, PronType=Int,Rel, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Typo=Yes

DET occurs with 100 feature combinations. The most frequent feature combination is Definite=Def|Gender=Masc|Number=Sing|PronType=Art (19653 tokens). Examples: el

Relations

DET nodes are attached to their parents using 23 different relations: det (60574; 99% instances), obl (106; 0% instances), nsubj (48; 0% instances), conj (39; 0% instances), nmod (33; 0% instances), root (19; 0% instances), obj (12; 0% instances), appos (10; 0% instances), dep (10; 0% instances), fixed (8; 0% instances), mark (6; 0% instances), flat (4; 0% instances), parataxis (4; 0% instances), acl:relcl (2; 0% instances), advcl (2; 0% instances), compound (2; 0% instances), nsubj:pass (2; 0% instances), acl (1; 0% instances), csubj (1; 0% instances), obl:agent (1; 0% instances), obl:arg (1; 0% instances), orphan (1; 0% instances), xcomp (1; 0% instances)

Parents of DET nodes belong to 13 different parts of speech: NOUN (51039; 84% instances), PROPN (6224; 10% instances), NUM (1034; 2% instances), PRON (826; 1% instances), SYM (683; 1% instances), VERB (479; 1% instances), X (218; 0% instances), ADJ (211; 0% instances), ADV (86; 0% instances), ADP (42; 0% instances), DET (21; 0% instances), (19; 0% instances), CCONJ (5; 0% instances)

60637 (100%) DET nodes are leaves.

120 (0%) DET nodes have one child.

63 (0%) DET nodes have two children.

67 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 7.

Children of DET nodes are attached using 21 different relations: nmod (103; 20% instances), punct (83; 16% instances), case (63; 12% instances), acl:relcl (48; 9% instances), cop (38; 7% instances), cc (34; 6% instances), nsubj (32; 6% instances), advmod (29; 6% instances), conj (25; 5% instances), fixed (14; 3% instances), amod (12; 2% instances), advcl (11; 2% instances), acl (9; 2% instances), mark (8; 2% instances), appos (4; 1% instances), det (4; 1% instances), dep (2; 0% instances), flat (2; 0% instances), aux (1; 0% instances), compound (1; 0% instances), nummod (1; 0% instances)

Children of DET nodes belong to 14 different parts of speech: NOUN (96; 18% instances), PUNCT (83; 16% instances), VERB (69; 13% instances), ADP (54; 10% instances), PROPN (47; 9% instances), CCONJ (42; 8% instances), AUX (39; 7% instances), ADV (32; 6% instances), DET (21; 4% instances), ADJ (16; 3% instances), SCONJ (12; 2% instances), PRON (9; 2% instances), NUM (3; 1% instances), SYM (1; 0% instances)