home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Spanish-GSD: POS Tags: DET

There are 93 DET lemmas (0%), 149 DET types (0%) and 60857 DET tokens (14%). Out of 17 observed tags, the rank of DET is: 8 in number of lemmas, 9 in number of types and 3 in number of tokens.

The 10 most frequent DET lemmas: el, uno, su, este, otro, todo, ese, alguno, varios, mucho

The 10 most frequent DET types: el, la, los, un, las, una, su, sus, este, esta

The 10 most frequent ambiguous lemmas: uno (DET 7651, PRON 540, NUM 108, ADJ 2, NOUN 2, PROPN 2, X 1), su (DET 3998, ADJ 1), este (DET 1643, PRON 446, NOUN 45, PROPN 11), otro (DET 699, PRON 208, ADJ 3, NOUN 1, PROPN 1, X 1), todo (DET 621, PRON 247, PROPN 6, ADJ 1, NOUN 1, PART 1, VERB 1), ese (DET 417, PRON 77), alguno (DET 275, PRON 75, ADJ 10, NOUN 8), varios (DET 242, PRON 5, ADJ 4), mucho (ADV 585, DET 220, PRON 148, ADJ 3, PROPN 2, X 1), mi (DET 174, PRON 10, X 3)

The 10 most frequent ambiguous types: la (DET 13333, PRON 398), los (DET 4939, PRON 245), un (DET 3886, NUM 53), las (DET 3146, PRON 100), una (DET 3269, PRON 181, NUM 35, NOUN 2, X 1), sus (DET 978, ADJ 1), este (DET 565, PRON 61, NOUN 35, AUX 4, VERB 1), esta (DET 454, AUX 36, PRON 31, VERB 23), otras (DET 217, PRON 27, ADJ 1, PROPN 1), otros (DET 212, PRON 71, ADJ 1)

Morphology

The form / lemma ratio of DET is 1.602151 (the average of all parts of speech is 1.278515).

The 1st highest number of forms (8) was observed with the lemma “este”: esta, estas, este, estos, ésta, éstas, éste, éstos.

The 2nd highest number of forms (5) was observed with the lemma “alguno”: alguna, algunas, alguno, algunos, algún.

The 3rd highest number of forms (5) was observed with the lemma “uno”: un, una, unas, uno, unos.

DET occurs with 8 features: PronType (60857; 100% instances), Number (60694; 100% instances), Gender (56063; 92% instances), Definite (51173; 84% instances), Poss (4458; 7% instances), Person (4361; 7% instances), NumType (408; 1% instances), Degree (3; 0% instances)

DET occurs with 19 feature-value pairs: Definite=Def, Definite=Ind, Degree=Abs, Gender=Fem, Gender=Masc, NumType=Card, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Art, PronType=Dem, PronType=Ind, PronType=Int,Rel, PronType=Neg, PronType=Prs, PronType=Tot

DET occurs with 75 feature combinations. The most frequent feature combination is Definite=Def|Gender=Masc|Number=Sing|PronType=Art (19652 tokens). Examples: el

Relations

DET nodes are attached to their parents using 22 different relations: det (60547; 99% instances), advmod (84; 0% instances), nsubj (48; 0% instances), conj (38; 0% instances), nmod (33; 0% instances), obl (23; 0% instances), root (20; 0% instances), obj (12; 0% instances), appos (10; 0% instances), dep (10; 0% instances), fixed (8; 0% instances), mark (6; 0% instances), parataxis (4; 0% instances), acl:relcl (2; 0% instances), advcl (2; 0% instances), compound (2; 0% instances), flat (2; 0% instances), nsubj:pass (2; 0% instances), acl (1; 0% instances), csubj (1; 0% instances), iobj (1; 0% instances), nummod (1; 0% instances)

Parents of DET nodes belong to 16 different parts of speech: NOUN (50952; 84% instances), PROPN (6254; 10% instances), NUM (1034; 2% instances), PRON (827; 1% instances), SYM (682; 1% instances), VERB (303; 0% instances), X (222; 0% instances), ADJ (221; 0% instances), SCONJ (166; 0% instances), ADV (89; 0% instances), ADP (59; 0% instances), (20; 0% instances), DET (18; 0% instances), CCONJ (5; 0% instances), AUX (4; 0% instances), PUNCT (1; 0% instances)

60621 (100%) DET nodes are leaves.

107 (0%) DET nodes have one child.

65 (0%) DET nodes have two children.

64 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 8.

Children of DET nodes are attached using 21 different relations: nmod (103; 20% instances), punct (66; 13% instances), case (61; 12% instances), acl:relcl (47; 9% instances), cop (39; 8% instances), cc (35; 7% instances), nsubj (32; 6% instances), advmod (29; 6% instances), conj (26; 5% instances), fixed (14; 3% instances), amod (12; 2% instances), advcl (11; 2% instances), acl (8; 2% instances), mark (8; 2% instances), appos (4; 1% instances), dep (2; 0% instances), det (2; 0% instances), flat (2; 0% instances), aux (1; 0% instances), compound (1; 0% instances), nummod (1; 0% instances)

Children of DET nodes belong to 14 different parts of speech: NOUN (98; 19% instances), VERB (67; 13% instances), PUNCT (66; 13% instances), ADP (52; 10% instances), PROPN (47; 9% instances), CCONJ (42; 8% instances), AUX (39; 8% instances), ADV (32; 6% instances), ADJ (18; 4% instances), DET (18; 4% instances), SCONJ (12; 2% instances), PRON (9; 2% instances), NUM (3; 1% instances), SYM (1; 0% instances)