This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home es/pos issue tracker

DET: determiner

This document is a placeholder for the language-specific documentation for DET.


Treebank Statistics (UD_Spanish)

There are 95 DET lemmas (0%), 153 DET types (0%) and 60872 DET tokens (14%). Out of 16 observed tags, the rank of DET is: 9 in number of lemmas, 9 in number of types and 3 in number of tokens.

The 10 most frequent DET lemmas: el, uno, su, este, otro, todo, ese, alguno, varios, mucho

The 10 most frequent DET types: el, la, los, un, las, una, su, sus, este, esta

The 10 most frequent ambiguous lemmas: uno (DET 7651, PRON 540, NUM 108, NOUN 2, PROPN 2, ADJ 2, X 1), su (DET 3998, ADJ 1), este (DET 1643, PRON 446, VERB 47, NOUN 45, AUX 24, PROPN 11), otro (DET 699, PRON 208, ADJ 3, X 1, NOUN 1, PROPN 1), todo (DET 621, PRON 247, PROPN 6, VERB 1, ADJ 1, PART 1, NOUN 1), ese (DET 417, PRON 77), alguno (DET 275, PRON 75, ADJ 10, NOUN 8), varios (DET 242, PRON 5, ADJ 4), mucho (ADV 585, DET 220, PRON 148, ADJ 3, PROPN 2, X 1), mi (DET 174, PRON 10, X 3)

The 10 most frequent ambiguous types: la (DET 13333, PRON 398), los (DET 4939, PRON 245), un (DET 3886, NUM 53), las (DET 3146, PRON 100), una (DET 3269, PRON 181, NUM 35, NOUN 2, X 1), sus (DET 978, ADJ 1), este (DET 565, PRON 61, NOUN 35, VERB 3, AUX 2), esta (DET 454, VERB 41, PRON 31, AUX 18), otras (DET 217, PRON 27, PROPN 1, ADJ 1), otros (DET 212, PRON 71, ADJ 1)

Morphology

The form / lemma ratio of DET is 1.610526 (the average of all parts of speech is 1.255739).

The 1st highest number of forms (8) was observed with the lemma “este”: esta, estas, este, estos, ésta, éstas, éste, éstos.

The 2nd highest number of forms (5) was observed with the lemma “alguno”: alguna, algunas, alguno, algunos, algún.

The 3rd highest number of forms (5) was observed with the lemma “uno”: un, una, unas, uno, unos.

DET occurs with 10 features: es-feat/PronType (60872; 100% instances), es-feat/Number (60709; 100% instances), es-feat/Gender (56064; 92% instances), es-feat/Definite (51173; 84% instances), es-feat/Poss (4458; 7% instances), es-feat/Person (4377; 7% instances), es-feat/NumType (408; 1% instances), es-feat/Case (15; 0% instances), es-feat/Degree (3; 0% instances), es-feat/PrepCase (2; 0% instances)

DET occurs with 23 feature-value pairs: Case=Acc, Case=Acc,Dat, Case=Dat, Definite=Def, Definite=Ind, Degree=Abs, Gender=Fem, Gender=Masc, NumType=Card, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Poss=Yes, PrepCase=Npr, PronType=Art, PronType=Dem, PronType=Ind, PronType=Int,Rel, PronType=Neg, PronType=Prs, PronType=Tot

DET occurs with 79 feature combinations. The most frequent feature combination is Definite=Def|Gender=Masc|Number=Sing|PronType=Art (19652 tokens). Examples: el

Relations

DET nodes are attached to their parents using 22 different relations: es-dep/det (60559; 99% instances), es-dep/advmod (86; 0% instances), es-dep/nmod (54; 0% instances), es-dep/nsubj (48; 0% instances), es-dep/conj (38; 0% instances), es-dep/root (20; 0% instances), es-dep/dobj (12; 0% instances), es-dep/appos (10; 0% instances), es-dep/dep (10; 0% instances), es-dep/mwe (8; 0% instances), es-dep/mark (6; 0% instances), es-dep/parataxis (4; 0% instances), es-dep/case (3; 0% instances), es-dep/acl (2; 0% instances), es-dep/acl:relcl (2; 0% instances), es-dep/compound (2; 0% instances), es-dep/name (2; 0% instances), es-dep/nsubjpass (2; 0% instances), es-dep/advcl (1; 0% instances), es-dep/csubj (1; 0% instances), es-dep/iobj (1; 0% instances), es-dep/nummod (1; 0% instances)

Parents of DET nodes belong to 16 different parts of speech: NOUN (50953; 84% instances), PROPN (6269; 10% instances), NUM (1034; 2% instances), PRON (827; 1% instances), SYM (682; 1% instances), VERB (300; 0% instances), X (222; 0% instances), ADJ (221; 0% instances), SCONJ (166; 0% instances), ADV (89; 0% instances), ADP (61; 0% instances), ROOT (20; 0% instances), DET (18; 0% instances), CONJ (5; 0% instances), AUX (4; 0% instances), PUNCT (1; 0% instances)

60651 (100%) DET nodes are leaves.

103 (0%) DET nodes have one child.

62 (0%) DET nodes have two children.

56 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 10.

Children of DET nodes are attached using 22 different relations: es-dep/nmod (103; 22% instances), es-dep/case (61; 13% instances), es-dep/punct (57; 12% instances), es-dep/acl:relcl (47; 10% instances), es-dep/cop (39; 8% instances), es-dep/nsubj (32; 7% instances), es-dep/advmod (28; 6% instances), es-dep/conj (24; 5% instances), es-dep/cc (20; 4% instances), es-dep/mwe (14; 3% instances), es-dep/amod (12; 3% instances), es-dep/advcl (11; 2% instances), es-dep/acl (8; 2% instances), es-dep/mark (8; 2% instances), es-dep/appos (4; 1% instances), es-dep/dep (2; 0% instances), es-dep/det (2; 0% instances), es-dep/name (2; 0% instances), es-dep/aux (1; 0% instances), es-dep/compound (1; 0% instances), es-dep/neg (1; 0% instances), es-dep/nummod (1; 0% instances)

Children of DET nodes belong to 14 different parts of speech: VERB (104; 22% instances), NOUN (97; 20% instances), PUNCT (57; 12% instances), ADP (50; 10% instances), PROPN (47; 10% instances), ADV (33; 7% instances), CONJ (29; 6% instances), DET (18; 4% instances), ADJ (17; 4% instances), SCONJ (12; 3% instances), PRON (9; 2% instances), NUM (3; 1% instances), AUX (1; 0% instances), SYM (1; 0% instances)


Treebank Statistics (UD_Spanish-AnCora)

There are 59 DET lemmas (0%), 122 DET types (0%) and 85533 DET tokens (15%). Out of 17 observed tags, the rank of DET is: 10 in number of lemmas, 10 in number of types and 3 in number of tokens.

The 10 most frequent DET lemmas: el, uno, su, este, todo, otro, ese, la, alguno, mismo

The 10 most frequent DET types: el, la, los, un, las, una, su, sus, lo, este

The 10 most frequent ambiguous lemmas: el (DET 62637, PRON 1), uno (DET 9472, PRON 739, NOUN 12, NUM 6, X 1), su (DET 4875, PRON 43), este (DET 1989, PRON 341, NOUN 12, ADJ 3, PROPN 1), todo (DET 882, PRON 414, NOUN 2), otro (DET 742, PRON 316, ADJ 20), ese (DET 709, PRON 275, NOUN 2), alguno (DET 343, PRON 147, ADJ 9), mismo (DET 314, ADJ 82, PRON 79, ADV 7), mucho (ADV 694, DET 270, PRON 89, ADJ 1)

The 10 most frequent ambiguous types: la (DET 18367, PRON 310), los (DET 7827, PRON 125), un (DET 5220, PRON 24, NUM 17), las (DET 4936, PRON 107), una (DET 3729, PRON 149, NUM 2, VERB 2, NOUN 1), su (DET 3504, PRON 21), sus (DET 1247, PRON 24), lo (DET 1164, PRON 783), este (DET 804, PRON 13, NOUN 11, ADJ 3), esta (DET 629, PRON 5)

Morphology

The form / lemma ratio of DET is 2.067797 (the average of all parts of speech is 1.501056).

The 1st highest number of forms (5) was observed with the lemma “alguno”: alguna, algunas, alguno, algunos, algún.

The 2nd highest number of forms (5) was observed with the lemma “el”: el, la, las, lo, los.

The 3rd highest number of forms (5) was observed with the lemma “uno”: un, una, unas, uno, unos.

DET occurs with 8 features: es-feat/PronType (85533; 100% instances), es-feat/Number (85467; 100% instances), es-feat/Gender (78722; 92% instances), es-feat/Definite (73130; 85% instances), es-feat/Person (5406; 6% instances), es-feat/Poss (5406; 6% instances), es-feat/Number[psor] (495; 1% instances), es-feat/NumType (16; 0% instances)

DET occurs with 20 feature-value pairs: Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, NumType=Card, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Art, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Rel, PronType=Tot

DET occurs with 50 feature combinations. The most frequent feature combination is Definite=Def|Gender=Masc|Number=Sing|PronType=Art (28409 tokens). Examples: el, lo

Relations

DET nodes are attached to their parents using 19 different relations: es-dep/det (84195; 98% instances), es-dep/mwe (1044; 1% instances), es-dep/advmod (61; 0% instances), es-dep/conj (55; 0% instances), es-dep/cop (47; 0% instances), es-dep/compound (46; 0% instances), es-dep/nmod (20; 0% instances), es-dep/mark (17; 0% instances), es-dep/case (9; 0% instances), es-dep/root (8; 0% instances), es-dep/dobj (6; 0% instances), es-dep/nsubj (6; 0% instances), es-dep/ccomp (5; 0% instances), es-dep/name (5; 0% instances), es-dep/acl (2; 0% instances), es-dep/appos (2; 0% instances), es-dep/cc (2; 0% instances), es-dep/dep (2; 0% instances), es-dep/advcl (1; 0% instances)

Parents of DET nodes belong to 17 different parts of speech: NOUN (66842; 78% instances), PROPN (9792; 11% instances), VERB (1677; 2% instances), NUM (1597; 2% instances), PRON (1480; 2% instances), ADJ (1421; 2% instances), DET (1356; 2% instances), ADP (876; 1% instances), SYM (317; 0% instances), ADV (131; 0% instances), AUX (24; 0% instances), ROOT (8; 0% instances), CONJ (5; 0% instances), PART (3; 0% instances), X (2; 0% instances), PUNCT (1; 0% instances), SCONJ (1; 0% instances)

82794 (97%) DET nodes are leaves.

2088 (2%) DET nodes have one child.

348 (0%) DET nodes have two children.

303 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 10.

Children of DET nodes are attached using 26 different relations: es-dep/det (1311; 34% instances), es-dep/nummod (753; 19% instances), es-dep/case (543; 14% instances), es-dep/nmod (224; 6% instances), es-dep/punct (212; 5% instances), es-dep/mwe (113; 3% instances), es-dep/name (93; 2% instances), es-dep/mark (92; 2% instances), es-dep/advmod (90; 2% instances), es-dep/amod (86; 2% instances), es-dep/appos (76; 2% instances), es-dep/nsubj (75; 2% instances), es-dep/dobj (67; 2% instances), es-dep/compound (53; 1% instances), es-dep/cc (32; 1% instances), es-dep/conj (27; 1% instances), es-dep/acl (18; 0% instances), es-dep/advcl (9; 0% instances), es-dep/cop (8; 0% instances), es-dep/neg (7; 0% instances), es-dep/dep (6; 0% instances), es-dep/parataxis (6; 0% instances), es-dep/aux (4; 0% instances), es-dep/csubj (2; 0% instances), es-dep/xcomp (2; 0% instances), es-dep/ccomp (1; 0% instances)

Children of DET nodes belong to 14 different parts of speech: DET (1356; 35% instances), NUM (758; 19% instances), ADP (557; 14% instances), NOUN (299; 8% instances), PROPN (251; 6% instances), PUNCT (213; 5% instances), ADV (141; 4% instances), ADJ (113; 3% instances), SCONJ (91; 2% instances), VERB (42; 1% instances), PRON (36; 1% instances), CONJ (28; 1% instances), AUX (15; 0% instances), SYM (10; 0% instances)


DET in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]