This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home nl/pos issue tracker

DET: determiner

This document is a placeholder for the language-specific documentation for DET.


Treebank Statistics (UD_Dutch)

There are 43 DET lemmas (0%), 42 DET types (0%) and 21850 DET tokens (10%). Out of 16 observed tags, the rank of DET is: 12 in number of lemmas, 12 in number of types and 4 in number of tokens.

The 10 most frequent DET lemmas: de, een, het, veel, meer, der, weinig, meest, minder, zoveel

The 10 most frequent DET types: de, een, het, veel, meer, der, vele, meeste, weinig, minder

The 10 most frequent ambiguous lemmas: de (DET 12552, ADP 353, PROPN 19, ADJ 1, PRON 1, X 1), een (DET 4476, X 50, NUM 21, PROPN 3, CONJ 2), het (DET 4283, PRON 1155, X 222, PROPN 8), veel (DET 154, PRON 142), meer (PRON 122, ADV 116, DET 98, X 25, NOUN 4, ADJ 1, PROPN 1), der (PROPN 72, DET 63, X 7), weinig (PRON 45, DET 36, X 1), meest (DET 34, ADV 34, X 10, PRON 6), minder (PRON 47, DET 25, AUX 1, VERB 1), zoveel (DET 21, PRON 14)

The 10 most frequent ambiguous types: de (DET 10964, ADP 353, PROPN 13), een (DET 4196, X 50, NUM 21, PROPN 2), het (DET 3802, PRON 793, X 222, PROPN 8), veel (PRON 122, DET 108), meer (ADV 116, PRON 115, DET 92, X 25), der (PROPN 72, DET 66, X 7), vele (DET 46, PRON 7), meeste (DET 34, PRON 3), weinig (PRON 43, DET 32, X 1), minder (PRON 59, DET 25)

Morphology

The form / lemma ratio of DET is 0.976744 (the average of all parts of speech is 1.258498).

The 1st highest number of forms (3) was observed with the lemma “min”: min, minder, mindere.

The 2nd highest number of forms (2) was observed with the lemma “de”: de, der.

The 3rd highest number of forms (2) was observed with the lemma “een”: ‘n, een.

DET occurs with 10 features: PronType (21850; 100% instances), Definite (21436; 98% instances), Number (4536; 21% instances), Gender (4323; 20% instances), Degree (410; 2% instances), NumType (410; 2% instances), Case (169; 1% instances), Number[psor] (2; 0% instances), Person (2; 0% instances), Poss (2; 0% instances)

DET occurs with 22 feature-value pairs: Case=Dat, Case=Gen, Case=Nom, Definite=Def, Definite=Ind, Degree=Cmp, Degree=Pos, Degree=Sup, Gender=Com, Gender=Fem, Gender=Masc, Gender=Neut, NumType=Card, Number=Plur, Number=Sing, Number[psor]=Plur, Person=1, Poss=Yes, PronType=Art, PronType=Ind, PronType=Prs, PronType=Tot

DET occurs with 19 feature combinations. The most frequent feature combination is Definite=Def|PronType=Art (12554 tokens). Examples: de

Relations

DET nodes are attached to their parents using 21 different relations: det (21263; 97% instances), det:nummod (410; 2% instances), nsubj (65; 0% instances), dobj (35; 0% instances), root (15; 0% instances), advmod (14; 0% instances), appos (13; 0% instances), dep (10; 0% instances), conj (6; 0% instances), nmod (3; 0% instances), acl (2; 0% instances), cc (2; 0% instances), compound (2; 0% instances), iobj (2; 0% instances), parataxis (2; 0% instances), advcl (1; 0% instances), amod (1; 0% instances), aux (1; 0% instances), name (1; 0% instances), neg (1; 0% instances), xcomp (1; 0% instances)

Parents of DET nodes belong to 16 different parts of speech: NOUN (18960; 87% instances), PROPN (1683; 8% instances), VERB (435; 2% instances), ADJ (293; 1% instances), X (284; 1% instances), NUM (65; 0% instances), PRON (60; 0% instances), AUX (21; 0% instances), ADV (18; 0% instances), ROOT (15; 0% instances), DET (9; 0% instances), ADP (2; 0% instances), CONJ (2; 0% instances), INTJ (1; 0% instances), SCONJ (1; 0% instances), SYM (1; 0% instances)

21561 (99%) DET nodes are leaves.

182 (1%) DET nodes have one child.

58 (0%) DET nodes have two children.

49 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 9.

Children of DET nodes are attached using 24 different relations: nmod (136; 27% instances), advmod (113; 22% instances), case (56; 11% instances), punct (38; 7% instances), advcl (29; 6% instances), cc (24; 5% instances), conj (22; 4% instances), cop (17; 3% instances), neg (13; 3% instances), nsubj (10; 2% instances), mark (9; 2% instances), compound (7; 1% instances), det (6; 1% instances), appos (5; 1% instances), amod (4; 1% instances), dobj (4; 1% instances), aux (3; 1% instances), ccomp (3; 1% instances), csubj (3; 1% instances), dep (3; 1% instances), name (2; 0% instances), parataxis (2; 0% instances), xcomp (2; 0% instances), acl (1; 0% instances)

Children of DET nodes belong to 14 different parts of speech: NOUN (136; 27% instances), ADV (67; 13% instances), ADP (50; 10% instances), PRON (40; 8% instances), PUNCT (38; 7% instances), NUM (33; 6% instances), VERB (32; 6% instances), ADJ (30; 6% instances), AUX (25; 5% instances), PROPN (23; 4% instances), CONJ (18; 4% instances), DET (9; 2% instances), X (6; 1% instances), SCONJ (5; 1% instances)


Treebank Statistics (UD_Dutch-LassySmall)

There are 32 DET lemmas (0%), 49 DET types (0%) and 11229 DET tokens (11%). Out of 17 observed tags, the rank of DET is: 12 in number of lemmas, 12 in number of types and 4 in number of tokens.

The 10 most frequent DET lemmas: de, het, een, zijn, deze, haar, hun, dit, die, veel

The 10 most frequent DET types: de, het, een, zijn, deze, haar, hun, dit, alle, geen

The 10 most frequent ambiguous lemmas: de (DET 5884, PROPN 73, X 6), het (DET 2199, PRON 211, PROPN 2), zijn (AUX 1326, DET 423, VERB 177, PRON 22), deze (DET 204, PRON 44), haar (DET 153, PRON 29, NOUN 4), dit (PRON 77, DET 77), die (PRON 411, DET 66), veel (PRON 248, DET 66), geen (DET 65, PRON 3), al (ADV 74, DET 63, PRON 22)

The 10 most frequent ambiguous types: de (DET 4905, PROPN 73, X 6), het (DET 1866, PRON 126), een (DET 1598, NUM 45), zijn (DET 388, AUX 219, VERB 57, PRON 11), deze (DET 137, PRON 20), haar (DET 146, PRON 17, NOUN 4), dit (DET 49, PRON 34), alle (DET 58, PRON 9), geen (DET 60, PRON 2), die (PRON 404, DET 58)

Morphology

The form / lemma ratio of DET is 1.531250 (the average of all parts of speech is 1.179900).

The 1st highest number of forms (5) was observed with the lemma “de”: ’s, de, der, des, dé.

The 2nd highest number of forms (2) was observed with the lemma “beide”: beide, beider.

The 3rd highest number of forms (2) was observed with the lemma “die”: die, diens.

DET occurs with 1 features: Definite (9757; 87% instances)

DET occurs with 2 feature-value pairs: Definite=Def, Definite=Ind

DET occurs with 3 feature combinations. The most frequent feature combination is Definite=Def (8091 tokens). Examples: de, het, der, ‘s, ‘t, des, dé

Relations

DET nodes are attached to their parents using 13 different relations: det (10683; 95% instances), mwe (288; 3% instances), root (61; 1% instances), conj (45; 0% instances), nmod (41; 0% instances), appos (39; 0% instances), nsubj (30; 0% instances), parataxis (19; 0% instances), amod (10; 0% instances), dobj (8; 0% instances), acl (3; 0% instances), advcl (1; 0% instances), compound (1; 0% instances)

Parents of DET nodes belong to 13 different parts of speech: NOUN (9067; 81% instances), PROPN (1087; 10% instances), ADJ (467; 4% instances), VERB (209; 2% instances), NUM (95; 1% instances), ADP (90; 1% instances), X (62; 1% instances), ROOT (61; 1% instances), DET (53; 0% instances), ADV (14; 0% instances), PRON (14; 0% instances), SYM (9; 0% instances), SCONJ (1; 0% instances)

10972 (98%) DET nodes are leaves.

75 (1%) DET nodes have one child.

48 (0%) DET nodes have two children.

134 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 16.

Children of DET nodes are attached using 19 different relations: mwe (495; 52% instances), punct (196; 21% instances), conj (55; 6% instances), nmod (41; 4% instances), case (28; 3% instances), name (27; 3% instances), nummod (25; 3% instances), acl (21; 2% instances), cc (17; 2% instances), parataxis (15; 2% instances), appos (14; 1% instances), advmod (4; 0% instances), amod (4; 0% instances), mark (4; 0% instances), cop (3; 0% instances), det (3; 0% instances), nsubj (2; 0% instances), advcl (1; 0% instances), neg (1; 0% instances)

Children of DET nodes belong to 15 different parts of speech: NOUN (269; 28% instances), PUNCT (202; 21% instances), PROPN (123; 13% instances), ADP (85; 9% instances), ADJ (69; 7% instances), NUM (67; 7% instances), DET (53; 6% instances), CONJ (25; 3% instances), VERB (23; 2% instances), SYM (13; 1% instances), ADV (11; 1% instances), PRON (8; 1% instances), AUX (3; 0% instances), SCONJ (3; 0% instances), X (2; 0% instances)


DET in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]