DET
: determiner
This document is a placeholder for the language-specific documentation
for DET
.
Treebank Statistics (UD_Dutch)
There are 43 DET
lemmas (0%), 42 DET
types (0%) and 21850 DET
tokens (10%).
Out of 16 observed tags, the rank of DET
is: 12 in number of lemmas, 12 in number of types and 4 in number of tokens.
The 10 most frequent DET
lemmas: de, een, het, veel, meer, der, weinig, meest, minder, zoveel
The 10 most frequent DET
types: de, een, het, veel, meer, der, vele, meeste, weinig, minder
The 10 most frequent ambiguous lemmas: de (DET 12552, ADP 353, PROPN 19, ADJ 1, PRON 1, X 1), een (DET 4476, X 50, NUM 21, PROPN 3, CONJ 2), het (DET 4283, PRON 1155, X 222, PROPN 8), veel (DET 154, PRON 142), meer (PRON 122, ADV 116, DET 98, X 25, NOUN 4, ADJ 1, PROPN 1), der (PROPN 72, DET 63, X 7), weinig (PRON 45, DET 36, X 1), meest (DET 34, ADV 34, X 10, PRON 6), minder (PRON 47, DET 25, AUX 1, VERB 1), zoveel (DET 21, PRON 14)
The 10 most frequent ambiguous types: de (DET 10964, ADP 353, PROPN 13), een (DET 4196, X 50, NUM 21, PROPN 2), het (DET 3802, PRON 793, X 222, PROPN 8), veel (PRON 122, DET 108), meer (ADV 116, PRON 115, DET 92, X 25), der (PROPN 72, DET 66, X 7), vele (DET 46, PRON 7), meeste (DET 34, PRON 3), weinig (PRON 43, DET 32, X 1), minder (PRON 59, DET 25)
- de
- een
- het
- veel
- meer
- der
- vele
- meeste
- weinig
- minder
Morphology
The form / lemma ratio of DET
is 0.976744 (the average of all parts of speech is 1.258498).
The 1st highest number of forms (3) was observed with the lemma “min”: min, minder, mindere.
The 2nd highest number of forms (2) was observed with the lemma “de”: de, der.
The 3rd highest number of forms (2) was observed with the lemma “een”: ‘n, een.
DET
occurs with 10 features: PronType (21850; 100% instances), Definite (21436; 98% instances), Number (4536; 21% instances), Gender (4323; 20% instances), Degree (410; 2% instances), NumType (410; 2% instances), Case (169; 1% instances), Number[psor] (2; 0% instances), Person (2; 0% instances), Poss (2; 0% instances)
DET
occurs with 22 feature-value pairs: Case=Dat
, Case=Gen
, Case=Nom
, Definite=Def
, Definite=Ind
, Degree=Cmp
, Degree=Pos
, Degree=Sup
, Gender=Com
, Gender=Fem
, Gender=Masc
, Gender=Neut
, NumType=Card
, Number=Plur
, Number=Sing
, Number[psor]=Plur
, Person=1
, Poss=Yes
, PronType=Art
, PronType=Ind
, PronType=Prs
, PronType=Tot
DET
occurs with 19 feature combinations.
The most frequent feature combination is Definite=Def|PronType=Art
(12554 tokens).
Examples: de
Relations
DET
nodes are attached to their parents using 21 different relations: det (21263; 97% instances), det:nummod (410; 2% instances), nsubj (65; 0% instances), dobj (35; 0% instances), root (15; 0% instances), advmod (14; 0% instances), appos (13; 0% instances), dep (10; 0% instances), conj (6; 0% instances), nmod (3; 0% instances), acl (2; 0% instances), cc (2; 0% instances), compound (2; 0% instances), iobj (2; 0% instances), parataxis (2; 0% instances), advcl (1; 0% instances), amod (1; 0% instances), aux (1; 0% instances), name (1; 0% instances), neg (1; 0% instances), xcomp (1; 0% instances)
Parents of DET
nodes belong to 16 different parts of speech: NOUN (18960; 87% instances), PROPN (1683; 8% instances), VERB (435; 2% instances), ADJ (293; 1% instances), X (284; 1% instances), NUM (65; 0% instances), PRON (60; 0% instances), AUX (21; 0% instances), ADV (18; 0% instances), ROOT (15; 0% instances), DET (9; 0% instances), ADP (2; 0% instances), CONJ (2; 0% instances), INTJ (1; 0% instances), SCONJ (1; 0% instances), SYM (1; 0% instances)
21561 (99%) DET
nodes are leaves.
182 (1%) DET
nodes have one child.
58 (0%) DET
nodes have two children.
49 (0%) DET
nodes have three or more children.
The highest child degree of a DET
node is 9.
Children of DET
nodes are attached using 24 different relations: nmod (136; 27% instances), advmod (113; 22% instances), case (56; 11% instances), punct (38; 7% instances), advcl (29; 6% instances), cc (24; 5% instances), conj (22; 4% instances), cop (17; 3% instances), neg (13; 3% instances), nsubj (10; 2% instances), mark (9; 2% instances), compound (7; 1% instances), det (6; 1% instances), appos (5; 1% instances), amod (4; 1% instances), dobj (4; 1% instances), aux (3; 1% instances), ccomp (3; 1% instances), csubj (3; 1% instances), dep (3; 1% instances), name (2; 0% instances), parataxis (2; 0% instances), xcomp (2; 0% instances), acl (1; 0% instances)
Children of DET
nodes belong to 14 different parts of speech: NOUN (136; 27% instances), ADV (67; 13% instances), ADP (50; 10% instances), PRON (40; 8% instances), PUNCT (38; 7% instances), NUM (33; 6% instances), VERB (32; 6% instances), ADJ (30; 6% instances), AUX (25; 5% instances), PROPN (23; 4% instances), CONJ (18; 4% instances), DET (9; 2% instances), X (6; 1% instances), SCONJ (5; 1% instances)
Treebank Statistics (UD_Dutch-LassySmall)
There are 32 DET
lemmas (0%), 49 DET
types (0%) and 11229 DET
tokens (11%).
Out of 17 observed tags, the rank of DET
is: 12 in number of lemmas, 12 in number of types and 4 in number of tokens.
The 10 most frequent DET
lemmas: de, het, een, zijn, deze, haar, hun, dit, die, veel
The 10 most frequent DET
types: de, het, een, zijn, deze, haar, hun, dit, alle, geen
The 10 most frequent ambiguous lemmas: de (DET 5884, PROPN 73, X 6), het (DET 2199, PRON 211, PROPN 2), zijn (AUX 1326, DET 423, VERB 177, PRON 22), deze (DET 204, PRON 44), haar (DET 153, PRON 29, NOUN 4), dit (PRON 77, DET 77), die (PRON 411, DET 66), veel (PRON 248, DET 66), geen (DET 65, PRON 3), al (ADV 74, DET 63, PRON 22)
The 10 most frequent ambiguous types: de (DET 4905, PROPN 73, X 6), het (DET 1866, PRON 126), een (DET 1598, NUM 45), zijn (DET 388, AUX 219, VERB 57, PRON 11), deze (DET 137, PRON 20), haar (DET 146, PRON 17, NOUN 4), dit (DET 49, PRON 34), alle (DET 58, PRON 9), geen (DET 60, PRON 2), die (PRON 404, DET 58)
- de
- het
- een
- zijn
- deze
- haar
- dit
- alle
- geen
- die
Morphology
The form / lemma ratio of DET
is 1.531250 (the average of all parts of speech is 1.179900).
The 1st highest number of forms (5) was observed with the lemma “de”: ’s, de, der, des, dé.
The 2nd highest number of forms (2) was observed with the lemma “beide”: beide, beider.
The 3rd highest number of forms (2) was observed with the lemma “die”: die, diens.
DET
occurs with 1 features: Definite (9757; 87% instances)
DET
occurs with 2 feature-value pairs: Definite=Def
, Definite=Ind
DET
occurs with 3 feature combinations.
The most frequent feature combination is Definite=Def
(8091 tokens).
Examples: de, het, der, ‘s, ‘t, des, dé
Relations
DET
nodes are attached to their parents using 13 different relations: det (10683; 95% instances), mwe (288; 3% instances), root (61; 1% instances), conj (45; 0% instances), nmod (41; 0% instances), appos (39; 0% instances), nsubj (30; 0% instances), parataxis (19; 0% instances), amod (10; 0% instances), dobj (8; 0% instances), acl (3; 0% instances), advcl (1; 0% instances), compound (1; 0% instances)
Parents of DET
nodes belong to 13 different parts of speech: NOUN (9067; 81% instances), PROPN (1087; 10% instances), ADJ (467; 4% instances), VERB (209; 2% instances), NUM (95; 1% instances), ADP (90; 1% instances), X (62; 1% instances), ROOT (61; 1% instances), DET (53; 0% instances), ADV (14; 0% instances), PRON (14; 0% instances), SYM (9; 0% instances), SCONJ (1; 0% instances)
10972 (98%) DET
nodes are leaves.
75 (1%) DET
nodes have one child.
48 (0%) DET
nodes have two children.
134 (1%) DET
nodes have three or more children.
The highest child degree of a DET
node is 16.
Children of DET
nodes are attached using 19 different relations: mwe (495; 52% instances), punct (196; 21% instances), conj (55; 6% instances), nmod (41; 4% instances), case (28; 3% instances), name (27; 3% instances), nummod (25; 3% instances), acl (21; 2% instances), cc (17; 2% instances), parataxis (15; 2% instances), appos (14; 1% instances), advmod (4; 0% instances), amod (4; 0% instances), mark (4; 0% instances), cop (3; 0% instances), det (3; 0% instances), nsubj (2; 0% instances), advcl (1; 0% instances), neg (1; 0% instances)
Children of DET
nodes belong to 15 different parts of speech: NOUN (269; 28% instances), PUNCT (202; 21% instances), PROPN (123; 13% instances), ADP (85; 9% instances), ADJ (69; 7% instances), NUM (67; 7% instances), DET (53; 6% instances), CONJ (25; 3% instances), VERB (23; 2% instances), SYM (13; 1% instances), ADV (11; 1% instances), PRON (8; 1% instances), AUX (3; 0% instances), SCONJ (3; 0% instances), X (2; 0% instances)
DET in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]