home fr/pos edit page issue tracker

DET: determiner

Definition

We follow the definition for DET proposed in the universal scheme.

However note that at the moment numerals are not consistently annotated as NUM, and are sometimes marked as DET.

For demonstratives such as ce …-là, ce …-ci (as in cet homme-ci, cette femme-là “this man, that women”), the first part of the determiner is annotated as DET and the clitic ci, là (which are split from the noun) are marked as PART.

Examples


Treebank Statistics (UD_French)

There are 1 DET lemmas (6%), 92 DET types (0%) and 61780 DET tokens (15%). Out of 17 observed tags, the rank of DET is: 6 in number of lemmas, 11 in number of types and 3 in number of tokens.

The 10 most frequent DET lemmas: _

The 10 most frequent DET types: le, la, les, l’, un, une, des, son, sa, cette

The 10 most frequent ambiguous lemmas: _ (NOUN 73641, ADP 64129, DET 61780, PUNCT 44312, VERB 36183, PROPN 31663, ADJ 22616, PRON 17750, ADV 13108, NUM 10834, CONJ 10138, AUX 8952, SCONJ 2908, PART 1668, X 1056, SYM 486, INTJ 267)

The 10 most frequent ambiguous types: le (DET 13821, PRON 287, PROPN 4), la (DET 9727, PRON 108, PROPN 4, NOUN 1, ADV 1), les (DET 8766, PRON 129), l’ (DET 6356, PRON 256, INTJ 18, PROPN 2), un (DET 3950, PRON 189, NUM 57, PROPN 1), une (DET 3382, PRON 111, NUM 59, NOUN 3, SCONJ 2), des (DET 1720, PROPN 4), son (DET 1392, NOUN 17, AUX 3), ce (DET 539, PRON 332, SCONJ 2, X 1), de (ADP 26367, DET 435, INTJ 101, PROPN 39, X 2, PRON 1)

Morphology

The form / lemma ratio of DET is 92.000000 (the average of all parts of speech is 2777.470588).

The 1st highest number of forms (92) was observed with the lemma “_”: #cette, His, Los, No, Nul, Quelles, Some, a, an, aucun, aucune, aucunes, autres, beaucoup, ce, certain, certaine, certaines, certains, ces, cet, cette, chaque, cinq, d’, de, des, deux, die, différent, différentes, différents, divers, diverses, du, fin, in, l, l’, la, ladite, las, le, les, leur, leurs, là, ma, mes, mi, mon, my, nombreuses, nos, nostris, notre, of, plusieurs, postes, quatre, quel, quelle, quelque, quelques, quels, sa, se, second, ses, seuls, six, son, sont, suis, tel, telle, telles, tes, the, those, ton, tous, tout, toute, toutes, trois, un, une, vos, votre, your, L’.

DET does not occur with any features.

Relations

DET nodes are attached to their parents using 13 different relations: fr-dep/det (56974; 92% instances), fr-dep/nmod:poss (4354; 7% instances), fr-dep/expl (142; 0% instances), fr-dep/mwe (139; 0% instances), fr-dep/neg (108; 0% instances), fr-dep/dep (28; 0% instances), fr-dep/compound (11; 0% instances), fr-dep/conj (9; 0% instances), fr-dep/case (8; 0% instances), fr-dep/advmod (4; 0% instances), fr-dep/advcl (1; 0% instances), fr-dep/name (1; 0% instances), fr-dep/nmod (1; 0% instances)

Parents of DET nodes belong to 15 different parts of speech: NOUN (53637; 87% instances), PROPN (6840; 11% instances), PRON (331; 1% instances), ADV (308; 0% instances), ADJ (198; 0% instances), NUM (160; 0% instances), X (132; 0% instances), VERB (106; 0% instances), ADP (34; 0% instances), SYM (18; 0% instances), DET (12; 0% instances), INTJ (1; 0% instances), PART (1; 0% instances), PUNCT (1; 0% instances), SCONJ (1; 0% instances)

61710 (100%) DET nodes are leaves.

49 (0%) DET nodes have one child.

17 (0%) DET nodes have two children.

4 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 6.

Children of DET nodes are attached using 16 different relations: fr-dep/mwe (19; 19% instances), fr-dep/nmod (14; 14% instances), fr-dep/punct (14; 14% instances), fr-dep/cc (11; 11% instances), fr-dep/case (9; 9% instances), fr-dep/conj (9; 9% instances), fr-dep/advmod (7; 7% instances), fr-dep/det (5; 5% instances), fr-dep/dobj (3; 3% instances), fr-dep/expl (2; 2% instances), fr-dep/mark (2; 2% instances), fr-dep/advcl (1; 1% instances), fr-dep/amod (1; 1% instances), fr-dep/appos (1; 1% instances), fr-dep/dep (1; 1% instances), fr-dep/nmod:poss (1; 1% instances)

Children of DET nodes belong to 12 different parts of speech: ADV (22; 22% instances), NOUN (18; 18% instances), ADP (15; 15% instances), PUNCT (14; 14% instances), DET (12; 12% instances), CONJ (10; 10% instances), PROPN (3; 3% instances), SCONJ (2; 2% instances), ADJ (1; 1% instances), PRON (1; 1% instances), VERB (1; 1% instances), X (1; 1% instances)


DET in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]