This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home gl/pos issue tracker

DET: determiner

This document is a placeholder for the language-specific documentation for DET.


Treebank Statistics (UD_Galician)

There are 29 DET lemmas (0%), 86 DET types (1%) and 23906 DET tokens (17%). Out of 15 observed tags, the rank of DET is: 8 in number of lemmas, 7 in number of types and 2 in number of tokens.

The 10 most frequent DET lemmas: o, un, este, seu, outro, todo, mesmo, cada, algún, ese

The 10 most frequent DET types: o, a, os, as, un, unha, este, súa, esta, seu

The 10 most frequent ambiguous lemmas: o (DET 18186, PRON 273), un (DET 2077, PRON 42), este (DET 954, PRON 1), mesmo (DET 157, ADV 52), aquel (DET 106, PRON 1), moito (DET 66, ADV 22), propio (DET 43, ADJ 28), certo (DET 33, ADJ 9), pouco (ADV 19, DET 19), que (PRON 1909, SCONJ 1195, DET 14)

The 10 most frequent ambiguous types: o (DET 6283, PRON 98), a (DET 6114, ADP 2083, PRON 53), os (DET 2686, PRON 93), as (DET 1881, PRON 27), un (DET 1008, PRON 25), unha (DET 935, PRON 16), esta (DET 240, PRON 1), mesmo (DET 84, ADV 52), aqueles (DET 51, PRON 1), propia (DET 19, ADJ 11)

Morphology

The form / lemma ratio of DET is 2.965517 (the average of all parts of speech is 1.518322).

The 1st highest number of forms (4) was observed with the lemma “algún”: algunha, algunhas, algún, algúns.

The 2nd highest number of forms (4) was observed with the lemma “aquel”: aquel, aquela, aquelas, aqueles.

The 3rd highest number of forms (4) was observed with the lemma “certo”: certa, certas, certo, certos.

DET does not occur with any features.

Relations

DET nodes are attached to their parents using 16 different relations: det (23730; 99% instances), nsubj (42; 0% instances), dep (33; 0% instances), amod (23; 0% instances), nmod (17; 0% instances), advmod (16; 0% instances), dobj (15; 0% instances), ccomp (7; 0% instances), cop (6; 0% instances), aux (5; 0% instances), punct (4; 0% instances), mark (3; 0% instances), cc (2; 0% instances), case (1; 0% instances), foreign (1; 0% instances), root (1; 0% instances)

Parents of DET nodes belong to 14 different parts of speech: NOUN (20516; 86% instances), VERB (798; 3% instances), NUM (542; 2% instances), DET (503; 2% instances), ADJ (436; 2% instances), PROPN (404; 2% instances), ADP (326; 1% instances), PRON (312; 1% instances), ADV (45; 0% instances), PUNCT (14; 0% instances), SCONJ (6; 0% instances), INTJ (2; 0% instances), PART (1; 0% instances), ROOT (1; 0% instances)

23181 (97%) DET nodes are leaves.

678 (3%) DET nodes have one child.

42 (0%) DET nodes have two children.

5 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 4.

Children of DET nodes are attached using 12 different relations: det (535; 69% instances), nummod (101; 13% instances), cc (38; 5% instances), case (31; 4% instances), advmod (21; 3% instances), dep (18; 2% instances), punct (14; 2% instances), nmod (12; 2% instances), dobj (4; 1% instances), amod (2; 0% instances), ccomp (1; 0% instances), mark (1; 0% instances)

Children of DET nodes belong to 12 different parts of speech: DET (503; 65% instances), NUM (129; 17% instances), ADP (34; 4% instances), CONJ (32; 4% instances), ADV (27; 3% instances), ADJ (21; 3% instances), PUNCT (20; 3% instances), PRON (8; 1% instances), INTJ (1; 0% instances), NOUN (1; 0% instances), PART (1; 0% instances), VERB (1; 0% instances)


Treebank Statistics (UD_Galician-TreeGal)

There are 30 DET lemmas (1%), 87 DET types (2%) and 3978 DET tokens (16%). Out of 15 observed tags, the rank of DET is: 9 in number of lemmas, 7 in number of types and 2 in number of tokens.

The 10 most frequent DET lemmas: o, un, seu, este, todo, outro, ese, noso, moito, mesmo

The 10 most frequent DET types: o, a, os, as, un, unha, súa, seu, esta, lo

The 10 most frequent ambiguous lemmas: o (DET 2885, PRON 115), un (DET 452, PRON 26, NUM 11), seu (DET 175, PRON 2), este (DET 101, PRON 29), todo (DET 61, PRON 22), outro (DET 55, PRON 20), ese (DET 44, PRON 24), noso (DET 31, PRON 1), moito (ADV 41, DET 26, PRON 7), mesmo (DET 22, ADV 11, PRON 8)

The 10 most frequent ambiguous types: o (DET 1094, PRON 42), a (DET 1035, ADP 400, PRON 23), os (DET 384, PRON 9), as (DET 274, PRON 8), un (DET 253, PRON 15, NUM 10), unha (DET 179, PRON 11, NUM 1), seu (DET 48, PRON 2), esta (DET 46, PRON 4), lo (DET 45, PRON 20), este (DET 37, PRON 6)

Morphology

The form / lemma ratio of DET is 2.900000 (the average of all parts of speech is 1.374140).

The 1st highest number of forms (8) was observed with the lemma “o”: a, as, la, las, lo, los, o, os.

The 2nd highest number of forms (5) was observed with the lemma “moito”: moita, moitas, moito, moitos, moitísimas.

The 3rd highest number of forms (5) was observed with the lemma “seu”: seu, seus, sua, súa, súas.

DET occurs with 7 features: Gender (3978; 100% instances), Number (3978; 100% instances), PronType (3762; 95% instances), Definite (3337; 84% instances), gl-feat/Number[psor] (217; 5% instances), Person (217; 5% instances), Poss (217; 5% instances)

DET occurs with 17 feature-value pairs: Definite=Def, Definite=Ind, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person=1, Person=3, Poss=Yes, PronType=Art, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Rel

DET occurs with 37 feature combinations. The most frequent feature combination is Definite=Def|Gender=Masc|Number=Sing|PronType=Art (1140 tokens). Examples: o, lo, os

Relations

DET nodes are attached to their parents using 7 different relations: det (3968; 100% instances), nmod (3; 0% instances), dep (2; 0% instances), nummod (2; 0% instances), conj (1; 0% instances), iobj (1; 0% instances), mark (1; 0% instances)

Parents of DET nodes belong to 8 different parts of speech: NOUN (3328; 84% instances), PROPN (364; 9% instances), PRON (165; 4% instances), ADJ (61; 2% instances), NUM (37; 1% instances), VERB (21; 1% instances), ADV (1; 0% instances), SYM (1; 0% instances)

3958 (99%) DET nodes are leaves.

13 (0%) DET nodes have one child.

6 (0%) DET nodes have two children.

1 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 3.

Children of DET nodes are attached using 8 different relations: case (12; 43% instances), advmod (4; 14% instances), nmod (3; 11% instances), acl (2; 7% instances), cc (2; 7% instances), conj (2; 7% instances), neg (2; 7% instances), punct (1; 4% instances)

Children of DET nodes belong to 8 different parts of speech: ADP (12; 43% instances), ADV (6; 21% instances), NOUN (3; 11% instances), CONJ (2; 7% instances), VERB (2; 7% instances), ADJ (1; 4% instances), PROPN (1; 4% instances), PUNCT (1; 4% instances)


DET in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]