home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Greek-GUD: POS Tags: DET

There are 40 DET lemmas (1%), 129 DET types (3%) and 3795 DET tokens (15%). Out of 17 observed tags, the rank of DET is: 6 in number of lemmas, 6 in number of types and 3 in number of tokens.

The 10 most frequent DET lemmas: ο, ένας, αυτός, άλλος, κάποιος, όλος, κάτι, κανένας, μόνος, τίποτα

The 10 most frequent DET types: το, την, ο, η, τα, τον, τη, του, οι, της

The 10 most frequent ambiguous lemmas: ένας (DET 227, NUM 11, SCONJ 1), αυτός (DET 132, PRON 15), άλλος (DET 87, PRON 5, ADV 1), κάποιος (DET 59, PRON 1), όλος (DET 50, ADJ 1, PRON 1), κάτι (DET 45, ADV 1, PRON 1), κανένας (DET 45, PRON 1), τίποτα (DET 25, PRON 3), ίδιος (DET 23, ADJ 1), ποιος (DET 17, PRON 9, SCONJ 3)

The 10 most frequent ambiguous types: το (DET 588, PRON 103), την (DET 432, PRON 33), τα (DET 226, PRON 61), τον (DET 235, PRON 99), τη (DET 193, PRON 15), του (PRON 218, DET 163), της (DET 113, PRON 78), τις (DET 107, PRON 3), μια (DET 97, SCONJ 3, NUM 1), ένα (DET 71, NUM 4)

Morphology

The form / lemma ratio of DET is 3.225000 (the average of all parts of speech is 1.660999).

The 1st highest number of forms (15) was observed with the lemma “ο”: αι, η, ο, οι, τ’, τα, τη, την, της, τις, το, τον, του, τους, των.

The 2nd highest number of forms (11) was observed with the lemma “αυτός”: αυτά, αυτές, αυτή, αυτήν, αυτοί, αυτού, αυτούς, αυτό, αυτόν, αυτός, αυτών.

The 3rd highest number of forms (9) was observed with the lemma “άλλος”: άλλα, άλλαι, άλλες, άλλη, άλλο, άλλοι, άλλος, άλλους, άλλων.

DET occurs with 6 features: PronType (3777; 100% instances), Gender (3738; 98% instances), Number (3738; 98% instances), Case (3737; 98% instances), Definite (3153; 83% instances), Degree (4; 0% instances)

DET occurs with 19 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Definite=Def, Definite=Ind, Degree=Cmp, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, PronType=Art, PronType=Dem, PronType=Emp, PronType=Ind, PronType=Int, PronType=Neg, PronType=Rel, PronType=Tot

DET occurs with 123 feature combinations. The most frequent feature combination is Case=Acc|Definite=Def|Gender=Fem|Number=Sing|PronType=Art (628 tokens). Examples: την, τη

Relations

DET nodes are attached to their parents using 19 different relations: det (3426; 90% instances), nsubj (163; 4% instances), obj (87; 2% instances), obl (49; 1% instances), advcl (13; 0% instances), conj (11; 0% instances), root (9; 0% instances), advmod (8; 0% instances), nsubj:pass (7; 0% instances), dislocated (5; 0% instances), nmod (4; 0% instances), flat (3; 0% instances), amod (2; 0% instances), ccomp (2; 0% instances), nsubj:outer (2; 0% instances), acl (1; 0% instances), appos (1; 0% instances), fixed (1; 0% instances), parataxis (1; 0% instances)

Parents of DET nodes belong to 14 different parts of speech: NOUN (2458; 65% instances), PROPN (679; 18% instances), VERB (315; 8% instances), ADJ (167; 4% instances), DET (84; 2% instances), NUM (27; 1% instances), ADV (20; 1% instances), PRON (13; 0% instances), X (11; 0% instances), (9; 0% instances), AUX (6; 0% instances), INTJ (3; 0% instances), ADP (2; 0% instances), SCONJ (1; 0% instances)

3596 (95%) DET nodes are leaves.

129 (3%) DET nodes have one child.

46 (1%) DET nodes have two children.

24 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 10.

Children of DET nodes are attached using 18 different relations: det (78; 25% instances), acl:relcl (56; 18% instances), case (53; 17% instances), punct (36; 11% instances), nmod (23; 7% instances), cc (16; 5% instances), flat (15; 5% instances), amod (8; 3% instances), cop (6; 2% instances), advmod (5; 2% instances), conj (5; 2% instances), obl (4; 1% instances), xcomp (4; 1% instances), mark (3; 1% instances), nsubj (3; 1% instances), advcl (1; 0% instances), compound (1; 0% instances), dep (1; 0% instances)

Children of DET nodes belong to 14 different parts of speech: DET (84; 26% instances), VERB (58; 18% instances), ADP (49; 15% instances), PUNCT (36; 11% instances), CCONJ (19; 6% instances), NOUN (18; 6% instances), PRON (14; 4% instances), ADJ (13; 4% instances), ADV (11; 3% instances), AUX (7; 2% instances), SCONJ (4; 1% instances), PROPN (3; 1% instances), NUM (1; 0% instances), PART (1; 0% instances)