home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Greek-GUD: POS Tags: DET

There are 38 DET lemmas (1%), 125 DET types (3%) and 3797 DET tokens (15%). Out of 17 observed tags, the rank of DET is: 6 in number of lemmas, 6 in number of types and 3 in number of tokens.

The 10 most frequent DET lemmas: ο, ένας, αυτός, άλλος, κάποιος, όλος, κάτι, κανένας, μόνος, τίποτα

The 10 most frequent DET types: το, την, ο, η, τα, τον, τη, του, οι, της

The 10 most frequent ambiguous lemmas: ένας (DET 220, NUM 18), άλλος (DET 92, ADV 1), εγώ (PRON 1256, DET 18), λίγος (DET 8, ADJ 1), τι (PRON 43, SCONJ 19, DET 8, ADV 6), τόσο (ADV 9, DET 7), ό,τι (DET 6, PRON 1), μισός (DET 5, ADJ 1), ποιος (SCONJ 14, PRON 10, DET 5), μόνο (ADV 21, DET 3)

The 10 most frequent ambiguous types: το (DET 588, PRON 103), την (DET 432, PRON 33), τα (DET 226, PRON 61), τον (DET 235, PRON 99), τη (DET 193, PRON 15), του (PRON 218, DET 163), της (DET 113, PRON 78), τις (DET 107, PRON 3), μια (DET 95, NUM 3, SCONJ 3), ένα (DET 71, NUM 4)

Morphology

The form / lemma ratio of DET is 3.289474 (the average of all parts of speech is 1.674109).

The 1st highest number of forms (15) was observed with the lemma “ο”: αι, η, ο, οι, τ’, τα, τη, την, της, τις, το, τον, του, τους, των.

The 2nd highest number of forms (11) was observed with the lemma “αυτός”: αυτά, αυτές, αυτή, αυτήν, αυτοί, αυτού, αυτούς, αυτό, αυτόν, αυτός, αυτών.

The 3rd highest number of forms (9) was observed with the lemma “άλλος”: άλλα, άλλαι, άλλες, άλλη, άλλο, άλλοι, άλλος, άλλους, άλλων.

DET occurs with 7 features: PronType (3790; 100% instances), Gender (3744; 99% instances), Case (3743; 99% instances), Number (3743; 99% instances), Definite (3156; 83% instances), Person (18; 0% instances), Degree (2; 0% instances)

DET occurs with 20 feature-value pairs: Case=Acc, Case=Gen, Case=Nom, Definite=Def, Definite=Ind, Degree=Cmp, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=3, PronType=Art, PronType=Dem, PronType=Emp, PronType=Ind, PronType=Int, PronType=Neg, PronType=Rel, PronType=Tot

DET occurs with 116 feature combinations. The most frequent feature combination is Case=Acc|Definite=Def|Gender=Fem|Number=Sing|PronType=Art (628 tokens). Examples: την, τη

Relations

DET nodes are attached to their parents using 19 different relations: det (3412; 90% instances), nsubj (140; 4% instances), obj (98; 3% instances), obl (45; 1% instances), root (16; 0% instances), advcl (13; 0% instances), dislocated (12; 0% instances), conj (11; 0% instances), nmod (8; 0% instances), nsubj:pass (8; 0% instances), fixed (7; 0% instances), nsubj:outer (7; 0% instances), advmod (6; 0% instances), appos (3; 0% instances), ccomp (3; 0% instances), flat (3; 0% instances), amod (2; 0% instances), parataxis (2; 0% instances), orphan (1; 0% instances)

Parents of DET nodes belong to 13 different parts of speech: NOUN (2461; 65% instances), PROPN (682; 18% instances), VERB (309; 8% instances), ADJ (167; 4% instances), DET (88; 2% instances), NUM (29; 1% instances), ADV (18; 0% instances), (16; 0% instances), PRON (9; 0% instances), X (9; 0% instances), ADP (6; 0% instances), INTJ (2; 0% instances), SCONJ (1; 0% instances)

3588 (94%) DET nodes are leaves.

131 (3%) DET nodes have one child.

43 (1%) DET nodes have two children.

35 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 10.

Children of DET nodes are attached using 20 different relations: det (80; 22% instances), acl:relcl (57; 16% instances), case (52; 14% instances), punct (44; 12% instances), nmod (21; 6% instances), cc (20; 6% instances), cop (16; 4% instances), flat (15; 4% instances), amod (11; 3% instances), advmod (10; 3% instances), conj (8; 2% instances), nsubj (7; 2% instances), obl (6; 2% instances), mark (4; 1% instances), xcomp (3; 1% instances), csubj (2; 1% instances), advcl (1; 0% instances), appos (1; 0% instances), nummod (1; 0% instances), orphan (1; 0% instances)

Children of DET nodes belong to 15 different parts of speech: DET (88; 24% instances), VERB (64; 18% instances), ADP (47; 13% instances), PUNCT (44; 12% instances), CCONJ (24; 7% instances), NOUN (22; 6% instances), AUX (16; 4% instances), ADJ (15; 4% instances), ADV (14; 4% instances), PRON (14; 4% instances), PROPN (4; 1% instances), SCONJ (4; 1% instances), PART (2; 1% instances), INTJ (1; 0% instances), NUM (1; 0% instances)