Treebank Statistics: UD_Greek-GUD: POS Tags: DET
There are 38 DET
lemmas (1%), 125 DET
types (3%) and 3797 DET
tokens (15%).
Out of 17 observed tags, the rank of DET
is: 6 in number of lemmas, 6 in number of types and 3 in number of tokens.
The 10 most frequent DET
lemmas: ο, ένας, αυτός, άλλος, κάποιος, όλος, κάτι, κανένας, μόνος, τίποτα
The 10 most frequent DET
types: το, την, ο, η, τα, τον, τη, του, οι, της
The 10 most frequent ambiguous lemmas: ένας (DET 220, NUM 18), άλλος (DET 92, ADV 1), εγώ (PRON 1256, DET 18), λίγος (DET 8, ADJ 1), τι (PRON 43, SCONJ 19, DET 8, ADV 6), τόσο (ADV 9, DET 7), ό,τι (DET 6, PRON 1), μισός (DET 5, ADJ 1), ποιος (SCONJ 14, PRON 10, DET 5), μόνο (ADV 21, DET 3)
The 10 most frequent ambiguous types: το (DET 588, PRON 103), την (DET 432, PRON 33), τα (DET 226, PRON 61), τον (DET 235, PRON 99), τη (DET 193, PRON 15), του (PRON 218, DET 163), της (DET 113, PRON 78), τις (DET 107, PRON 3), μια (DET 95, NUM 3, SCONJ 3), ένα (DET 71, NUM 4)
- το
- την
- τα
- τον
- τη
- του
- της
- τις
- μια
- ένα
Morphology
The form / lemma ratio of DET
is 3.289474 (the average of all parts of speech is 1.674109).
The 1st highest number of forms (15) was observed with the lemma “ο”: αι, η, ο, οι, τ’, τα, τη, την, της, τις, το, τον, του, τους, των.
The 2nd highest number of forms (11) was observed with the lemma “αυτός”: αυτά, αυτές, αυτή, αυτήν, αυτοί, αυτού, αυτούς, αυτό, αυτόν, αυτός, αυτών.
The 3rd highest number of forms (9) was observed with the lemma “άλλος”: άλλα, άλλαι, άλλες, άλλη, άλλο, άλλοι, άλλος, άλλους, άλλων.
DET
occurs with 7 features: PronType (3790; 100% instances), Gender (3744; 99% instances), Case (3743; 99% instances), Number (3743; 99% instances), Definite (3156; 83% instances), Person (18; 0% instances), Degree (2; 0% instances)
DET
occurs with 20 feature-value pairs: Case=Acc
, Case=Gen
, Case=Nom
, Definite=Def
, Definite=Ind
, Degree=Cmp
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Plur
, Number=Sing
, Person=3
, PronType=Art
, PronType=Dem
, PronType=Emp
, PronType=Ind
, PronType=Int
, PronType=Neg
, PronType=Rel
, PronType=Tot
DET
occurs with 116 feature combinations.
The most frequent feature combination is Case=Acc|Definite=Def|Gender=Fem|Number=Sing|PronType=Art
(628 tokens).
Examples: την, τη
Relations
DET
nodes are attached to their parents using 19 different relations: det (3412; 90% instances), nsubj (140; 4% instances), obj (98; 3% instances), obl (45; 1% instances), root (16; 0% instances), advcl (13; 0% instances), dislocated (12; 0% instances), conj (11; 0% instances), nmod (8; 0% instances), nsubj:pass (8; 0% instances), fixed (7; 0% instances), nsubj:outer (7; 0% instances), advmod (6; 0% instances), appos (3; 0% instances), ccomp (3; 0% instances), flat (3; 0% instances), amod (2; 0% instances), parataxis (2; 0% instances), orphan (1; 0% instances)
Parents of DET
nodes belong to 13 different parts of speech: NOUN (2461; 65% instances), PROPN (682; 18% instances), VERB (309; 8% instances), ADJ (167; 4% instances), DET (88; 2% instances), NUM (29; 1% instances), ADV (18; 0% instances), (16; 0% instances), PRON (9; 0% instances), X (9; 0% instances), ADP (6; 0% instances), INTJ (2; 0% instances), SCONJ (1; 0% instances)
3588 (94%) DET
nodes are leaves.
131 (3%) DET
nodes have one child.
43 (1%) DET
nodes have two children.
35 (1%) DET
nodes have three or more children.
The highest child degree of a DET
node is 10.
Children of DET
nodes are attached using 20 different relations: det (80; 22% instances), acl:relcl (57; 16% instances), case (52; 14% instances), punct (44; 12% instances), nmod (21; 6% instances), cc (20; 6% instances), cop (16; 4% instances), flat (15; 4% instances), amod (11; 3% instances), advmod (10; 3% instances), conj (8; 2% instances), nsubj (7; 2% instances), obl (6; 2% instances), mark (4; 1% instances), xcomp (3; 1% instances), csubj (2; 1% instances), advcl (1; 0% instances), appos (1; 0% instances), nummod (1; 0% instances), orphan (1; 0% instances)
Children of DET
nodes belong to 15 different parts of speech: DET (88; 24% instances), VERB (64; 18% instances), ADP (47; 13% instances), PUNCT (44; 12% instances), CCONJ (24; 7% instances), NOUN (22; 6% instances), AUX (16; 4% instances), ADJ (15; 4% instances), ADV (14; 4% instances), PRON (14; 4% instances), PROPN (4; 1% instances), SCONJ (4; 1% instances), PART (2; 1% instances), INTJ (1; 0% instances), NUM (1; 0% instances)