home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Ancient_Greek-Perseus: POS Tags: DET

There are 6 DET lemmas (0%), 52 DET types (0%) and 11831 DET tokens (6%). Out of 15 observed tags, the rank of DET is: 14 in number of lemmas, 11 in number of types and 8 in number of tokens.

The 10 most frequent DET lemmas: ὁ, ὀ, οὗ, ὅς, μήν, τίς

The 10 most frequent DET types: τῶν, τὴν, ὁ, τὸν, τὸ, τῆς, τοῦ, τοὺς, οἱ, τοῖς

The 10 most frequent ambiguous lemmas: (DET 11811, PRON 2252, ADJ 1, X 1), οὗ (ADV 6, PRON 3, DET 2), ὅς (PRON 1352, ADJ 22, DET 2), μήν (PART 36, ADV 28, DET 1, NOUN 1), τίς (PRON 235, ADV 8, ADJ 7, X 7, DET 1)

The 10 most frequent ambiguous types: τῶν (DET 1436, PRON 111), τὴν (DET 1345, PRON 118), (DET 947, PRON 13), τὸν (DET 941, PRON 365, ADJ 1), τὸ (DET 904, PRON 71, ADJ 1), τῆς (DET 882, PRON 22, ADJ 1), τοῦ (DET 752, PRON 119, ADJ 1), τοὺς (DET 718, PRON 74), οἱ (DET 626, PRON 441), τοῖς (DET 524, PRON 34)

Morphology

The form / lemma ratio of DET is 8.666667 (the average of all parts of speech is 3.010372).

The 1st highest number of forms (51) was observed with the lemma “ὁ”: αἱ, αἳ, αἵ, οἱ, οἳ, οἵ, τά, τάν, τάς, τήν, ταῖς, τούς, τοὶ, τοὺς, τοῖν, τοῖο, τοῖς, τοῖσι, τοῖσιν, τοῦ, τό, τόν, τὰ, τὰν, τὰς, τὴν, τὸ, τὸν, τὼ, τᾶν, τᾶς, τᾷ, τῆ, τῆς, τῇ, τῇσι, τῳ, τῶ, τῶν, τῷ, χοἰ, χἠ, χὠ, ἁ, ἃ, ἡ, ἣ, ἥ, ὁ, ὃ, ὅ.

The 2nd highest number of forms (2) was observed with the lemma “ὅς”: τοῦ, ὃ.

The 3rd highest number of forms (1) was observed with the lemma “μήν”: μὴν.

DET occurs with 3 features: Case (11810; 100% instances), Number (11808; 100% instances), Gender (11802; 100% instances)

DET occurs with 11 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, Number=Dual, Number=Plur, Number=Sing

DET occurs with 37 feature combinations. The most frequent feature combination is Case=Acc|Gender=Fem|Number=Sing (1385 tokens). Examples: τὴν, τήν, τὰν, τάν

Relations

DET nodes are attached to their parents using 4 different relations: det (11812; 100% instances), advmod (15; 0% instances), xcomp (3; 0% instances), root (1; 0% instances)

Parents of DET nodes belong to 9 different parts of speech: NOUN (9097; 77% instances), VERB (1288; 11% instances), ADJ (1236; 10% instances), PRON (140; 1% instances), ADV (52; 0% instances), DET (14; 0% instances), NUM (2; 0% instances), PART (1; 0% instances), (1; 0% instances)

11239 (95%) DET nodes are leaves.

426 (4%) DET nodes have one child.

123 (1%) DET nodes have two children.

43 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 7.

Children of DET nodes are attached using 16 different relations: nmod (329; 40% instances), punct (255; 31% instances), case (114; 14% instances), advmod (34; 4% instances), amod (25; 3% instances), cc (17; 2% instances), det (13; 2% instances), conj (8; 1% instances), obl (8; 1% instances), acl (7; 1% instances), xcomp (5; 1% instances), appos (3; 0% instances), nummod (3; 0% instances), cop (2; 0% instances), nsubj (2; 0% instances), advcl (1; 0% instances)

Children of DET nodes belong to 12 different parts of speech: PUNCT (255; 31% instances), NOUN (232; 28% instances), ADP (113; 14% instances), ADJ (62; 8% instances), ADV (40; 5% instances), PRON (39; 5% instances), VERB (38; 5% instances), CCONJ (18; 2% instances), DET (14; 2% instances), PART (10; 1% instances), NUM (3; 0% instances), AUX (2; 0% instances)