Treebank Statistics: UD_Greek-Cretan: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem, Masc, Neut.
1860 tokens (43%) have a non-empty value of Gender.
761 types (55%) occur at least once with a non-empty value of Gender.
593 lemmas (59%) occur at least once with a non-empty value of Gender.
The feature is used with 8 part-of-speech tags: DET (635; 15% instances), NOUN (601; 14% instances), PRON (314; 7% instances), PROPN (148; 3% instances), ADJ (122; 3% instances), NUM (22; 1% instances), VERB (13; 0% instances), INTJ (5; 0% instances).
DET
635 DET tokens (99% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (572; 90%), Definite=Def (544; 86%), Number=Sing (491; 77%).
DET tokens may have the following values of Gender:
Fem(188; 30% of non-emptyGender): η, τη, την, μια, τσι, οι, τση, τσ’, της, άλλεςMasc(249; 39% of non-emptyGender): ο, του, τον, το, αυτός, τσοι, οι, ένας, τσ’, τ’Neut(198; 31% of non-emptyGender): το, τα, τ’, ό,τι, ένα, άλλα, του, αυτό, των, ίδιαEMPTY(6): κάτι, το, Τάδε
| Paradigm ο | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Definite=Def|Number=Sing|PronType=Art | τον, το | τη, την | το, τ' |
| Case=Acc|Definite=Def|Number=Plur|PronType=Art | τσοι, τους | τσι, τσ' | τα, τ' |
| Case=Acc|Number=Sing|PronType=Art | τ' | ||
| Case=Acc|Number=Plur | τ', τα | ||
| Case=Gen|Definite=Def|Number=Sing|PronType=Art | του, τ' | τση, της | του, τ' |
| Case=Gen|Definite=Def|Number=Plur|PronType=Art | τω | των | |
| Case=Nom|Definite=Def|Number=Sing|PronType=Art | ο | η | το, τ' |
| Case=Nom|Definite=Def|Number=Plur|PronType=Art | οι | οι | τα, τ' |
NOUN
601 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (430; 72%), Case=Acc (358; 60%).
NOUN tokens may have the following values of Gender:
Fem(207; 34% of non-emptyGender): θεια, χάρη, κερά, παρέα, ώρα, γυναίκα, γυναίκες, κορφή, μύτη, νύχταMasc(168; 28% of non-emptyGender): βασιλιάς, κύρη, σταυρό, γέρο, γερο, γιατρέ, κουμπάρος, κόσμο, φαμέγιο, ΑϊNeut(226; 38% of non-emptyGender): σπίτι, χρόνια, νερό, γάλα, παιδί, πόδια, τέκνο, κρίματά, μεσάνυχτα, μπεγίριEMPTY(1): ποξημερώματα
| Paradigm χρόνος | Masc | Neut |
|---|---|---|
| Case=Acc|Number=Plur | χρόνια | |
| Case=Gen|Number=Plur | χρονώ | |
| Case=Nom|Number=Sing | χρόνος | |
| Case=Nom|Number=Plur | χρόνια |
Gender seems to be lexical feature of NOUN. 97% lemmas (361) occur only with one value of Gender.
PRON
314 PRON tokens (75% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (273; 87%), PronType=Prs (263; 84%), Poss=EMPTY (235; 75%), Person=3 (216; 69%).
PRON tokens may have the following values of Gender:
Fem(89; 28% of non-emptyGender): τηνε, τζη, την, που, τση, ντηνε, τσι, ντως, τη, τηςMasc(152; 48% of non-emptyGender): ντου, του, τονε, που, τον, ντονε, τσοι, μου, σου, ΠοιοςNeut(73; 23% of non-emptyGender): το, είντα, τα, που, ντως, ό,τι, έτονα, εσείς, κιάνα, ντουςEMPTY(104): μου, μας, σου, εγώ, ντως, με, μαςε, σε, τως, ‘μάς
| Paradigm εγώ | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Number=Sing|Person=1 | με, τονε | τηνε | |
| Case=Acc|Number=Sing|Person=2 | σε | ||
| Case=Acc|Number=Sing|Person=3 | τονε, τον, ντονε, ντον, τηνε | τηνε, την, ντηνε, τη | το |
| Case=Acc|Number=Sing | τονε | ντηνε, τηνε | το |
| Case=Acc|Number=Plur|Person=3 | τσοι | τσι | τα |
| Case=Acc|Number=Plur | τσι | ||
| Case=Gen|Number=Sing|Person=1|Poss=Yes | μου | ||
| Case=Gen|Number=Sing|Person=1 | μου | τση | |
| Case=Gen|Number=Sing|Person=2|Poss=Yes | σου | ||
| Case=Gen|Number=Sing|Person=2 | σου, σ' | ||
| Case=Gen|Number=Sing|Person=3|Poss=Yes | ντου, του | τζη, της | |
| Case=Gen|Number=Sing|Person=3 | του, ντου, τ', ντονε | τζη, Τση, της | τ' |
| Case=Gen|Number=Sing|Poss=Yes | ντου, του | τζη | |
| Case=Gen|Number=Sing | ντου | τζη | |
| Case=Gen|Number=Plur|Person=1|Poss=Yes | μας | ||
| Case=Gen|Number=Plur|Person=2 | σας | ||
| Case=Gen|Number=Plur|Person=3|Poss=Yes | τους, τως | τωνε | ντως |
| Case=Gen|Number=Plur|Person=3 | τωνε | ντως | |
| Case=Gen|Number=Plur|Poss=Yes | ντως | ντως | |
| Case=Gen|Number=Plur | ντους | ||
| Case=Nom|Number=Plur|Person=2 | εσείς |
PROPN
148 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (145; 98%), Degree=EMPTY (122; 82%), Case=Nom (97; 66%).
PROPN tokens may have the following values of Gender:
Fem(29; 20% of non-emptyGender): Μελεμενιά, Πινελιά, Δεσποινιώ, Κυριακή, Ανεζώ, Αννιά, Αρχουντού, Ασπασία, Βενετιά, ΔεσποινιώςMasc(105; 71% of non-emptyGender): Δρόσος, Μαθιός, Θιος, Νικολής, Προκόπης, Νικολάρος, Στελιανός, Νικολή, Θεού, ΝικόληςNeut(14; 9% of non-emptyGender): Σηφαλιό, Γαρεφαλιό, Μιχαλιό, Ανεζώ, Δεσποινιό, Δεσποινιώ, Μαρουλιώ, Μελπονιό, Παρίσα, Φιτσανά
| Paradigm Δέσποινα | Fem | Neut |
|---|---|---|
| Case=Gen | Δεσποινιώς | |
| Case=Nom | Δεσποινιώ | Δεσποινιό, Δεσποινιώ |
Gender seems to be lexical feature of PROPN. 95% lemmas (57) occur only with one value of Gender.
ADJ
122 ADJ tokens (98% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (92; 75%).
ADJ tokens may have the following values of Gender:
Fem(32; 26% of non-emptyGender): αριστερή, μεγάλη, ούλη, Μεγάλης, Πεταχτούλα, Φαρμακόγλωσσες, άνομες, ανθρώπινη, γεμάτη, δεκαεφτάχρονηMasc(48; 39% of non-emptyGender): μεγάλος, φρέσκος, φτωχός, Κεφαλάς, Μαυροκακομοίρη, Μικιοί, Πασίχαρος, Πονόψυχος, Τίμιου, άγνωστοςNeut(42; 34% of non-emptyGender): δίκαια, καλό, παχουλά, ροδοκόκκινα, Βοστρυχωτά, Μικρό, αγαπημένα, αδιανόητο, αθάνατο, αμέτρηταEMPTY(2): μαρέ, μπλε
| Paradigm καλός | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Number=Sing | καλό | ||
| Case=Acc|Number=Plur | καλά | ||
| Case=Nom|Number=Sing | καλός | καλή | Καλό |
| Case=Voc|Number=Sing | καλέ |
Gender seems to be lexical feature of ADJ. 93% lemmas (91) occur only with one value of Gender.
NUM
22 NUM tokens (88% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (22; 100%), Number=Plur (22; 100%), Case=Acc (17; 77%).
NUM tokens may have the following values of Gender:
Fem(5; 23% of non-emptyGender): δυο, τρειςMasc(6; 27% of non-emptyGender): Εκατό, διακόσιους, εκατόν, πέντε, πενήντα, σαράνταNeut(11; 50% of non-emptyGender): πέντε, Σαράντα, δεκαεφτά, δυο, δύο, είκοσι, ογδόντα, οχτακόσα, τρία, χίλιαEMPTY(3): εννιά, σαράντα, τριάντα
| Paradigm δύο | Fem | Neut |
|---|---|---|
| Case=Acc | δυο | δυο |
| Case=Nom | δύο |
VERB
13 VERB tokens (2% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Aspect=EMPTY (13; 100%), Mood=EMPTY (13; 100%), Person=EMPTY (13; 100%), Tense=EMPTY (13; 100%), VerbForm=Part (12; 92%), Voice=Pass (12; 92%), Number=Sing (9; 69%).
VERB tokens may have the following values of Gender:
Fem(2; 15% of non-emptyGender): Ευλογημένη, μπροκωμένηMasc(8; 62% of non-emptyGender): βαφτισμένος, δασκαλεμένοι, διαλεγώνα, κουρελιασμένος, κουρελοντυμένος, μυρωμένος, ξεσυγυρισμένους, φτιαγμένοNeut(3; 23% of non-emptyGender): αυλακιασμένα, δεμένα, μεστωμένοEMPTY(678): κάνει, είπε, λέει, λέω, πήγε, ήκανε, πάει, πήρε, πεις, ‘πε
Gender seems to be lexical feature of VERB. 100% lemmas (13) occur only with one value of Gender.
INTJ
5 INTJ tokens (15% of all INTJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which INTJ and Gender co-occurred: Number=Sing (5; 100%), Case=Voc (4; 80%).
INTJ tokens may have the following values of Gender:
Fem(1; 20% of non-emptyGender): μωρήMasc(3; 60% of non-emptyGender): μωρέ, μπρεNeut(1; 20% of non-emptyGender): λοιπόςEMPTY(29): μα, Ε, μπρε, ντα, άντε, λοιπόν, μωρέ, να, Αμήν, λοιπό
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (403; 98%),
PROPN –[det]–> DET (135; 99%),
NOUN –[amod]–> ADJ (39; 100%),
ADJ –[det]–> DET (29; 100%),
NOUN –[nummod]–> NUM (12; 92%),
ADJ –[conj]–> ADJ (9; 100%),
PROPN –[compound]–> NOUN (8; 100%),
ADJ –[nsubj]–> NOUN (7; 100%),
DET –[det]–> DET (6; 100%),
NOUN –[appos]–> PROPN (5; 83%).