Treebank Statistics: UD_Cappadocian-TueCL: Features: Gender
This feature is universal.
It occurs with 3 different values: Fem, Masc, Neut.
1731 tokens (42%) have a non-empty value of Gender.
509 types (51%) occur at least once with a non-empty value of Gender.
400 lemmas (56%) occur at least once with a non-empty value of Gender.
The feature is used with 6 part-of-speech tags: NOUN (679; 16% instances), DET (651; 16% instances), PRON (279; 7% instances), ADJ (59; 1% instances), NUM (56; 1% instances), PROPN (7; 0% instances).
NOUN
679 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (546; 80%), Case=Acc (368; 54%).
NOUN tokens may have the following values of Gender:
Fem(92; 14% of non-emptyGender): στράτα, ημέρα, Μαρκάλτσα, Ντιλπέρτσα, ψυσή, εβή, εβήτζα, εβίτζα, μερέ, ρούχαMasc(189; 28% of non-emptyGender): αωπός, ασλάνος, νομάτ, αράπ, αωπό, ταρό, τερζής, τσουφαλάς, φτάλμε, dόστοιNeut(398; 59% of non-emptyGender): φσάχι, άβγο, λαχτόρι, σοιρίδι, μεντζιλίσι, ποτάμι, ξύο, τζαναβάρα, κορτζόκκο, τσίκκινEMPTY(4): Τζερετζή, τοκτόρ, τσαρούχα, χαϊβάνα
| Paradigm φσάχ | Masc | Neut |
|---|---|---|
| Case=Acc|Number=Sing | φσάχι | |
| Case=Acc|Number=Plur | φσάχα | |
| Case=Nom|Number=Sing | φσάχι, φσόκκο | |
| Case=Nom|Number=Plur | φσάχα | |
| Case=Voc|Number=Sing | φσάχι |
Gender seems to be lexical feature of NOUN. 93% lemmas (253) occur only with one value of Gender.
DET
651 DET tokens (99% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (611; 94%), Number=Sing (525; 81%), Case=Acc (388; 60%), Definite=Def (382; 59%).
DET tokens may have the following values of Gender:
Fem(81; 12% of non-emptyGender): η, τη, την, α, τα, αν, αμ, οι, τις, ΚάταMasc(145; 22% of non-emptyGender): ο, τον, τα, ον, του, οι, το, αν, dα, έναNeut(425; 65% of non-emptyGender): το, τα, ο, α, του, τ΄, τ’, αν, ατό, τωνEMPTY(7): ο, α, ε, ον, τα, τζ’
| Paradigm το | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Definite=Def|Number=Sing|PronType=Art | το, ο | τα | το, ο, α, τ’, ον, τα |
| Case=Acc|Definite=Def|Number=Plur|PronType=Art | α, τ’ | τα, α, dα, ο, τ’ | |
| Case=Acc|Number=Sing | ο | ||
| Case=Acc|Number=Sing|PronType=Art | τ’ | το, ο, τα, τ΄, τ’, 'τ, α, τ', τε | |
| Case=Acc|Number=Plur | τα | ||
| Case=Acc|Number=Plur|PronType=Art | τα, α, τ’ | ||
| Case=Gen|Definite=Def|Number=Sing|PronType=Art | του | του | |
| Case=Gen|Definite=Def|Number=Plur|PronType=Art | των | ||
| Case=Gen|Number=Sing | του | ||
| Case=Gen|Number=Sing|PronType=Art | του | ||
| Case=Gen|Number=Plur|PronType=Art | Των, ων | ||
| Case=Nom|Definite=Def|Number=Sing | το | ||
| Case=Nom|Definite=Def|Number=Sing|PronType=Art | το, dα, Τό, τ’ | ||
| Case=Nom|Definite=Def|Number=Plur|PronType=Art | ο | τα, το | |
| Case=Nom|Number=Sing|PronType=Art | το | το, τ΄ | |
| Case=Nom|Number=Plur|PronType=Art | τα, τ΄ | ||
| Case=Voc|Number=Sing|PronType=Art | τα |
PRON
279 PRON tokens (85% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Poss=EMPTY (234; 84%), Number=Sing (217; 78%), Person=EMPTY (190; 68%), PronType=Prs (164; 59%).
PRON tokens may have the following values of Gender:
Fem(16; 6% of non-emptyGender): ατέ, κάτα, τα, τούτη, dα, εμέν’, σου, τούτηννας, τούτιναMasc(78; 28% of non-emptyGender): του, γω, τουν, Σεις, ατός, μας, μου, αdέ, με, συNeut(185; 66% of non-emptyGender): του, με, dα, μες, μας, σε, τα, πα, συ, τουνEMPTY(48): σου, με, μου, του, συ, τουν, γω, σα, ‘γω, Πώτς
| Paradigm εγώ | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc|Number=Sing|Person=1|PronType=Prs | με | ||
| Case=Acc|Number=Sing|PronType=Prs | με, μένα | εμέν’ | με, μένα, μέν, μεν |
| Case=Acc|Number=Plur|Person=1|PronType=Prs | μας | μες, μας | |
| Case=Gen|Number=Sing|Person=3|Poss=Yes|PronType=Prs | μου | ||
| Case=Gen|Number=Sing|PronType=Prs | μου | μου, μ΄ | |
| Case=Gen|Number=Plur|Person=2|Poss=Yes|PronType=Prs | μας | ||
| Case=Nom|Number=Sing | μας | ||
| Case=Nom|Number=Sing|Person=1|PronType=Prs | γω | ||
| Case=Nom|Number=Sing|PronType=Prs | γω, ’Γω | ||
| Case=Nom|Number=Plur|Person=2|PronType=Prs | Σεις | ||
| Case=Voc|Number=Sing | μου |
ADJ
59 ADJ tokens (98% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Number=Sing (43; 73%), Case=Acc (33; 56%).
ADJ tokens may have the following values of Gender:
Fem(5; 8% of non-emptyGender): άβου, αρά, γκιορέ, ιράσταMasc(22; 37% of non-emptyGender): άβου, καό, μπρο, παλό, τίπκε, άλεϊ, απμένον, αχιλλούς, κιορ, λειψόNeut(32; 54% of non-emptyGender): άβου, δομαίνο, ζόρι, πίσι, ’πομεινά, gουμουσόνα, άβο, αλτουνώνα, βυνατό, δραEMPTY(1): τσιπ
| Paradigm άβ | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc | άβου | άβο, άβου | |
| Case=Gen | άβου | ||
| Case=Nom | άβου | άβου | άβου |
Gender seems to be lexical feature of ADJ. 95% lemmas (40) occur only with one value of Gender.
NUM
56 NUM tokens (97% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: NumType=EMPTY (44; 79%), Case=Acc (38; 68%), Number=Sing (35; 63%).
NUM tokens may have the following values of Gender:
Fem(14; 25% of non-emptyGender): μία, οφτά, τρία, α, δύο, δώδεκαMasc(7; 13% of non-emptyGender): αν, έξι, α, πένdε, τέσσερα, τρίαNeut(35; 63% of non-emptyGender): τρία, δύο, αν, δεύτερο, εν, α, ε, πρωτινά, πρωτινό, τρίτονEMPTY(2): α, ‘τζα
| Paradigm α | Masc | Fem | Neut |
|---|---|---|---|
| Case=Acc | αν | α | |
| Case=Acc|NumType=Card | α | ||
| Case=Nom | α | α | αν |
PROPN
7 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.
The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (7; 100%).
PROPN tokens may have the following values of Gender:
Fem(3; 43% of non-emptyGender): Καβάρη, ΠαναγίαMasc(2; 29% of non-emptyGender): ΚαβάρηNeut(2; 29% of non-emptyGender): Γενιτζερίουν, μεντζιλίσι
| Paradigm Καβάρης | Masc | Fem |
|---|---|---|
| Case=Acc | Καβάρη | Καβάρη |
| Case=Gen | Καβάρη |
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (495; 93%),
NOUN –[amod]–> ADJ (43; 90%),
NOUN –[nummod]–> NUM (37; 82%),
NOUN –[nmod]–> NOUN (30; 58%),
NOUN –[conj]–> NOUN (26; 67%),
NUM –[det]–> DET (26; 100%),
ADJ –[det]–> DET (16; 94%),
PRON –[det]–> DET (15; 100%),
NOUN –[det]–> PRON (6; 55%),
NOUN –[nmod]–> DET (5; 71%).