Treebank Statistics: UD_Irish-Cadhan: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
1570 tokens (33%) have a non-empty value of Gender
.
1081 types (58%) occur at least once with a non-empty value of Gender
.
759 lemmas (69%) occur at least once with a non-empty value of Gender
.
The feature is used with 6 part-of-speech tags: NOUN (980; 20% instances), PROPN (196; 4% instances), ADP (107; 2% instances), ADJ (103; 2% instances), DET (99; 2% instances), PRON (85; 2% instances).
NOUN
980 NOUN tokens (86% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: VerbForm=EMPTY (980; 100%), Number=Sing (847; 86%), Case=Nom (718; 73%), Form=EMPTY (666; 68%), Definite=EMPTY (616; 63%).
NOUN
tokens may have the following values of Gender
:
Fem
(344; 35% of non-emptyGender
): beatha, leith, bliadhna, cuid, oidhche, réir, thoil, laimh, linn, láimhMasc
(636; 65% of non-emptyGender
): lá, duine, fhios, saoghal, bith, fear, la, ainm, creidimh, macEMPTY
(160): bheith, chur, dhéanamh, cur, tan, tabhairt, teacht, éis, ais, déanamh
Paradigm cor | Masc | Fem |
---|---|---|
Case=Gen|Form=Ecl|NounType=Weak|Number=Plur | gcor | |
Case=Nom|Form=Len|Number=Sing | chor | choir |
Gender
seems to be lexical feature of NOUN
. 100% lemmas (546) occur only with one value of Gender
.
PROPN
196 PROPN tokens (86% of all PROPN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PROPN
and Gender
co-occurred: Definite=Def (177; 90%), Number=Sing (175; 89%), Form=EMPTY (158; 81%), Case=Nom (100; 51%).
PROPN
tokens may have the following values of Gender
:
Fem
(46; 23% of non-emptyGender
): Éireann, Éirinn, Danann, Eireann, Uladh, n-Éirinn, Callain, Casga, Chill, ChnámhchoillMasc
(150; 77% of non-emptyGender
): Iósa, Dia, Dé, Sacsaibh, Ursula, Beare, Bheannchair, Dhia, Dhía, IosaEMPTY
(31): Buck, Comhghall, Hanmer, Bangor, Bhuck, Chairbre, Cuidithe, Cúna, Dyea, Hibernia
Paradigm Ulaidh | Masc | Fem |
---|---|---|
Uladh | Uladh |
Gender
seems to be lexical feature of PROPN
. 99% lemmas (126) occur only with one value of Gender
.
ADP
107 ADP tokens (15% of all ADP
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADP
and Gender
co-occurred: Number=Sing (107; 100%), Person=3 (107; 100%).
ADP
tokens may have the following values of Gender
:
Fem
(9; 8% of non-emptyGender
): aici, uirre, dhi, di, lei, léithi, ríaMasc
(98; 92% of non-emptyGender
): ann, aige, air, ‘na, na, d’á, dó, as, da, dhóEMPTY
(626): ar, ag, i, do, le, ó, go, re, gan, dochum
Paradigm ar | Masc | Fem |
---|---|---|
air | uirre |
ADJ
103 ADJ tokens (49% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Degree=EMPTY (103; 100%), Number=Sing (93; 90%), Case=Nom (85; 83%), Form=EMPTY (84; 82%).
ADJ
tokens may have the following values of Gender
:
Fem
(24; 23% of non-emptyGender
): ghloin, mhaith, shuthoin, aintréin, bheg, bhán, breagh, buidhe, cobhartha, direachMasc
(79; 77% of non-emptyGender
): beag, mór, dil, maith, Caoimh, Caoin, Sasanach, aisteach, allta, amhainEMPTY
(108): maith, mó, amháin, iomdha, ionann, mór, buaine, cóir, geal, mhór
Paradigm mór | Masc | Fem |
---|---|---|
Form=Len | mhór | |
mór |
DET
99 DET tokens (22% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Number=Sing (98; 99%), Case=EMPTY (53; 54%), Definite=EMPTY (53; 54%), Person=3 (53; 54%), Poss=Yes (53; 54%), PronType=EMPTY (53; 54%).
DET
tokens may have the following values of Gender
:
Fem
(28; 28% of non-emptyGender
): na, a, n-a, náMasc
(71; 72% of non-emptyGender
): a, an, n-a, doEMPTY
(351): an, na, gach, mo, a, do, eile, so, aon, sin
Paradigm an | Masc | Fem |
---|---|---|
an | na |
PRON
85 PRON tokens (41% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=Sing (85; 100%), Person=3 (85; 100%), PronType=EMPTY (80; 94%).
PRON
tokens may have the following values of Gender
:
Fem
(13; 15% of non-emptyGender
): sí, í, si, siseMasc
(72; 85% of non-emptyGender
): sé, é, hé, se, e, seision, eisean, seiseanEMPTY
(124): sin, féin, mé, siad, tú, iad, so, a, sibh, tusa
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[amod]–> ADJ (91; 87%),
NOUN –[conj]–> NOUN (46; 68%),
PROPN –[conj]–> PROPN (15; 100%),
PROPN –[appos]–> NOUN (14; 88%),
PROPN –[nmod]–> PROPN (10; 53%),
NOUN –[appos]–> PROPN (9; 69%),
PROPN –[flat:name]–> PROPN (9; 82%),
PROPN –[amod]–> ADJ (5; 100%),
PROPN –[conj]–> NOUN (5; 56%),
PROPN –[nmod]–> NOUN (5; 56%).