home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Irish-IDT: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc. Some words have combined values of the feature; 1 combinations have been observed: Fem|Masc.

42680 tokens (37%) have a non-empty value of Gender. 11526 types (77%) occur at least once with a non-empty value of Gender. 6457 lemmas (72%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (28077; 24% instances), PROPN (5161; 4% instances), ADJ (3489; 3% instances), DET (2413; 2% instances), ADP (1795; 2% instances), PRON (1736; 1% instances), AUX (9; 0% instances).

NOUN

28077 NOUN tokens (85% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: VerbForm=EMPTY (28063; 100%), Number=Sing (22713; 81%), Case=Nom (21610; 77%), Form=EMPTY (20263; 72%), Definite=EMPTY (15513; 55%).

NOUN tokens may have the following values of Gender:

Paradigm básMascFem
Case=Gen|Definite=Def|Form=Ecl|Number=Singmbáis
Case=Gen|Definite=Def|Form=Len|Number=Singbháis
Case=Gen|Form=Len|Number=Singbháis
Case=Gen|NounType=Strong|Number=PlurBásanna
Case=Gen|Number=Singbáis
Case=Nom|Definite=Def|Form=Ecl|Number=Singmbás
Case=Nom|Definite=Def|Form=Len|Number=Singbhás
Case=Nom|Definite=Def|Number=Singbás, b(h)ás
Case=Nom|Definite=Def|Number=Sing|Typo=Yesbas
Case=Nom|Form=Len|Number=Singbhás
Case=Nom|Form=Len|Number=Plurbhásanna
Case=Nom|Number=Singbás
Case=Nom|Number=Plurbásanna

Gender seems to be lexical feature of NOUN. 99% lemmas (4225) occur only with one value of Gender.

PROPN

5161 PROPN tokens (87% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Definite=Def (5143; 100%), Number=Sing (4761; 92%), Form=EMPTY (3701; 72%).

PROPN tokens may have the following values of Gender:

Paradigm CiarraíMascFem
Case=Gen|Form=LenChiarraí
Case=Nom|Form=LenChiarraíChiarraí
Case=NomCiarraí
Form=LenChiarraí
Ciarraí

Gender seems to be lexical feature of PROPN. 98% lemmas (1668) occur only with one value of Gender.

ADJ

3489 ADJ tokens (54% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=EMPTY (3489; 100%), VerbForm=EMPTY (3489; 100%), Case=Nom (3080; 88%), Form=EMPTY (2885; 83%), NounType=EMPTY (2426; 70%), Number=Sing (2415; 69%).

ADJ tokens may have the following values of Gender:

Paradigm mórMascFem
Case=Gen|Form=Len|Number=Singmhóir
Case=Gen|NounType=Strong|Number=Plurmóramóra
Case=Gen|NounType=Weak|Number=Plurmór
Case=Gen|Number=SingMóirMóire, Móir
Case=Nom|Form=Len|NounType=NotSlender|Number=Plurmhóra
Case=Nom|Form=Len|NounType=Slender|Number=Plurmhóra
Case=Nom|Form=Len|Number=Singmhórmhór
Case=Nom|NounType=NotSlender|Number=Plurmóramóra
Case=Nom|Number=Singmórmór
Number=SingMór

DET

2413 DET tokens (23% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (2413; 100%), Case=Gen (1883; 78%), Definite=Def (1883; 78%), Person=EMPTY (1883; 78%), Poss=EMPTY (1883; 78%), PronType=Art (1883; 78%).

DET tokens may have the following values of Gender:

Paradigm aFem,MascMascFem
Form=Ecln-a
aaa

ADP

1795 ADP tokens (10% of all ADP tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADP and Gender co-occurred: Number=Sing (1795; 100%), Person=3 (1795; 100%), PronType=EMPTY (1644; 92%).

ADP tokens may have the following values of Gender:

Paradigm iFem,MascMascFem
anninti
Poss=Yesinaina, 'na, naina
Typo=Yesan

PRON

1736 PRON tokens (48% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1736; 100%), Person=3 (1736; 100%), PronType=EMPTY (1702; 98%).

PRON tokens may have the following values of Gender:

AUX

9 AUX tokens (1% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Form=EMPTY (9; 100%), Polarity=EMPTY (9; 100%), Tense=Pres (9; 100%), VerbForm=Cop (9; 100%).

AUX tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[nmod]–> NOUN (3950; 51%), NOUN –[amod]–> ADJ (3217; 88%), NOUN –[conj]–> NOUN (1183; 55%), PROPN –[det]–> DET (545; 56%), PROPN –[flat:name]–> PROPN (298; 75%), PROPN –[conj]–> PROPN (152; 56%), PROPN –[amod]–> ADJ (130; 92%), PROPN –[nmod]–> NOUN (112; 51%), NOUN –[compound]–> NOUN (101; 64%), ADJ –[conj]–> ADJ (94; 96%).