home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Irish-IDT: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc. Some words have combined values of the feature; 1 combinations have been observed: Fem|Masc.

42686 tokens (37%) have a non-empty value of Gender. 11521 types (77%) occur at least once with a non-empty value of Gender. 6435 lemmas (72%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (28277; 24% instances), PROPN (4957; 4% instances), ADJ (3494; 3% instances), DET (2418; 2% instances), ADP (1795; 2% instances), PRON (1736; 1% instances), AUX (9; 0% instances).

NOUN

28277 NOUN tokens (85% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: VerbForm=EMPTY (28264; 100%), Number=Sing (22905; 81%), Case=Nom (21730; 77%), Form=EMPTY (20366; 72%), Definite=EMPTY (15537; 55%).

NOUN tokens may have the following values of Gender:

Paradigm básMascFem
Case=Gen|Definite=Def|Form=Ecl|Number=Singmbáis
Case=Gen|Definite=Def|Form=Len|Number=Singbháis
Case=Gen|Form=Len|Number=Singbháis
Case=Gen|NounType=Strong|Number=PlurBásanna
Case=Gen|Number=Singbáis
Case=Nom|Definite=Def|Form=Ecl|Number=Singmbás
Case=Nom|Definite=Def|Form=Len|Number=Singbhás
Case=Nom|Definite=Def|Number=Singbás, b(h)ás
Case=Nom|Definite=Def|Number=Sing|Typo=Yesbas
Case=Nom|Form=Len|Number=Singbhás
Case=Nom|Form=Len|Number=Plurbhásanna
Case=Nom|Number=Singbás
Case=Nom|Number=Plurbásanna

Gender seems to be lexical feature of NOUN. 99% lemmas (4226) occur only with one value of Gender.

PROPN

4957 PROPN tokens (87% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Definite=Def (4941; 100%), Number=Sing (4570; 92%), Form=EMPTY (3599; 73%).

PROPN tokens may have the following values of Gender:

Paradigm CiarraíMascFem
Case=Gen|Form=LenChiarraí
Case=Nom|Form=LenChiarraíChiarraí
Case=NomCiarraí
Form=LenChiarraí
Ciarraí

Gender seems to be lexical feature of PROPN. 98% lemmas (1637) occur only with one value of Gender.

ADJ

3494 ADJ tokens (54% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: Degree=EMPTY (3494; 100%), VerbForm=EMPTY (3494; 100%), Case=Nom (3085; 88%), Form=EMPTY (2888; 83%), NounType=EMPTY (2429; 70%), Number=Sing (2419; 69%).

ADJ tokens may have the following values of Gender:

Paradigm mórMascFem
Case=Gen|Form=Len|Number=Singmhóir
Case=Gen|NounType=Strong|Number=Plurmóramóra
Case=Gen|NounType=Weak|Number=Plurmór
Case=Gen|Number=SingMóirMóire, Móir
Case=Nom|Form=Len|NounType=NotSlender|Number=Plurmhóra
Case=Nom|Form=Len|NounType=Slender|Number=Plurmhóra
Case=Nom|Form=Len|Number=Singmhórmhór
Case=Nom|NounType=NotSlender|Number=Plurmóramóra
Case=Nom|Number=Singmórmór
Number=SingMór

DET

2418 DET tokens (24% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (2418; 100%), Case=Gen (1886; 78%), Definite=Def (1886; 78%), Person=EMPTY (1886; 78%), Poss=EMPTY (1886; 78%), PronType=Art (1886; 78%).

DET tokens may have the following values of Gender:

Paradigm aFem,MascMascFem
Form=Ecln-a
aaa

ADP

1795 ADP tokens (10% of all ADP tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADP and Gender co-occurred: Number=Sing (1795; 100%), Person=3 (1795; 100%), PronType=EMPTY (1644; 92%).

ADP tokens may have the following values of Gender:

Paradigm iFem,MascMascFem
anninti
Poss=Yesinaina, 'na, naina
Typo=Yesan

PRON

1736 PRON tokens (48% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (1736; 100%), Person=3 (1736; 100%), PronType=EMPTY (1702; 98%).

PRON tokens may have the following values of Gender:

AUX

9 AUX tokens (1% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Form=EMPTY (9; 100%), Polarity=EMPTY (9; 100%), Tense=Pres (9; 100%), VerbForm=Cop (9; 100%).

AUX tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[nmod]–> NOUN (4026; 51%), NOUN –[amod]–> ADJ (3226; 88%), NOUN –[conj]–> NOUN (1196; 55%), PROPN –[det]–> DET (529; 58%), PROPN –[flat:name]–> PROPN (297; 75%), PROPN –[conj]–> PROPN (145; 58%), PROPN –[amod]–> ADJ (124; 91%), NOUN –[compound]–> NOUN (101; 64%), ADJ –[conj]–> ADJ (94; 96%), NOUN –[appos]–> NOUN (92; 60%).