Treebank Statistics: UD_Tagalog-TRG: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
24 tokens (3%) have a non-empty value of Gender.
14 types (6%) occur at least once with a non-empty value of Gender.
14 lemmas (8%) occur at least once with a non-empty value of Gender.
The feature is used with 4 part-of-speech tags: PROPN (18; 2% instances), NOUN (4; 1% instances), ADJ (1; 0% instances), DET (1; 0% instances).
PROPN
18 PROPN tokens (78% of all PROPN tokens) have a non-empty value of Gender.
PROPN tokens may have the following values of Gender:
Fem(4; 22% of non-emptyGender): Linda, Maria, Mary, RosaMasc(14; 78% of non-emptyGender): Juan, Pedro, John, BillEMPTY(5): Maynila, City, Pasay
NOUN
4 NOUN tokens (3% of all NOUN tokens) have a non-empty value of Gender.
NOUN tokens may have the following values of Gender:
Fem(2; 50% of non-emptyGender): Biyuda, maestraMasc(2; 50% of non-emptyGender): Biyudo, maestroEMPTY(155): bata, pagkain, babae, nanay, libro, titser, bahay, bangka, banko, bigas
ADJ
1 ADJ tokens (5% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (1; 100%).
ADJ tokens may have the following values of Gender:
Fem(1; 100% of non-emptyGender): KomikaEMPTY(18): Mabuti, Malapit, bago, Bagong, Interesante, Maganda, Masagwa, Matalino, Matamis, Napakaano
DET
1 DET tokens (9% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Number=EMPTY (1; 100%), PronType=Emp (1; 100%).
DET tokens may have the following values of Gender:
Masc(1; 100% of non-emptyGender): mismoEMPTY(10): mga, lahat
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[nsubj]–> NOUN (2; 100%),
PROPN –[nmod]–> DET (1; 100%).