home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Tamil-MWTT: Features: Gender

This feature is universal. It occurs with 4 different values: Com, Fem, Masc, Neut.

619 tokens (24%) have a non-empty value of Gender. 269 types (32%) occur at least once with a non-empty value of Gender. 127 lemmas (29%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: VERB (351; 14% instances), PRON (153; 6% instances), AUX (71; 3% instances), NOUN (38; 1% instances), ADJ (2; 0% instances), ADV (2; 0% instances), PROPN (2; 0% instances).

VERB

351 VERB tokens (69% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Person=3 (324; 92%), Mood=EMPTY (309; 88%), Polarity=EMPTY (309; 88%), VerbForm=EMPTY (309; 88%), Number=Sing (299; 85%), Tense=Past (211; 60%).

VERB tokens may have the following values of Gender:

Paradigm இருMascFemNeutCom
Mood=Ind|Number=Plur|Person=3|Polarity=Pos|Polite=Form|Tense=Pres|VerbForm=Fin|Voice=Actஇருக்கின்றன
Number=Sing|Person=1|Tense=Presஇருக்கிறேன்
Number=Sing|Person=3|Tense=Futஇருப்பான்
Number=Sing|Person=3|Tense=Pastஇருந்தான்இருந்தது
Number=Sing|Person=3|Tense=Presஇருக்கிறான்இருக்கிறாள்இருக்கிறது
Number=Plur|Person=3|Tense=Presஇருக்கின்றனஇருக்கிறார்கள்

PRON

153 PRON tokens (89% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (127; 83%), Animacy=EMPTY (117; 76%), PronType=EMPTY (108; 71%), Case=Nom (86; 56%).

PRON tokens may have the following values of Gender:

Paradigm அவள்MascFem
Case=Accஅவளை
Case=Datஅவளுக்கு
Case=Nomஅவள்அவள்

Gender seems to be lexical feature of PRON. 95% lemmas (19) occur only with one value of Gender.

AUX

71 AUX tokens (83% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: VerbForm=EMPTY (69; 97%), Person=3 (68; 96%), Polarity=EMPTY (67; 94%), Number=Sing (64; 90%).

AUX tokens may have the following values of Gender:

Paradigm இருMascNeutCom
Number=Sing|Polite=Form|Tense=Presஇருக்கிறார்
Number=Sing|Tense=Futஇருப்பான்இருக்கும்
Number=Sing|Tense=Pastஇருந்தான்
Number=Sing|Tense=Presஇருக்கிறான்இருக்கிறது
Number=Plur|Tense=Futஇருப்பார்கள்
Number=Plur|Tense=Presஇருக்கிறார்கள்

NOUN

38 NOUN tokens (7% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Person=3 (38; 100%), Number=Sing (33; 87%), Case=Nom (20; 53%).

NOUN tokens may have the following values of Gender:

Gender seems to be lexical feature of NOUN. 100% lemmas (22) occur only with one value of Gender.

ADJ

2 ADJ tokens (6% of all ADJ tokens) have a non-empty value of Gender.

ADJ tokens may have the following values of Gender:

ADV

2 ADV tokens (1% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: Case=EMPTY (2; 100%), Number=Sing (2; 100%), Person=EMPTY (2; 100%).

ADV tokens may have the following values of Gender:

PROPN

2 PROPN tokens (1% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Number=Sing (2; 100%), Person=3 (2; 100%), Polite=EMPTY (2; 100%).

PROPN tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: VERB –[nsubj]–> PRON (37; 53%), PRON –[nsubj]–> PRON (6; 100%), VERB –[obl]–> PRON (3; 75%), VERB –[nsubj:nc]–> PRON (2; 67%), VERB –[xcomp]–> NOUN (2; 100%), PRON –[nmod]–> PRON (1; 100%), PROPN –[obl]–> PROPN (1; 100%).