home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Tamil-TTB: Features: Gender

This feature is universal. It occurs with 3 different values: Com, Masc, Neut.

5292 tokens (55%) have a non-empty value of Gender. 2503 types (70%) occur at least once with a non-empty value of Gender. 1505 lemmas (74%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (2753; 29% instances), PROPN (1370; 14% instances), AUX (477; 5% instances), VERB (425; 4% instances), PRON (236; 2% instances), NUM (16; 0% instances), PART (15; 0% instances).

NOUN

2753 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Person=3 (2753; 100%), Number=Sing (2146; 78%), Case=Nom (1808; 66%).

NOUN tokens may have the following values of Gender:

Paradigm மக்கள்NeutCom
Animacy=Anim|Case=Acc|Number=Plurமக்களை, மக்களைக்
Animacy=Anim|Case=Dat|Number=Plurமக்களுக்கு
Animacy=Anim|Case=Gen|Number=Plurமக்களின்
Animacy=Anim|Case=Loc|Number=Plurமக்களிடம்
Animacy=Anim|Case=Nom|Number=Plurமக்கள்
Case=Dat|Number=Singமக்களுக்க்
Case=Dat|Number=Sing|Polite=Formமக்களுக்குப்

Gender seems to be lexical feature of NOUN. 98% lemmas (818) occur only with one value of Gender.

PROPN

1370 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Person=3 (1370; 100%), Number=Sing (1337; 98%), Polite=EMPTY (1106; 81%), Case=Nom (839; 61%).

PROPN tokens may have the following values of Gender:

Paradigm தமிழர்NeutCom
Animacy=Anim|Case=Acc|Number=Plurதமிழர்களை, தமிழர்களைச்
Animacy=Anim|Case=Dat|Number=Plurதமிழர்களுக்க், தமிழர்களுக்கு, தமிழர்களுக்குத்
Animacy=Anim|Case=Nom|Number=Plurதமிழர்களின்
Case=Loc|Number=Plurதமிழர்களிடம்
Case=Nom|Number=Singதமிழர்

Gender seems to be lexical feature of PROPN. 99% lemmas (547) occur only with one value of Gender.

AUX

477 AUX tokens (75% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Polarity=Pos (466; 98%), Person=3 (460; 96%), Mood=Ind (419; 88%), VerbForm=Fin (419; 88%), Voice=Act (401; 84%), Polite=EMPTY (392; 82%), Number=Sing (355; 74%).

AUX tokens may have the following values of Gender:

Paradigm உள்NeutCom
Animacy=Anim|Mood=Ind|Number=Sing|Person=1|VerbForm=Finஉள்ளேன்
Animacy=Anim|Mood=Ind|Number=Plur|Person=1|VerbForm=Finஉள்ளோம்
Animacy=Anim|Mood=Ind|Number=Plur|Person=3|VerbForm=Finஉள்ளனர்
Case=Acc|Number=Sing|Person=3|VerbForm=Gerஉள்ளதைய்
Case=Ins|Number=Sing|Person=3|VerbForm=Gerஉள்ளதால்
Case=Nom|Number=Sing|Person=3|VerbForm=Gerஉள்ளத், உள்ளது
Mood=Ind|Number=Sing|Person=3|Polite=Form|VerbForm=Finஉள்ளார்
Mood=Ind|Number=Sing|Person=3|VerbForm=Finஉள்ளது
Mood=Ind|Number=Plur|Person=3|VerbForm=Finஉள்ளன

VERB

425 VERB tokens (34% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Polarity=Pos (423; 100%), Voice=Act (422; 99%), Person=3 (414; 97%), Number=Sing (351; 83%), Case=EMPTY (288; 68%), VerbForm=Fin (288; 68%), Mood=Ind (287; 68%), Polite=EMPTY (282; 66%), Tense=Past (232; 55%).

VERB tokens may have the following values of Gender:

Paradigm தெரிவிNeutCom
Case=Dat|Number=Sing|Tense=Past|VerbForm=Gerதெரிவித்ததற்க்
Mood=Ind|Number=Sing|Polite=Form|Tense=Past|VerbForm=Finதெரிவித்தார்
Mood=Ind|Number=Sing|Tense=Past|VerbForm=Finதெரிவித்தது
Mood=Ind|Number=Sing|Tense=Pres|VerbForm=Finதெரிவிக்கிறது
Mood=Ind|Number=Plur|Polite=Form|Tense=Past|VerbForm=Finதெரிவித்தனர்
Mood=Ind|Number=Plur|Polite=Form|Tense=Pres|VerbForm=Finதெரிவிக்கின்றனர்
Mood=Ind|Number=Plur|Tense=Past|VerbForm=Finதெரிவித்தன

PRON

236 PRON tokens (100% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: PronType=Prs (222; 94%), Person=3 (196; 83%), Number=Sing (175; 74%), Polite=EMPTY (166; 70%), Animacy=EMPTY (135; 57%), Case=Nom (134; 57%).

PRON tokens may have the following values of Gender:

Gender seems to be lexical feature of PRON. 100% lemmas (25) occur only with one value of Gender.

NUM

16 NUM tokens (6% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (16; 100%), NumForm=Digit (9; 56%).

NUM tokens may have the following values of Gender:

Gender seems to be lexical feature of NUM. 100% lemmas (13) occur only with one value of Gender.

PART

15 PART tokens (2% of all PART tokens) have a non-empty value of Gender.

The most frequent other feature values with which PART and Gender co-occurred: Polarity=Pos (15; 100%), VerbForm=Ger (15; 100%).

PART tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[nmod]–> NOUN (485; 70%), NOUN –[nmod]–> PROPN (470; 68%), PROPN –[nmod]–> NOUN (114; 79%), NOUN –[conj]–> NOUN (89; 93%), PROPN –[conj]–> PROPN (77; 94%), NOUN –[obl]–> NOUN (51; 86%), PROPN –[conj]–> NOUN (15; 79%), NOUN –[nsubj]–> NOUN (14; 52%), NOUN –[nsubj]–> PROPN (14; 70%), NUM –[nmod]–> NOUN (9; 90%).