This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home ta/feat issue tracker

Gender: gender

This document is a placeholder for the language-specific documentation for Gender.


Treebank Statistics (UD_Tamil)

This feature is universal. It occurs with 3 different values: Com, Masc, Neut.

5292 tokens (55%) have a non-empty value of Gender. 2504 types (70%) occur at least once with a non-empty value of Gender. 1505 lemmas (74%) occur at least once with a non-empty value of Gender. The feature is used with 7 part-of-speech tags: NOUN (2753; 29% instances), PROPN (1370; 14% instances), AUX (476; 5% instances), VERB (426; 4% instances), PRON (236; 2% instances), NUM (16; 0% instances), PART (15; 0% instances).

NOUN

2753 NOUN tokens (100% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Person=3 (2753; 100%), Number=Sing (2146; 78%), Case=Nom (1808; 66%).

NOUN tokens may have the following values of Gender:

Paradigm மக்கள்NeutCom
Animacy=Anim|Case=Acc|Number=Plurமக்களைக், மக்களை
Animacy=Anim|Case=Dat|Number=Plurமக்களுக்கு
Animacy=Anim|Case=Gen|Number=Plurமக்களின்
Animacy=Anim|Case=Loc|Number=Plurமக்களிடம்
Animacy=Anim|Case=Nom|Number=Plurமக்கள்
Case=Dat|Number=Singமக்களுக்க்
Case=Dat|Number=Sing|Polite=Polமக்களுக்குப்

Gender seems to be lexical feature of NOUN. 98% lemmas (818) occur only with one value of Gender.

PROPN

1370 PROPN tokens (100% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Person=3 (1370; 100%), Number=Sing (1337; 98%), Polite=EMPTY (1106; 81%), Case=Nom (839; 61%).

PROPN tokens may have the following values of Gender:

Paradigm தமிழர்NeutCom
Animacy=Anim|Case=Acc|Number=Plurதமிழர்களை, தமிழர்களைச்
Animacy=Anim|Case=Dat|Number=Plurதமிழர்களுக்க், தமிழர்களுக்குத், தமிழர்களுக்கு
Animacy=Anim|Case=Nom|Number=Plurதமிழர்களின்
Case=Loc|Number=Plurதமிழர்களிடம்
Case=Nom|Number=Singதமிழர்

Gender seems to be lexical feature of PROPN. 99% lemmas (547) occur only with one value of Gender.

AUX

476 AUX tokens (76% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Negative=Pos (465; 98%), Person=3 (459; 96%), VerbForm=Fin (418; 88%), Mood=Ind (418; 88%), Voice=Act (400; 84%), Polite=EMPTY (391; 82%), Number=Sing (355; 75%).

AUX tokens may have the following values of Gender:

Paradigm உள்NeutCom
Animacy=Anim|Mood=Ind|Number=Sing|Person=1|VerbForm=Finஉள்ளேன்
Animacy=Anim|Mood=Ind|Number=Plur|Person=1|VerbForm=Finஉள்ளோம்
Animacy=Anim|Mood=Ind|Number=Plur|Person=3|VerbForm=Finஉள்ளனர்
Case=Acc|Number=Sing|Person=3|VerbForm=Gerஉள்ளதைய்
Case=Ins|Number=Sing|Person=3|VerbForm=Gerஉள்ளதால்
Case=Nom|Number=Sing|Person=3|VerbForm=Gerஉள்ளத், உள்ளது
Mood=Ind|Number=Sing|Person=3|Polite=Pol|VerbForm=Finஉள்ளார்
Mood=Ind|Number=Sing|Person=3|VerbForm=Finஉள்ளது
Mood=Ind|Number=Plur|Person=3|VerbForm=Finஉள்ளன

VERB

426 VERB tokens (36% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Negative=Pos (424; 100%), Voice=Act (423; 99%), Person=3 (415; 97%), Number=Sing (351; 82%), Case=EMPTY (289; 68%), VerbForm=Fin (289; 68%), Mood=Ind (288; 68%), Polite=EMPTY (283; 66%), Tense=Past (232; 54%).

VERB tokens may have the following values of Gender:

Paradigm தெரிவிNeutCom
Case=Dat|Number=Sing|Tense=Past|VerbForm=Gerதெரிவித்ததற்க்
Mood=Ind|Number=Sing|Polite=Pol|Tense=Past|VerbForm=Finதெரிவித்தார்
Mood=Ind|Number=Sing|Tense=Past|VerbForm=Finதெரிவித்தது
Mood=Ind|Number=Sing|Tense=Pres|VerbForm=Finதெரிவிக்கிறது
Mood=Ind|Number=Plur|Polite=Pol|Tense=Past|VerbForm=Finதெரிவித்தனர்
Mood=Ind|Number=Plur|Polite=Pol|Tense=Pres|VerbForm=Finதெரிவிக்கின்றனர்
Mood=Ind|Number=Plur|Tense=Past|VerbForm=Finதெரிவித்தன

PRON

236 PRON tokens (95% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: PronType=Prs (222; 94%), Person=3 (196; 83%), Number=Sing (175; 74%), Polite=EMPTY (166; 70%), Animacy=EMPTY (135; 57%), Case=Nom (134; 57%).

PRON tokens may have the following values of Gender:

Gender seems to be lexical feature of PRON. 100% lemmas (25) occur only with one value of Gender.

NUM

16 NUM tokens (6% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=EMPTY (9; 56%), NumForm=Digit (9; 56%).

NUM tokens may have the following values of Gender:

Gender seems to be lexical feature of NUM. 100% lemmas (13) occur only with one value of Gender.

PART

15 PART tokens (2% of all PART tokens) have a non-empty value of Gender.

The most frequent other feature values with which PART and Gender co-occurred: VerbForm=Ger (15; 100%), Negative=Pos (15; 100%).

PART tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[nmod]–> NOUN (526; 73%), NOUN –[nmod]–> PROPN (473; 69%), PROPN –[nmod]–> NOUN (115; 79%), NOUN –[conj]–> NOUN (89; 95%), PROPN –[conj]–> PROPN (77; 94%), NOUN –[dobj]–> NOUN (19; 58%), NOUN –[nsubj]–> PROPN (14; 70%), NOUN –[nsubj]–> NOUN (14; 52%), PROPN –[conj]–> NOUN (12; 75%), NUM –[nmod]–> NOUN (9; 82%).


Gender in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]