home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Sinhala-Appuwa: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

61 tokens (9%) have a non-empty value of Gender. 33 types (8%) occur at least once with a non-empty value of Gender. 25 lemmas (7%) occur at least once with a non-empty value of Gender. The feature is used with 3 part-of-speech tags: NOUN (33; 5% instances), PROPN (27; 4% instances), PRON (1; 0% instances).

NOUN

33 NOUN tokens (13% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (32; 97%), Definite=EMPTY (31; 94%), Animacy=EMPTY (28; 85%).

NOUN tokens may have the following values of Gender:

Gender seems to be lexical feature of NOUN. 100% lemmas (17) occur only with one value of Gender.

PROPN

27 PROPN tokens (45% of all PROPN tokens) have a non-empty value of Gender.

The most frequent other feature values with which PROPN and Gender co-occurred: Case=EMPTY (25; 93%), Animacy=EMPTY (20; 74%), Number=Sing (20; 74%).

PROPN tokens may have the following values of Gender:

PRON

1 PRON tokens (4% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Case=Dat (1; 100%), Number=EMPTY (1; 100%), Person=EMPTY (1; 100%), PronType=Prs (1; 100%).

PRON tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[compound]–> PROPN (5; 71%), PROPN –[nsubj]–> NOUN (2; 100%), PROPN –[conj]–> PROPN (1; 100%).