home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Catalan-AnCora: Features: Gender

This feature is universal. It occurs with 3 different values: Fem, Masc, Neut. Some words have combined values of the feature; 1 combinations have been observed: Fem|Masc.

194766 tokens (36%) have a non-empty value of Gender. 14350 types (44%) occur at least once with a non-empty value of Gender. 9829 lemmas (42%) occur at least once with a non-empty value of Gender. The feature is used with 8 part-of-speech tags: NOUN (86128; 16% instances), DET (75857; 14% instances), ADJ (20208; 4% instances), VERB (6815; 1% instances), PRON (3742; 1% instances), NUM (1358; 0% instances), AUX (650; 0% instances), PROPN (8; 0% instances).

NOUN

86128 NOUN tokens (87% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (59389; 69%).

NOUN tokens may have the following values of Gender:

Paradigm casMascFem
Number=Singcas
Number=Plurcasoscas

Gender seems to be lexical feature of NOUN. 99% lemmas (6523) occur only with one value of Gender.

DET

75857 DET tokens (87% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: PronType=Art (58300; 77%), Number=Sing (57983; 76%), Definite=Def (53933; 71%).

DET tokens may have the following values of Gender:

Paradigm elMascFem
Definite=Def|Foreign=Yes|Number=Sing|PronType=Artel
Definite=Def|Number=Sing|PronType=Artella, L'
Definite=Def|Number=Plur|PronType=Artelsles
Definite=Ind|Number=Sing|PronType=Artla
Number=Sing|PronType=Artella
Number=Plur|Person=3|Poss=Yes|PronType=Prsles
Number=Plur|PronType=Artelsles

ADJ

20208 ADJ tokens (67% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: VerbForm=EMPTY (14619; 72%), Number=Sing (14061; 70%).

ADJ tokens may have the following values of Gender:

Paradigm nouMascFem
Number=Singnounova
Number=Plurnousnoves

VERB

6815 VERB tokens (16% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (6815; 100%), Person=EMPTY (6815; 100%), Tense=Past (6815; 100%), VerbForm=Part (6815; 100%), Number=Sing (6564; 96%).

VERB tokens may have the following values of Gender:

Paradigm ferMascFem
Number=Singfet
Number=Plurfetes

PRON

3742 PRON tokens (16% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: PrepCase=EMPTY (3742; 100%), Reflex=EMPTY (3742; 100%), Number=Sing (2965; 79%), Case=EMPTY (2545; 68%), Person=EMPTY (2205; 59%).

PRON tokens may have the following values of Gender:

Paradigm ellFem,MascMascFemNeut
Case=Acc|Number=Singl'el, lo, 'l, -lo, lla, -laho, -ho
Case=Acc|Number=Plurles
Number=Singellella
Number=Plurellselles

NUM

1358 NUM tokens (14% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumType=Card (1358; 100%), NumForm=Word (1356; 100%), Number=Plur (891; 66%).

NUM tokens may have the following values of Gender:

Paradigm dosMascFem
Number=Singdos
Number=Plurdosdues
dosdues

AUX

650 AUX tokens (3% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (650; 100%), Number=Sing (650; 100%), Person=EMPTY (650; 100%), Tense=Past (650; 100%), VerbForm=Part (650; 100%).

AUX tokens may have the following values of Gender:

PROPN

8 PROPN tokens (0% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (55270; 82%), NOUN –[amod]–> ADJ (14877; 64%), NOUN –[conj]–> NOUN (2733; 53%), DET –[det]–> DET (1202; 79%), NOUN –[appos]–> NOUN (1058; 51%), ADJ –[nsubj]–> NOUN (528; 60%), ADJ –[det]–> DET (458; 59%), ADJ –[conj]–> ADJ (428; 52%), PRON –[nmod]–> NOUN (411; 72%), NOUN –[acl]–> ADJ (127; 60%).