home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Catalan: Features: Gender

This feature is universal. It occurs with 2 different values: Fem, Masc.

193535 tokens (36%) have a non-empty value of Gender. 14362 types (44%) occur at least once with a non-empty value of Gender. 9877 lemmas (42%) occur at least once with a non-empty value of Gender. The feature is used with 11 part-of-speech tags: NOUN (85332; 16% instances), DET (61191; 12% instances), ADJ (20196; 4% instances), ADP (14673; 3% instances), VERB (6669; 1% instances), PRON (3247; 1% instances), NUM (1369; 0% instances), AUX (796; 0% instances), ADV (60; 0% instances), PROPN (1; 0% instances), SYM (1; 0% instances).

NOUN

85332 NOUN tokens (86% of all NOUN tokens) have a non-empty value of Gender.

The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (59388; 70%).

NOUN tokens may have the following values of Gender:

Paradigm casMascFem
Number=Singcas
Number=Plurcasoscas

Gender seems to be lexical feature of NOUN. 99% lemmas (6537) occur only with one value of Gender.

DET

61191 DET tokens (84% of all DET tokens) have a non-empty value of Gender.

The most frequent other feature values with which DET and Gender co-occurred: Number=Sing (47167; 77%), PronType=Art (43634; 71%), Definite=Def (39267; 64%).

DET tokens may have the following values of Gender:

Paradigm elMascFem
Definite=Def|Number=Sing|PronType=Artella, L'
Definite=Def|Number=Plur|PronType=Artelsles
Number=Sing|PronType=Artella
Number=Plur|Person=3|Poss=Yes|PronType=Prsles
Number=Plur|PronType=Artelsles

ADJ

20196 ADJ tokens (67% of all ADJ tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADJ and Gender co-occurred: VerbForm=EMPTY (14607; 72%), Number=Sing (14049; 70%).

ADJ tokens may have the following values of Gender:

Paradigm nouMascFem
Number=Singnounova
Number=Plurnousnoves

ADP

14673 ADP tokens (17% of all ADP tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADP and Gender co-occurred: AdpType=Preppron (14673; 100%), Number=Sing (10823; 74%).

ADP tokens may have the following values of Gender:

Gender seems to be lexical feature of ADP. 100% lemmas (14) occur only with one value of Gender.

VERB

6669 VERB tokens (17% of all VERB tokens) have a non-empty value of Gender.

The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (6669; 100%), Person=EMPTY (6669; 100%), Tense=Past (6669; 100%), VerbForm=Part (6669; 100%), Number=Sing (6433; 96%).

VERB tokens may have the following values of Gender:

Paradigm ferMascFem
Number=Singfet
Number=Plurfetes

PRON

3247 PRON tokens (14% of all PRON tokens) have a non-empty value of Gender.

The most frequent other feature values with which PRON and Gender co-occurred: Number=Sing (2458; 76%), Person=EMPTY (2206; 68%).

PRON tokens may have the following values of Gender:

Paradigm ellMascFem
Case=Acc|Number=Singel, lo, 'l, li, -lola, -la
Case=Acc|Number=Plurels, 'lsles
Number=Singellella
Number=Plurells, elselles

NUM

1369 NUM tokens (15% of all NUM tokens) have a non-empty value of Gender.

The most frequent other feature values with which NUM and Gender co-occurred: NumForm=EMPTY (1369; 100%), NumType=Card (1369; 100%), Number=Plur (891; 65%).

NUM tokens may have the following values of Gender:

Paradigm dosMascFem
Number=Singdos
Number=Plurdosdues
dos

AUX

796 AUX tokens (3% of all AUX tokens) have a non-empty value of Gender.

The most frequent other feature values with which AUX and Gender co-occurred: Mood=EMPTY (796; 100%), Person=EMPTY (796; 100%), Tense=Past (796; 100%), VerbForm=Part (796; 100%), Number=Sing (781; 98%).

AUX tokens may have the following values of Gender:

Gender seems to be lexical feature of AUX. 100% lemmas (59) occur only with one value of Gender.

ADV

60 ADV tokens (0% of all ADV tokens) have a non-empty value of Gender.

The most frequent other feature values with which ADV and Gender co-occurred: Polarity=EMPTY (60; 100%).

ADV tokens may have the following values of Gender:

PROPN

1 PROPN tokens (0% of all PROPN tokens) have a non-empty value of Gender.

PROPN tokens may have the following values of Gender:

SYM

1 SYM tokens (0% of all SYM tokens) have a non-empty value of Gender.

The most frequent other feature values with which SYM and Gender co-occurred: NumForm=EMPTY (1; 100%), NumType=EMPTY (1; 100%).

SYM tokens may have the following values of Gender:

Relations with Agreement in Gender

The 10 most frequent relations where parent and child node agree in Gender: NOUN –[det]–> DET (46906; 81%), NOUN –[amod]–> ADJ (14858; 64%), NOUN –[conj]–> NOUN (2642; 53%), DET –[det]–> DET (1177; 81%), NOUN –[appos]–> NOUN (1032; 51%), ADJ –[nsubj]–> NOUN (558; 60%), ADJ –[conj]–> ADJ (427; 52%), PRON –[nmod]–> NOUN (411; 73%), ADJ –[det]–> DET (384; 62%), NOUN –[acl]–> ADJ (148; 60%).