Gender
: gender
This document is a placeholder for the language-specific documentation
for Gender
.
Treebank Statistics (UD_Catalan)
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
193692 tokens (35%) have a non-empty value of Gender
.
14355 types (44%) occur at least once with a non-empty value of Gender
.
9875 lemmas (42%) occur at least once with a non-empty value of Gender
.
The feature is used with 11 part-of-speech tags: NOUN (85332; 16% instances), DET (75900; 14% instances), ADJ (20196; 4% instances), VERB (6751; 1% instances), PRON (3247; 1% instances), NUM (1369; 0% instances), AUX (714; 0% instances), ADP (121; 0% instances), ADV (60; 0% instances), PROPN (1; 0% instances), SYM (1; 0% instances).
NOUN
85332 NOUN tokens (86% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (59388; 70%).
NOUN
tokens may have the following values of Gender
:
Fem
(41322; 48% of non-emptyGender
): persones, obres, obra, empresa, llei, ciutat, zona, cosa, situació, bandaMasc
(44010; 52% of non-emptyGender
): anys, milions, any, president, temps, grup, projecte, cas, partit, directorEMPTY
(13419): pessetes, any, través, cap, euros, juny, part, partir, dia, terme
Paradigm cas | Masc | Fem |
---|---|---|
Number=Sing | cas | |
Number=Plur | casos | cas |
Gender
seems to be lexical feature of NOUN
. 99% lemmas (6537) occur only with one value of Gender
.
DET
75900 DET tokens (87% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: PronType=Art (58343; 77%), Definite=Def (58142; 77%), Number=Sing (58128; 77%).
DET
tokens may have the following values of Gender
:
Fem
(33547; 44% of non-emptyGender
): la, les, una, seva, aquesta, seves, aquestes, totes, altra, totaMasc
(42353; 56% of non-emptyGender
): el, els, un, aquest, seu, seus, aquests, tots, tot, mateixEMPTY
(11405): l’, altres, cap, cada, diferents, qualsevol, qual, nostres, meva, prou
Paradigm el | Masc | Fem |
---|---|---|
Definite=Def|Number=Sing|PronType=Art | el | la, L' |
Definite=Def|Number=Plur|PronType=Art | els | les |
Number=Plur|Person=3|Poss=Yes|PronType=Prs | les |
ADJ
20196 ADJ tokens (67% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: VerbForm=EMPTY (14607; 72%), Number=Sing (14049; 70%).
ADJ
tokens may have the following values of Gender
:
Fem
(9118; 45% of non-emptyGender
): primera, nova, catalana, noves, política, segona, única, pública, bona, espanyolaMasc
(11078; 55% of non-emptyGender
): passat, primer, nou, espanyol, nous, català, públic, últims, polític, últimEMPTY
(9849): gran, general, grans, actual, important, social, baix, possible, municipal, anterior
Paradigm nou | Masc | Fem |
---|---|---|
Number=Sing | nou | nova |
Number=Plur | nous | noves |
VERB
6751 VERB tokens (16% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Tense=Past (6751; 100%), VerbForm=Part (6751; 100%), Mood=EMPTY (6751; 100%), Person=EMPTY (6751; 100%), Number=Sing (6515; 97%).
VERB
tokens may have the following values of Gender
:
Fem
(332; 5% of non-emptyGender
): dictada, aprovada, presentada, considerada, donada, atesa, inclosa, inaugurada, traslladada, conegudaMasc
(6419; 95% of non-emptyGender
): fet, explicat, dit, presentat, tingut, estat, assegurat, destacat, demanat, passatEMPTY
(34700): fer, té, ha, està, és, tenir, donar, dir, tenen, arribar
Paradigm fer | Masc | Fem |
---|---|---|
Number=Sing | fet | |
Number=Plur | fetes |
PRON
3247 PRON tokens (14% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Number=Sing (2458; 76%), Person=EMPTY (2206; 68%).
PRON
tokens may have the following values of Gender
:
Fem
(907; 28% of non-emptyGender
): una, la, -la, aquesta, altra, les, unes, -les, ella, algunesMasc
(2340; 72% of non-emptyGender
): un, tot, el, ell, uns, -lo, ells, alguns, aquest, totsEMPTY
(20122): que, es, s’, hi, li, -se, on, què, se, això
Paradigm ell | Masc | Fem |
---|---|---|
Case=Acc|Number=Sing | el, -lo, 'l, li | la, -la |
Case=Acc|Number=Plur | els, 'ls | les, -les |
Number=Sing | ell | ella |
Number=Plur | ells, els | elles |
NUM
1369 NUM tokens (15% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumForm=EMPTY (1369; 100%), NumType=Card (1369; 100%), Number=Plur (891; 65%).
NUM
tokens may have the following values of Gender
:
Fem
(500; 37% of non-emptyGender
): dues, una, mitja, ambdues, desena, tres-centes, Desenes, Vuit-centes, cinquena, dues-centesMasc
(869; 63% of non-emptyGender
): dos, un, mig, ambdós, tercer, quart, cinc-cents, 2, centenars, desèEMPTY
(7892): tres, quatre, cent, 10, cinc, sis, 15, 30, 20, vuit
Paradigm dos | Masc | Fem |
---|---|---|
Number=Sing | dos | |
Number=Plur | dos | dues |
dos |
AUX
714 AUX tokens (3% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: Tense=Past (714; 100%), VerbForm=Part (714; 100%), Person=EMPTY (714; 100%), Mood=EMPTY (714; 100%), Number=Sing (699; 98%).
AUX
tokens may have the following values of Gender
:
Fem
(9; 1% of non-emptyGender
): aprovada, controlades, declarades, endeutada, investigada, investigades, presentades, remodelada, sostretaMasc
(705; 99% of non-emptyGender
): estat, pogut, hagut, començat, volgut, anat, fet, tornat, deixat, arribatEMPTY
(21775): va, ha, és, van, han, ser, són, havia, pot, fa
Gender
seems to be lexical feature of AUX
. 100% lemmas (55) occur only with one value of Gender
.
ADP
121 ADP tokens (0% of all ADP
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADP
and Gender
co-occurred: AdpType=Preppron (121; 100%).
ADP
tokens may have the following values of Gender
:
Fem
(3; 2% of non-emptyGender
): daMasc
(118; 98% of non-emptyGender
): dels, als, Del, al, do, DELEMPTY
(87854): de, a, d’, per, en, amb, entre, sobre, segons, des
Gender
seems to be lexical feature of ADP
. 100% lemmas (11) occur only with one value of Gender
.
ADV
60 ADV tokens (0% of all ADV
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADV
and Gender
co-occurred: Negative=EMPTY (60; 100%).
ADV
tokens may have the following values of Gender
:
Masc
(60; 100% of non-emptyGender
): més, fins, enfront, entorn, enllà, quant, prop, enmigEMPTY
(15397): no, més, també, ja, després, ahir, molt, avui, només, ara
SYM
1 SYM tokens (0% of all SYM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which SYM
and Gender
co-occurred: NumForm=EMPTY (1; 100%), NumType=EMPTY (1; 100%).
SYM
tokens may have the following values of Gender
:
Masc
(1; 100% of non-emptyGender
): 1%EMPTY
(4637): ’, %, 50%, 10%, 30%, 5%, 40%, 2%, 25%, 1%
PROPN
1 PROPN tokens (0% of all PROPN
tokens) have a non-empty value of Gender
.
PROPN
tokens may have the following values of Gender
:
Masc
(1; 100% of non-emptyGender
): JustíciaEMPTY
(46730): Catalunya, Barcelona, Generalitat, Govern, sant, Ajuntament, Girona, Josep, CiU, PP
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[det]–> DET (54766; 82%),
NOUN –[amod]–> ADJ (14858; 64%),
NOUN –[conj]–> NOUN (2649; 53%),
DET –[det]–> DET (1207; 79%),
NOUN –[appos]–> NOUN (1035; 51%),
ADJ –[nsubj]–> NOUN (565; 60%),
ADJ –[det]–> DET (459; 59%),
ADJ –[conj]–> ADJ (427; 52%),
PRON –[nmod]–> NOUN (420; 72%),
NOUN –[acl]–> ADJ (148; 60%).
Gender in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]