Gender
: gender
This document is a placeholder for the language-specific documentation
for Gender
.
Treebank Statistics (UD_Romanian)
This feature is universal.
It occurs with 2 different values: Fem
, Masc
.
90602 tokens (41%) have a non-empty value of Gender
.
24276 types (77%) occur at least once with a non-empty value of Gender
.
12135 lemmas (70%) occur at least once with a non-empty value of Gender
.
The feature is used with 8 part-of-speech tags: NOUN (52829; 24% instances), ADJ (14794; 7% instances), DET (10394; 5% instances), VERB (7729; 4% instances), PRON (3174; 1% instances), NUM (901; 0% instances), AUX (475; 0% instances), PROPN (306; 0% instances).
NOUN
52829 NOUN tokens (97% of all NOUN
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NOUN
and Gender
co-occurred: Number=Sing (38507; 73%), Case=Acc,Nom (28798; 55%), Definite=Def (27175; 51%).
NOUN
tokens may have the following values of Gender
:
Fem
(32551; 62% of non-emptyGender
): conformitate, membre, statele, Comisia, parte, față, partea, fața, comisiei, urmăMasc
(20278; 38% of non-emptyGender
): ani, timp, cazul, loc, timpul, mod, acord, b, lucru, cadrulEMPTY
(1378): art., a., nr., b., mg, lit., alin., ml, CE, I.
Paradigm timp | Masc | Fem |
---|---|---|
Case=Acc,Nom|Definite=Def|Number=Sing | timpul | |
Case=Acc,Nom|Definite=Def|Number=Sing|Variant=Short | timpu' | |
Case=Acc,Nom|Definite=Def|Number=Plur | timpurile | |
Case=Dat,Gen|Definite=Def|Number=Sing | timpului | |
Definite=Ind|Number=Sing | timp | |
Definite=Ind|Number=Plur | timpuri |
Gender
seems to be lexical feature of NOUN
. 92% lemmas (7007) occur only with one value of Gender
.
ADJ
14794 ADJ tokens (96% of all ADJ
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which ADJ
and Gender
co-occurred: Degree=Pos (14758; 100%), Definite=Ind (13906; 94%), Number=Sing (9889; 67%), Case=EMPTY (9064; 61%).
ADJ
tokens may have the following values of Gender
:
Fem
(9326; 63% of non-emptyGender
): mare, europene, necesare, europeană, prezenta, mică, română, naționale, chimice, prezenteiMasc
(5468; 37% of non-emptyGender
): mare, nou, prezentul, european, general, prezentului, mic, național, românesc, singurEMPTY
(574): asemenea, mari, mici, noi, vechi, standard, anume, românești, roșii, vii
Paradigm mare | Masc | Fem |
---|---|---|
Case=Acc,Nom|Definite=Def|Number=Sing | marele | marea |
Case=Acc,Nom|Definite=Def|Number=Plur | marii | marile |
Case=Acc,Nom|Definite=Ind|Number=Sing | mare | |
Case=Dat,Gen|Definite=Def|Number=Sing | marelui | |
Case=Dat,Gen|Definite=Ind|Number=Sing | mari | |
Definite=Ind|Number=Sing | mare |
DET
10394 DET tokens (86% of all DET
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which DET
and Gender
co-occurred: Position=EMPTY (8962; 86%), Number=Sing (8614; 83%), Person=EMPTY (7701; 74%), Poss=EMPTY (7076; 68%), Case=Acc,Nom (6033; 58%), PronType=Ind (5364; 52%).
DET
tokens may have the following values of Gender
:
Fem
(6268; 60% of non-emptyGender
): o, a, ale, unei, toate, această, aceste, cele, alte, multeMasc
(4126; 40% of non-emptyGender
): un, al, unui, acest, cel, său, ai, același, cei, acestuiEMPTY
(1629): lui, lor, orice, unor, fiecare, ei, acestor, niște, tuturor, altor
Paradigm un | Masc | Fem |
---|---|---|
Case=Acc,Nom|Number=Sing | un, -un | o, -o |
Case=Acc,Nom|Number=Plur|Person=3|Position=Prenom | unii | unele |
Case=Dat,Gen|Number=Sing | unui | unei |
VERB
7729 VERB tokens (31% of all VERB
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which VERB
and Gender
co-occurred: Tense=EMPTY (7729; 100%), Mood=EMPTY (7729; 100%), VerbForm=Part (7729; 100%), Person=EMPTY (7729; 100%), Number=Sing (5707; 74%).
VERB
tokens may have the following values of Gender
:
Fem
(2779; 36% of non-emptyGender
): prevăzute, prevăzută, menționate, stabilite, legate, puse, utilizate, obținute, prezentate, aflateMasc
(4950; 64% of non-emptyGender
): fost, avut, făcut, spus, putut, rupt, murit, dat, devenit, luatEMPTY
(17439): este, poate, era, trebuie, sunt, pot, avea, putea, are, privind
Paradigm avea | Masc | Fem |
---|---|---|
Number=Sing | avut | avută |
Number=Plur | avute |
PRON
3174 PRON tokens (27% of all PRON
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which PRON
and Gender
co-occurred: Reflex=EMPTY (3174; 100%), Person=3 (3154; 99%), Variant=EMPTY (2690; 85%), Number=Sing (2205; 69%), Case=Acc,Nom (1977; 62%), PronType=Prs (1674; 53%).
PRON
tokens may have the following values of Gender
:
Fem
(1557; 49% of non-emptyGender
): o, le, ea, ceea, aceasta, acestea, una, -o, ele, toateMasc
(1617; 51% of non-emptyGender
): el, -l, îl, unul, ei, -i, l-, cel, acesta, ceiEMPTY
(8723): se, care, ce, s-, își, -și, și-, îi, -se, i
Paradigm el | Masc | Fem |
---|---|---|
Case=Acc,Nom|Number=Sing|Strength=Strong | el | ea |
Case=Acc,Nom|Number=Plur|Strength=Strong | ei | ele |
Case=Acc|Number=Sing|Strength=Weak | îl | o |
Case=Acc|Number=Sing|Strength=Weak|Variant=Short | -l, l-, l | -o |
Case=Acc|Number=Plur|Strength=Weak | îi | le |
Case=Acc|Number=Plur|Strength=Weak|Variant=Short | -i, i- | le-, -le |
Case=Dat,Gen|Number=Sing|Strength=Strong | lui | ei |
NUM
901 NUM tokens (16% of all NUM
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which NUM
and Gender
co-occurred: NumForm=Word (853; 95%), Number=Plur (484; 54%), NumType=Ord (473; 52%).
NUM
tokens may have the following values of Gender
:
Fem
(551; 61% of non-emptyGender
): două, prima, doua, primele, milioane, ambele, mii, treia, ultimele, miliardeMasc
(350; 39% of non-emptyGender
): primul, doi, doilea, ultimii, ultimul, unu, primului, amândoi, prim-, treileaEMPTY
(4638): 1, 2, 3, 4, trei, 5, 6, 7, 8, I
Paradigm doi | Masc | Fem |
---|---|---|
Number=Sing|NumType=Ord | doilea, secund | doua |
Number=Plur|NumType=Card | doi | două |
AUX
475 AUX tokens (7% of all AUX
tokens) have a non-empty value of Gender
.
The most frequent other feature values with which AUX
and Gender
co-occurred: VerbForm=Part (475; 100%), Mood=EMPTY (475; 100%), Tense=EMPTY (475; 100%), Number=Sing (475; 100%), Person=EMPTY (475; 100%).
AUX
tokens may have the following values of Gender
:
Masc
(475; 100% of non-emptyGender
): fostEMPTY
(6132): a, au, fi, este, sunt, va, ar, am, vor, fie
PROPN
306 PROPN tokens (5% of all PROPN
tokens) have a non-empty value of Gender
.
PROPN
tokens may have the following values of Gender
:
Fem
(246; 80% of non-emptyGender
): României, Moldovei, Dunării, Europei, Franței, Italiei, Norvegiei, Rusiei, Ungariei, GermanieiMasc
(60; 20% of non-emptyGender
): Carpaților, Iașilor, Jiului, Banatul, Iașii, Israelul, Israelului, Aradului, Banatului, BucureștiuluiEMPTY
(5617): România, Winston, București, Timișoara, Iași, CEE, Ion, Paris, Alexandru, O’Brien
Gender
seems to be lexical feature of PROPN
. 100% lemmas (88) occur only with one value of Gender
.
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender
:
NOUN –[amod]–> ADJ (11509; 96%),
NOUN –[nmod]–> NOUN (8860; 54%),
NOUN –[det]–> DET (7395; 79%),
NOUN –[conj]–> NOUN (2461; 73%),
VERB –[nsubjpass]–> NOUN (891; 58%),
ADJ –[conj]–> ADJ (650; 94%),
NOUN –[amod]–> DET (606; 82%),
VERB –[conj]–> VERB (515; 61%),
ADJ –[nsubj]–> NOUN (386; 93%),
ADJ –[nmod]–> NOUN (377; 52%).
Gender in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]