Treebank Statistics: UD_Romanian-ArT: Features: Gender
This feature is universal.
It occurs with 2 different values: Fem, Masc.
132 tokens (23%) have a non-empty value of Gender.
107 types (36%) occur at least once with a non-empty value of Gender.
83 lemmas (37%) occur at least once with a non-empty value of Gender.
The feature is used with 6 part-of-speech tags: NOUN (78; 14% instances), PRON (22; 4% instances), DET (14; 2% instances), ADJ (8; 1% instances), NUM (5; 1% instances), VERB (5; 1% instances).
NOUN
78 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Gender.
The most frequent other feature values with which NOUN and Gender co-occurred: Number=Sing (69; 88%), Case=Acc,Nom (57; 73%), Definite=Def (40; 51%).
NOUN tokens may have the following values of Gender:
Fem(41; 53% of non-emptyGender): casă, feata, lamńa, oară, amiroańa, apă, barba, broască, bucăţle, caleaMasc(37; 47% of non-emptyGender): fiĉorlu, ańi, gardu, lucru, ќiro, Araplu, Merlu, Tată, Uvreulu, aluatluEMPTY(1): Muşata-Locului
| Paradigm mer | Masc | Fem |
|---|---|---|
| Case=Acc,Nom|Definite=Def|Number=Sing | Merlu | |
| Definite=Ind|Number=Plur | meari |
Gender seems to be lexical feature of NOUN. 98% lemmas (58) occur only with one value of Gender.
PRON
22 PRON tokens (31% of all PRON tokens) have a non-empty value of Gender.
The most frequent other feature values with which PRON and Gender co-occurred: Person=3 (22; 100%), Reflex=EMPTY (22; 100%), Number=Sing (19; 86%), PronType=Prs (18; 82%), Variant=EMPTY (15; 68%).
PRON tokens may have the following values of Gender:
Fem(10; 45% of non-emptyGender): u, nâsă, -o, O, aestă, liMasc(12; 55% of non-emptyGender): lu, vârnu, -l, Nâs, Nâşi, aestu, lo-, nâsă, năs, ĺ-EMPTY(49): s-, mi, si, ti, tine, ţi, -ĺi, cari, io, se-
| Paradigm el | Masc | Fem |
|---|---|---|
| Case=Acc|Number=Sing|Strength=Strong | u | |
| Case=Acc|Number=Sing|Strength=Weak | u, O | |
| Case=Acc|Number=Sing|Strength=Weak|Variant=Short | lu, -l | -o |
| Case=Acc|Number=Plur|Strength=Weak|Variant=Short | lo- | li |
| Case=Dat|Number=Sing|Strength=Weak|Variant=Short | ĺ- |
DET
14 DET tokens (93% of all DET tokens) have a non-empty value of Gender.
The most frequent other feature values with which DET and Gender co-occurred: Poss=EMPTY (13; 93%), Case=Acc,Nom (12; 86%), Position=EMPTY (12; 86%), Number=Sing (11; 79%), PronType=Ind (11; 79%), Person=EMPTY (8; 57%).
DET tokens may have the following values of Gender:
Fem(7; 50% of non-emptyGender): nă, Ună, alti, câte, ndoauăMasc(7; 50% of non-emptyGender): un, -su, Aestu, multuEMPTY(1): a
| Paradigm un | Masc | Fem |
|---|---|---|
| un | nă, Ună |
ADJ
8 ADJ tokens (89% of all ADJ tokens) have a non-empty value of Gender.
The most frequent other feature values with which ADJ and Gender co-occurred: Degree=Pos (8; 100%), Definite=Ind (7; 88%), Number=Sing (7; 88%), Case=Acc,Nom (6; 75%).
ADJ tokens may have the following values of Gender:
Fem(4; 50% of non-emptyGender): albă, greauă, mari, uscatMasc(4; 50% of non-emptyGender): aleptu, bun, mărli, sănătosEMPTY(1): Ahtare
| Paradigm mare | Masc | Fem |
|---|---|---|
| Definite=Def|Number=Plur | mărli | |
| Definite=Ind|Number=Sing | mari |
NUM
5 NUM tokens (83% of all NUM tokens) have a non-empty value of Gender.
The most frequent other feature values with which NUM and Gender co-occurred: Case=Acc,Nom (5; 100%), NumForm=Word (5; 100%), NumType=Card (5; 100%), Definite=Def (4; 80%), Number=Sing (3; 60%).
NUM tokens may have the following values of Gender:
Fem(3; 60% of non-emptyGender): ună, năMasc(2; 40% of non-emptyGender): doľi, treiľiEMPTY(1): unsprăyinģiţĺi
VERB
5 VERB tokens (4% of all VERB tokens) have a non-empty value of Gender.
The most frequent other feature values with which VERB and Gender co-occurred: Mood=EMPTY (5; 100%), Number=Sing (5; 100%), Person=EMPTY (5; 100%), Tense=EMPTY (5; 100%), VerbForm=Part (5; 100%).
VERB tokens may have the following values of Gender:
Fem(1; 20% of non-emptyGender): ascăpatăMasc(4; 80% of non-emptyGender): faptă, minduit, niapirită, nidatăEMPTY(110): adară, ascapă, ascăpă, aştipta, facă, feaţi, fudzirâ, imna, lucra, mutreaşte
Relations with Agreement in Gender
The 10 most frequent relations where parent and child node agree in Gender:
NOUN –[det]–> DET (13; 93%),
NOUN –[amod]–> ADJ (4; 67%),
ADJ –[nsubj]–> NOUN (1; 100%),
DET –[fixed]–> NOUN (1; 100%),
NOUN –[amod]–> NUM (1; 100%).