Treebank Statistics: UD_Romanian-TueCL: Features: Definite
This feature is universal.
It occurs with 2 different values: Def, Ind.
1053 tokens (24%) have a non-empty value of Definite.
756 types (48%) occur at least once with a non-empty value of Definite.
585 lemmas (51%) occur at least once with a non-empty value of Definite.
The feature is used with 5 part-of-speech tags: NOUN (821; 19% instances), ADJ (224; 5% instances), PROPN (6; 0% instances), DET (1; 0% instances), NUM (1; 0% instances).
NOUN
821 NOUN tokens (97% of all NOUN tokens) have a non-empty value of Definite.
The most frequent other feature values with which NOUN and Definite co-occurred: Typo=EMPTY (732; 89%), Number=Sing (592; 72%), Case=Acc,Nom (475; 58%), Gender=Fem (442; 54%).
NOUN tokens may have the following values of Definite:
Def(300; 37% of non-emptyDefinite): femeia, femeile, fetele, bărbatul, bărbații, bărbaților, femeii, apa, sânii, DiavolulInd(521; 63% of non-emptyDefinite): femeie, femei, bărbat, PUPICI, fată, barbat, bărbați, fund, bani, feteEMPTY(23): BITCH, BRO, Femeia, MILFă, Schadenfreude, baby, butter, cauciuc, crop, decolteu
| Paradigm femeie | Ind | Def |
|---|---|---|
| Case=Acc,Nom|Number=Sing | femeie | femeia |
| Case=Acc,Nom|Number=Sing|Typo=Yes | femei | |
| Case=Acc,Nom|Number=Plur | femeile | |
| Case=Dat,Gen|Number=Sing | femeii | |
| Case=Dat,Gen|Number=Sing|Typo=Yes | femeii | |
| Case=Dat,Gen|Number=Plur | femeilor | |
| Number=Plur | femei |
ADJ
224 ADJ tokens (95% of all ADJ tokens) have a non-empty value of Definite.
The most frequent other feature values with which ADJ and Definite co-occurred: Degree=Pos (204; 91%), Typo=EMPTY (192; 86%), Number=Sing (159; 71%), Gender=Fem (134; 60%), Case=EMPTY (117; 52%).
ADJ tokens may have the following values of Definite:
Def(6; 3% of non-emptyDefinite): fosta, frumușico, grasele, propriilor, scurți, simplulInd(218; 97% of non-emptyDefinite): DULCI, frumoasă, frumoasa, bună, mare, misogini, FRUMOȘI, dulce, urâtă, atentEMPTY(11): așa, hot, sexy, SEXSY, bine, imens, mini, nesexy
| Paradigm scurt | Ind | Def |
|---|---|---|
| Case=Acc|Gender=Masc | scurți | |
| Gender=Fem | scurte |
Definite seems to be lexical feature of ADJ. 98% lemmas (132) occur only with one value of Definite.
PROPN
6 PROPN tokens (8% of all PROPN tokens) have a non-empty value of Definite.
PROPN tokens may have the following values of Definite:
Def(6; 100% of non-emptyDefinite): Doamne, Elenei, Ezada, Maica, Marea, SinnEMPTY(66): România, Mirela, Vaida, Irinel, Maria, @KlausIohannis, @Utilizator_x3, ALEXANDRA, Africa, Alex
DET
1 DET tokens (0% of all DET tokens) have a non-empty value of Definite.
The most frequent other feature values with which DET and Definite co-occurred: Case=Dat,Gen (1; 100%), Gender=EMPTY (1; 100%), Number=Sing (1; 100%), Number[psor]=EMPTY (1; 100%), Person=EMPTY (1; 100%), Position=EMPTY (1; 100%), Poss=EMPTY (1; 100%), PronType=Art (1; 100%).
DET tokens may have the following values of Definite:
Def(1; 100% of non-emptyDefinite): luiEMPTY(211): o, un, asta, mea, toate, multe, mulți, ta, ce, lor
NUM
1 NUM tokens (3% of all NUM tokens) have a non-empty value of Definite.
The most frequent other feature values with which NUM and Definite co-occurred: Gender=Fem (1; 100%), NumForm=Word (1; 100%), NumType=Ord (1; 100%), Number=Sing (1; 100%).
NUM tokens may have the following values of Definite:
Def(1; 100% of non-emptyDefinite): primaEMPTY(30): 10, 2, 3, 9, doi, 1, 112, 12, 12000, 2,5
Relations with Agreement in Definite
The 10 most frequent relations where parent and child node agree in Definite:
NOUN –[amod]–> ADJ (73; 62%),
NOUN –[nmod]–> NOUN (60; 59%),
NOUN –[conj]–> NOUN (35; 73%),
NOUN –[list]–> NOUN (15; 79%),
ADJ –[conj]–> ADJ (10; 83%),
NOUN –[parataxis]–> NOUN (9; 100%),
ADJ –[list]–> ADJ (8; 100%),
ADJ –[obl]–> NOUN (8; 80%),
ADJ –[conj]–> NOUN (5; 100%),
NOUN –[amod]–> NOUN (4; 100%).