home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Romanian-RRT: Features: Definite

This feature is universal. It occurs with 2 different values: Def, Ind.

69044 tokens (32%) have a non-empty value of Definite. 22187 types (70%) occur at least once with a non-empty value of Definite. 10788 lemmas (62%) occur at least once with a non-empty value of Definite. The feature is used with 5 part-of-speech tags: NOUN (52840; 24% instances), ADJ (15006; 7% instances), NUM (450; 0% instances), DET (426; 0% instances), PROPN (322; 0% instances).

NOUN

52840 NOUN tokens (97% of all NOUN tokens) have a non-empty value of Definite.

The most frequent other feature values with which NOUN and Definite co-occurred: Number=Sing (38509; 73%), Gender=Fem (32516; 62%), Case=Acc,Nom (28809; 55%).

NOUN tokens may have the following values of Definite:

Paradigm anIndDef
Case=Acc,Nom|Number=Singanul
Case=Acc,Nom|Number=Pluranii
Case=Dat,Gen|Number=Singanului
Case=Dat,Gen|Number=Pluranilor
Number=Singan
Number=Plurani

ADJ

15006 ADJ tokens (98% of all ADJ tokens) have a non-empty value of Definite.

The most frequent other feature values with which ADJ and Definite co-occurred: Degree=Pos (14966; 100%), Number=Sing (9813; 65%), Case=EMPTY (9402; 63%), Gender=Fem (9199; 61%).

ADJ tokens may have the following values of Definite:

Paradigm mareIndDef
Case=Acc,Nom|Gender=Masc|Number=Singmarele
Case=Acc,Nom|Gender=Masc|Number=Plurmarii
Case=Acc,Nom|Gender=Fem|Number=Singmarea
Case=Acc,Nom|Gender=Fem|Number=Plurmarile
Case=Dat,Gen|Gender=Masc|Number=Singmarelui
Case=Dat,Gen|Gender=Fem|Number=SingmariMarii
Case=Dat,Gen|Number=Plurmarilor
Number=Singmare
Number=Plurmari, Mare

Definite seems to be lexical feature of ADJ. 96% lemmas (3271) occur only with one value of Definite.

NUM

450 NUM tokens (8% of all NUM tokens) have a non-empty value of Definite.

The most frequent other feature values with which NUM and Definite co-occurred: NumForm=Word (450; 100%), NumType=Ord (333; 74%), Gender=Fem (280; 62%), Number=Sing (257; 57%).

NUM tokens may have the following values of Definite:

Paradigm primIndDef
Case=Acc,Nom|Gender=Masc|Number=Singprimul
Case=Acc,Nom|Gender=Masc|Number=Plurprimii
Case=Acc,Nom|Gender=Fem|Number=Singprimăprima
Case=Acc,Nom|Gender=Fem|Number=Plurprimele
Case=Dat,Gen|Gender=Masc|Number=Singprimului
Case=Dat,Gen|Gender=Masc|Number=Plurprimilor
Case=Dat,Gen|Gender=Fem|Number=Singprimeprimei
Case=Dat,Gen|Gender=Fem|Number=Plurprimelor
Gender=Masc|Number=Singprim
Gender=Masc|Number=Sing|Variant=Shortprim-

DET

426 DET tokens (4% of all DET tokens) have a non-empty value of Definite.

The most frequent other feature values with which DET and Definite co-occurred: Person=EMPTY (426; 100%), Position=EMPTY (426; 100%), Poss=EMPTY (426; 100%), PronType=Art (426; 100%), Number=Sing (422; 99%), Case=Dat,Gen (341; 80%), Gender=EMPTY (320; 75%).

DET tokens may have the following values of Definite:

PROPN

322 PROPN tokens (5% of all PROPN tokens) have a non-empty value of Definite.

PROPN tokens may have the following values of Definite:

Paradigm IugoslaviaIndDef
Case=Acc,NomIugoslavie
Case=Dat,GenIugoslaviei

Definite seems to be lexical feature of PROPN. 98% lemmas (102) occur only with one value of Definite.

Relations with Agreement in Definite

The 10 most frequent relations where parent and child node agree in Definite: NOUN –[nmod]–> NOUN (9585; 56%), NOUN –[conj]–> NOUN (3046; 89%), ADJ –[conj]–> ADJ (713; 100%), NOUN –[appos]–> NOUN (279; 51%), ADJ –[conj]–> NOUN (86; 80%), NOUN –[conj]–> ADJ (32; 73%), ADJ –[amod]–> ADJ (28; 67%), NOUN –[acl]–> NOUN (27; 55%), ADJ –[advmod]–> ADJ (25; 96%), ADJ –[xcomp]–> NOUN (24; 73%).