Treebank Statistics: UD_Romanian-SiMoNERo: Features: Number
This feature is universal.
It occurs with 2 different values: Plur
, Sing
.
This is a layered feature with the following layers: Number, Number[psor].
80540 tokens (55%) have a non-empty value of Number
.
16119 types (90%) occur at least once with a non-empty value of Number
.
8598 lemmas (81%) occur at least once with a non-empty value of Number
.
The feature is used with 8 part-of-speech tags: NOUN (39911; 27% instances), ADJ (16869; 12% instances), DET (7248; 5% instances), VERB (6500; 4% instances), NUM (4603; 3% instances), AUX (4115; 3% instances), PRON (1281; 1% instances), PROPN (13; 0% instances).
NOUN
39911 NOUN tokens (93% of all NOUN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NOUN
and Number
co-occurred: Gender=Fem (24915; 62%), Definite=Def (21800; 55%), Case=Nom (20629; 52%).
NOUN
tokens may have the following values of Number
:
Plur
(10655; 27% of non-emptyNumber
): pacienții, pacienți, ani, cazuri, pacienților, vârstnici, studii, ori, celulele, celuleSing
(29256; 73% of non-emptyNumber
): nivelul, diabet, risc, cazul, insulină, tip, creșterea, tratamentul, tratament, vârstaEMPTY
(2787): mg, IC, vs, HTA, TA, DZ, FA, AVC, dl, EI
Paradigm pacient | Sing | Plur |
---|---|---|
Case=Gen|Definite=Def | pacientului | pacienților |
Case=Nom|Definite=Def | pacientul | pacienții, pacientii |
Definite=Ind | pacient | pacienți, pacienții |
ADJ
16869 ADJ tokens (99% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: Degree=Pos (16830; 100%), Definite=Ind (16640; 99%), Gender=Fem (10770; 64%), Case=EMPTY (10041; 60%).
ADJ
tokens may have the following values of Number
:
Plur
(5296; 31% of non-emptyNumber
): vârstnici, mici, clinice, mari, adverse, crescute, diferite, frecvente, cardiovasculare, importanteSing
(11573; 69% of non-emptyNumber
): mare, crescut, zaharat, cardiacă, important, renală, cronică, clinic, severă, chirurgicalăEMPTY
(183): precoce, standard, referitoare, asemănătoare, online, postpartum, pre-test, AV, costisitoare, viitoare
Paradigm mare | Sing | Plur |
---|---|---|
Case=Gen|Definite=Def|Gender=Fem | marii | |
Case=Gen|Definite=Ind|Gender=Fem | mari | |
Case=Nom|Definite=Def|Gender=Masc | Marele | |
Case=Nom|Definite=Def|Gender=Fem | marea | marile |
Case=Nom|Definite=Ind|Gender=Fem | mare | |
Definite=Ind | mari | |
Definite=Ind|Gender=Masc | mare |
DET
7248 DET tokens (98% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: Position=EMPTY (6078; 84%), Person=EMPTY (5646; 78%), Gender=Fem (4410; 61%), Poss=EMPTY (4237; 58%).
DET
tokens may have the following values of Number
:
Plur
(1590; 22% of non-emptyNumber
): ale, unor, cele, aceste, alte, multe, ai, acestor, toate, aceștiSing
(5658; 78% of non-emptyNumber
): a, o, un, al, unui, acest, această, unei, cel, ceaEMPTY
(176): lor, orice, ei, lui, care, niște, oarecare, ce, oricare
Paradigm al | Sing | Plur |
---|---|---|
Gender=Masc | al | ai |
Gender=Fem | a | ale |
VERB
6500 VERB tokens (64% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: Tense=EMPTY (3901; 60%), Mood=EMPTY (3890; 60%), Person=EMPTY (3889; 60%), VerbForm=Part (3889; 60%).
VERB
tokens may have the following values of Number
:
Plur
(2096; 32% of non-emptyNumber
): pot, au, legate, apar, tratați, asociate, fac, diagnosticate, cresc, efectuateSing
(4404; 68% of non-emptyNumber
): poate, are, arătat, privește, crește, demonstrat, produce, face, asociată, rămâneEMPTY
(3708): trebuie, există, reprezintă, prezintă, putea, determină, având, asociază, necesită, sugerează
Paradigm putea | Sing | Plur |
---|---|---|
Gender=Masc|VerbForm=Part | putut | |
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | pot | putem |
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | puteți | |
Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | putea | puteau |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | poate | pot |
Mood=Sub|Person=1|Tense=Pres|VerbForm=Fin | putem |
NUM
4603 NUM tokens (100% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: NumType=Card (4161; 90%), NumForm=Digit (3890; 85%).
NUM
tokens may have the following values of Number
:
Plur
(550; 12% of non-emptyNumber
): două, trei, ambele, primele, patru, cinci, ultimii, șase, doi, ultimeleSing
(4053; 88% of non-emptyNumber
): 2, 1, 3, 4, 5, 30, 10, 20, 6, 15EMPTY
(2): II-2, III-IV
Paradigm 5 | Sing | Plur |
---|---|---|
5 | 5- |
Number
seems to be lexical feature of NUM
. 100% lemmas (915) occur only with one value of Number
.
AUX
4115 AUX tokens (83% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: Person=3 (3694; 90%), VerbForm=Fin (2814; 68%), Mood=EMPTY (2090; 51%), Tense=EMPTY (2090; 51%).
AUX
tokens may have the following values of Number
:
Plur
(1414; 34% of non-emptyNumber
): au, sunt, vor, erau, veți, vom, ați, suntem, sunteți, aSing
(2701; 66% of non-emptyNumber
): este, a, fost, va, era, e, Aș, esti, fii, iEMPTY
(843): fi, fiind, ar, fie, am, nefiind, putea, fie
Paradigm fi | Sing | Plur |
---|---|---|
Gender=Masc|VerbForm=Part | fost | |
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | sunt | suntem |
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | esti | sunteți |
Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | era | erau |
Mood=Ind|Person=3|Tense=Pres|Variant=Short|VerbForm=Fin | i | |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | este, e | sunt |
Mood=Sub|Person=1|Tense=Pres|VerbForm=Fin | fim | |
Mood=Sub|Person=2|Tense=Pres|VerbForm=Fin | fii | |
Person=3|VerbForm=Fin | sunt |
PRON
1281 PRON tokens (30% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: Reflex=EMPTY (1281; 100%), Person=3 (1238; 97%), Strength=EMPTY (978; 76%), Case=Nom (899; 70%), PronType=Dem (770; 60%), Gender=Fem (669; 52%).
PRON
tokens may have the following values of Number
:
Plur
(589; 46% of non-emptyNumber
): acestea, cei, cele, acestora, ele, aceștia, celor, toate, le, liSing
(692; 54% of non-emptyNumber
): ceea, cea, aceasta, cel, aceea, acesta, unul, el, o, acestuiaEMPTY
(2922): care, se, ce, s-, își, și-, sine, oricare, ce-, ceea-ce
Paradigm care | Sing | Plur |
---|---|---|
Gender=Masc | căruia | |
Gender=Fem | căreia | |
cărora |
PROPN
13 PROPN tokens (2% of all PROPN
tokens) have a non-empty value of Number
.
PROPN
tokens may have the following values of Number
:
Sing
(13; 100% of non-emptyNumber
): Americii, Americă, Asiei, Europei, Franței, Greciei, RomânieiEMPTY
(704): Graves-Basedow, Doppler, Rubino, Europa, România, Langerhans, Paulescu, Pendred, Britanie, Esnaola
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[amod]–> ADJ (13716; 96%),
NOUN –[nmod]–> NOUN (9481; 56%),
NOUN –[det]–> DET (5452; 80%),
NOUN –[conj]–> NOUN (2988; 70%),
VERB –[nsubj]–> NOUN (1677; 52%),
NOUN –[acl]–> VERB (1429; 59%),
VERB –[aux]–> AUX (949; 56%),
VERB –[nsubj:pass]–> NOUN (887; 73%),
ADJ –[cop]–> AUX (716; 83%),
ADJ –[conj]–> ADJ (645; 95%).