Treebank Statistics: UD_Romanian-RRT: Features: Number
This feature is universal.
It occurs with 2 different values: Plur
, Sing
.
This is a layered feature with the following layers: Number, Number[psor].
111248 tokens (51%) have a non-empty value of Number
.
27650 types (88%) occur at least once with a non-empty value of Number
.
13242 lemmas (77%) occur at least once with a non-empty value of Number
.
The feature is used with 8 part-of-speech tags: NOUN (52271; 24% instances), VERB (15188; 7% instances), ADJ (14769; 7% instances), DET (11218; 5% instances), AUX (6951; 3% instances), NUM (5533; 3% instances), PRON (4996; 2% instances), PROPN (322; 0% instances).
NOUN
52271 NOUN tokens (96% of all NOUN
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NOUN
and Number
co-occurred: Gender=Fem (32471; 62%), Case=Acc,Nom (28805; 55%), Definite=Def (27198; 52%).
NOUN
tokens may have the following values of Number
:
Plur
(13678; 26% of non-emptyNumber
): ani, membre, statele, date, pacienții, informații, zile, ori, ore, condițiileSing
(38593; 74% of non-emptyNumber
): timp, cazul, conformitate, loc, timpul, mod, acord, Comisia, parte, bEMPTY
(1987): art., a., ianuarie, nr., CE, decembrie, b., mg, lit., alin.
Paradigm an | Sing | Plur |
---|---|---|
Case=Acc,Nom|Definite=Def | anul | anii |
Case=Dat,Gen|Definite=Def | anului | anilor |
Definite=Ind | an | ani |
VERB
15188 VERB tokens (66% of all VERB
tokens) have a non-empty value of Number
.
The most frequent other feature values with which VERB
and Number
co-occurred: Tense=EMPTY (7746; 51%), Mood=EMPTY (7634; 50%), Person=EMPTY (7634; 50%), VerbForm=Part (7634; 50%).
VERB
tokens may have the following values of Number
:
Plur
(4449; 29% of non-emptyNumber
): pot, prevăzute, au, luați, menționate, fac, stabilite, legate, sunt, avețiSing
(10739; 71% of non-emptyNumber
): poate, are, avut, avea, era, putea, face, făcut, este, spusEMPTY
(7811): trebuie, putea, există, trebui, având, avea, reprezintă, prezintă, face, aplică
Paradigm putea | Sing | Plur |
---|---|---|
Gender=Masc|VerbForm=Part | putut | |
Mood=Imp|Person=2|VerbForm=Fin | poți | |
Mood=Ind|Person=1|Tense=Past|VerbForm=Fin | puturăm | |
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | pot | putem |
Mood=Ind|Person=2|Tense=Imp|VerbForm=Fin | puteai | |
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | poți | puteți |
Mood=Ind|Person=2|VerbForm=Fin | Poți | |
Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | putea | puteau |
Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | putu | putură |
Mood=Ind|Person=3|Tense=Pqp|VerbForm=Fin | putuse | |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | poate | pot |
ADJ
14769 ADJ tokens (97% of all ADJ
tokens) have a non-empty value of Number
.
The most frequent other feature values with which ADJ
and Number
co-occurred: Degree=Pos (14729; 100%), Definite=Ind (13875; 94%), Case=EMPTY (9157; 62%), Gender=Fem (8967; 61%).
ADJ
tokens may have the following values of Number
:
Plur
(4949; 34% of non-emptyNumber
): necesare, mari, mici, chimice, diferite, disponibile, specifice, suplimentare, contractante, noiSing
(9820; 66% of non-emptyNumber
): mare, prezentul, nou, prezenta, europene, europeană, european, prezentului, mică, generalEMPTY
(529): asemenea, standard, corespunzătoare, următoare, referitoare, anume, viitoare, așa, n., asemănătoare
Paradigm mare | Sing | Plur |
---|---|---|
Case=Acc,Nom|Definite=Def|Gender=Masc | marele | marii |
Case=Acc,Nom|Definite=Def|Gender=Fem | marea | marile |
Case=Dat,Gen|Definite=Def | marilor | |
Case=Dat,Gen|Definite=Def|Gender=Masc | marelui | |
Case=Dat,Gen|Definite=Def|Gender=Fem | Marii | |
Case=Dat,Gen|Definite=Ind|Gender=Fem | mari | |
Definite=Ind | mare | mari |
DET
11218 DET tokens (93% of all DET
tokens) have a non-empty value of Number
.
The most frequent other feature values with which DET
and Number
co-occurred: Position=EMPTY (9514; 85%), Person=EMPTY (8171; 73%), Poss=EMPTY (7887; 70%), Gender=Fem (6252; 56%), Case=Acc,Nom (6167; 55%), PronType=Ind (5741; 51%).
DET
tokens may have the following values of Number
:
Plur
(2173; 19% of non-emptyNumber
): ale, toate, unor, aceste, cele, alte, multe, ai, câteva, anumiteSing
(9045; 81% of non-emptyNumber
): o, un, a, al, lui, unei, unui, acest, cel, aceastăEMPTY
(807): lui, lor, orice, ei, niște, ce, oarecare, care, -i, cutare
Paradigm un | Sing | Plur |
---|---|---|
Case=Acc,Nom|Gender=Masc|Person=3|Position=Prenom | unii | |
Case=Acc,Nom|Gender=Masc | un, -un | |
Case=Acc,Nom|Gender=Fem|Person=3|Position=Prenom | unele | |
Case=Acc,Nom|Gender=Fem | o, -o | |
Case=Dat,Gen|Gender=Masc | unui | |
Case=Dat,Gen|Gender=Fem | unei | |
Case=Dat,Gen | unor |
AUX
6951 AUX tokens (81% of all AUX
tokens) have a non-empty value of Number
.
The most frequent other feature values with which AUX
and Number
co-occurred: Person=3 (6070; 87%), Tense=EMPTY (4382; 63%), Mood=EMPTY (4380; 63%), VerbForm=EMPTY (3748; 54%).
AUX
tokens may have the following values of Number
:
Plur
(1824; 26% of non-emptyNumber
): au, sunt, vor, erau, ați, vom, veți, sunteți, -au, suntemSing
(5127; 74% of non-emptyNumber
): a, este, fost, era, va, e, ai, fusese, -a, aveaEMPTY
(1614): fi, ar, am, fie, fiind, eram, nefiind, -ar, fiindu, -am
Paradigm fi | Sing | Plur |
---|---|---|
Gender=Masc|VerbForm=Part | fost | |
Mood=Imp|Person=2|VerbForm=Fin | fi, fii | |
Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | sunt | suntem |
Mood=Ind|Person=2|Tense=Imp|VerbForm=Fin | erai | |
Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | ești | sunteți |
Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | era | erau |
Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | fu | fură |
Mood=Ind|Person=3|Tense=Pqp|VerbForm=Fin | fusese | fuseseră |
Mood=Ind|Person=3|Tense=Pres|Variant=Short|VerbForm=Fin | -i, E- | -s |
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | este, e, Sunt, îi | sunt |
Mood=Sub|Person=1|Tense=Pres|VerbForm=Fin | fiu | fim |
Mood=Sub|Person=2|Tense=Pres|VerbForm=Fin | fii | fiți |
VerbForm=Part | este |
NUM
5533 NUM tokens (100% of all NUM
tokens) have a non-empty value of Number
.
The most frequent other feature values with which NUM
and Number
co-occurred: NumType=Card (4812; 87%), Gender=EMPTY (4593; 83%), NumForm=Digit (3796; 69%).
NUM
tokens may have the following values of Number
:
Plur
(1114; 20% of non-emptyNumber
): două, trei, doi, patru, cinci, primele, milioane, șase, opt, ambeleSing
(4419; 80% of non-emptyNumber
): 1, 2, 3, 4, 5, 6, primul, 7, 8, iEMPTY
(16): dintâi, întâi, ý10, ý15, ý5
Paradigm doi | Sing | Plur |
---|---|---|
Foreign=Yes|NumForm=Roman|NumType=Ord | II | |
Gender=Masc|NumForm=Word|NumType=Card | doi | |
Gender=Masc|NumForm=Word|NumType=Ord | doilea, secund | |
Gender=Fem|NumForm=Word|NumType=Card | două | |
Gender=Fem|NumForm=Word|NumType=Ord | doua | |
NumForm=Roman|NumType=Ord | ii |
Number
seems to be lexical feature of NUM
. 97% lemmas (910) occur only with one value of Number
.
PRON
4996 PRON tokens (42% of all PRON
tokens) have a non-empty value of Number
.
The most frequent other feature values with which PRON
and Number
co-occurred: Reflex=EMPTY (4996; 100%), Variant=EMPTY (4048; 81%), Person=3 (3929; 79%), PronType=Prs (3184; 64%).
PRON
tokens may have the following values of Number
:
Plur
(1486; 30% of non-emptyNumber
): le, ne, vă, acestea, ei, ele, toate, cele, noi, ceiSing
(3510; 70% of non-emptyNumber
): el, o, -l, îl, ea, îi, -i, i, ceea, măEMPTY
(6811): se, care, ce, s-, își, -și, și-, -se, dumneavoastră, cine
Paradigm el | Sing | Plur |
---|---|---|
Case=Acc,Nom|Gender=Masc|Strength=Strong | el | ei |
Case=Acc,Nom|Gender=Fem|Strength=Strong | ea | ele |
Case=Acc|Gender=Masc|Strength=Weak | îl | îi, i |
Case=Acc|Gender=Masc|Strength=Weak|Variant=Short | -l, l-, l | -i, i- |
Case=Acc|Gender=Fem|Strength=Weak | o | le |
Case=Acc|Gender=Fem|Strength=Weak|Variant=Short | -o | le-, -le |
Case=Dat,Gen|Gender=Masc|Strength=Strong | lui | |
Case=Dat,Gen|Gender=Fem|Strength=Strong | ei | |
Case=Dat,Gen|Strength=Strong | lor | |
Case=Dat,Gen|Strength=Weak|Variant=Short | i- | |
Case=Dat|Gender=Masc|Strength=Weak|Variant=Short | -i | |
Case=Dat|Strength=Weak | îi, i | le, li |
Case=Dat|Strength=Weak|Variant=Short | -i, i- | le-, -le, -li |
PROPN
322 PROPN tokens (5% of all PROPN
tokens) have a non-empty value of Number
.
PROPN
tokens may have the following values of Number
:
Plur
(15; 5% of non-emptyNumber
): Carpaților, Iașilor, Iașii, Carpații, Nibelungilor, SubcarpațiiSing
(307; 95% of non-emptyNumber
): României, Moldovei, Dunării, Europei, Franței, Italiei, Norvegiei, Rusiei, Ungariei, GermanieiEMPTY
(5563): România, Winston, București, Timișoara, Iași, Ion, Paris, Alexandru, O’Brien, Moldova
Number
seems to be lexical feature of PROPN
. 100% lemmas (104) occur only with one value of Number
.
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number
:
NOUN –[amod]–> ADJ (11857; 96%),
NOUN –[nmod]–> NOUN (9963; 59%),
NOUN –[det]–> DET (8599; 83%),
VERB –[nsubj]–> NOUN (3309; 59%),
NOUN –[acl]–> VERB (2816; 67%),
NOUN –[conj]–> NOUN (2727; 81%),
VERB –[aux]–> AUX (2497; 59%),
NOUN –[nummod]–> NUM (1929; 53%),
VERB –[conj]–> VERB (1381; 70%),
VERB –[nsubj:pass]–> NOUN (1230; 72%).