Treebank Statistics: UD_Romanian-RRT: Features: Number
This feature is universal.
It occurs with 2 different values: Plur, Sing.
This is a layered feature with the following layers: Number, Number[psor].
111244 tokens (51%) have a non-empty value of Number.
27648 types (88%) occur at least once with a non-empty value of Number.
13242 lemmas (77%) occur at least once with a non-empty value of Number.
The feature is used with 8 part-of-speech tags: NOUN (52269; 24% instances), VERB (15181; 7% instances), ADJ (14768; 7% instances), DET (11217; 5% instances), AUX (6958; 3% instances), NUM (5533; 3% instances), PRON (4996; 2% instances), PROPN (322; 0% instances).
NOUN
52269 NOUN tokens (96% of all NOUN tokens) have a non-empty value of Number.
The most frequent other feature values with which NOUN and Number co-occurred: Gender=Fem (32471; 62%), Case=Acc,Nom (28805; 55%), Definite=Def (27197; 52%).
NOUN tokens may have the following values of Number:
Plur(13678; 26% of non-emptyNumber): ani, membre, statele, date, pacienții, informații, zile, ori, ore, condițiileSing(38591; 74% of non-emptyNumber): timp, cazul, conformitate, loc, timpul, mod, acord, Comisia, parte, bEMPTY(1987): art., a., ianuarie, nr., CE, decembrie, b., mg, lit., alin.
| Paradigm an | Sing | Plur |
|---|---|---|
| Case=Acc,Nom|Definite=Def | anul | anii |
| Case=Dat,Gen|Definite=Def | anului | anilor |
| Definite=Ind | an | ani |
VERB
15181 VERB tokens (66% of all VERB tokens) have a non-empty value of Number.
The most frequent other feature values with which VERB and Number co-occurred: Tense=EMPTY (7745; 51%), Mood=EMPTY (7633; 50%), Person=EMPTY (7632; 50%), VerbForm=Part (7632; 50%).
VERB tokens may have the following values of Number:
Plur(4448; 29% of non-emptyNumber): pot, prevăzute, au, luați, menționate, fac, stabilite, legate, sunt, avețiSing(10733; 71% of non-emptyNumber): poate, are, avut, avea, era, putea, face, făcut, este, spusEMPTY(7809): trebuie, putea, există, trebui, având, avea, reprezintă, prezintă, face, aplică
| Paradigm putea | Sing | Plur |
|---|---|---|
| Gender=Masc|VerbForm=Part | putut | |
| Mood=Imp|Person=2|VerbForm=Fin | poți | |
| Mood=Ind|Person=1|Tense=Past|VerbForm=Fin | puturăm | |
| Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | pot | putem |
| Mood=Ind|Person=2|Tense=Imp|VerbForm=Fin | puteai | |
| Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | poți | puteți |
| Mood=Ind|Person=2|VerbForm=Fin | Poți | |
| Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | putea | puteau |
| Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | putu | putură |
| Mood=Ind|Person=3|Tense=Pqp|VerbForm=Fin | putuse | |
| Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | poate | pot |
ADJ
14768 ADJ tokens (97% of all ADJ tokens) have a non-empty value of Number.
The most frequent other feature values with which ADJ and Number co-occurred: Degree=Pos (14728; 100%), Definite=Ind (13873; 94%), Case=EMPTY (9155; 62%), Gender=Fem (8968; 61%).
ADJ tokens may have the following values of Number:
Plur(4950; 34% of non-emptyNumber): necesare, mari, mici, chimice, diferite, disponibile, specifice, suplimentare, contractante, noiSing(9818; 66% of non-emptyNumber): mare, prezentul, nou, prezenta, europene, europeană, european, prezentului, mică, generalEMPTY(529): asemenea, standard, corespunzătoare, următoare, referitoare, anume, viitoare, așa, n., asemănătoare
| Paradigm mare | Sing | Plur |
|---|---|---|
| Case=Acc,Nom|Definite=Def|Gender=Masc | marele | marii |
| Case=Acc,Nom|Definite=Def|Gender=Fem | marea | marile |
| Case=Dat,Gen|Definite=Def | marilor | |
| Case=Dat,Gen|Definite=Def|Gender=Masc | marelui | |
| Case=Dat,Gen|Definite=Def|Gender=Fem | Marii | |
| Case=Dat,Gen|Definite=Ind|Gender=Fem | mari | |
| Definite=Ind | mare | mari |
DET
11217 DET tokens (93% of all DET tokens) have a non-empty value of Number.
The most frequent other feature values with which DET and Number co-occurred: Position=EMPTY (9514; 85%), Person=EMPTY (8171; 73%), Poss=EMPTY (7886; 70%), Gender=Fem (6251; 56%), Case=Acc,Nom (6166; 55%), PronType=Ind (5740; 51%).
DET tokens may have the following values of Number:
Plur(2173; 19% of non-emptyNumber): ale, toate, unor, aceste, cele, alte, multe, ai, câteva, anumiteSing(9044; 81% of non-emptyNumber): o, un, a, al, lui, unei, unui, acest, cel, aceastăEMPTY(807): lui, lor, orice, ei, niște, ce, oarecare, care, -i, cutare
| Paradigm un | Sing | Plur |
|---|---|---|
| Case=Acc,Nom|ExtPos=ADV|Gender=Masc | un | |
| Case=Acc,Nom|ExtPos=ADV|Gender=Fem | o | |
| Case=Acc,Nom|Gender=Masc|Person=3|Position=Prenom | unii | |
| Case=Acc,Nom|Gender=Masc | un, -un | |
| Case=Acc,Nom|Gender=Fem|Person=3|Position=Prenom | unele | |
| Case=Acc,Nom|Gender=Fem | o, -o | |
| Case=Dat,Gen|Gender=Masc | unui | |
| Case=Dat,Gen|Gender=Fem | unei | |
| Case=Dat,Gen | unor |
AUX
6958 AUX tokens (81% of all AUX tokens) have a non-empty value of Number.
The most frequent other feature values with which AUX and Number co-occurred: Person=3 (6076; 87%), Tense=EMPTY (4382; 63%), Mood=EMPTY (4380; 63%), VerbForm=EMPTY (3747; 54%).
AUX tokens may have the following values of Number:
Plur(1824; 26% of non-emptyNumber): au, sunt, vor, erau, ați, vom, veți, sunteți, -au, suntemSing(5134; 74% of non-emptyNumber): a, este, fost, era, va, e, ai, fusese, -a, aveaEMPTY(1616): fi, ar, am, fie, fiind, eram, nefiind, -ar, fiindu, -am
| Paradigm fi | Sing | Plur |
|---|---|---|
| Gender=Masc|VerbForm=Part | fost | |
| Mood=Imp|Person=2|VerbForm=Fin | fi, fii | |
| Mood=Ind|Person=1|Tense=Pres|VerbForm=Fin | sunt | suntem |
| Mood=Ind|Person=2|Tense=Imp|VerbForm=Fin | erai | |
| Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | ești | sunteți |
| Mood=Ind|Person=3|Tense=Imp|VerbForm=Fin | era | erau |
| Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | fu | fură |
| Mood=Ind|Person=3|Tense=Pqp|VerbForm=Fin | fusese | fuseseră |
| Mood=Ind|Person=3|Tense=Pres|Variant=Short|VerbForm=Fin | -i, E- | -s |
| Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | este, e, Sunt, îi | sunt |
| Mood=Sub|Person=1|Tense=Pres|VerbForm=Fin | fiu | fim |
| Mood=Sub|Person=2|Tense=Pres|VerbForm=Fin | fii | fiți |
| VerbForm=Part | este |
NUM
5533 NUM tokens (100% of all NUM tokens) have a non-empty value of Number.
The most frequent other feature values with which NUM and Number co-occurred: NumType=Card (4812; 87%), Gender=EMPTY (4593; 83%), NumForm=Digit (3796; 69%).
NUM tokens may have the following values of Number:
Plur(1114; 20% of non-emptyNumber): două, trei, doi, patru, cinci, primele, milioane, șase, opt, ambeleSing(4419; 80% of non-emptyNumber): 1, 2, 3, 4, 5, 6, primul, 7, 8, iEMPTY(16): dintâi, întâi, ý10, ý15, ý5
| Paradigm doi | Sing | Plur |
|---|---|---|
| Foreign=Yes|NumForm=Roman|NumType=Ord | II | |
| Gender=Masc|NumForm=Word|NumType=Card | doi | |
| Gender=Masc|NumForm=Word|NumType=Ord | doilea, secund | |
| Gender=Fem|NumForm=Word|NumType=Card | două | |
| Gender=Fem|NumForm=Word|NumType=Ord | doua | |
| NumForm=Roman|NumType=Ord | ii |
Number seems to be lexical feature of NUM. 97% lemmas (910) occur only with one value of Number.
PRON
4996 PRON tokens (42% of all PRON tokens) have a non-empty value of Number.
The most frequent other feature values with which PRON and Number co-occurred: Reflex=EMPTY (4996; 100%), Variant=EMPTY (4048; 81%), Person=3 (3929; 79%), PronType=Prs (3184; 64%).
PRON tokens may have the following values of Number:
Plur(1486; 30% of non-emptyNumber): le, ne, vă, acestea, ei, ele, toate, cele, noi, ceiSing(3510; 70% of non-emptyNumber): el, o, -l, îl, ea, îi, -i, i, ceea, măEMPTY(6812): se, care, ce, s-, își, -și, și-, -se, dumneavoastră, cine
| Paradigm el | Sing | Plur |
|---|---|---|
| Case=Acc,Nom|Gender=Masc|Strength=Strong | el | ei |
| Case=Acc,Nom|Gender=Fem|Strength=Strong | ea | ele |
| Case=Acc|Gender=Masc|Strength=Weak | îl | îi, i |
| Case=Acc|Gender=Masc|Strength=Weak|Variant=Short | -l, l-, l | -i, i- |
| Case=Acc|Gender=Fem|Strength=Weak | o | le |
| Case=Acc|Gender=Fem|Strength=Weak|Variant=Short | -o | le-, -le |
| Case=Dat,Gen|Gender=Masc|Strength=Strong | lui | |
| Case=Dat,Gen|Gender=Fem|Strength=Strong | ei | |
| Case=Dat,Gen|Strength=Strong | lor | |
| Case=Dat,Gen|Strength=Weak|Variant=Short | i- | |
| Case=Dat|Gender=Masc|Strength=Weak|Variant=Short | -i | |
| Case=Dat|Strength=Weak | îi, i | le, li |
| Case=Dat|Strength=Weak|Variant=Short | -i, i- | le-, -le, -li |
PROPN
322 PROPN tokens (5% of all PROPN tokens) have a non-empty value of Number.
PROPN tokens may have the following values of Number:
Plur(15; 5% of non-emptyNumber): Carpaților, Iașilor, Iașii, Carpații, Nibelungilor, SubcarpațiiSing(307; 95% of non-emptyNumber): României, Moldovei, Dunării, Europei, Franței, Italiei, Norvegiei, Rusiei, Ungariei, GermanieiEMPTY(5563): România, Winston, București, Timișoara, Iași, Ion, Paris, Alexandru, O’Brien, Moldova
Number seems to be lexical feature of PROPN. 100% lemmas (104) occur only with one value of Number.
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number:
NOUN –[amod]–> ADJ (11967; 96%),
NOUN –[nmod]–> NOUN (10055; 59%),
NOUN –[det]–> DET (8617; 83%),
VERB –[nsubj]–> NOUN (3327; 59%),
NOUN –[acl]–> VERB (2819; 67%),
NOUN –[conj]–> NOUN (2729; 81%),
VERB –[aux]–> AUX (2497; 59%),
NOUN –[nummod]–> NUM (1932; 53%),
VERB –[conj]–> VERB (1383; 70%),
VERB –[nsubj:pass]–> NOUN (1230; 72%).