home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Estonian-EDT: Features: Number

This feature is universal. It occurs with 2 different values: Plur, Sing.

245321 tokens (56%) have a non-empty value of Number. 76132 types (95%) occur at least once with a non-empty value of Number. 37193 lemmas (89%) occur at least once with a non-empty value of Number. The feature is used with 12 part-of-speech tags: NOUN (113230; 26% instances), ADJ (31453; 7% instances), VERB (25129; 6% instances), PROPN (24962; 6% instances), PRON (22785; 5% instances), AUX (14167; 3% instances), NUM (7015; 2% instances), DET (6487; 1% instances), SYM (72; 0% instances), X (16; 0% instances), ADV (4; 0% instances), CCONJ (1; 0% instances).

NOUN

113230 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Number.

NOUN tokens may have the following values of Number:

Paradigm aastaSingPlur
Case=Ablaastalt
Case=Adeaastal, aastalgiaastatel, aastail
Case=Allaastale
Case=Comaastagaaastatega
Case=Elaaastastaastatest, aastaist
Case=Genaastaaastate
Case=Illaastasseaastatesse
Case=Ineaastasaastates
Case=Nomaastaaastad
Case=Paraastat, aastas, aastatkiaastaid
Case=Teraastaniaastateni
Case=Traaastaksaastateks

ADJ

31453 ADJ tokens (86% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Tense=EMPTY (27254; 87%), Voice=EMPTY (27227; 87%), VerbForm=EMPTY (27220; 87%), Degree=Pos (25901; 82%).

ADJ tokens may have the following values of Number:

Paradigm suurSingPlur
Case=Ablsuurelt
Case=Addsuurde
Case=Adesuurelsuurtel
Case=Allsuurelesuurtele
Case=Elasuurestsuurtest
Case=Gensuuresuurte
Case=Illsuurtesse
Case=Inesuuressuurtes
Case=Nomsuursuured
Case=Parsuurtsuuri
Case=Trasuurekssuurteks

VERB

25129 VERB tokens (53% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: VerbForm=Fin (25129; 100%), Voice=Act (25128; 100%), Mood=Ind (24332; 97%), Person=3 (21174; 84%), Tense=Pres (13972; 56%).

VERB tokens may have the following values of Number:

Paradigm olemaSingPlur
Mood=Cnd|Person=3|Tense=Presoleksid
Mood=Imp|Person=2|Tense=PresoleOlge
Mood=Imp|Person=3|Tense=Presoleolgu
Mood=Imp|Tense=Presolgu
Mood=Ind|Person=1|Tense=Pastolinolime
Mood=Ind|Person=1|Tense=Presolenoleme, olemegi
Mood=Ind|Person=2|Tense=Pastolid, Olidki
Mood=Ind|Person=2|Tense=Presoledolete
Mood=Ind|Person=3|Tense=ImpOli
Mood=Ind|Person=3|Tense=Pastoli, oligiolid
Mood=Ind|Person=3|Tense=Preson, ongi, Onson, ongi

PROPN

24962 PROPN tokens (95% of all PROPN tokens) have a non-empty value of Number.

PROPN tokens may have the following values of Number:

Paradigm MaaSingPlur
Case=AblMaalt
Case=AdeMaal
Case=AllMaale
Case=ComMaaga
Case=ElaMaast
Case=GenMaa, MAA
Case=NomMaaMaad
Case=ParMaad

Number seems to be lexical feature of PROPN. 99% lemmas (7093) occur only with one value of Number.

PRON

22785 PRON tokens (100% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: Person=EMPTY (13845; 61%), PronType=Prs (11581; 51%).

PRON tokens may have the following values of Number:

Paradigm temaSingPlur
Case=Abl|Person=3|PronType=Prstemalt, taltneilt
Case=Ade|Person=3|PronType=Prstal, temal, temalgineil, nendel
Case=All|Person=3|PronType=Prstalle, temaleneile, nendele, neilegi, nendelegi
Case=Com|Person=3|PronType=Prstemaganendega
Case=Ela|Person=3|PronType=Prstemast, tast, te-mast, temastkineist, nendest, neistki
Case=Gen|Person=3|PronType=Prstema, ta, temaginende
Case=Gen|PronType=Demnende
Case=Ill|Person=3|PronType=Prstemasseneisse
Case=Ine|Person=3|PronType=Prstemasneis
Case=Nom|Person=3|PronType=Prsta, tema, temagi, taginad, nemad, nemadki
Case=Par|Person=3|PronType=Prsteda, tedagineid, neidki
Case=Par|PronType=Demneid
Case=Ter|Person=3|PronType=Prstemani
Case=Tra|Person=3|PronType=Prstemaksnendeks
Person=3|PronType=Prstal

AUX

14167 AUX tokens (65% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (14167; 100%), Voice=Act (14167; 100%), Polarity=EMPTY (14130; 100%), Mood=Ind (13883; 98%), Person=3 (13226; 93%), Tense=Pres (11138; 79%).

AUX tokens may have the following values of Number:

Paradigm olemaSingPlur
Mood=Cnd|Person=1|Tense=Presoleksinoleksime
Mood=Cnd|Person=2|Tense=Presoleksid
Mood=Cnd|Person=3|Tense=Presoleksoleksid, oleksidki
Mood=Imp|Person=1|Tense=Presolgem
Mood=Imp|Person=2|Tense=PresoleOlge
Mood=Imp|Person=3|Tense=Presolgu, ole
Mood=Imp|Tense=Presolgu
Mood=Ind|Person=1|Tense=Pastolinolime, olimegi
Mood=Ind|Person=1|Tense=Presolen, olengioleme, olemegi
Mood=Ind|Person=2|Tense=Pastolid, olidkiolite
Mood=Ind|Person=2|Tense=Presoled, oledkiolete, oletegi
Mood=Ind|Person=3|Tense=Pastoli, oligiolid, olidki
Mood=Ind|Person=3|Tense=Preson, ongi, ole, onson, ongi

NUM

7015 NUM tokens (76% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: NumType=Card (7007; 100%), Case=Nom (4614; 66%), NumForm=Digit (3631; 52%).

NUM tokens may have the following values of Number:

Paradigm miljonSingPlur
Case=Ablmiljonilt
Case=AdeMiljonitel
Case=Commiljoniga
Case=Elamiljonistmiljonitest
Case=Genmiljonimiljonite
Case=Nommiljon, miljonit, miljoni, miljonini, miljonile, miljonilt, miljonite, miljonitega, miljoniteni, miljonitestmiljonid
Case=Parmiljonit, miljonimiljoneid

Number seems to be lexical feature of NUM. 99% lemmas (970) occur only with one value of Number.

DET

6487 DET tokens (95% of all DET tokens) have a non-empty value of Number.

The most frequent other feature values with which DET and Number co-occurred: PronType=Dem (3337; 51%).

DET tokens may have the following values of Number:

Paradigm seeSingPlur
Case=Abl|PronType=Demsellelt, selleltkineilt
Case=Ade|PronType=Demsel, sellel, Selgineil, nendel
Case=All|PronType=Demselleleneile, nendele
Case=Ela|PronType=Demsellest, sestneist, nendest
Case=Gen|Person=3|PronType=Prsnende
Case=Gen|PronType=Demsellenende
Case=Ill|PronType=Demsellesseneisse, nendesse
Case=Ine|PronType=Demselles, ses, Selleskineis, nendes
Case=Nom|PronType=Demsee, seegineed
Case=Par|PronType=Demsedaneid, neidki
Case=Tra|PronType=Demselleks, SeksNendeks

SYM

72 SYM tokens (11% of all SYM tokens) have a non-empty value of Number.

The most frequent other feature values with which SYM and Number co-occurred: Abbr=EMPTY (64; 89%), NumForm=Digit (64; 89%), NumType=Card (64; 89%), Case=Nom (46; 64%).

SYM tokens may have the following values of Number:

X

16 X tokens (2% of all X tokens) have a non-empty value of Number.

The most frequent other feature values with which X and Number co-occurred: Abbr=EMPTY (16; 100%), Foreign=EMPTY (16; 100%).

X tokens may have the following values of Number:

Number seems to be lexical feature of X. 100% lemmas (13) occur only with one value of Number.

ADV

4 ADV tokens (0% of all ADV tokens) have a non-empty value of Number.

ADV tokens may have the following values of Number:

CCONJ

1 CCONJ tokens (0% of all CCONJ tokens) have a non-empty value of Number.

CCONJ tokens may have the following values of Number:

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: NOUN –[amod]–> ADJ (20033; 94%), NOUN –[nmod]–> NOUN (15486; 63%), VERB –[nsubj]–> NOUN (10380; 70%), NOUN –[conj]–> NOUN (6084; 79%), NOUN –[det]–> DET (5835; 93%), NOUN –[nmod]–> PROPN (5075; 74%), VERB –[nsubj]–> PRON (4685; 67%), NOUN –[nummod]–> NUM (4157; 86%), PROPN –[flat]–> PROPN (3836; 92%), NOUN –[acl]–> ADJ (3169; 53%).