home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Estonian-EDT: Features: Number

This feature is universal. It occurs with 2 different values: Plur, Sing.

206795 tokens (56%) have a non-empty value of Number. 66174 types (95%) occur at least once with a non-empty value of Number. 32339 lemmas (88%) occur at least once with a non-empty value of Number. The feature is used with 11 part-of-speech tags: NOUN (95526; 26% instances), ADJ (26645; 7% instances), VERB (21741; 6% instances), PROPN (20204; 6% instances), PRON (19408; 5% instances), AUX (12017; 3% instances), NUM (5696; 2% instances), DET (5527; 2% instances), SYM (19; 0% instances), X (11; 0% instances), ADV (1; 0% instances).

NOUN

95526 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Number.

NOUN tokens may have the following values of Number:

Paradigm aastaSingPlur
Case=Ablaastalt
Case=Adeaastal, aastalgiaastatel, aastail
Case=Allaastale
Case=Comaastagaaastatega
Case=Elaaastastaastatest, aastaist
Case=Genaastaaastate
Case=Illaastasse
Case=Ineaastasaastates
Case=Nomaastaaastad
Case=Paraastat, aastas, aastatkiaastaid
Case=Teraastaniaastateni
Case=Traaastaksaastateks

ADJ

26645 ADJ tokens (86% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Tense=EMPTY (23122; 87%), Voice=EMPTY (23095; 87%), VerbForm=EMPTY (23088; 87%), Degree=Pos (21950; 82%).

ADJ tokens may have the following values of Number:

Paradigm suurSingPlur
Case=Ablsuurelt
Case=Addsuurde
Case=Adesuurelsuurtel
Case=Allsuurelesuurtele
Case=Elasuurestsuurtest
Case=Gensuuresuurte
Case=Illsuurtesse
Case=Inesuuressuurtes
Case=Nomsuursuured
Case=Parsuurtsuuri
Case=Trasuurekssuurteks

VERB

21741 VERB tokens (53% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: VerbForm=Fin (21741; 100%), Voice=Act (21740; 100%), Mood=Ind (21034; 97%), Person=3 (18241; 84%), Tense=Pres (11871; 55%).

VERB tokens may have the following values of Number:

Paradigm saamaSingPlur
Mood=Cnd|Person=1|Tense=Pressaaksinsaaksime
Mood=Cnd|Person=2|Tense=Pressaaksite
Mood=Cnd|Person=3|Tense=Pressaaksid
Mood=Imp|Person=2|Tense=PresSaasaage
Mood=Imp|Person=3|Tense=Pressaagu
Mood=Ind|Person=1|Tense=Impsaime
Mood=Ind|Person=1|Tense=Pastsain, saingisaime
Mood=Ind|Person=1|Tense=Pressaansaame, saamegi
Mood=Ind|Person=2|Tense=Pastsaidsaite
Mood=Ind|Person=2|Tense=Pressaadsaate
Mood=Ind|Person=3|Tense=Impsai
Mood=Ind|Person=3|Tense=Pastsai, saigisaid
Mood=Ind|Person=3|Tense=Pressaabsaavad

PROPN

20204 PROPN tokens (95% of all PROPN tokens) have a non-empty value of Number.

PROPN tokens may have the following values of Number:

Paradigm JanSingPlur
Case=AblJanilt
Case=AdeJanil
Case=AllJanile
Case=ComJaniga
Case=GenJani
Case=NomJan
Case=ParJaniJane
Case=TraJaniks

Number seems to be lexical feature of PROPN. 99% lemmas (5770) occur only with one value of Number.

PRON

19408 PRON tokens (100% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: Person=EMPTY (11640; 60%), PronType=Prs (9952; 51%).

PRON tokens may have the following values of Number:

Paradigm temaSingPlur
Case=Abl|Person=3|PronType=Prstemalt, taltneilt
Case=Ade|Person=3|PronType=Prstal, temal, temalgineil, nendel
Case=All|Person=3|PronType=Prstalle, temaleneile, nendele, neilegi, nendelegi
Case=Com|Person=3|PronType=Prstemaganendega
Case=Ela|Person=3|PronType=Prstemast, tast, te-mast, temastkineist, nendest
Case=Gen|Person=3|PronType=Prstema, tanende
Case=Gen|PronType=Demnende
Case=Ill|Person=3|PronType=Prstemasseneisse
Case=Ine|Person=3|PronType=Prstemasneis
Case=Nom|Person=3|PronType=Prsta, tema, temaginad, nemad, nemadki
Case=Par|Person=3|PronType=Prsteda, tedagineid, neidki
Case=Par|PronType=Demneid
Case=Ter|Person=3|PronType=Prstemani
Case=Tra|Person=3|PronType=Prstemaksnendeks
Person=3|PronType=Prstal

AUX

12017 AUX tokens (64% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (12017; 100%), Voice=Act (12017; 100%), Polarity=EMPTY (11980; 100%), Mood=Ind (11773; 98%), Person=3 (11214; 93%), Tense=Pres (9353; 78%).

AUX tokens may have the following values of Number:

Paradigm olemaSingPlur
Mood=Cnd|Person=1|Tense=Presoleksinoleksime
Mood=Cnd|Person=2|Tense=Presoleksid
Mood=Cnd|Person=3|Tense=Presoleksid, oleksidki
Mood=Imp|Person=1|Tense=Presolgem
Mood=Imp|Person=2|Tense=PresoleOlge
Mood=Imp|Person=3|Tense=Presolgu, ole
Mood=Imp|Tense=Presolgu
Mood=Ind|Person=1|Tense=Pastolinolime, olimegi
Mood=Ind|Person=1|Tense=Presolen, olengioleme, olemegi
Mood=Ind|Person=2|Tense=Pastolid, olidkiolite
Mood=Ind|Person=2|Tense=Presoled, oledkiolete, oletegi
Mood=Ind|Person=3|Tense=Pastoli, oligiolid, olidki
Mood=Ind|Person=3|Tense=Preson, ongi, ole, onson, ongi

NUM

5696 NUM tokens (77% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: NumType=Card (5692; 100%), Case=Nom (3669; 64%), NumForm=Digit (2867; 50%).

NUM tokens may have the following values of Number:

Paradigm miljonSingPlur
Case=Ablmiljonilt
Case=AdeMiljonitel
Case=Commiljoniga
Case=Elamiljonistmiljonitest
Case=Genmiljonimiljonite
Case=Nommiljon, miljonit, miljoni, miljoneid, miljonilt, miljonini, miljoniteni, miljonitestmiljonid
Case=Parmiljonit, miljonimiljoneid

Number seems to be lexical feature of NUM. 99% lemmas (914) occur only with one value of Number.

DET

5527 DET tokens (96% of all DET tokens) have a non-empty value of Number.

The most frequent other feature values with which DET and Number co-occurred: PronType=Dem (2843; 51%).

DET tokens may have the following values of Number:

Paradigm seeSingPlur
Case=Abl|PronType=Demsellelt, selleltkineilt
Case=Ade|PronType=Demsel, sellelneil, nendel
Case=All|PronType=Demselleleneile, nendele
Case=Ela|PronType=Demsellest, sestneist, nendest
Case=Gen|Person=3|PronType=Prsnende
Case=Gen|PronType=Demsellenende
Case=Ill|PronType=Demsellesseneisse
Case=Ine|PronType=Demselles, ses, Selleskineis, nendes
Case=Nom|PronType=Demsee, seegineed
Case=Par|PronType=Demsedaneid, neidki
Case=Tra|PronType=Demselleks, SeksNendeks

SYM

19 SYM tokens (12% of all SYM tokens) have a non-empty value of Number.

The most frequent other feature values with which SYM and Number co-occurred: Abbr=EMPTY (11; 58%), NumForm=Digit (11; 58%), NumType=Card (11; 58%).

SYM tokens may have the following values of Number:

X

11 X tokens (3% of all X tokens) have a non-empty value of Number.

The most frequent other feature values with which X and Number co-occurred: Abbr=EMPTY (11; 100%), Foreign=EMPTY (11; 100%).

X tokens may have the following values of Number:

ADV

1 ADV tokens (0% of all ADV tokens) have a non-empty value of Number.

ADV tokens may have the following values of Number:

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: NOUN –[amod]–> ADJ (16897; 95%), NOUN –[nmod]–> NOUN (12926; 63%), VERB –[nsubj]–> NOUN (8839; 70%), NOUN –[conj]–> NOUN (5048; 79%), NOUN –[det]–> DET (4919; 93%), NOUN –[nmod]–> PROPN (4153; 74%), VERB –[nsubj]–> PRON (4080; 67%), NOUN –[nummod]–> NUM (3494; 87%), PROPN –[flat]–> PROPN (3033; 93%), NOUN –[acl]–> ADJ (2681; 53%).