home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Estonian-EDT: Features: Number

This feature is universal. It occurs with 2 different values: Plur, Sing.

242229 tokens (55%) have a non-empty value of Number. 75207 types (94%) occur at least once with a non-empty value of Number. 36054 lemmas (86%) occur at least once with a non-empty value of Number. The feature is used with 9 part-of-speech tags: NOUN (113904; 26% instances), ADJ (30545; 7% instances), VERB (25233; 6% instances), PROPN (24978; 6% instances), PRON (22823; 5% instances), AUX (14404; 3% instances), DET (6811; 2% instances), NUM (3473; 1% instances), SYM (58; 0% instances).

NOUN

113904 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Number.

NOUN tokens may have the following values of Number:

Paradigm aastaSingPlur
Case=Ablaastalt
Case=Adeaastal, aastalgiaastatel, aastail
Case=Allaastale
Case=Comaastagaaastatega
Case=Elaaastastaastatest, aastaist
Case=Genaastaaastate
Case=Illaastasseaastatesse
Case=Ineaastasaastates
Case=Nomaastaaastad
Case=Paraastat, aastas, aastatkiaastaid
Case=Teraastaniaastateni
Case=Traaastaksaastateks

ADJ

30545 ADJ tokens (83% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Tense=EMPTY (26323; 86%), Voice=EMPTY (26297; 86%), VerbForm=EMPTY (26290; 86%), Degree=Pos (26013; 85%).

ADJ tokens may have the following values of Number:

Paradigm suurSingPlur
Case=Ablsuurelt
Case=Addsuurde
Case=Adesuurelsuurtel
Case=Allsuurelesuurtele
Case=Elasuurestsuurtest
Case=Gensuuresuurte
Case=Illsuurtesse
Case=Inesuuressuurtes
Case=Nomsuursuured
Case=Parsuurtsuuri
Case=Trasuurekssuurteks

VERB

25233 VERB tokens (53% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: VerbForm=Fin (25233; 100%), Voice=Act (25232; 100%), Mood=Ind (24438; 97%), Person=3 (21251; 84%), Tense=Pres (14041; 56%).

VERB tokens may have the following values of Number:

Paradigm saamaSingPlur
Mood=Cnd|Person=1|Tense=Pressaaksinsaaksime
Mood=Cnd|Person=2|Tense=Pressaaksite
Mood=Cnd|Person=3|Tense=Pressaaksid
Mood=Imp|Person=1|Tense=Pressaagem
Mood=Imp|Person=2|Tense=PresSaasaage
Mood=Imp|Person=3|Tense=Pressaagu
Mood=Ind|Person=1|Tense=Pastsain, sai, saingisaime, saimegi
Mood=Ind|Person=1|Tense=Pressaansaame, saamegi
Mood=Ind|Person=2|Tense=Pastsaidsaite
Mood=Ind|Person=2|Tense=Pressaadsaate
Mood=Ind|Person=3|Tense=Pastsai, saigisaid
Mood=Ind|Person=3|Tense=Pressaab, saabkisaavad

PROPN

24978 PROPN tokens (94% of all PROPN tokens) have a non-empty value of Number.

PROPN tokens may have the following values of Number:

Paradigm MaaSingPlur
Case=AblMaalt
Case=AdeMaal
Case=AllMaale
Case=ComMaaga
Case=ElaMaast
Case=GenMaa, MAA
Case=NomMaaMaad
Case=ParMaad

Number seems to be lexical feature of PROPN. 99% lemmas (7023) occur only with one value of Number.

PRON

22823 PRON tokens (100% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: Person=EMPTY (13799; 60%), PronType=Prs (11703; 51%).

PRON tokens may have the following values of Number:

Paradigm temaSingPlur
Case=Abl|Person=3|PronType=Prstemalt, taltneilt
Case=Ade|Person=3|PronType=Prstal, temal, temalgineil, nendel
Case=All|Person=3|PronType=Prstalle, temaleneile, nendele, neilegi, nendelegi
Case=Com|Person=3|PronType=Prstemaganendega
Case=Ela|Person=3|PronType=Prstemast, tast, te-mast, temastkineist, nendest, neistki
Case=Gen|Person=3|PronType=Prstema, ta, temaginende
Case=Gen|PronType=Demnende
Case=Ill|Person=3|PronType=Prstemasseneisse
Case=Ine|Person=3|PronType=Prstemasneis
Case=Nom|Person=3|PronType=Prsta, tema, temagi, taginad, nemad, nemadki
Case=Par|Person=3|PronType=Prsteda, tedagineid, neidki
Case=Par|PronType=Demneid
Case=Ter|Person=3|PronType=Prstemani
Case=Tra|Person=3|PronType=Prstemaksnendeks
Person=3|PronType=Prstal

AUX

14404 AUX tokens (65% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (14404; 100%), Voice=Act (14404; 100%), Polarity=EMPTY (14367; 100%), Mood=Ind (14115; 98%), Person=3 (13447; 93%), Tense=Pres (11348; 79%).

AUX tokens may have the following values of Number:

Paradigm olemaSingPlur
Mood=Cnd|Person=1|Tense=Presoleksinoleksime
Mood=Cnd|Person=2|Tense=Presoleksid
Mood=Cnd|Person=3|Tense=Presoleksoleksid, oleksidki
Mood=Imp|Person=1|Tense=Presolgem
Mood=Imp|Person=2|Tense=PresoleOlge
Mood=Imp|Person=3|Tense=Presoleolgu, ole
Mood=Imp|Tense=Presolgu
Mood=Ind|Person=1|Tense=Pastolinolime, olimegi
Mood=Ind|Person=1|Tense=Presolen, olengioleme, olemegi
Mood=Ind|Person=2|Tense=Pastolid, olidkiolite
Mood=Ind|Person=2|Tense=Presoled, oledkiolete, oletegi
Mood=Ind|Person=3|Tense=Pastoli, oligiolid, olidki
Mood=Ind|Person=3|Tense=Preson, ongi, ole, om, onson, ongi

DET

6811 DET tokens (96% of all DET tokens) have a non-empty value of Number.

DET tokens may have the following values of Number:

Paradigm seeSingPlur
Case=Abl|PronType=Demsellelt, selleltkineilt
Case=Ade|PronType=Demsel, sellel, Selgineil, nendel
Case=All|PronType=Demselleleneile, nendele
Case=Ela|PronType=Demsellest, sestneist, nendest
Case=Gen|Person=3|PronType=Prsnende
Case=Gen|PronType=Demsellenende
Case=Ill|PronType=Demsellesseneisse, nendesse
Case=Ine|PronType=Demselles, ses, Selleskineis, nendes
Case=Nom|PronType=Demsee, seegineed
Case=Par|PronType=Demsedaneid, neidki
Case=Tra|PronType=Demselleks, SeksNendeks

NUM

3473 NUM tokens (38% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: NumForm=Word (3341; 96%), NumType=Card (3272; 94%).

NUM tokens may have the following values of Number:

Paradigm kolmSingPlur
Case=Ablkolmelt
Case=Adekolmel
Case=Allkolmele
Case=Elakolmest
Case=Genkolme
Case=Inekolmes
Case=Nomkolm, kolmeni
Case=Parkolmekolmesid
Case=Terkolmeni
Case=Trakolmeks

Number seems to be lexical feature of NUM. 94% lemmas (157) occur only with one value of Number.

SYM

58 SYM tokens (8% of all SYM tokens) have a non-empty value of Number.

The most frequent other feature values with which SYM and Number co-occurred: Abbr=EMPTY (51; 88%).

SYM tokens may have the following values of Number:

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: NOUN –[amod]–> ADJ (19071; 90%), NOUN –[nmod]–> NOUN (15442; 63%), VERB –[nsubj]–> NOUN (10412; 70%), NOUN –[det]–> DET (6178; 94%), NOUN –[conj]–> NOUN (6111; 79%), NOUN –[nmod]–> PROPN (5114; 74%), VERB –[nsubj]–> PRON (4737; 66%), PROPN –[flat]–> PROPN (3786; 92%), NOUN –[acl]–> ADJ (3307; 53%), NOUN –[cop]–> AUX (2899; 64%).