home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Estonian-EDT: Features: Number

This feature is universal. It occurs with 2 different values: Plur, Sing.

This is a layered feature with the following layers: Number, Number[psor].

242388 tokens (55%) have a non-empty value of Number. 75275 types (94%) occur at least once with a non-empty value of Number. 36135 lemmas (86%) occur at least once with a non-empty value of Number. The feature is used with 9 part-of-speech tags: NOUN (114224; 26% instances), ADJ (30462; 7% instances), VERB (25132; 6% instances), PROPN (24851; 6% instances), PRON (22795; 5% instances), AUX (14525; 3% instances), DET (6879; 2% instances), NUM (3462; 1% instances), SYM (58; 0% instances).

NOUN

114224 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Number.

NOUN tokens may have the following values of Number:

Paradigm aastaSingPlur
Case=Ablaastalt
Case=Adeaastal, aastalgiaastatel, aastail
Case=Allaastale
Case=Comaastagaaastatega
Case=Elaaastastaastatest, aastaist
Case=Genaastaaastate
Case=Illaastasseaastatesse
Case=Ineaastasaastates
Case=Nomaastaaastad
Case=Paraastat, aastas, aastatkiaastaid
Case=Teraastaniaastateni
Case=Traaastaksaastateks

ADJ

30462 ADJ tokens (83% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Tense=EMPTY (26241; 86%), Voice=EMPTY (26213; 86%), VerbForm=EMPTY (26206; 86%), Degree=Pos (26024; 85%).

ADJ tokens may have the following values of Number:

Paradigm suurSingPlur
Case=Ablsuurelt
Case=Addsuurde
Case=Adesuurelsuurtel
Case=Allsuurelesuurtele
Case=Elasuurestsuurtest
Case=Gensuuresuurte
Case=Illsuurtesse
Case=Inesuuressuurtes
Case=Nomsuursuured
Case=Parsuurtsuuri
Case=Trasuurekssuurteks

VERB

25132 VERB tokens (53% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: VerbForm=Fin (25132; 100%), Voice=Act (25122; 100%), Mood=Ind (24334; 97%), Person=3 (21166; 84%), Tense=Pres (13969; 56%).

VERB tokens may have the following values of Number:

Paradigm saamaSingPlur
Mood=Cnd|Person=1|Tense=Pressaaksinsaaksime
Mood=Cnd|Person=2|Tense=Pressaaksite
Mood=Cnd|Person=3|Tense=Pressaaksid
Mood=Imp|Person=1|Tense=Pressaagem
Mood=Imp|Person=2|Tense=PresSaasaage
Mood=Imp|Person=3|Tense=Pressaagu
Mood=Ind|Person=1|Tense=Pastsain, sai, saingisaime, saimegi
Mood=Ind|Person=1|Tense=Pressaansaame, saamegi
Mood=Ind|Person=2|Tense=Pastsaidsaite
Mood=Ind|Person=2|Tense=Pressaadsaate
Mood=Ind|Person=3|Tense=Pastsai, saigisaid
Mood=Ind|Person=3|Tense=Pressaab, saabkisaavad

PROPN

24851 PROPN tokens (95% of all PROPN tokens) have a non-empty value of Number.

PROPN tokens may have the following values of Number:

Paradigm MaaSingPlur
Case=AblMaalt
Case=AdeMaal
Case=AllMaale
Case=ComMaaga
Case=ElaMaast
Case=GenMaa, MAA
Case=NomMaaMaad
Case=ParMaad

Number seems to be lexical feature of PROPN. 99% lemmas (6955) occur only with one value of Number.

PRON

22795 PRON tokens (100% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: Person=EMPTY (13748; 60%), PronType=Prs (11726; 51%).

PRON tokens may have the following values of Number:

Paradigm temaSingPlur
Case=Abl|Person=3|PronType=Prstemalt, taltneilt
Case=Ade|Person=3|PronType=Prstal, temal, temalgineil, nendel
Case=All|Person=3|PronType=Prstalle, temaleneile, nendele, neilegi, nendelegi
Case=Com|Person=3|PronType=Prstemaganendega
Case=Ela|Person=3|PronType=Prstemast, tast, temastkineist, nendest, neistki
Case=Gen|Person=3|PronType=Prstema, ta, temaginende
Case=Ill|Person=3|PronType=Prstemasseneisse
Case=Ine|Person=3|PronType=Prstemasneis
Case=Nom|Person=3|PronType=Prsta, tema, temagi, taginad, nemad, nemadki
Case=Nom|Person=3|PronType=Prs|Typo=Yesta
Case=Par|Person=3|PronType=Prsteda, tedagineid, neidki
Case=Par|PronType=Demneid
Case=Ter|Person=3|PronType=Prstemani
Case=Tra|Person=3|PronType=Prstemaksnendeks

AUX

14525 AUX tokens (65% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (14525; 100%), Voice=Act (14519; 100%), Polarity=EMPTY (14487; 100%), Mood=Ind (14233; 98%), Person=3 (13550; 93%), Tense=Pres (11436; 79%).

AUX tokens may have the following values of Number:

Paradigm olemaSingPlur
Mood=Cnd|Person=1|Tense=Presoleksinoleksime
Mood=Cnd|Person=2|Tense=Presoleksid
Mood=Cnd|Person=3|Tense=Presoleksoleksid, oleksidki
Mood=Imp|Person=1|Tense=Presolgem
Mood=Imp|Person=2|Tense=PresoleOlge
Mood=Imp|Person=3|Tense=Presolgu, oleolgu, ole
Mood=Imp|Tense=Presolgu
Mood=Ind|Person=1|Tense=Pastolinolime, olimegi
Mood=Ind|Person=1|Tense=Presolen, olengioleme, olemegi
Mood=Ind|Person=2|Tense=Pastolidolite
Mood=Ind|Person=2|Tense=Presoled, oledkiolete, oletegi
Mood=Ind|Person=3|Tense=Past|Typo=Yesoli
Mood=Ind|Person=3|Tense=Pastoli, oligiolid, olidki
Mood=Ind|Person=3|Tense=Preson, ongi, ole, om, onson, ongi

DET

6879 DET tokens (95% of all DET tokens) have a non-empty value of Number.

DET tokens may have the following values of Number:

Paradigm seeSingPlur
Case=Abl|PronType=Demsellelt, selleltkineilt
Case=Ade|PronType=Demsel, sellel, Selgineil, nendel
Case=All|PronType=Demselleleneile, nendele
Case=Ela|PronType=Demsellest, sestneist, nendest
Case=Gen|Person=3|PronType=Prsnende
Case=Gen|PronType=Demsellenende
Case=Ill|PronType=Demsellesseneisse, nendesse
Case=Ine|PronType=Demselles, ses, Selleskineis, nendes
Case=Nom|PronType=Demsee, seegineed
Case=Par|PronType=Demsedaneid, neidki
Case=Tra|PronType=Demselleks, SeksNendeks

NUM

3462 NUM tokens (38% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: NumForm=Word (3349; 97%), NumType=Card (3255; 94%).

NUM tokens may have the following values of Number:

Paradigm kolmSingPlur
Case=Ablkolmelt
Case=Adekolmel
Case=Allkolmele
Case=Elakolmest
Case=Genkolme
Case=Inekolmes
Case=Nomkolm
Case=Parkolmekolmesid
Case=Terkolmeni
Case=Trakolmeks

Number seems to be lexical feature of NUM. 93% lemmas (137) occur only with one value of Number.

SYM

58 SYM tokens (8% of all SYM tokens) have a non-empty value of Number.

The most frequent other feature values with which SYM and Number co-occurred: Abbr=EMPTY (51; 88%).

SYM tokens may have the following values of Number:

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: NOUN –[amod]–> ADJ (18937; 90%), NOUN –[nmod]–> NOUN (15532; 63%), VERB –[nsubj]–> NOUN (10403; 70%), NOUN –[det]–> DET (6305; 94%), NOUN –[conj]–> NOUN (6146; 79%), NOUN –[nmod]–> PROPN (5120; 74%), VERB –[nsubj]–> PRON (4730; 67%), PROPN –[flat]–> PROPN (3701; 92%), NOUN –[acl]–> ADJ (3359; 52%), NOUN –[cop]–> AUX (2911; 64%).