home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Upper_Sorbian-UFAL: Features: Number

This feature is universal. It occurs with 4 different values: Dual, Plur, Ptan, Sing.

This is a layered feature with the following layers: Number, Number[psor].

5886 tokens (53%) have a non-empty value of Number. 3744 types (86%) occur at least once with a non-empty value of Number. 2385 lemmas (78%) occur at least once with a non-empty value of Number. The feature is used with 9 part-of-speech tags: NOUN (2522; 23% instances), ADJ (1406; 13% instances), VERB (688; 6% instances), PROPN (545; 5% instances), AUX (286; 3% instances), DET (275; 2% instances), PRON (131; 1% instances), NUM (32; 0% instances), ADV (1; 0% instances).

NOUN

2522 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Number.

The most frequent other feature values with which NOUN and Number co-occurred: Animacy=EMPTY (1383; 55%).

NOUN tokens may have the following values of Number:

Paradigm lětoSingDualPlur
Case=Acclěto
Case=Genlětalět, lětow
Case=Loclěće, lětulětomajlětach
Case=Nomlěto

ADJ

1406 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Number.

The most frequent other feature values with which ADJ and Number co-occurred: Animacy=EMPTY (1241; 88%), Voice=EMPTY (1215; 86%), VerbForm=EMPTY (1214; 86%), Degree=EMPTY (894; 64%).

ADJ tokens may have the following values of Number:

Paradigm serbskiSingDualPlur
Animacy=Inan|Case=Acc|Degree=Pos|Gender=Mascserbskej
Case=Acc|Degree=Pos|Gender=Mascserbski
Case=Acc|Degree=Pos|Gender=Neutserbske
Case=Acc|Gender=Femserbsku
Case=Dat|Gender=Mascserbskemu
Case=Dat|Gender=Femserbskim
Case=Gen|Degree=Pos|Gender=Femserbskeje
Case=Gen|Gender=MascSerbskehoserbskich
Case=Gen|Gender=Femserbskeje
Case=Ins|Gender=Femserbskej, serbsku
Case=Loc|Degree=Pos|Gender=MascSerbskim
Case=Loc|Gender=MascSerbskim
Case=Loc|Gender=Femserbskej
Case=Nom|Degree=Pos|Gender=MascSerbski, SERBSKI
Case=Nom|Degree=Pos|Gender=Femserbska
Case=Nom|Gender=Femserbska
Case=Nom|Gender=Neutserbske

VERB

688 VERB tokens (84% of all VERB tokens) have a non-empty value of Number.

The most frequent other feature values with which VERB and Number co-occurred: VerbForm=Fin (638; 93%), Mood=Ind (625; 91%), Person=3 (590; 86%), Tense=Pres (431; 63%).

VERB tokens may have the following values of Number:

Paradigm měćSingDualPlur
Animacy=Inan|Gender=Masc|Tense=Past|VerbForm=Part|Voice=Actmał
Gender=Masc|Tense=Past|VerbForm=Part|Voice=Actměł
Mood=Ind|Person=3|Polarity=Neg|Tense=Past|VerbForm=Finnjeměješe
Mood=Ind|Person=3|Tense=Past|VerbForm=Finměješemějachu
Mood=Ind|Person=3|Tense=Pres|VerbForm=Finma, nimamatejmaja, nimaja
Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin|VerbType=Modma, nimamaja
Mood=Ind|Tense=Pres|VerbForm=Finmaja

PROPN

545 PROPN tokens (91% of all PROPN tokens) have a non-empty value of Number.

The most frequent other feature values with which PROPN and Number co-occurred: Gender=Masc (281; 52%).

PROPN tokens may have the following values of Number:

Paradigm WikipedijaSingPlur
Case=AccWikipediju
Case=GenWikipedijeWikipedijow
Case=LocWikipediji
Case=NomWikipedija

Number seems to be lexical feature of PROPN. 99% lemmas (324) occur only with one value of Number.

AUX

286 AUX tokens (99% of all AUX tokens) have a non-empty value of Number.

The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (284; 99%), Person=3 (282; 99%), Mood=Ind (274; 96%), Voice=EMPTY (241; 84%), Tense=Pres (194; 68%).

AUX tokens may have the following values of Number:

Paradigm byćSingDualPlur
Gender=Masc|Tense=Past|VerbForm=Part|Voice=Actbył
Gender=Fem|Tense=Past|VerbForm=Part|Voice=Actbyła
Mood=Cnd|Person=3|VerbForm=Finbybychu
Mood=Ind|Person=2|Tense=Pres|VerbForm=Finsy
Mood=Ind|Person=3|Polarity=Neg|Tense=Past|VerbForm=Finnjebuchu
Mood=Ind|Person=3|Polarity=Neg|Tense=Pres|VerbForm=Finnjejenjejsu, njesu
Mood=Ind|Person=3|Tense=Fut|VerbForm=Finbudu, budźe
Mood=Ind|Person=3|Tense=Past|VerbForm=Finbě, buběštejběchu, buchu
Mood=Ind|Person=3|Tense=Past|VerbForm=Fin|Voice=Passbubuštejbuchu
Mood=Ind|Person=3|Tense=Pres|VerbForm=Finjestej, stajsu
Mood=Ind|Person=3|VerbForm=Fin|Voice=Passbuchu

DET

275 DET tokens (84% of all DET tokens) have a non-empty value of Number.

The most frequent other feature values with which DET and Number co-occurred: Abbr=EMPTY (238; 87%), Number[psor]=EMPTY (230; 84%), Person=EMPTY (229; 83%), Poss=EMPTY (201; 73%), Animacy=EMPTY (185; 67%).

DET tokens may have the following values of Number:

Paradigm kotryžSingDualPlur
Animacy=Anim|Case=Dat|Gender=Masckotrymž
Animacy=Anim|Case=Nom|Gender=Masckotryžkotřiž
Animacy=Inan|Case=Acc|Gender=Masckotryž
Animacy=Inan|Case=Gen|Gender=Masckotrychž
Animacy=Inan|Case=Loc|Gender=Masckotrychž
Animacy=Inan|Case=Nom|Gender=Masckotryž, kotrežkotrež
Case=Gen|Gender=Masckotrehož
Case=Gen|Gender=Femkotrejež
Case=Ins|Gender=Femkotrymiž
Case=Loc|Gender=Masckotrymž
Case=Loc|Gender=Femkotrejžkotrychž
Case=Loc|Gender=Neutkotrychž
Case=Nom|Gender=Masckotryžkotrež
Case=Nom|Gender=Femkotražkotrež
Case=Nom|Gender=Neutkotrežkotrejžkotrež

PRON

131 PRON tokens (39% of all PRON tokens) have a non-empty value of Number.

The most frequent other feature values with which PRON and Number co-occurred: Reflex=EMPTY (131; 100%), Gender=Neut (80; 61%), Person=EMPTY (74; 56%).

PRON tokens may have the following values of Number:

Paradigm wónSingDualPlur
Animacy=Anim|Case=Nom|Gender=MascWoni
Animacy=Inan|Case=Acc|Gender=Mascje
Animacy=Nhum|Case=Acc|Gender=Mascjeho
Case=Acc|Gender=Mascjón, jeho
Case=Acc|Gender=Femju, njuje
Case=Accje
Case=Dat|Gender=FemJej, jeje, njej
Case=Gen|Gender=Mascnich
Case=Gen|Gender=Femnjeje
Case=Ins|Gender=Neutnimi
Case=Loc|Gender=Mascnim
Case=Loc|Gender=Neutnim
Case=Nom|Gender=Mascwón
Case=Nom|Gender=Femwonawone
Case=Nom|Gender=Neutwono, wonewone
Case=NomWonej
Gender=Mascjón

NUM

32 NUM tokens (8% of all NUM tokens) have a non-empty value of Number.

The most frequent other feature values with which NUM and Number co-occurred: NumType=Card (31; 97%).

NUM tokens may have the following values of Number:

ADV

1 ADV tokens (0% of all ADV tokens) have a non-empty value of Number.

The most frequent other feature values with which ADV and Number co-occurred: Degree=Pos (1; 100%), PronType=EMPTY (1; 100%).

ADV tokens may have the following values of Number:

Relations with Agreement in Number

The 10 most frequent relations where parent and child node agree in Number: NOUN –[amod]–> ADJ (1079; 99%), VERB –[nsubj]–> NOUN (346; 91%), NOUN –[nmod]–> NOUN (310; 58%), VERB –[obl]–> NOUN (234; 56%), NOUN –[conj]–> NOUN (211; 89%), NOUN –[det]–> DET (178; 83%), ADJ –[cop]–> AUX (136; 96%), NOUN –[nmod]–> PROPN (108; 64%), PROPN –[conj]–> PROPN (82; 93%), NOUN –[cop]–> AUX (79; 91%).