Treebank Statistics: UD_Upper_Sorbian-UFAL: Features: Number
This feature is universal.
It occurs with 4 different values: Dual, Plur, Ptan, Sing.
This is a layered feature with the following layers: Number, Number[psor].
5886 tokens (53%) have a non-empty value of Number.
3744 types (86%) occur at least once with a non-empty value of Number.
2385 lemmas (78%) occur at least once with a non-empty value of Number.
The feature is used with 9 part-of-speech tags: NOUN (2522; 23% instances), ADJ (1406; 13% instances), VERB (688; 6% instances), PROPN (545; 5% instances), AUX (286; 3% instances), DET (275; 2% instances), PRON (131; 1% instances), NUM (32; 0% instances), ADV (1; 0% instances).
NOUN
2522 NOUN tokens (99% of all NOUN tokens) have a non-empty value of Number.
The most frequent other feature values with which NOUN and Number co-occurred: Animacy=EMPTY (1383; 55%).
NOUN tokens may have the following values of Number:
Dual(27; 1% of non-emptyNumber): měsacaj, rěkomaj, Kralej, atomaj, atomow, genusaj, izotopaj, kmjenaj, likwidaj, lětomajPlur(807; 32% of non-emptyNumber): rěčow, kilometrow, nastawki, rostliny, lět, knihi, města, rěče, statow, wobrazyPtan(4; 0% of non-emptyNumber): droždźemi, duri, hody, wikiSing(1684; 67% of non-emptyNumber): l, př, město, rěč, woda, lěta, stolica, lěće, mócnarstwo, pismoEMPTY(15): km, m, CEST, centrum, jan, thumb, čas
| Paradigm lěto | Sing | Dual | Plur |
|---|---|---|---|
| Case=Acc | lěto | ||
| Case=Gen | lěta | lět, lětow | |
| Case=Loc | lěće, lětu | lětomaj | lětach |
| Case=Nom | lěto |
ADJ
1406 ADJ tokens (99% of all ADJ tokens) have a non-empty value of Number.
The most frequent other feature values with which ADJ and Number co-occurred: Animacy=EMPTY (1241; 88%), Voice=EMPTY (1215; 86%), VerbForm=EMPTY (1214; 86%), Degree=EMPTY (894; 64%).
ADJ tokens may have the following values of Number:
Dual(12; 1% of non-emptyNumber): dalšej, fotosynteizskej, přesunjenej, rozbiwanej, rozpušćenej, serbskej, sonantnej, wodźikoweju, wudospołnjatej, znatejPlur(510; 36% of non-emptyNumber): druhich, druhe, ablawtowych, dalše, wjacore, prěnje, wažne, wotpowědne, wědomostnych, ZjednoćenychSing(884; 63% of non-emptyNumber): serbski, serbskeje, Serbskeho, najwjetše, prěni, wulki, wulku, klinowe, serbska, EkscelentnyEMPTY(13): němsko, Awstro, Tibeto, al, dołho, duchowno, hornjo, krótko, online, politisko
| Paradigm serbski | Sing | Dual | Plur |
|---|---|---|---|
| Animacy=Inan|Case=Acc|Degree=Pos|Gender=Masc | serbskej | ||
| Case=Acc|Degree=Pos|Gender=Masc | serbski | ||
| Case=Acc|Degree=Pos|Gender=Neut | serbske | ||
| Case=Acc|Gender=Fem | serbsku | ||
| Case=Dat|Gender=Masc | serbskemu | ||
| Case=Dat|Gender=Fem | serbskim | ||
| Case=Gen|Degree=Pos|Gender=Fem | serbskeje | ||
| Case=Gen|Gender=Masc | Serbskeho | serbskich | |
| Case=Gen|Gender=Fem | serbskeje | ||
| Case=Ins|Gender=Fem | serbskej, serbsku | ||
| Case=Loc|Degree=Pos|Gender=Masc | Serbskim | ||
| Case=Loc|Gender=Masc | Serbskim | ||
| Case=Loc|Gender=Fem | serbskej | ||
| Case=Nom|Degree=Pos|Gender=Masc | Serbski, SERBSKI | ||
| Case=Nom|Degree=Pos|Gender=Fem | serbska | ||
| Case=Nom|Gender=Fem | serbska | ||
| Case=Nom|Gender=Neut | serbske |
VERB
688 VERB tokens (84% of all VERB tokens) have a non-empty value of Number.
The most frequent other feature values with which VERB and Number co-occurred: VerbForm=Fin (638; 93%), Mood=Ind (625; 91%), Person=3 (590; 86%), Tense=Pres (431; 63%).
VERB tokens may have the following values of Number:
Dual(12; 2% of non-emptyNumber): jewjetej, matej, móžetej, nabywaštej, nahrawałoj, přidźělitej, rozšěrištaj, spěchowaštej, słušatej, wotkryłojPlur(236; 34% of non-emptyNumber): su, běchu, maja, eksistuja, móžachu, móžeja, pokazuja, wužiwachu, wužiwaja, hodźaSing(440; 64% of non-emptyNumber): ma, móže, wobsahuje, móžeš, hlej, leži, rěči, dyrbi, wužiwa, hodźiEMPTY(130): nastać, měć, pisać, přełožować, wobkedźbować, čitać, dać, definować, dopokazać, kliknyć
| Paradigm měć | Sing | Dual | Plur |
|---|---|---|---|
| Animacy=Inan|Gender=Masc|Tense=Past|VerbForm=Part|Voice=Act | mał | ||
| Gender=Masc|Tense=Past|VerbForm=Part|Voice=Act | měł | ||
| Mood=Ind|Person=3|Polarity=Neg|Tense=Past|VerbForm=Fin | njeměješe | ||
| Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | měješe | mějachu | |
| Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | ma, nima | matej | maja, nimaja |
| Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin|VerbType=Mod | ma, nima | maja | |
| Mood=Ind|Tense=Pres|VerbForm=Fin | maja |
PROPN
545 PROPN tokens (91% of all PROPN tokens) have a non-empty value of Number.
The most frequent other feature values with which PROPN and Number co-occurred: Gender=Masc (281; 52%).
PROPN tokens may have the following values of Number:
Dual(2; 0% of non-emptyNumber): ŁužicomajPlur(53; 10% of non-emptyNumber): Sumeričanow, Aramejčanow, Assyričenjo, Serbow, Assyričanow, Geuzen, Milčanow, Serbach, Łužičanow, AlpowPtan(4; 1% of non-emptyNumber): Drježdźanach, Drježdźany, Mułkecy, WikachSing(486; 89% of non-emptyNumber): Mezopotamiskeje, Mezopotamiska, Mezopotamiskej, Wikimedia, Łužicy, Europje, Assur, Assyriska, Aššur, BabylonEMPTY(51): Aššur, C, Angeles, Gasche, Los, Tamil, Adl, Beth, Bilād, CET
| Paradigm Wikipedija | Sing | Plur |
|---|---|---|
| Case=Acc | Wikipediju | |
| Case=Gen | Wikipedije | Wikipedijow |
| Case=Loc | Wikipediji | |
| Case=Nom | Wikipedija |
Number seems to be lexical feature of PROPN. 99% lemmas (324) occur only with one value of Number.
AUX
286 AUX tokens (99% of all AUX tokens) have a non-empty value of Number.
The most frequent other feature values with which AUX and Number co-occurred: VerbForm=Fin (284; 99%), Person=3 (282; 99%), Mood=Ind (274; 96%), Voice=EMPTY (241; 84%), Tense=Pres (194; 68%).
AUX tokens may have the following values of Number:
Dual(9; 3% of non-emptyNumber): buštej, stej, běštej, stajPlur(93; 33% of non-emptyNumber): su, buchu, njejsu, běchu, bychu, njebuchu, njesuSing(184; 64% of non-emptyNumber): je, bu, bě, by, njeje, sy, budu, budźe, był, byłaEMPTY(2): być
| Paradigm być | Sing | Dual | Plur |
|---|---|---|---|
| Gender=Masc|Tense=Past|VerbForm=Part|Voice=Act | był | ||
| Gender=Fem|Tense=Past|VerbForm=Part|Voice=Act | była | ||
| Mood=Cnd|Person=3|VerbForm=Fin | by | bychu | |
| Mood=Ind|Person=2|Tense=Pres|VerbForm=Fin | sy | ||
| Mood=Ind|Person=3|Polarity=Neg|Tense=Past|VerbForm=Fin | njebuchu | ||
| Mood=Ind|Person=3|Polarity=Neg|Tense=Pres|VerbForm=Fin | njeje | njejsu, njesu | |
| Mood=Ind|Person=3|Tense=Fut|VerbForm=Fin | budu, budźe | ||
| Mood=Ind|Person=3|Tense=Past|VerbForm=Fin | bě, bu | běštej | běchu, buchu |
| Mood=Ind|Person=3|Tense=Past|VerbForm=Fin|Voice=Pass | bu | buštej | buchu |
| Mood=Ind|Person=3|Tense=Pres|VerbForm=Fin | je | stej, staj | su |
| Mood=Ind|Person=3|VerbForm=Fin|Voice=Pass | buchu |
DET
275 DET tokens (84% of all DET tokens) have a non-empty value of Number.
The most frequent other feature values with which DET and Number co-occurred: Abbr=EMPTY (238; 87%), Number[psor]=EMPTY (230; 84%), Person=EMPTY (229; 83%), Poss=EMPTY (201; 73%), Animacy=EMPTY (185; 67%).
DET tokens may have the following values of Number:
Dual(3; 1% of non-emptyNumber): Wobě, jeju, kotrejžPlur(106; 39% of non-emptyNumber): kotrež, tute, wšě, někotrych, swoje, kotrychž, někotre, tutych, wšěch, někotřiSing(166; 60% of non-emptyNumber): n, kotryž, kotraž, tutón, tuta, swoju, kotrež, tute, tutej, tutuEMPTY(51): jeho, jich, wjele, jeje, mnoho, mjenje, najwjace, tróšku, tójšto, wjace
| Paradigm kotryž | Sing | Dual | Plur |
|---|---|---|---|
| Animacy=Anim|Case=Dat|Gender=Masc | kotrymž | ||
| Animacy=Anim|Case=Nom|Gender=Masc | kotryž | kotřiž | |
| Animacy=Inan|Case=Acc|Gender=Masc | kotryž | ||
| Animacy=Inan|Case=Gen|Gender=Masc | kotrychž | ||
| Animacy=Inan|Case=Loc|Gender=Masc | kotrychž | ||
| Animacy=Inan|Case=Nom|Gender=Masc | kotryž, kotrež | kotrež | |
| Case=Gen|Gender=Masc | kotrehož | ||
| Case=Gen|Gender=Fem | kotrejež | ||
| Case=Ins|Gender=Fem | kotrymiž | ||
| Case=Loc|Gender=Masc | kotrymž | ||
| Case=Loc|Gender=Fem | kotrejž | kotrychž | |
| Case=Loc|Gender=Neut | kotrychž | ||
| Case=Nom|Gender=Masc | kotryž | kotrež | |
| Case=Nom|Gender=Fem | kotraž | kotrež | |
| Case=Nom|Gender=Neut | kotrež | kotrejž | kotrež |
PRON
131 PRON tokens (39% of all PRON tokens) have a non-empty value of Number.
The most frequent other feature values with which PRON and Number co-occurred: Reflex=EMPTY (131; 100%), Gender=Neut (80; 61%), Person=EMPTY (74; 56%).
PRON tokens may have the following values of Number:
Dual(1; 1% of non-emptyNumber): WonejPlur(19; 15% of non-emptyNumber): je, wone, kiž, Woni, nam, nich, nimiSing(111; 85% of non-emptyNumber): to, toho, tym, wona, wón, wono, čimž, jón, tomu, jehoEMPTY(204): so, kiž, sej, sobu, sebi
| Paradigm wón | Sing | Dual | Plur |
|---|---|---|---|
| Animacy=Anim|Case=Nom|Gender=Masc | Woni | ||
| Animacy=Inan|Case=Acc|Gender=Masc | je | ||
| Animacy=Nhum|Case=Acc|Gender=Masc | jeho | ||
| Case=Acc|Gender=Masc | jón, jeho | ||
| Case=Acc|Gender=Fem | ju, nju | je | |
| Case=Acc | je | ||
| Case=Dat|Gender=Fem | Jej, jeje, njej | ||
| Case=Gen|Gender=Masc | nich | ||
| Case=Gen|Gender=Fem | njeje | ||
| Case=Ins|Gender=Neut | nimi | ||
| Case=Loc|Gender=Masc | nim | ||
| Case=Loc|Gender=Neut | nim | ||
| Case=Nom|Gender=Masc | wón | ||
| Case=Nom|Gender=Fem | wona | wone | |
| Case=Nom|Gender=Neut | wono, wone | wone | |
| Case=Nom | Wonej | ||
| Gender=Masc | jón |
NUM
32 NUM tokens (8% of all NUM tokens) have a non-empty value of Number.
The most frequent other feature values with which NUM and Number co-occurred: NumType=Card (31; 97%).
NUM tokens may have the following values of Number:
Dual(12; 38% of non-emptyNumber): dwaj, dwěmaj, dweju, dwě, woběmajPlur(4; 13% of non-emptyNumber): Mio, miliardowSing(16; 50% of non-emptyNumber): jedyn, jedna, jednu, jednym, jedneho, jednyEMPTY(350): 2, 1, 6, 4, 3, 5, 7, I, 000, 10
ADV
1 ADV tokens (0% of all ADV tokens) have a non-empty value of Number.
The most frequent other feature values with which ADV and Number co-occurred: Degree=Pos (1; 100%), PronType=EMPTY (1; 100%).
ADV tokens may have the following values of Number:
Plur(1; 100% of non-emptyNumber): wuchodneEMPTY(534): tež, tak, hišće, zwjetša, hač, něhdźe, hižo, tu, wjace, najprjedy
Relations with Agreement in Number
The 10 most frequent relations where parent and child node agree in Number:
NOUN –[amod]–> ADJ (1079; 99%),
VERB –[nsubj]–> NOUN (346; 91%),
NOUN –[nmod]–> NOUN (310; 58%),
VERB –[obl]–> NOUN (234; 56%),
NOUN –[conj]–> NOUN (211; 89%),
NOUN –[det]–> DET (178; 83%),
ADJ –[cop]–> AUX (136; 96%),
NOUN –[nmod]–> PROPN (108; 64%),
PROPN –[conj]–> PROPN (82; 93%),
NOUN –[cop]–> AUX (79; 91%).