Treebank Statistics: UD_Upper_Sorbian-UFAL: Features: Voice
This feature is universal.
It occurs with 2 different values: Act, Pass.
287 tokens (3%) have a non-empty value of Voice.
221 types (5%) occur at least once with a non-empty value of Voice.
174 lemmas (6%) occur at least once with a non-empty value of Voice.
The feature is used with 3 part-of-speech tags: ADJ (191; 2% instances), VERB (51; 0% instances), AUX (45; 0% instances).
ADJ
191 ADJ tokens (13% of all ADJ tokens) have a non-empty value of Voice.
The most frequent other feature values with which ADJ and Voice co-occurred: VerbForm=Part (191; 100%), Degree=EMPTY (186; 97%), Animacy=EMPTY (156; 82%), Case=Nom (132; 69%), Number=Sing (107; 56%).
ADJ tokens may have the following values of Voice:
Act(25; 13% of non-emptyVoice): přiběracu, wušłe, Bywša, Přiběrace, Rozrostowace, Slědowace, běžace, dalokosahace, ekspandowaceho, florěrowacePass(166; 87% of non-emptyVoice): mj, mjenowany, mjenowanych, namakane, rozdźělene, Zjednoćenych, listowany, mjenowane, natwarjene, pisaneEMPTY(1228): serbski, druhe, druhich, najwjetše, prěni, prěnje, serbskeje, Serbskeho, wulki, ablawtowych
Voice seems to be lexical feature of ADJ. 100% lemmas (132) occur only with one value of Voice.
VERB
51 VERB tokens (6% of all VERB tokens) have a non-empty value of Voice.
The most frequent other feature values with which VERB and Voice co-occurred: Tense=Past (51; 100%), Mood=EMPTY (50; 98%), Person=EMPTY (50; 98%), VerbForm=Part (50; 98%), Number=Sing (29; 57%).
VERB tokens may have the following values of Voice:
Act(50; 98% of non-emptyVoice): přewzali, wužiwali, započał, změnili, dodźeržała, eksistowali, ilustrował, kontrolowali, mał, mjenowałPass(1; 2% of non-emptyVoice): buEMPTY(767): ma, leži, móže, wobsahuje, móžeš, su, hlej, maja, rěči, běchu
Voice seems to be lexical feature of VERB. 100% lemmas (42) occur only with one value of Voice.
AUX
45 AUX tokens (16% of all AUX tokens) have a non-empty value of Voice.
The most frequent other feature values with which AUX and Voice co-occurred: Tense=Past (44; 98%), Mood=Ind (43; 96%), Person=3 (43; 96%), VerbForm=Fin (43; 96%), Number=Sing (30; 67%).
AUX tokens may have the following values of Voice:
Act(2; 4% of non-emptyVoice): był, byłaPass(43; 96% of non-emptyVoice): bu, buchu, buštejEMPTY(243): je, su, bě, by, njeje, njejsu, běchu, stej, bu, bychu
| Paradigm być | Act | Pass |
|---|---|---|
| Gender=Masc|Number=Sing|Tense=Past|VerbForm=Part | był | |
| Gender=Fem|Number=Sing|Tense=Past|VerbForm=Part | była | |
| Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin | bu | |
| Mood=Ind|Number=Dual|Person=3|Tense=Past|VerbForm=Fin | buštej | |
| Mood=Ind|Number=Plur|Person=3|Tense=Past|VerbForm=Fin | buchu | |
| Mood=Ind|Number=Plur|Person=3|VerbForm=Fin | buchu |
Relations with Agreement in Voice
The 10 most frequent relations where parent and child node agree in Voice:
ADJ –[aux:pass]–> AUX (1; 100%),
VERB –[parataxis]–> ADJ (1; 100%).