Voice
: voice
Voice is a feature of verbs that helps map the traditional syntactic functions, such as subject and object, to semantic roles, such as agent and pacient.
Act
: active voice
The subject of the verb is the doer of the action (agent), the object is affected by the action (pacient).
All finite verb forms and the active/past participles are tagged Voice=Act
.
Examples
- Napadli jsme nepřítele. “We attacked the enemy” (the active participle napadli can be used to form either past tense or conditional mood; here it forms the past tense.)
Pass
: passive voice
The subject of the verb is affected by the action (patient). The doer (agent) is either unexpressed or it appears as an object of the verb.
Only the passive participle is tagged Voice=Pass
.
Examples
- Jsme napadeni nepřítelem. “We are attacked by the enemy” (the passive participle napadeni is used to form passive in all tenses; here it forms the present passive.)
Treebank Statistics (UD_Czech)
This feature is universal.
It occurs with 2 different values: Act
, Pass
.
154790 tokens (10%) have a non-empty value of Voice
.
24977 types (19%) occur at least once with a non-empty value of Voice
.
6347 lemmas (11%) occur at least once with a non-empty value of Voice
.
The feature is used with 3 part-of-speech tags: cs-pos/VERB (139146; 9% instances), cs-pos/AUX (11146; 1% instances), cs-pos/ADJ (4498; 0% instances).
VERB
139146 cs-pos/VERB tokens (84% of all VERB
tokens) have a non-empty value of Voice
.
The most frequent other feature values with which VERB
and Voice
co-occurred: Negative=Pos (125084; 90%), Number=Sing (89367; 64%), Gender=EMPTY (75751; 54%), VerbForm=Fin (75729; 54%), Mood=Ind (75729; 54%), Tense=Pres (74329; 53%).
VERB
tokens may have the following values of Voice
:
Act
(129620; 93% of non-emptyVoice
): je, jsou, má, není, byl, může, bylo, řekl, měl, majíPass
(9526; 7% of non-emptyVoice
): řečeno, přesvědčen, připravena, připraven, otevřena, rozhodnuto, zvolen, uzavřena, uvedeno, založenaEMPTY
(26488): být, mít, získat, stát, hrát, říci, platit, muset, dělat, dostat
Paradigm říci | Act | Pass |
---|---|---|
Animacy=Anim|Aspect=Perf|Gender=Masc|Negative=Neg|Number=Plur|Tense=Past|VerbForm=Part | neřekli | |
Animacy=Anim|Aspect=Perf|Gender=Masc|Negative=Pos|Number=Plur|Tense=Past|VerbForm=Part | řekli | |
Animacy=Inan|Aspect=Perf|Gender=Fem,Masc|Negative=Pos|Number=Plur|Tense=Past|VerbForm=Part | řekly | |
Aspect=Imp|Negative=Pos|Number=Plur|Tense=Pres|VerbForm=Trans | řkouce | |
Aspect=Perf|Gender=Masc|Negative=Neg|Number=Sing|Tense=Past|VerbForm=Part | neřekl | |
Aspect=Perf|Gender=Masc|Negative=Pos|Number=Sing|Tense=Past|VerbForm=Part | řekl | |
Aspect=Perf|Gender=Fem,Neut|Negative=Neg|Number=Plur,Sing|Tense=Past|VerbForm=Part | neřekla | |
Aspect=Perf|Gender=Fem,Neut|Negative=Pos|Number=Plur,Sing|Tense=Past|VerbForm=Part | řekla | |
Aspect=Perf|Gender=Neut|Negative=Neg|Number=Sing|Tense=Past|VerbForm=Part | neřeklo | |
Aspect=Perf|Gender=Neut|Negative=Pos|Number=Sing|Tense=Past|VerbForm=Part | řeklo | |
Aspect=Perf|Gender=Neut|Negative=Pos|Number=Sing|VerbForm=Part | řečeno | |
Aspect=Perf|Mood=Ind|Negative=Neg|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin | neřeknu | |
Aspect=Perf|Mood=Ind|Negative=Neg|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin | neřekne | |
Aspect=Perf|Mood=Ind|Negative=Neg|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin | neřeknou | |
Aspect=Perf|Mood=Ind|Negative=Pos|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin | řeknu | |
Aspect=Perf|Mood=Ind|Negative=Pos|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin | řekne | |
Aspect=Perf|Mood=Ind|Negative=Pos|Number=Plur|Person=1|Tense=Pres|VerbForm=Fin | řekneme | |
Aspect=Perf|Mood=Ind|Negative=Pos|Number=Plur|Person=2|Tense=Pres|VerbForm=Fin | řeknete | |
Aspect=Perf|Mood=Ind|Negative=Pos|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin | řeknou |
AUX
11146 cs-pos/AUX tokens (54% of all AUX
tokens) have a non-empty value of Voice
.
The most frequent other feature values with which AUX
and Voice
co-occurred: Negative=Pos (10350; 93%), Gender=EMPTY (7894; 71%), VerbForm=Fin (7893; 71%), Mood=Ind (7893; 71%), Number=Sing (6212; 56%).
AUX
tokens may have the following values of Voice
:
Act
(11146; 100% of non-emptyVoice
): bude, jsem, jsme, byl, budou, byla, je, bylo, byly, jsouEMPTY
(9649): by, být, bychom, bych, byste, budiž, bys, býti
ADJ
4498 cs-pos/ADJ tokens (2% of all ADJ
tokens) have a non-empty value of Voice
.
The most frequent other feature values with which ADJ
and Voice
co-occurred: Degree=EMPTY (4498; 100%), Negative=Pos (4367; 97%), Number=Sing (2627; 58%), Gender=Masc (2336; 52%).
ADJ
tokens may have the following values of Voice
:
Act
(4498; 100% of non-emptyVoice
): rozhodující, vedoucí, následující, vynikající, týkající, odpovídající, rostoucí, žijící, kupující, následujícíchEMPTY
(176313): první, další, české, nové, druhé, poslední, státní, dalších, možné, vlastní
Voice
seems to be lexical feature of ADJ
. 100% lemmas (940) occur only with one value of Voice
.
Relations with Agreement in Voice
The 10 most frequent relations where parent and child node agree in Voice
:
VERB –[conj]–> VERB (13283; 87%),
VERB –[ccomp]–> VERB (6279; 69%),
VERB –[advcl]–> VERB (5035; 74%),
VERB –[parataxis]–> VERB (822; 75%),
VERB –[appos]–> VERB (167; 88%),
VERB –[advmod]–> VERB (20; 87%),
VERB –[acl]–> VERB (16; 89%),
AUX –[conj]–> AUX (6; 100%),
VERB –[mark]–> VERB (1; 100%).
Treebank Statistics (UD_Czech-CAC)
This feature is universal.
It occurs with 2 different values: Act
, Pass
.
50464 tokens (10%) have a non-empty value of Voice
.
12504 types (20%) occur at least once with a non-empty value of Voice
.
4074 lemmas (14%) occur at least once with a non-empty value of Voice
.
The feature is used with 3 part-of-speech tags: cs-pos/VERB (44479; 9% instances), cs-pos/AUX (3842; 1% instances), cs-pos/ADJ (2143; 0% instances).
VERB
44479 cs-pos/VERB tokens (84% of all VERB
tokens) have a non-empty value of Voice
.
The most frequent other feature values with which VERB
and Voice
co-occurred: Negative=Pos (40906; 92%), Gender=EMPTY (28513; 64%), Mood=Ind (28504; 64%), VerbForm=Fin (28504; 64%), Tense=Pres (28174; 63%), Number=Sing (26436; 59%), Person=3 (24928; 56%), Aspect=EMPTY (22594; 51%).
VERB
tokens may have the following values of Voice
:
Act
(40213; 90% of non-emptyVoice
): je, jsou, má, není, mají, musí, může, bylo, byl, jdePass
(4266; 10% of non-emptyVoice
): řečeno, dosaženo, věnována, dána, provedena, uvedeny, určena, určeny, splněny, zahájenaEMPTY
(8464): být, mít, zajistit, říci, vidět, dělat, řešit, věnovat, použít, provádět
Paradigm dát | Act | Pass |
---|---|---|
Animacy=Anim|Gender=Masc|Negative=Neg|Number=Plur|Tense=Past|VerbForm=Part | nedali | |
Animacy=Anim|Gender=Masc|Negative=Pos|Number=Plur|Tense=Past|VerbForm=Part | dali | |
Animacy=Inan|Gender=Fem,Masc|Negative=Pos|Number=Plur|Tense=Past|VerbForm=Part | daly | |
Animacy=Inan|Gender=Fem,Masc|Negative=Pos|Number=Plur|VerbForm=Part | dány | |
Gender=Masc|Negative=Neg|Number=Sing|Tense=Past|VerbForm=Part | nedal | |
Gender=Masc|Negative=Pos|Number=Sing|Tense=Past|VerbForm=Part | dal | |
Gender=Masc|Negative=Pos|Number=Sing|VerbForm=Part | dán | |
Gender=Fem,Neut|Negative=Neg|Number=Plur,Sing|Tense=Past|VerbForm=Part | nedala | |
Gender=Fem,Neut|Negative=Pos|Number=Plur,Sing|Tense=Past|VerbForm=Part | dala | |
Gender=Fem,Neut|Negative=Pos|Number=Plur,Sing|VerbForm=Part | dána | |
Gender=Neut|Negative=Neg|Number=Sing|Tense=Past|VerbForm=Part | nedalo | |
Gender=Neut|Negative=Pos|Number=Sing|Tense=Past|VerbForm=Part | dalo | |
Gender=Neut|Negative=Pos|Number=Sing|VerbForm=Part | dáno | |
Mood=Ind|Negative=Neg|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin | nedám | |
Mood=Ind|Negative=Neg|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin | nedá | |
Mood=Ind|Negative=Neg|Number=Plur|Person=1|Tense=Pres|VerbForm=Fin | nedáme | |
Mood=Ind|Negative=Neg|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin | nedají | |
Mood=Ind|Negative=Pos|Number=Sing|Person=1|Tense=Pres|VerbForm=Fin | dám | |
Mood=Ind|Negative=Pos|Number=Sing|Person=2|Tense=Pres|VerbForm=Fin | dáš | |
Mood=Ind|Negative=Pos|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin | dá | |
Mood=Ind|Negative=Pos|Number=Plur|Person=1|Tense=Pres|VerbForm=Fin | dáme | |
Mood=Ind|Negative=Pos|Number=Plur|Person=2|Tense=Pres|VerbForm=Fin | dáte | |
Mood=Ind|Negative=Pos|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin | dají |
AUX
3842 cs-pos/AUX tokens (62% of all AUX
tokens) have a non-empty value of Voice
.
The most frequent other feature values with which AUX
and Voice
co-occurred: Negative=Pos (3647; 95%), Gender=EMPTY (2529; 66%), VerbForm=Fin (2528; 66%), Mood=Ind (2528; 66%), Number=Sing (2018; 53%).
AUX
tokens may have the following values of Voice
:
Act
(3842; 100% of non-emptyVoice
): je, jsme, bude, jsem, bylo, byla, byl, byly, jsou, budouEMPTY
(2315): by, být, bychom, bych, byste, býti, budiž, bys
ADJ
2143 cs-pos/ADJ tokens (3% of all ADJ
tokens) have a non-empty value of Voice
.
The most frequent other feature values with which ADJ
and Voice
co-occurred: Degree=EMPTY (2143; 100%), Negative=Pos (2112; 99%), Number=Sing (1119; 52%), Gender=Masc (1095; 51%).
ADJ
tokens may have the following values of Voice
:
Act
(2143; 100% of non-emptyVoice
): pracujících, rozhodující, pracující, vedoucí, odpovídající, následující, řídící, týkající, vyplývající, rostoucíEMPTY
(68385): další, pracovní, první, jednotlivých, základní, nové, možno, socialistické, různých, každý
Voice
seems to be lexical feature of ADJ
. 100% lemmas (539) occur only with one value of Voice
.
Relations with Agreement in Voice
The 10 most frequent relations where parent and child node agree in Voice
:
VERB –[conj]–> VERB (5068; 87%),
VERB –[advcl]–> VERB (1572; 72%),
VERB –[ccomp]–> VERB (863; 53%),
VERB –[parataxis]–> VERB (221; 67%),
VERB –[appos]–> VERB (27; 87%),
AUX –[conj]–> AUX (9; 100%).
Treebank Statistics (UD_Czech-CLTT)
This feature is universal.
It occurs with 2 different values: Act
, Pass
.
2587 tokens (7%) have a non-empty value of Voice
.
642 types (14%) occur at least once with a non-empty value of Voice
.
339 lemmas (13%) occur at least once with a non-empty value of Voice
.
The feature is used with 3 part-of-speech tags: cs-pos/VERB (2190; 6% instances), cs-pos/ADJ (288; 1% instances), cs-pos/AUX (109; 0% instances).
VERB
2190 cs-pos/VERB tokens (87% of all VERB
tokens) have a non-empty value of Voice
.
The most frequent other feature values with which VERB
and Voice
co-occurred: Negative=Pos (1937; 88%), VerbForm=Fin (1806; 82%), Mood=Ind (1806; 82%), Person=3 (1806; 82%), Gender=EMPTY (1806; 82%), Tense=Pres (1803; 82%), Number=Sing (1335; 61%).
VERB
tokens may have the following values of Voice
:
Act
(1930; 88% of non-emptyVoice
): je, jsou, obsahuje, rozumí, může, uvede, mohou, není, nejsou, použijíPass
(260; 12% of non-emptyVoice
): stanoveno, sestavena, zahrnuty, obchodovány, uvedeny, zavedena, oprávněn, uvedena, vykázány, účtoványEMPTY
(327): vést, použít, být, mít, účtovat, odpisovat, uvést, sestavit, zajistit, provést
Paradigm použít | Act | Pass |
---|---|---|
Animacy=Inan|Gender=Fem,Masc|Negative=Pos|Number=Plur|VerbForm=Part | použity | |
Gender=Fem,Neut|Negative=Pos|Number=Plur,Sing|Tense=Past|VerbForm=Part | použila | |
Gender=Fem,Neut|Negative=Pos|Number=Plur,Sing|VerbForm=Part | použita | |
Gender=Neut|Negative=Pos|Number=Sing|VerbForm=Part | použito | |
Mood=Ind|Negative=Neg|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin | nepoužije | |
Mood=Ind|Negative=Neg|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin | nepoužijí | |
Mood=Ind|Negative=Pos|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin | použije | |
Mood=Ind|Negative=Pos|Number=Plur|Person=3|Tense=Pres|VerbForm=Fin | použijí |
ADJ
288 cs-pos/ADJ tokens (4% of all ADJ
tokens) have a non-empty value of Voice
.
The most frequent other feature values with which ADJ
and Voice
co-occurred: Degree=EMPTY (288; 100%), Negative=Pos (286; 99%), Number=Sing (177; 61%), Animacy=EMPTY (172; 60%).
ADJ
tokens may have the following values of Voice
:
Act
(288; 100% of non-emptyVoice
): konsolidující, zanikající, následujícího, související, předcházejícímu, týkající, přejímající, předcházející, souvisejících, řídícíchEMPTY
(6251): účetní, účetních, účetního, konsolidované, finanční, účetním, povinny, výroční, právní, jiných
Voice
seems to be lexical feature of ADJ
. 100% lemmas (50) occur only with one value of Voice
.
AUX
109 cs-pos/AUX tokens (64% of all AUX
tokens) have a non-empty value of Voice
.
The most frequent other feature values with which AUX
and Voice
co-occurred: Negative=Pos (86; 79%), Animacy=EMPTY (84; 77%), Gender=EMPTY (58; 53%), VerbForm=Fin (58; 53%), Person=3 (58; 53%), Mood=Ind (58; 53%), Number=Plur (57; 52%).
AUX
tokens may have the following values of Voice
:
Act
(109; 100% of non-emptyVoice
): byly, je, nejsou, byl, jsou, bude, budou, nebyly, byla, byloEMPTY
(61): být, by
Relations with Agreement in Voice
The 10 most frequent relations where parent and child node agree in Voice
:
VERB –[conj]–> VERB (203; 88%),
VERB –[advcl]–> VERB (67; 65%),
VERB –[parataxis]–> VERB (28; 90%),
VERB –[csubjpass]–> VERB (1; 100%),
VERB –[appos]–> VERB (1; 100%).
Voice in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]