home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bulgarian-BTB: Features: Voice

This feature is universal. It occurs with 2 different values: Act, Pass.

22666 tokens (15%) have a non-empty value of Voice. 7792 types (30%) occur at least once with a non-empty value of Voice. 2932 lemmas (20%) occur at least once with a non-empty value of Voice. The feature is used with 3 part-of-speech tags: VERB (16580; 11% instances), AUX (4600; 3% instances), ADJ (1486; 1% instances).

VERB

16580 VERB tokens (99% of all VERB tokens) have a non-empty value of Voice.

The most frequent other feature values with which VERB and Voice co-occurred: Gender=EMPTY (14758; 89%), Definite=EMPTY (13916; 84%), Mood=Ind (13916; 84%), VerbForm=Fin (13916; 84%), Number=Sing (11553; 70%), Person=3 (11480; 69%), Tense=Pres (9528; 57%), Aspect=Imp (8576; 52%).

VERB tokens may have the following values of Voice:

Paradigm кажаActPass
Definite=Ind|Gender=Masc|Number=Sing|Tense=Past|VerbForm=Partказал
Definite=Ind|Gender=Fem|Number=Sing|Tense=Past|VerbForm=Partказала
Definite=Ind|Gender=Neut|Number=Sing|VerbForm=Partказано
Definite=Ind|Number=Plur|Tense=Past|VerbForm=Partказали
Mood=Ind|Number=Sing|Person=1|Tense=Past|VerbForm=Finказах
Mood=Ind|Number=Sing|Person=1|Tense=Pres|VerbForm=Finкажа
Mood=Ind|Number=Sing|Person=2|Tense=Pres|VerbForm=Finкажеш
Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Finказа
Mood=Ind|Number=Sing|Person=3|Tense=Pres|VerbForm=Finкаже
Mood=Ind|Number=Plur|Person=1|Tense=Past|VerbForm=Finказахме
Mood=Ind|Number=Plur|Person=1|Tense=Pres|VerbForm=Finкажем
Mood=Ind|Number=Plur|Person=2|Tense=Past|VerbForm=Finказахте
Mood=Ind|Number=Plur|Person=2|Tense=Pres|VerbForm=Finкажете
Mood=Ind|Number=Plur|Person=3|Tense=Past|VerbForm=Finказаха
Mood=Ind|Number=Plur|Person=3|Tense=Pres|VerbForm=Finкажат

AUX

4600 AUX tokens (50% of all AUX tokens) have a non-empty value of Voice.

The most frequent other feature values with which AUX and Voice co-occurred: Mood=Ind (4600; 100%), VerbForm=Fin (4354; 95%), Aspect=Imp (4214; 92%), Person=3 (3966; 86%), Tense=Pres (3679; 80%), Number=Sing (3307; 72%).

AUX tokens may have the following values of Voice:

ADJ

1486 ADJ tokens (11% of all ADJ tokens) have a non-empty value of Voice.

The most frequent other feature values with which ADJ and Voice co-occurred: VerbForm=Part (1486; 100%), Degree=Pos (1203; 81%), Aspect=Perf (1057; 71%), Number=Sing (852; 57%), Definite=Ind (816; 55%).

ADJ tokens may have the following values of Voice:

Paradigm следвамActPass
Definite=Def|Degree=Pos|Gender=Masc|Number=Sing|Tense=Presследващия, следващият
Definite=Def|Degree=Pos|Gender=Fem|Number=Sing|Tense=Presследващата
Definite=Def|Degree=Pos|Gender=Neut|Number=Sing|Tense=Presследващото
Definite=Def|Degree=Pos|Number=Plur|Tense=Presследващите
Definite=Ind|Degree=Pos|Gender=Masc|Number=Singследван
Definite=Ind|Gender=Fem|Number=Singследвана
Definite=Ind|Number=Plurследвани

Voice seems to be lexical feature of ADJ. 98% lemmas (661) occur only with one value of Voice.

Relations with Agreement in Voice

The 10 most frequent relations where parent and child node agree in Voice: VERB –[ccomp]–> VERB (1569; 90%), VERB –[conj]–> VERB (1460; 92%), VERB –[advcl]–> VERB (1136; 88%), VERB –[xcomp]–> VERB (628; 94%), VERB –[parataxis]–> VERB (299; 85%), VERB –[csubj]–> VERB (137; 90%), AUX –[ccomp]–> VERB (108; 97%), VERB –[csubj:pass]–> VERB (47; 70%), VERB –[ccomp]–> AUX (22; 100%), AUX –[advcl]–> VERB (19; 95%).