AUX
: auxiliary verb
Definition
An auxiliary verb is a verb that accompanies the lexical verb of a verb phrase and expresses grammatical distinctions not carried by the lexical verb, such as person, number, tense, mood, aspect, and voice. In Slovenian, only instances of the verb biti “to be” that accompany lexical verbs are marked as AUX
.
Examples
- Tistega večera sem.
AUX
preveč popil.VERB
. “I drank too much that evening.” - V bolnišnici bodo.
AUX
uvedli.VERB
šolo za starše. “A parenting school will be introduced in the hospital.” - Kam bi.
AUX
se lahko zatekla.VERB
? “Where could she have hidden?”
Delimitation
Note that in cases, where biti is used independently as a copula or a content verb, it is marked as verb:
- To je.
VERB
grozno. “This is horrible.” - Za nami je.
VERB
dolga vrsta. “There is a long queue behind us.” - Vsi smo.
AUX
bili.VERB
zadovoljni. “We were all content.”
Conversion from JOS
In ssj500k, all instances of verb biti “to be” have been annotated as Type=auxiliary. To separate the actual auxiliary function from other functions, syntax has to be taken into account. Thus, tokens of biti bearing the dependency relation PPart with a main verb become annotated as `AUX˙.
Treebank Statistics (UD_Slovenian)
There are 1 AUX
lemmas (0%), 30 AUX
types (0%) and 7147 AUX
tokens (5%).
Out of 16 observed tags, the rank of AUX
is: 16 in number of lemmas, 12 in number of types and 6 in number of tokens.
The 10 most frequent AUX
lemmas: biti
The 10 most frequent AUX
types: je, so, bi, bo, sem, ni, bodo, sta, smo, niso
The 10 most frequent ambiguous lemmas: biti (AUX 7147, VERB 3788)
The 10 most frequent ambiguous types: je (AUX 3057, VERB 1508, PRON 14), so (AUX 1089, VERB 473), bi (AUX 879, X 1, VERB 1), bo (AUX 376, VERB 171), sem (AUX 358, VERB 35, ADV 8), ni (AUX 247, VERB 239), bodo (AUX 225, VERB 40), sta (AUX 165, VERB 36, X 4), smo (AUX 163, VERB 42), niso (AUX 84, VERB 41)
- je
- so
- bi
- bo
- sem
- ni
- bodo
- sta
- smo
- niso
Morphology
The form / lemma ratio of AUX
is 30.000000 (the average of all parts of speech is 1.894262).
The 1st highest number of forms (30) was observed with the lemma “biti”: as, b, bi, bil, bila, bili, bo, bodo, bojo, bom, bomo, bosta, boste, bova, je, ni, nisem, nisi, nismo, niso, nista, niste, nisva, sem, si, smo, so, sta, ste, sva.
AUX
occurs with 7 features: sl-feat/VerbForm (7147; 100% instances), sl-feat/Mood (7126; 100% instances), sl-feat/Number (6261; 88% instances), sl-feat/Negative (6240; 87% instances), sl-feat/Person (6240; 87% instances), sl-feat/Tense (6240; 87% instances), sl-feat/Gender (21; 0% instances)
AUX
occurs with 16 feature-value pairs: Gender=Fem
, Gender=Masc
, Mood=Cnd
, Mood=Ind
, Negative=Neg
, Negative=Pos
, Number=Dual
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Tense=Fut
, Tense=Pres
, VerbForm=Fin
, VerbForm=Part
AUX
occurs with 29 feature combinations.
The most frequent feature combination is Mood=Ind|Negative=Pos|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin
(3099 tokens).
Examples: je
Relations
AUX
nodes are attached to their parents using 1 different relations: sl-dep/aux (7147; 100% instances)
Parents of AUX
nodes belong to 6 different parts of speech: VERB (6429; 90% instances), ADJ (532; 7% instances), NOUN (166; 2% instances), PRON (13; 0% instances), PROPN (6; 0% instances), NUM (1; 0% instances)
7146 (100%) AUX
nodes are leaves.
0 (0%) AUX
nodes have one child.
1 (0%) AUX
nodes have two children.
The highest child degree of a AUX
node is 2.
Children of AUX
nodes are attached using 2 different relations: sl-dep/cc (1; 50% instances), sl-dep/conj (1; 50% instances)
Children of AUX
nodes belong to 2 different parts of speech: CONJ (1; 50% instances), VERB (1; 50% instances)
Treebank Statistics (UD_Slovenian-SST)
There are 1 AUX
lemmas (0%), 26 AUX
types (0%) and 1256 AUX
tokens (4%).
Out of 16 observed tags, the rank of AUX
is: 16 in number of lemmas, 13 in number of types and 11 in number of tokens.
The 10 most frequent AUX
lemmas: biti
The 10 most frequent AUX
types: je, sem, bi, so, smo, bo, bomo, ni, si, ste
The 10 most frequent ambiguous lemmas: biti (VERB 1348, AUX 1256)
The 10 most frequent ambiguous types: je (VERB 639, AUX 358, PRON 6, INTJ 3), sem (AUX 174, VERB 33, ADV 4), bi (AUX 134, VERB 15, X 1), so (VERB 125, AUX 117, X 2), smo (AUX 88, VERB 20), bo (AUX 79, VERB 54), bomo (AUX 46, VERB 5), ni (VERB 86, AUX 43, X 1), si (PRON 48, AUX 39, VERB 32, X 1), ste (AUX 32, VERB 7)
- je
- sem
- bi
- so
- smo
- bo
- bomo
- ni
- VERB 86: aha … ja ja … aha ni kaj ja ja
- AUX 43: od maja meseca pa dokler ni bila slana smo pasli [:voice]
- X 1: in seveda vlekel sem vprašanja noter in sem gledal samo da izvlečem da popravljam jaz da je tekst pa da jaz popravim kje je treba vejica kje je treba dvopičje ne pa da da jaz delam kaj je eee osebek povedek te pa sem ni mi nekako nisem ni [gap] nisem tu jaz bil cel ne
- si
- ste
Morphology
The form / lemma ratio of AUX
is 26.000000 (the average of all parts of speech is 1.575031).
The 1st highest number of forms (26) was observed with the lemma “biti”: bi, biti, bo, bodo, bojo, bom, bomo, bosta, boste, bova, boš, je, ni, nisem, nisi, nismo, niso, niste, nisva, sem, si, smo, so, sta, ste, sva.
AUX
occurs with 6 features: sl-feat/Mood (1256; 100% instances), sl-feat/VerbForm (1256; 100% instances), sl-feat/Negative (1122; 89% instances), sl-feat/Number (1122; 89% instances), sl-feat/Person (1122; 89% instances), sl-feat/Tense (1122; 89% instances)
AUX
occurs with 13 feature-value pairs: Mood=Cnd
, Mood=Ind
, Negative=Neg
, Negative=Pos
, Number=Dual
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Tense=Fut
, Tense=Pres
, VerbForm=Fin
AUX
occurs with 24 feature combinations.
The most frequent feature combination is Mood=Ind|Negative=Pos|Number=Sing|Person=3|Tense=Pres|VerbForm=Fin
(359 tokens).
Examples: je, biti
Relations
AUX
nodes are attached to their parents using 2 different relations: sl-dep/aux (1217; 97% instances), sl-dep/reparandum (39; 3% instances)
Parents of AUX
nodes belong to 9 different parts of speech: VERB (1116; 89% instances), ADJ (72; 6% instances), NOUN (47; 4% instances), X (10; 1% instances), PROPN (4; 0% instances), PRON (3; 0% instances), ADV (2; 0% instances), AUX (1; 0% instances), NUM (1; 0% instances)
1215 (97%) AUX
nodes are leaves.
32 (3%) AUX
nodes have one child.
4 (0%) AUX
nodes have two children.
5 (0%) AUX
nodes have three or more children.
The highest child degree of a AUX
node is 4.
Children of AUX
nodes are attached using 10 different relations: sl-dep/advmod (16; 29% instances), sl-dep/mark (10; 18% instances), sl-dep/reparandum (8; 14% instances), sl-dep/nsubj (7; 13% instances), sl-dep/dobj (6; 11% instances), sl-dep/expl (4; 7% instances), sl-dep/iobj (2; 4% instances), sl-dep/advcl (1; 2% instances), sl-dep/dislocated (1; 2% instances), sl-dep/neg (1; 2% instances)
Children of AUX
nodes belong to 10 different parts of speech: PRON (15; 27% instances), ADV (11; 20% instances), SCONJ (10; 18% instances), X (6; 11% instances), CONJ (5; 9% instances), NOUN (4; 7% instances), NUM (2; 4% instances), AUX (1; 2% instances), PART (1; 2% instances), VERB (1; 2% instances)
AUX in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]