This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home hr/pos issue tracker

AUX: auxiliary verb

This document is a placeholder for the language-specific documentation for AUX.


Treebank Statistics (UD_Croatian)

There are 4 AUX lemmas (0%), 47 AUX types (0%) and 8989 AUX tokens (6%). Out of 15 observed tags, the rank of AUX is: 15 in number of lemmas, 10 in number of types and 7 in number of tokens.

The 10 most frequent AUX lemmas: biti, htjeti, moći, susretati

The 10 most frequent AUX types: je, su, će, bi, biti, nije, bilo, smo, bio, sam

The 10 most frequent ambiguous lemmas: biti (AUX 8017, ADV 5), htjeti (AUX 970, VERB 35), moći (VERB 407, AUX 1), susretati (VERB 6, AUX 1)

The 10 most frequent ambiguous types: je (AUX 4431, PRON 20), su (AUX 1356, ADP 1), biti (AUX 335, NOUN 1), bilo (AUX 143, PART 16, CONJ 3), sam (AUX 112, ADJ 18), bit (AUX 53, NOUN 2), si (PRON 14, AUX 3), budemo (AUX 1, VERB 1), budete (AUX 1, VERB 1), možeš (VERB 3, AUX 1)

Morphology

The form / lemma ratio of AUX is 11.750000 (the average of all parts of speech is 1.779790).

The 1st highest number of forms (34) was observed with the lemma “biti”: Jesmo, bi, bih, bijaše, bila, bile, bili, bilo, bio, bismo, biste, bit, biti, bude, budemo, budete, budite, budu, je, jesam, jest, jeste, jesu, nije, nisam, nisi, nismo, niste, nisu, sam, si, smo, ste, su.

The 2nd highest number of forms (11) was observed with the lemma “htjeti”: neće, nećemo, nećete, nećeš, neću, će, ćemo, ćete, ćeš, ću, češ.

The 3rd highest number of forms (1) was observed with the lemma “moći”: možeš.

AUX occurs with 7 features: Number (8597; 96% instances), Person (8104; 90% instances), Tense (8103; 90% instances), VerbForm (885; 10% instances), Gender (493; 5% instances), Negative (275; 3% instances), Mood (1; 0% instances)

AUX occurs with 15 feature-value pairs: Gender=Fem, Gender=Masc, Gender=Neut, Mood=Imp, Negative=Neg, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Tense=Imp, Tense=Past, Tense=Pres, VerbForm=Inf, VerbForm=Part

AUX occurs with 26 feature combinations. The most frequent feature combination is Number=Sing|Person=3|Tense=Pres (5248 tokens). Examples: je, će, nije, jest, bude, neće, jeste

Relations

AUX nodes are attached to their parents using 28 different relations: aux (5440; 61% instances), cop (2246; 25% instances), auxpass (826; 9% instances), xcomp (143; 2% instances), root (80; 1% instances), acl (39; 0% instances), conj (38; 0% instances), advcl (28; 0% instances), ccomp (26; 0% instances), discourse (20; 0% instances), case (15; 0% instances), dobj (13; 0% instances), csubj (11; 0% instances), compound (10; 0% instances), parataxis (9; 0% instances), det (8; 0% instances), mwe (8; 0% instances), remnant (7; 0% instances), amod (4; 0% instances), nsubj (4; 0% instances), neg (3; 0% instances), nmod (3; 0% instances), advmod (2; 0% instances), appos (2; 0% instances), dep (1; 0% instances), expl (1; 0% instances), mark (1; 0% instances), nsubjpass (1; 0% instances)

Parents of AUX nodes belong to 12 different parts of speech: VERB (4982; 55% instances), ADJ (2157; 24% instances), NOUN (1337; 15% instances), ADV (144; 2% instances), PRON (112; 1% instances), AUX (99; 1% instances), ROOT (80; 1% instances), PROPN (58; 1% instances), NUM (15; 0% instances), ADP (3; 0% instances), PART (1; 0% instances), PUNCT (1; 0% instances)

8603 (96%) AUX nodes are leaves.

164 (2%) AUX nodes have one child.

43 (0%) AUX nodes have two children.

179 (2%) AUX nodes have three or more children.

The highest child degree of a AUX node is 13.

Children of AUX nodes are attached using 22 different relations: punct (199; 19% instances), xcomp (164; 15% instances), nsubj (133; 12% instances), nmod (127; 12% instances), mark (102; 10% instances), aux (83; 8% instances), advmod (54; 5% instances), conj (39; 4% instances), cc (35; 3% instances), dobj (27; 3% instances), parataxis (23; 2% instances), advcl (19; 2% instances), ccomp (16; 1% instances), discourse (15; 1% instances), neg (10; 1% instances), remnant (7; 1% instances), compound (5; 0% instances), iobj (5; 0% instances), case (4; 0% instances), nsubjpass (2; 0% instances), acl (1; 0% instances), csubj (1; 0% instances)

Children of AUX nodes belong to 13 different parts of speech: NOUN (252; 24% instances), PUNCT (201; 19% instances), ADJ (116; 11% instances), VERB (100; 9% instances), AUX (99; 9% instances), PRON (75; 7% instances), ADV (70; 7% instances), SCONJ (65; 6% instances), CONJ (40; 4% instances), PROPN (27; 3% instances), PART (16; 1% instances), ADP (7; 1% instances), NUM (3; 0% instances)


AUX in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]