home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Icelandic-IcePaHC: POS Tags: AUX

There are 11 AUX lemmas (0%), 260 AUX types (0%) and 51252 AUX tokens (5%). Out of 16 observed tags, the rank of AUX is: 16 in number of lemmas, 10 in number of types and 8 in number of tokens.

The 10 most frequent AUX lemmas: vera, hafa, munu, skulu, vilja, mega, geta, verða, fá, kunna

The 10 most frequent AUX types: var, er, voru, hafði, vera, væri, hafa, eru, mun, verið

The 10 most frequent ambiguous lemmas: vera (AUX 28989, VERB 411, NOUN 71, SCONJ 17, PRON 2, ADJ 1, ADV 1), hafa (AUX 9132, VERB 1650, ADV 3, NOUN 1), munu (AUX 3288, VERB 83), skulu (AUX 2937, VERB 96), vilja (AUX 2556, VERB 402, NOUN 14, ADV 5), mega (AUX 1898, VERB 83, ADV 8, NOUN 1), geta (AUX 1182, VERB 314, NOUN 11, ADV 1), verða (VERB 3169, AUX 1069, ADJ 2, NOUN 2, ADV 1), (VERB 1542, AUX 183, ADV 3, ADJ 2, ADP 1), kunna (VERB 703, AUX 14, ADV 9, ADJ 1)

The 10 most frequent ambiguous types: var (AUX 9792, ADJ 39, VERB 9, ADV 1), er (SCONJ 8106, AUX 7226, VERB 110, NOUN 2, X 2, ADP 1, ADV 1), voru (AUX 2148, PRON 78, VERB 2), hafði (AUX 2242, VERB 636, ADV 1), vera (AUX 1859, NOUN 7, VERB 2), væri (AUX 1661, VERB 1), hafa (AUX 1662, VERB 210), eru (AUX 1434, VERB 17, SCONJ 1), mun (AUX 1070, VERB 34, NOUN 32, ADV 1), verið (AUX 1106, VERB 64)

Morphology

The form / lemma ratio of AUX is 23.636364 (the average of all parts of speech is 1.842490).

The 1st highest number of forms (53) was observed with the lemma “vera”: Váru, em, emk, en, er, ert, eru, erum, eruð, es, ru, sjá, sják, sé, séim, séið, sém, sénu, sér, sért, sérð, séu, séum, séuð, séð, var, varr, varst, vart, veit, ver, vera, verandi, veri, verir, verið, vert, verum, voru, vorum, vorust, voruð, várust, væra, væri, værim, værir, værið, væru, værum, væruð, vóru, vórum.

The 2nd highest number of forms (42) was observed with the lemma “hafa”: haf, hafa, hafandi, hafast, hafi, hafim, hafinn, hafir, hafist, hafið, hafiður, hafst, haft, hafð, hafða, hafðar, hafði, hafðir, hafðist, hafður, hatði, hef, hefi, hefir, hefoi, heft, hefur, hefða, hefði, hefðim, hefðir, hefðu, hefðum, hefðust, hefðuð, höfu, höfum, höfð, höfðu, höfðum, höfðust, höfðuð.

The 3rd highest number of forms (31) was observed with the lemma “munu”: man, mun, muna, munda, mundi, mundir, mundu, mundum, munduð, muni, munir, munið, munt, muntu, munu, munum, munuð, mynda, myndi, myndim, myndir, myndu, myndum, mynduð, myni, mynim, mynir, mynið, mǿndi, mȯn, mndi.

AUX occurs with 12 features: VerbForm (49772; 97% instances), Voice (49766; 97% instances), Number (45555; 89% instances), Mood (44624; 87% instances), Tense (44618; 87% instances), Person (44589; 87% instances), Case (966; 2% instances), Gender (966; 2% instances), Definite (823; 2% instances), Degree (114; 0% instances), Foreign (27; 0% instances), PronType (18; 0% instances)

AUX occurs with 33 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Definite=Def, Definite=Ind, Degree=Cmp, Degree=Pos, Degree=Sup, Foreign=Yes, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Imp, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, VerbForm=Sup, Voice=Act, Voice=Mid

AUX occurs with 117 feature combinations. The most frequent feature combination is Mood=Ind|Number=Sing|Person=3|Tense=Past|VerbForm=Fin|Voice=Act (13389 tokens). Examples: var, hafði, vildi, mátti, gat, varð, mundi, fékk, mætti, skildi

Relations

AUX nodes are attached to their parents using 18 different relations: cop (28118; 55% instances), aux (19915; 39% instances), ccomp (1047; 2% instances), acl:relcl (745; 1% instances), advcl (687; 1% instances), acl (217; 0% instances), conj (213; 0% instances), obl (125; 0% instances), dep (73; 0% instances), xcomp (71; 0% instances), amod (21; 0% instances), obj (7; 0% instances), nsubj (5; 0% instances), parataxis (3; 0% instances), appos (2; 0% instances), iobj (1; 0% instances), nmod:poss (1; 0% instances), root (1; 0% instances)

Parents of AUX nodes belong to 16 different parts of speech: VERB (29192; 57% instances), ADJ (6273; 12% instances), NOUN (6189; 12% instances), PRON (3650; 7% instances), ADV (1676; 3% instances), AUX (1123; 2% instances), PROPN (1109; 2% instances), DET (975; 2% instances), PART (470; 1% instances), ADP (327; 1% instances), CCONJ (143; 0% instances), NUM (70; 0% instances), X (31; 0% instances), SCONJ (16; 0% instances), INTJ (7; 0% instances), (1; 0% instances)

45877 (90%) AUX nodes are leaves.

2439 (5%) AUX nodes have one child.

449 (1%) AUX nodes have two children.

2487 (5%) AUX nodes have three or more children.

The highest child degree of a AUX node is 19.

Children of AUX nodes are attached using 28 different relations: punct (2766; 20% instances), nsubj (1995; 14% instances), mark (1978; 14% instances), obl (1685; 12% instances), obj (1315; 9% instances), advmod (1100; 8% instances), aux (657; 5% instances), cop (398; 3% instances), cc (314; 2% instances), xcomp (303; 2% instances), ccomp (268; 2% instances), amod (262; 2% instances), conj (226; 2% instances), advcl (209; 1% instances), compound:prt (138; 1% instances), acl (135; 1% instances), case (112; 1% instances), acl:relcl (85; 1% instances), dep (78; 1% instances), vocative (36; 0% instances), iobj (31; 0% instances), appos (24; 0% instances), expl (10; 0% instances), parataxis (10; 0% instances), discourse (6; 0% instances), nmod:poss (4; 0% instances), nmod (3; 0% instances), flat:foreign (1; 0% instances)

Children of AUX nodes belong to 16 different parts of speech: PUNCT (2766; 20% instances), NOUN (2522; 18% instances), PRON (2027; 14% instances), SCONJ (1984; 14% instances), ADV (1196; 8% instances), AUX (1123; 8% instances), VERB (823; 6% instances), ADJ (380; 3% instances), PROPN (340; 2% instances), CCONJ (323; 2% instances), ADP (322; 2% instances), DET (296; 2% instances), NUM (16; 0% instances), X (13; 0% instances), PART (12; 0% instances), INTJ (6; 0% instances)