home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Icelandic-IcePaHC: POS Tags: ADV

There are 2003 ADV lemmas (5%), 2257 ADV types (3%) and 79020 ADV tokens (8%). Out of 16 observed tags, the rank of ADV is: 5 in number of lemmas, 5 in number of types and 6 in number of tokens.

The 10 most frequent ADV lemmas: ekki, þá, svo, þar, nú, vel, þó, hér, síðan, og

The 10 most frequent ADV types: þá, svo, þar, ekki, nú, eigi, þó, hér, síðan, og

The 10 most frequent ambiguous lemmas: ekki (ADV 8594, NOUN 8, DET 5, VERB 1), þá (ADV 8465, ADP 6, DET 2, NOUN 2, PRON 1), svo (ADV 6597, ADP 1446, PROPN 6, ADJ 4, NOUN 2, X 1), þar (ADV 5752, PRON 1), (ADV 4719, INTJ 34, NOUN 1), vel (ADV 1874, ADJ 3, X 2), þó (ADV 1538, SCONJ 712, ADP 7, INTJ 1), hér (ADV 1502, ADP 1, NOUN 1), síðan (ADV 1492, ADP 67, PROPN 1), og (CCONJ 44084, ADV 1457, ADP 1291, SCONJ 17, INTJ 3, X 1)

The 10 most frequent ambiguous types: þá (ADV 6747, PRON 1043, DET 876, VERB 13, ADP 4, NOUN 3), svo (ADV 5966, ADP 1418, ADJ 4, PROPN 3, NOUN 2, X 1), þar (ADV 5006, PRON 1), ekki (ADV 4811, DET 151, NOUN 1, PRON 1), (ADV 3637, INTJ 5), eigi (ADV 3148, VERB 49, ADJ 1), þó (ADV 1454, SCONJ 663, ADP 6, VERB 6, INTJ 1), hér (ADV 1280, ADP 1, NOUN 1), síðan (ADV 930, ADP 61), og (CCONJ 41028, ADV 1456, ADP 1291, SCONJ 17, X 1)

Morphology

The form / lemma ratio of ADV is 1.126810 (the average of all parts of speech is 1.842490).

The 1st highest number of forms (17) was observed with the lemma “margur”: fleira, fleiri, fleirum, flest, flesta, flestir, flestum, marga, margan, margar, margir, margra, margt, mart, mörg, mörgu, mörgum.

The 2nd highest number of forms (16) was observed with the lemma “mikill”: meir, meira, meiri, mest, mestan, mestur, mikil, mikill, mikilli, mikillrar, mikinn, mikið, mikla, miklar, miklu, miklum.

The 3rd highest number of forms (15) was observed with the lemma “enginn”: Engan, Engra, ekkert, ekki, enga, engi, engin, enginn, engir, engu, engva, öngum, öngva, öngvan, öngvir.

ADV occurs with 13 features: Degree (5760; 7% instances), Number (4370; 6% instances), Case (4032; 5% instances), Gender (4014; 5% instances), Definite (2958; 4% instances), PronType (866; 1% instances), VerbForm (454; 1% instances), Voice (454; 1% instances), Person (342; 0% instances), Mood (339; 0% instances), Tense (339; 0% instances), Foreign (107; 0% instances), NumType (87; 0% instances)

ADV occurs with 34 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Definite=Def, Definite=Ind, Degree=Cmp, Degree=Pos, Degree=Sup, Foreign=Yes, Gender=Fem, Gender=Masc, Gender=Neut, Mood=Imp, Mood=Ind, Mood=Sub, NumType=Card, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, Tense=Past, Tense=Pres, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, VerbForm=Sup, Voice=Act, Voice=Mid

ADV occurs with 251 feature combinations. The most frequent feature combination is _ (70341 tokens). Examples: þá, svo, þar, ekki, nú, eigi, þó, hér, síðan, og

Relations

ADV nodes are attached to their parents using 19 different relations: advmod (70056; 89% instances), obl (3859; 5% instances), amod (2541; 3% instances), conj (736; 1% instances), root (508; 1% instances), advcl (403; 1% instances), ccomp (343; 0% instances), acl:relcl (190; 0% instances), obj (121; 0% instances), acl (119; 0% instances), xcomp (63; 0% instances), parataxis (29; 0% instances), dep (24; 0% instances), nsubj (17; 0% instances), appos (3; 0% instances), fixed (3; 0% instances), iobj (3; 0% instances), mark (1; 0% instances), nmod:poss (1; 0% instances)

Parents of ADV nodes belong to 16 different parts of speech: VERB (58314; 74% instances), NOUN (5998; 8% instances), ADJ (4952; 6% instances), ADV (3423; 4% instances), PRON (1911; 2% instances), AUX (1196; 2% instances), DET (1103; 1% instances), PROPN (1066; 1% instances), (508; 1% instances), CCONJ (175; 0% instances), ADP (162; 0% instances), PART (85; 0% instances), NUM (82; 0% instances), X (28; 0% instances), SCONJ (9; 0% instances), INTJ (8; 0% instances)

60351 (76%) ADV nodes are leaves.

13669 (17%) ADV nodes have one child.

3141 (4%) ADV nodes have two children.

1859 (2%) ADV nodes have three or more children.

The highest child degree of a ADV node is 14.

Children of ADV nodes are attached using 31 different relations: punct (6298; 23% instances), advcl (3990; 14% instances), obl (3286; 12% instances), advmod (3144; 11% instances), case (2737; 10% instances), cop (1289; 5% instances), cc (1280; 5% instances), mark (1077; 4% instances), nsubj (927; 3% instances), amod (582; 2% instances), fixed (532; 2% instances), conj (515; 2% instances), acl:relcl (380; 1% instances), xcomp (286; 1% instances), aux (235; 1% instances), ccomp (211; 1% instances), compound:prt (205; 1% instances), acl (181; 1% instances), obj (161; 1% instances), nmod:poss (76; 0% instances), det (53; 0% instances), discourse (48; 0% instances), expl (45; 0% instances), vocative (36; 0% instances), dep (27; 0% instances), appos (22; 0% instances), nummod (17; 0% instances), iobj (7; 0% instances), nmod (7; 0% instances), flat:foreign (2; 0% instances), parataxis (2; 0% instances)

Children of ADV nodes belong to 16 different parts of speech: PUNCT (6298; 23% instances), VERB (4372; 16% instances), ADV (3423; 12% instances), ADP (3037; 11% instances), NOUN (2440; 9% instances), AUX (1676; 6% instances), SCONJ (1584; 6% instances), PRON (1488; 5% instances), CCONJ (1322; 5% instances), DET (780; 3% instances), ADJ (713; 3% instances), PROPN (361; 1% instances), NUM (53; 0% instances), PART (51; 0% instances), INTJ (47; 0% instances), X (13; 0% instances)