home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Finnish-FTB: POS Tags: ADV

There are 1370 ADV lemmas (6%), 1699 ADV types (4%) and 10204 ADV tokens (6%). Out of 17 observed tags, the rank of ADV is: 5 in number of lemmas, 5 in number of types and 6 in number of tokens.

The 10 most frequent ADV lemmas: nyt, sitten, aina, siellä, paljon, täällä, hyvin, miten, vähän, niin

The 10 most frequent ADV types: nyt, aina, paljon, siellä, täällä, hyvin, miten, niin, vähän, sitten

The 10 most frequent ambiguous lemmas: nyt (ADV 479, PART 125), sitten (ADV 260, PART 133, ADP 58), aina (ADV 204, PART 2), niin (PART 635, ADV 154), siinä (ADV 90, PART 5), taas (ADV 83, PART 32), yli (ADV 75, ADP 29), ennen (ADP 81, ADV 73), mukaan (ADP 147, ADV 73), näin (ADV 63, PART 28)

The 10 most frequent ambiguous types: nyt (ADV 338, PART 118), aina (ADV 184, PART 2), niin (PART 454, ADV 137), sitten (ADV 99, PART 98, ADP 57), sit (ADV 79, PRON 1), tässä (ADV 62, DET 47, PRON 1), taas (ADV 76, PART 32), siinä (PRON 71, ADV 62, DET 23, PART 4), yli (ADV 70, ADP 29), miksi (ADV 32, PRON 2)

Morphology

The form / lemma ratio of ADV is 1.240146 (the average of all parts of speech is 2.048675).

The 1st highest number of forms (11) was observed with the lemma “siellä”: Sielki, Siellähän, siel, siell, siellä, sielläkin, sielläkö, sielä, siäl, siälä, siäläh.

The 2nd highest number of forms (9) was observed with the lemma “täällä”: Täällähän, Täälläkin, tiällä, tääl, tääll, täällä, täälläkään, täälä, tääläk.

The 3rd highest number of forms (8) was observed with the lemma “miksi”: Miksikähän, Miksikö, Miksiköhän, Mix, miks, miksi, miksihän, miksipä.

ADV occurs with 6 features: PronType (2320; 23% instances), Style (518; 5% instances), Degree (460; 5% instances), Person[psor] (430; 4% instances), Clitic (220; 2% instances), Number[psor] (103; 1% instances)

ADV occurs with 23 feature-value pairs: Clitic=Han, Clitic=Han,Ka, Clitic=Han,Ko, Clitic=Ka, Clitic=Ka,S, Clitic=Kaan, Clitic=Kin, Clitic=Ko, Clitic=Ko,S, Clitic=Pa, Clitic=S, Degree=Cmp, Degree=Sup, Number[psor]=Plur, Number[psor]=Sing, Person[psor]=1, Person[psor]=2, Person[psor]=3, PronType=Dem, PronType=Ind, PronType=Int, PronType=Rel, Style=Coll

ADV occurs with 62 feature combinations. The most frequent feature combination is _ (6632 tokens). Examples: nyt, aina, paljon, hyvin, vähän, sitten, pois, liian, oikein, heti

Relations

ADV nodes are attached to their parents using 12 different relations: advmod (9273; 91% instances), mark (238; 2% instances), compound:prt (191; 2% instances), conj (164; 2% instances), root (91; 1% instances), fixed (86; 1% instances), expl (74; 1% instances), dep (40; 0% instances), advcl (34; 0% instances), nmod (9; 0% instances), acl (3; 0% instances), ccomp (1; 0% instances)

Parents of ADV nodes belong to 15 different parts of speech: VERB (7594; 74% instances), NOUN (883; 9% instances), ADJ (762; 7% instances), ADV (496; 5% instances), PRON (112; 1% instances), NUM (93; 1% instances), (91; 1% instances), PROPN (84; 1% instances), DET (26; 0% instances), PART (25; 0% instances), ADP (23; 0% instances), SCONJ (6; 0% instances), X (5; 0% instances), INTJ (3; 0% instances), PUNCT (1; 0% instances)

6870 (67%) ADV nodes are leaves.

2619 (26%) ADV nodes have one child.

584 (6%) ADV nodes have two children.

131 (1%) ADV nodes have three or more children.

The highest child degree of a ADV node is 7.

Children of ADV nodes are attached using 21 different relations: punct (2084; 49% instances), advmod (826; 20% instances), nmod (266; 6% instances), fixed (230; 5% instances), conj (173; 4% instances), advcl (146; 3% instances), cc (134; 3% instances), acl (125; 3% instances), amod (79; 2% instances), mark (63; 1% instances), det (42; 1% instances), nummod (11; 0% instances), case (10; 0% instances), dep (5; 0% instances), obj (5; 0% instances), aux (4; 0% instances), cop (4; 0% instances), discourse (3; 0% instances), nsubj (3; 0% instances), nsubj:cop (3; 0% instances), vocative (2; 0% instances)

Children of ADV nodes belong to 15 different parts of speech: PUNCT (2084; 49% instances), PART (524; 12% instances), ADV (496; 12% instances), NOUN (343; 8% instances), SCONJ (201; 5% instances), VERB (184; 4% instances), CCONJ (136; 3% instances), ADJ (114; 3% instances), DET (42; 1% instances), PROPN (34; 1% instances), PRON (32; 1% instances), NUM (11; 0% instances), ADP (10; 0% instances), AUX (4; 0% instances), INTJ (3; 0% instances)