home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Scottish_Gaelic-ARCOSG: POS Tags: ADV

There are 339 ADV lemmas (5%), 366 ADV types (4%) and 4587 ADV tokens (5%). Out of 17 observed tags, the rank of ADV is: 5 in number of lemmas, 5 in number of types and 8 in number of tokens.

The 10 most frequent ADV lemmas: an, a-mach, suas, cho, a, dìreach, math, ann, cuideachd, a-steach

The 10 most frequent ADV types: an, a-mach, cho, a, dìreach, suas, ann, cuideachd, math, a-steach

The 10 most frequent ambiguous lemmas: an (DET 5347, ADP 2934, ADV 311, PART 218, PRON 95, ADJ 27, SCONJ 18, INTJ 2, NOUN 1, X 1), a-mach (ADV 188, NOUN 5, ADJ 1), suas (ADV 176, ADJ 3), a (PART 3253, DET 592, PRON 427, ADP 279, ADV 142, ADJ 51, SCONJ 7, X 6, INTJ 4, PROPN 3, CCONJ 2), dìreach (ADV 135, ADJ 5, INTJ 1), math (ADJ 146, ADV 135, NOUN 7), ann (ADV 131, ADP 1, X 1), cuideachd (ADV 130, NOUN 7), a-steach (ADV 119, NOUN 12), co-dhiubh (ADV 111, SCONJ 2)

The 10 most frequent ambiguous types: an (DET 2298, ADP 1699, ADV 293, PART 212, PRON 94, AUX 37, ADJ 27, SCONJ 15, NOUN 1, X 1), a-mach (ADV 188, NOUN 5, ADJ 1), a (PART 3247, DET 597, PRON 427, ADP 262, ADV 141, ADJ 51, SCONJ 7, X 6, CCONJ 2, PROPN 2, INTJ 1), dìreach (ADV 133, ADJ 5, INTJ 1), ann (ADP 438, ADV 131, X 1), cuideachd (ADV 129, NOUN 2), math (ADV 124, ADJ 73, NOUN 5), a-steach (ADV 119, NOUN 12), co-dhiubh (ADV 107, SCONJ 1), a-staigh (ADV 94, ADJ 16)

Morphology

The form / lemma ratio of ADV is 1.079646 (the average of all parts of speech is 1.317448).

The 1st highest number of forms (2) was observed with the lemma “aithghearr”: dh’aithghearr, dh’aithghearr.

The 2nd highest number of forms (2) was observed with the lemma “an”: ‘n, an.

The 3rd highest number of forms (2) was observed with the lemma “bliadhna”: bhliadhna, bliadhna.

ADV occurs with 5 features: AdvType (4585; 100% instances), ExtPos (558; 12% instances), CleftType (14; 0% instances), Foreign (11; 0% instances), Degree (1; 0% instances)

ADV occurs with 7 feature-value pairs: AdvType=Loc, AdvType=Man, AdvType=Tim, CleftType=Adv, Degree=Cmp,Sup, ExtPos=ADV, Foreign=Yes

ADV occurs with 16 feature combinations. The most frequent feature combination is AdvType=Man (1794 tokens). Examples: cho, dìreach, cuideachd, math, co-dhiubh, idir, còmhla, ma-thà, seachad, leòr

Relations

ADV nodes are attached to their parents using 15 different relations: advmod (3558; 78% instances), fixed (723; 16% instances), xcomp:pred (200; 4% instances), conj (47; 1% instances), root (26; 1% instances), advcl (8; 0% instances), parataxis (7; 0% instances), ccomp (6; 0% instances), discourse (4; 0% instances), compound (2; 0% instances), reparandum (2; 0% instances), case (1; 0% instances), csubj:cop (1; 0% instances), flat (1; 0% instances), obj (1; 0% instances)

Parents of ADV nodes belong to 14 different parts of speech: VERB (1589; 35% instances), NOUN (1411; 31% instances), ADV (866; 19% instances), ADJ (376; 8% instances), PROPN (162; 4% instances), PRON (121; 3% instances), (26; 1% instances), X (11; 0% instances), INTJ (8; 0% instances), NUM (6; 0% instances), ADP (4; 0% instances), DET (4; 0% instances), PART (2; 0% instances), AUX (1; 0% instances)

3406 (74%) ADV nodes are leaves.

884 (19%) ADV nodes have one child.

247 (5%) ADV nodes have two children.

50 (1%) ADV nodes have three or more children.

The highest child degree of a ADV node is 7.

Children of ADV nodes are attached using 27 different relations: fixed (716; 46% instances), mark:prt (363; 23% instances), advmod (125; 8% instances), punct (72; 5% instances), conj (47; 3% instances), cc (46; 3% instances), obl (41; 3% instances), case (28; 2% instances), cop (27; 2% instances), parataxis (20; 1% instances), csubj:cleft (14; 1% instances), advcl (10; 1% instances), discourse (10; 1% instances), obl:unmarked (10; 1% instances), amod (6; 0% instances), ccomp (6; 0% instances), csubj:cop (5; 0% instances), advcl:relcl (4; 0% instances), xcomp (4; 0% instances), compound (2; 0% instances), mark (2; 0% instances), nummod (2; 0% instances), reparandum (2; 0% instances), xcomp:pred (2; 0% instances), det (1; 0% instances), flat (1; 0% instances), obj (1; 0% instances)

Children of ADV nodes belong to 16 different parts of speech: ADV (866; 55% instances), PART (367; 23% instances), PUNCT (72; 5% instances), VERB (70; 4% instances), CCONJ (46; 3% instances), NOUN (44; 3% instances), AUX (27; 2% instances), ADP (26; 2% instances), PRON (18; 1% instances), INTJ (10; 1% instances), ADJ (8; 1% instances), PROPN (6; 0% instances), NUM (3; 0% instances), SCONJ (2; 0% instances), DET (1; 0% instances), X (1; 0% instances)