home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Scottish_Gaelic-ARCOSG: POS Tags: DET

There are 19 DET lemmas (0%), 36 DET types (0%) and 6540 DET tokens (7%). Out of 17 observed tags, the rank of DET is: 14 in number of lemmas, 13 in number of types and 6 in number of tokens.

The 10 most frequent DET lemmas: an, a, sin, seo, uile, mo, do, gach, ud, sa

The 10 most frequent DET types: an, na, a’, a, am, a’, nan, sin, seo, ‘n

The 10 most frequent ambiguous lemmas: an (DET 5270, ADP 2915, ADV 301, PART 217, PRON 94, PROPN 31, SCONJ 16, ADJ 10, NOUN 2, X 1), a (PART 3234, DET 593, PRON 429, ADP 280, ADV 140, PROPN 38, ADJ 21, SCONJ 6, X 6, INTJ 4, CCONJ 3, NOUN 1, SYM 1), sin (PRON 354, DET 156, ADV 106, VERB 1), seo (PRON 193, DET 131, ADV 77), uile (DET 90, ADJ 12, ADV 9, NOUN 1), mo (DET 82, PRON 16), do (ADP 870, PART 218, DET 68, PRON 6, X 1), sa (DET 25, ADV 1, CCONJ 1), ar (DET 24, PRON 13, ADP 2), ur (DET 8, PRON 3)

The 10 most frequent ambiguous types: an (DET 2294, ADP 1663, ADV 293, PART 211, PRON 94, AUX 37, PROPN 20, SCONJ 15, ADJ 10, NOUN 2, X 1), na (DET 1170, ADP 70, PART 65, PRON 59, PROPN 18, CCONJ 12, X 3), a’ (DET 815, PART 142, PROPN 11), a (PART 3228, DET 599, PRON 429, ADP 263, ADV 139, PROPN 38, ADJ 20, SCONJ 6, X 6, CCONJ 3, INTJ 1, NOUN 1), am (DET 293, ADP 169, PART 77, NOUN 12, ADV 9, ADJ 1, SCONJ 1), a’ (PART 1262, DET 273, PROPN 30, ADP 10, ADV 2), nan (DET 178, PART 14, ADP 11, PROPN 6, SCONJ 1), sin (PRON 348, DET 156, ADV 68), seo (PRON 188, DET 131, ADV 59), ‘n (DET 129, ADP 12, ADV 10, PART 3, SCONJ 2, PRON 1)

Morphology

The form / lemma ratio of DET is 1.894737 (the average of all parts of speech is 1.302531).

The 1st highest number of forms (13) was observed with the lemma “an”: ‘m, ‘n, a, a’, am, an, a’, na, nam, nan, ‘m, ‘n, ’n.

The 2nd highest number of forms (4) was observed with the lemma “do”: d’, do, d’, t’.

The 3rd highest number of forms (3) was observed with the lemma “mo”: m’, mo, m’.

DET occurs with 8 features: PronType (6540; 100% instances), Number (5955; 91% instances), Gender (5045; 77% instances), Definite (4964; 76% instances), Case (1279; 20% instances), Person (884; 14% instances), Poss (884; 14% instances), Foreign (3; 0% instances)

DET occurs with 14 feature-value pairs: Case=Gen, Definite=Def, Foreign=Yes, Gender=Fem, Gender=Masc, Number=Dual, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Art, PronType=Prs

DET occurs with 28 feature combinations. The most frequent feature combination is Definite=Def|Gender=Masc|Number=Sing|PronType=Art (1651 tokens). Examples: an, a’, am, ‘n, a’, ‘m, ‘n, ’n, nam

Relations

DET nodes are attached to their parents using 13 different relations: det (5530; 85% instances), nmod:poss (608; 9% instances), obj (243; 4% instances), fixed (89; 1% instances), nsubj:pass (27; 0% instances), flat:name (24; 0% instances), reparandum (6; 0% instances), obl (5; 0% instances), conj (3; 0% instances), xcomp:pred (2; 0% instances), advmod (1; 0% instances), appos (1; 0% instances), root (1; 0% instances)

Parents of DET nodes belong to 12 different parts of speech: NOUN (6019; 92% instances), PROPN (261; 4% instances), DET (90; 1% instances), PRON (49; 1% instances), ADP (46; 1% instances), X (28; 0% instances), NUM (17; 0% instances), PART (14; 0% instances), VERB (10; 0% instances), ADJ (4; 0% instances), (1; 0% instances), SYM (1; 0% instances)

6431 (98%) DET nodes are leaves.

100 (2%) DET nodes have one child.

3 (0%) DET nodes have two children.

6 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 4.

Children of DET nodes are attached using 11 different relations: fixed (93; 74% instances), case (12; 10% instances), flat (6; 5% instances), advmod (4; 3% instances), flat:name (3; 2% instances), cc (2; 2% instances), cop (2; 2% instances), nsubj (1; 1% instances), parataxis (1; 1% instances), punct (1; 1% instances), reparandum (1; 1% instances)

Children of DET nodes belong to 10 different parts of speech: DET (90; 71% instances), ADP (14; 11% instances), NOUN (7; 6% instances), ADV (4; 3% instances), PROPN (4; 3% instances), AUX (2; 2% instances), CCONJ (2; 2% instances), ADJ (1; 1% instances), PRON (1; 1% instances), PUNCT (1; 1% instances)