home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Scottish_Gaelic-ARCOSG: POS Tags: X

There are 175 X lemmas (3%), 175 X types (2%) and 424 X tokens (0%). Out of 17 observed tags, the rank of X is: 6 in number of lemmas, 6 in number of types and 16 in number of tokens.

The 10 most frequent X lemmas: [?], the, a, of, on, and, i, in, isles, na

The 10 most frequent X types: [?], the, a, of, on, I, Isles, and, in, na

The 10 most frequent ambiguous lemmas: [?] (X 204, NOUN 2), the (X 10, DET 4, PROPN 1), a (PART 3234, DET 593, PRON 429, ADP 280, ADV 140, PROPN 38, ADJ 21, SCONJ 6, X 6, INTJ 4, CCONJ 3, NOUN 1, SYM 1), of (X 4, NOUN 1), on (SCONJ 7, X 4, ADP 1), and (X 3, NUM 1), i (PRON 826, X 3, NOUN 1), na (PART 67, PRON 59, ADP 52, PROPN 18, CCONJ 12, X 3), ann (ADV 129, PRON 2, X 2), bhunkhouse (X 2, NOUN 1)

The 10 most frequent ambiguous types: [?] (X 204, NOUN 2), the (X 9, DET 1, PROPN 1), a (PART 3228, DET 599, PRON 429, ADP 263, ADV 139, PROPN 38, ADJ 20, SCONJ 6, X 6, CCONJ 3, INTJ 1, NOUN 1), of (X 4, NOUN 1), on (SCONJ 7, X 4, ADP 1), and (X 3, NUM 1), na (DET 1170, ADP 70, PART 65, PRON 59, PROPN 18, CCONJ 12, X 3), ann (ADP 436, ADV 129, PRON 2, X 2), bhunkhouse (X 2, NOUN 1), no (CCONJ 180, X 2, INTJ 1)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.302531).

The 1st highest number of forms (1) was observed with the lemma “”: </em>.

The 2nd highest number of forms (1) was observed with the lemma “Domhnaill”: Dhomhnaill.

The 3rd highest number of forms (1) was observed with the lemma “Gillean”: Gillean.

X occurs with 1 features: Foreign (183; 43% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is _ (241 tokens). Examples: [?], na, a, a:, ann, b:, cuag, si, ISA, Malpa:

Relations

X nodes are attached to their parents using 18 different relations: dep (128; 30% instances), flat:foreign (101; 24% instances), obl (30; 7% instances), root (29; 7% instances), nsubj (23; 5% instances), xcomp:pred (20; 5% instances), reparandum (17; 4% instances), obj (15; 4% instances), conj (13; 3% instances), nmod (12; 3% instances), discourse (10; 2% instances), appos (9; 2% instances), case (4; 1% instances), flat (4; 1% instances), parataxis (3; 1% instances), advcl (2; 0% instances), ccomp (2; 0% instances), fixed (2; 0% instances)

Parents of X nodes belong to 11 different parts of speech: VERB (147; 35% instances), NOUN (110; 26% instances), X (108; 25% instances), (29; 7% instances), PROPN (13; 3% instances), PRON (8; 2% instances), ADJ (3; 1% instances), ADP (2; 0% instances), PART (2; 0% instances), ADV (1; 0% instances), NUM (1; 0% instances)

287 (68%) X nodes are leaves.

55 (13%) X nodes have one child.

35 (8%) X nodes have two children.

47 (11%) X nodes have three or more children.

The highest child degree of a X node is 7.

Children of X nodes are attached using 27 different relations: flat:foreign (101; 32% instances), case (45; 14% instances), punct (36; 11% instances), det (29; 9% instances), cc (13; 4% instances), advmod (12; 4% instances), mark:prt (11; 3% instances), amod (8; 3% instances), dep (7; 2% instances), conj (6; 2% instances), cop (6; 2% instances), nsubj (6; 2% instances), acl:relcl (5; 2% instances), discourse (5; 2% instances), parataxis (5; 2% instances), csubj:cleft (4; 1% instances), flat (4; 1% instances), advcl (2; 1% instances), appos (2; 1% instances), obj (2; 1% instances), reparandum (2; 1% instances), vocative (2; 1% instances), xcomp (2; 1% instances), fixed (1; 0% instances), mark (1; 0% instances), nummod (1; 0% instances), xcomp:pred (1; 0% instances)

Children of X nodes belong to 16 different parts of speech: X (108; 34% instances), ADP (43; 13% instances), PUNCT (36; 11% instances), DET (28; 9% instances), NOUN (18; 6% instances), VERB (17; 5% instances), CCONJ (13; 4% instances), PART (13; 4% instances), ADV (12; 4% instances), ADJ (8; 3% instances), PRON (7; 2% instances), AUX (6; 2% instances), INTJ (5; 2% instances), PROPN (3; 1% instances), NUM (1; 0% instances), SCONJ (1; 0% instances)