home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Icelandic-GC: POS Tags: X

There are 149 X lemmas (1%), 150 X types (1%) and 200 X tokens (0%). Out of 17 observed tags, the rank of X is: 7 in number of lemmas, 8 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: the, of, að, sem, vera, í, Love, and, is, á

The 10 most frequent X types: the, of, að, sem, í, Love, and, er, is, verið

The 10 most frequent ambiguous lemmas: the (X 6, NOUN 2, PROPN 1), of (ADV 27, X 8, PROPN 2), (PART 2062, SCONJ 1497, ADP 431, ADV 139, X 6, VERB 2), sem (SCONJ 1553, ADP 31, ADV 27, X 6, PROPN 1), vera (AUX 2480, VERB 1353, X 5, ADV 2, NOUN 1), í (ADP 3479, ADV 269, X 4, ADJ 1, NOUN 1, PROPN 1), Love (X 3, PROPN 1), and (PROPN 2, X 2), á (ADP 2096, ADV 244, NOUN 18, SCONJ 12, VERB 7, X 3, PROPN 2), er (SCONJ 31, ADV 5, VERB 3, X 2)

The 10 most frequent ambiguous types: the (X 6, NOUN 2, PROPN 1), of (ADV 26, X 8, PROPN 2), (PART 2051, SCONJ 1496, ADP 413, ADV 129, X 6, VERB 1), sem (SCONJ 1552, ADP 28, ADV 26, X 6, PROPN 1), í (ADP 3313, ADV 255, X 4, PROPN 1), Love (X 3, PROPN 1), and (PROPN 2, X 2), er (AUX 933, VERB 546, SCONJ 30, ADV 6, X 3), verið (AUX 266, VERB 108, X 3, NOUN 1), á (ADP 2211, ADV 238, VERB 85, SCONJ 12, NOUN 6, X 3, PROPN 2, PART 1)

Morphology

The form / lemma ratio of X is 1.006711 (the average of all parts of speech is 1.434754).

The 1st highest number of forms (3) was observed with the lemma “vera”: er, var, verið.

The 2nd highest number of forms (2) was observed with the lemma “ár”: ár, ára.

The 3rd highest number of forms (1) was observed with the lemma “A”: A.

X does not occur with any features.

Relations

X nodes are attached to their parents using 11 different relations: flat:foreign (120; 60% instances), dep (39; 20% instances), obl (24; 12% instances), conj (4; 2% instances), nsubj (4; 2% instances), advcl (3; 2% instances), root (2; 1% instances), case (1; 1% instances), mark (1; 1% instances), nmod:poss (1; 1% instances), obj (1; 1% instances)

Parents of X nodes belong to 9 different parts of speech: X (107; 54% instances), NOUN (40; 20% instances), VERB (40; 20% instances), ADJ (4; 2% instances), ADV (3; 2% instances), PROPN (2; 1% instances), (2; 1% instances), PRON (1; 1% instances), SCONJ (1; 1% instances)

116 (58%) X nodes are leaves.

58 (29%) X nodes have one child.

12 (6%) X nodes have two children.

14 (7%) X nodes have three or more children.

The highest child degree of a X node is 22.

Children of X nodes are attached using 15 different relations: flat:foreign (105; 64% instances), case (15; 9% instances), punct (14; 8% instances), cc (7; 4% instances), advmod (4; 2% instances), conj (4; 2% instances), amod (3; 2% instances), nsubj (3; 2% instances), advcl (2; 1% instances), obl (2; 1% instances), xcomp (2; 1% instances), ccomp (1; 1% instances), cop (1; 1% instances), fixed (1; 1% instances), mark (1; 1% instances)

Children of X nodes belong to 11 different parts of speech: X (107; 65% instances), ADP (16; 10% instances), PUNCT (14; 8% instances), CCONJ (7; 4% instances), NOUN (6; 4% instances), VERB (5; 3% instances), ADV (4; 2% instances), ADJ (3; 2% instances), AUX (1; 1% instances), PRON (1; 1% instances), SCONJ (1; 1% instances)