home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Icelandic-IcePaHC: POS Tags: X

There are 1193 X lemmas (3%), 1219 X types (2%) and 2277 X tokens (0%). Out of 16 observed tags, the rank of X is: 6 in number of lemmas, 6 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: anno, dominus, item, in, sankti, et, majst, trankival, etc, darius

The 10 most frequent X types: anno, item, in, domini, Dominus, et, Majst, Trankival, sankti, etc

The 10 most frequent ambiguous lemmas: anno (X 142, NOUN 17, PROPN 5), dominus (X 82, PROPN 40), item (X 60, ADV 4), in (X 53, ADV 1), sankti (PROPN 55, X 40, ADJ 2, NOUN 2), et (X 30, CCONJ 1, NOUN 1, SCONJ 1), majst (X 25, PROPN 12), trankival (X 25, PROPN 16), darius (PROPN 105, X 15, ADJ 2, NOUN 1), kristur (PROPN 519, X 15, NOUN 4)

The 10 most frequent ambiguous types: anno (NOUN 5, X 5), item (X 23, ADV 4), in (X 50, DET 38, ADV 1, NOUN 1), domini (X 9, PROPN 1), Dominus (PROPN 29, X 29), et (X 24, VERB 3, CCONJ 1, NOUN 1, SCONJ 1), Majst (X 25, PROPN 12), Trankival (X 25, PROPN 13), sankti (PROPN 48, X 23), sanktus (PROPN 15, X 14, ADJ 2)

Morphology

The form / lemma ratio of X is 1.021794 (the average of all parts of speech is 1.856953).

The 1st highest number of forms (8) was observed with the lemma “kristur”: Christi, Christum, Christus, Kristo, Kristum, Kristus, Kristí, kristi.

The 2nd highest number of forms (6) was observed with the lemma “jesús”: Iesu, Iesus, Jesu, Jesus, Jesú, Jesúm.

The 3rd highest number of forms (5) was observed with the lemma “dominus”: Dominus, domine, domini, domino, dominum.

X occurs with 8 features: Foreign (2108; 93% instances), Case (161; 7% instances), Number (161; 7% instances), Definite (159; 7% instances), Gender (153; 7% instances), Degree (25; 1% instances), VerbForm (2; 0% instances), Voice (2; 0% instances)

X occurs with 17 feature-value pairs: Case=Acc, Case=Dat, Case=Gen, Case=Nom, Definite=Def, Definite=Ind, Degree=Cmp, Degree=Pos, Degree=Sup, Foreign=Yes, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, VerbForm=Inf, Voice=Act

X occurs with 42 feature combinations. The most frequent feature combination is Foreign=Yes (2108 tokens). Examples: anno, item, in, domini, Dominus, et, Majst, Trankival, etc, sankti

Relations

X nodes are attached to their parents using 21 different relations: flat:foreign (879; 39% instances), dep (437; 19% instances), obl (299; 13% instances), root (99; 4% instances), nmod:poss (98; 4% instances), conj (91; 4% instances), appos (85; 4% instances), nsubj (82; 4% instances), obj (61; 3% instances), amod (51; 2% instances), xcomp (36; 2% instances), ccomp (19; 1% instances), acl:relcl (9; 0% instances), advcl (8; 0% instances), iobj (7; 0% instances), flat:name (6; 0% instances), vocative (3; 0% instances), acl (2; 0% instances), flat (2; 0% instances), parataxis (2; 0% instances), nmod (1; 0% instances)

Parents of X nodes belong to 14 different parts of speech: VERB (824; 36% instances), X (793; 35% instances), NOUN (332; 15% instances), PROPN (107; 5% instances), (99; 4% instances), PRON (28; 1% instances), ADJ (23; 1% instances), DET (22; 1% instances), ADV (13; 1% instances), AUX (12; 1% instances), CCONJ (10; 0% instances), NUM (10; 0% instances), ADP (3; 0% instances), PART (1; 0% instances)

1042 (46%) X nodes are leaves.

715 (31%) X nodes have one child.

258 (11%) X nodes have two children.

262 (12%) X nodes have three or more children.

The highest child degree of a X node is 17.

Children of X nodes are attached using 28 different relations: flat:foreign (749; 31% instances), punct (521; 21% instances), dep (196; 8% instances), nummod (152; 6% instances), conj (135; 6% instances), appos (102; 4% instances), amod (97; 4% instances), cc (83; 3% instances), nmod:poss (62; 3% instances), case (61; 2% instances), det (58; 2% instances), obl (55; 2% instances), acl:relcl (35; 1% instances), cop (28; 1% instances), advmod (24; 1% instances), nsubj (21; 1% instances), mark (18; 1% instances), parataxis (9; 0% instances), xcomp (8; 0% instances), ccomp (6; 0% instances), flat:name (5; 0% instances), nmod (5; 0% instances), acl (3; 0% instances), advcl (3; 0% instances), compound:prt (3; 0% instances), flat (2; 0% instances), discourse (1; 0% instances), obj (1; 0% instances)

Children of X nodes belong to 15 different parts of speech: X (793; 32% instances), PUNCT (521; 21% instances), ADP (237; 10% instances), NOUN (224; 9% instances), NUM (158; 6% instances), ADJ (97; 4% instances), CCONJ (83; 3% instances), VERB (78; 3% instances), DET (65; 3% instances), PRON (60; 2% instances), PROPN (49; 2% instances), AUX (31; 1% instances), ADV (28; 1% instances), SCONJ (18; 1% instances), INTJ (1; 0% instances)