home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Bokmaal: POS Tags: X

There are 438 X lemmas (2%), 438 X types (1%) and 726 X tokens (0%). Out of 17 observed tags, the rank of X is: 6 in number of lemmas, 6 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: the, of, and, in, to, you, a, is, for, i

The 10 most frequent X types: the, of, and, in, to, you, a, is, for, i

The 10 most frequent ambiguous lemmas: the (X 31, DET 4), of (X 25, ADP 1), and (X 20, CCONJ 1, NOUN 1), in (X 16, ADJ 2), to (NUM 356, X 11), you (X 9, PRON 1), a (X 8, NOUN 7, INTJ 1), is (NOUN 13, X 7, VERB 1), for (ADP 2701, SCONJ 1009, ADV 148, CCONJ 99, X 7), i (ADP 8439, SCONJ 192, X 3, PROPN 1)

The 10 most frequent ambiguous types: the (X 31, DET 4), of (X 25, ADP 1), and (X 20, CCONJ 1, NOUN 1), in (X 16, ADJ 2), to (NUM 331, X 11), you (X 9, PRON 1), a (X 8, ADJ 5, NOUN 2, INTJ 1), is (X 7, NOUN 4, VERB 1), for (ADP 2558, SCONJ 985, ADV 143, CCONJ 44, X 7), i (ADP 7662, SCONJ 191, X 3)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.381903).

The 1st highest number of forms (1) was observed with the lemma “32”: 32.

The 2nd highest number of forms (1) was observed with the lemma “34”: 34.

The 3rd highest number of forms (1) was observed with the lemma “Annan”: Annan.

X does not occur with any features.

Relations

X nodes are attached to their parents using 12 different relations: flat:foreign (478; 66% instances), flat:name (152; 21% instances), root (51; 7% instances), compound (9; 1% instances), obj (9; 1% instances), nmod (6; 1% instances), obl (6; 1% instances), appos (5; 1% instances), xcomp (5; 1% instances), conj (2; 0% instances), nsubj (2; 0% instances), acl (1; 0% instances)

Parents of X nodes belong to 9 different parts of speech: X (478; 66% instances), PROPN (156; 21% instances), (51; 7% instances), VERB (20; 3% instances), NOUN (17; 2% instances), ADJ (1; 0% instances), ADP (1; 0% instances), DET (1; 0% instances), PRON (1; 0% instances)

592 (82%) X nodes are leaves.

53 (7%) X nodes have one child.

7 (1%) X nodes have two children.

74 (10%) X nodes have three or more children.

The highest child degree of a X node is 41.

Children of X nodes are attached using 12 different relations: flat:foreign (481; 65% instances), punct (219; 30% instances), case (10; 1% instances), parataxis (10; 1% instances), advmod (3; 0% instances), appos (3; 0% instances), conj (3; 0% instances), nmod (3; 0% instances), mark (2; 0% instances), acl:relcl (1; 0% instances), amod (1; 0% instances), obl (1; 0% instances)

Children of X nodes belong to 9 different parts of speech: X (478; 65% instances), PUNCT (219; 30% instances), VERB (12; 2% instances), ADP (10; 1% instances), NOUN (7; 1% instances), PROPN (5; 1% instances), ADV (3; 0% instances), SCONJ (2; 0% instances), ADJ (1; 0% instances)