home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Bokmaal: POS Tags: X

There are 467 X lemmas (2%), 467 X types (1%) and 795 X tokens (0%). Out of 17 observed tags, the rank of X is: 6 in number of lemmas, 6 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: $,, the, of, and, in, to, you, a, i, is

The 10 most frequent X types: ,, the, of, and, in, to, you, a, i, is

The 10 most frequent ambiguous lemmas: $, (PUNCT 11516, X 34), the (X 31, DET 4), of (X 25, ADV 1), and (X 20, CCONJ 1, NOUN 1), in (X 16, ADJ 2), to (NUM 356, X 11), you (X 9, PRON 1), a (X 8, NOUN 7, INTJ 1), i (ADP 8577, ADV 54, X 3, PROPN 1), is (NOUN 13, X 7, VERB 1)

The 10 most frequent ambiguous types: , (PUNCT 11516, X 34), the (X 31, DET 4), of (X 25, ADV 1), and (X 20, CCONJ 1, NOUN 1), in (X 16, ADJ 2), to (NUM 331, X 11), you (X 9, PRON 1), a (X 8, ADJ 5, NOUN 2, INTJ 1), i (ADP 7800, ADV 53, X 3), is (X 7, NOUN 4, VERB 1)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.381641).

The 1st highest number of forms (1) was observed with the lemma “$,”: ,.

The 2nd highest number of forms (1) was observed with the lemma “$-”: -.

The 3rd highest number of forms (1) was observed with the lemma “$.”: ..

X occurs with 1 features: Foreign (696; 88% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is Foreign=Yes (696 tokens). Examples: ,, the, and, in, to, you, of, a, i, it

Relations

X nodes are attached to their parents using 13 different relations: flat:foreign (590; 74% instances), flat:name (79; 10% instances), root (47; 6% instances), obl (14; 2% instances), obj (13; 2% instances), ccomp (10; 1% instances), nsubj (10; 1% instances), compound (9; 1% instances), xcomp (8; 1% instances), appos (6; 1% instances), nmod (6; 1% instances), conj (2; 0% instances), nsubj:pass (1; 0% instances)

Parents of X nodes belong to 8 different parts of speech: X (590; 74% instances), PROPN (83; 10% instances), VERB (54; 7% instances), (47; 6% instances), NOUN (18; 2% instances), ADJ (1; 0% instances), ADP (1; 0% instances), DET (1; 0% instances)

666 (84%) X nodes are leaves.

18 (2%) X nodes have one child.

8 (1%) X nodes have two children.

103 (13%) X nodes have three or more children.

The highest child degree of a X node is 46.

Children of X nodes are attached using 17 different relations: flat:foreign (590; 64% instances), punct (217; 24% instances), flat:name (55; 6% instances), case (21; 2% instances), appos (4; 0% instances), nmod:poss (4; 0% instances), obl (4; 0% instances), advmod (3; 0% instances), conj (3; 0% instances), nmod (3; 0% instances), acl:relcl (2; 0% instances), amod (2; 0% instances), cop (2; 0% instances), nsubj (2; 0% instances), cc (1; 0% instances), det (1; 0% instances), mark (1; 0% instances)

Children of X nodes belong to 13 different parts of speech: X (590; 64% instances), PUNCT (217; 24% instances), PROPN (59; 6% instances), ADP (22; 2% instances), NOUN (11; 1% instances), ADJ (3; 0% instances), ADV (3; 0% instances), VERB (3; 0% instances), AUX (2; 0% instances), NUM (2; 0% instances), CCONJ (1; 0% instances), DET (1; 0% instances), SCONJ (1; 0% instances)