home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-Nynorsk: POS Tags: X

There are 713 X lemmas (3%), 713 X types (2%) and 1232 X tokens (0%). Out of 17 observed tags, the rank of X is: 6 in number of lemmas, 6 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: of, the, og, det, en, i, for, som, til, den

The 10 most frequent X types: of, the, og, det, en, i, for, som, til, den

The 10 most frequent ambiguous lemmas: of (X 35, ADP 6), the (X 20, DET 7, NUM 1), og (CCONJ 8213, X 28, ADV 17), det (PRON 5531, DET 1337, X 16), en (X 22, DET 5, ADP 1), i (ADP 9532, ADV 84, X 20, NOUN 3), for (ADP 3646, ADV 205, CCONJ 100, X 18), som (SCONJ 3460, ADP 1330, X 19, ADV 5), til (ADP 4375, ADV 223, SCONJ 40, X 16, PROPN 1), den (DET 1927, PRON 149, X 12, PROPN 1)

The 10 most frequent ambiguous types: of (X 35, ADP 6), the (X 20, DET 7, NUM 1), og (CCONJ 7882, X 28, ADV 16, PART 3), det (PRON 4104, DET 1165, X 16, ADV 1), en (X 22, DET 7, ADP 1), i (ADP 8727, ADV 81, X 20, NOUN 3), for (ADP 3509, ADV 197, CCONJ 52, X 18, VERB 1), som (SCONJ 3434, ADP 1268, X 19, ADV 5), til (ADP 4296, ADV 223, SCONJ 40, X 16, PROPN 1), den (DET 1665, PRON 115, X 12)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.346300).

The 1st highest number of forms (1) was observed with the lemma “$/”: /.

The 2nd highest number of forms (1) was observed with the lemma “-e-”: -e-.

The 3rd highest number of forms (1) was observed with the lemma “07.30”: 07.30.

X occurs with 1 features: Foreign (1021; 83% instances)

X occurs with 1 feature-value pairs: Foreign=Yes

X occurs with 2 feature combinations. The most frequent feature combination is Foreign=Yes (1021 tokens). Examples: det, en, i, og, the, of, som, til, den, for

Relations

X nodes are attached to their parents using 15 different relations: flat:foreign (874; 71% instances), flat:name (165; 13% instances), root (69; 6% instances), obj (23; 2% instances), appos (19; 2% instances), obl (18; 1% instances), ccomp (13; 1% instances), nsubj (13; 1% instances), xcomp (12; 1% instances), compound (10; 1% instances), conj (8; 1% instances), nmod (5; 0% instances), csubj (1; 0% instances), dislocated (1; 0% instances), nsubj:pass (1; 0% instances)

Parents of X nodes belong to 7 different parts of speech: X (881; 72% instances), PROPN (168; 14% instances), (69; 6% instances), VERB (67; 5% instances), NOUN (35; 3% instances), ADJ (9; 1% instances), PRON (3; 0% instances)

1015 (82%) X nodes are leaves.

63 (5%) X nodes have one child.

16 (1%) X nodes have two children.

138 (11%) X nodes have three or more children.

The highest child degree of a X node is 40.

Children of X nodes are attached using 20 different relations: flat:foreign (874; 64% instances), punct (341; 25% instances), flat:name (41; 3% instances), case (26; 2% instances), mark (12; 1% instances), conj (11; 1% instances), cc (7; 1% instances), nmod:poss (6; 0% instances), nsubj (6; 0% instances), obl (6; 0% instances), amod (5; 0% instances), appos (5; 0% instances), cop (5; 0% instances), nmod (4; 0% instances), advmod (3; 0% instances), det (3; 0% instances), acl:relcl (2; 0% instances), advcl (2; 0% instances), parataxis (2; 0% instances), xcomp (1; 0% instances)

Children of X nodes belong to 13 different parts of speech: X (881; 65% instances), PUNCT (341; 25% instances), PROPN (52; 4% instances), ADP (26; 2% instances), NOUN (17; 1% instances), SCONJ (12; 1% instances), CCONJ (8; 1% instances), ADJ (7; 1% instances), VERB (6; 0% instances), AUX (5; 0% instances), DET (3; 0% instances), ADV (2; 0% instances), PRON (2; 0% instances)