home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Norwegian-NynorskLIA: POS Tags: X

There are 219 X lemmas (6%), 227 X types (5%) and 1944 X tokens (4%). Out of 16 observed tags, the rank of X is: 5 in number of lemmas, 5 in number of types and 11 in number of tokens.

The 10 most frequent X lemmas: e, em, s-, _, d-, f-, p-, b-, de-, e-

The 10 most frequent X types: e, em, s-, d-, f-, -, p-, b-, de-, e-

The 10 most frequent ambiguous lemmas: _ (X 19, VERB 16, NOUN 9, DET 4, ADV 3, CCONJ 3, PART 3, ADP 2, ADJ 1, PUNCT 1), m (INTJ 36, X 1)

The 10 most frequent ambiguous types: - (X 10, PUNCT 1), m- (X 7, INTJ 1), m (INTJ 36, X 1)

Morphology

The form / lemma ratio of X is 1.036530 (the average of all parts of speech is 1.284871).

The 1st highest number of forms (10) was observed with the lemma “_”: -, -styr, -ure, al-, dadet, erfor, henn, hæmmer, kjempå, s….

The 2nd highest number of forms (2) was observed with the lemma “ee”: e, ee.

The 3rd highest number of forms (1) was observed with the lemma “Arne-”: Arne-.

X occurs with 3 features: Definite (1; 0% instances), Gender (1; 0% instances), Number (1; 0% instances)

X occurs with 3 feature-value pairs: Definite=Ind, Gender=Masc, Number=Sing

X occurs with 2 feature combinations. The most frequent feature combination is _ (1943 tokens). Examples: e, em, s-, d-, f-, -, p-, b-, de-, e-

Relations

X nodes are attached to their parents using 6 different relations: discourse:filler (1872; 96% instances), reparandum (41; 2% instances), root (17; 1% instances), parataxis:deletion (10; 1% instances), obl (2; 0% instances), punct (2; 0% instances)

Parents of X nodes belong to 16 different parts of speech: VERB (462; 24% instances), PRON (344; 18% instances), NOUN (330; 17% instances), ADV (159; 8% instances), ADJ (151; 8% instances), DET (120; 6% instances), INTJ (93; 5% instances), CCONJ (82; 4% instances), PROPN (62; 3% instances), ADP (49; 3% instances), SCONJ (40; 2% instances), NUM (21; 1% instances), (17; 1% instances), PART (10; 1% instances), X (3; 0% instances), AUX (1; 0% instances)

1433 (74%) X nodes are leaves.

498 (26%) X nodes have one child.

11 (1%) X nodes have two children.

2 (0%) X nodes have three or more children.

The highest child degree of a X node is 3.

Children of X nodes are attached using 7 different relations: punct (516; 98% instances), det (3; 1% instances), discourse:filler (3; 1% instances), advmod (1; 0% instances), cc (1; 0% instances), nsubj (1; 0% instances), obl (1; 0% instances)

Children of X nodes belong to 6 different parts of speech: PUNCT (516; 98% instances), DET (3; 1% instances), X (3; 1% instances), CCONJ (2; 0% instances), ADV (1; 0% instances), PRON (1; 0% instances)