home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Danish: POS Tags: X

There are 366 X lemmas (3%), 365 X types (2%) and 439 X tokens (0%). Out of 17 observed tags, the rank of X is: 7 in number of lemmas, 7 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: of, MEDARB, en, vivaldi, we, allesammen, are, at, frue, hel

The 10 most frequent X types: of, MEDARB, en, vivaldi, we, allesammen, are, at, fru, hel

The 10 most frequent ambiguous lemmas: en (DET 2186, PRON 105, X 7), at (PART 1215, SCONJ 957, X 3), hel (ADJ 98, X 3), med (ADP 1045, ADV 82, X 3), the (X 3, NOUN 1), af (ADP 1265, ADV 29, X 2), fremfor (X 2, ADP 1), følge (VERB 40, NOUN 12, X 2), i (ADP 2838, ADV 10, X 1), in (X 2, ADJ 1)

The 10 most frequent ambiguous types: en (DET 1395, PRON 49, X 7), at (PART 1209, SCONJ 943, X 3), hel (ADJ 12, X 3), med (ADP 1008, ADV 80, X 3), the (X 3, NOUN 1), af (ADP 1251, ADV 29, X 2), forsknings- (NOUN 2, X 1), fremfor (X 2, ADP 1), følge (NOUN 11, VERB 8, X 2), i (ADP 2622, ADV 10, X 1)

Morphology

The form / lemma ratio of X is 0.997268 (the average of all parts of speech is 1.355946).

The 1st highest number of forms (1) was observed with the lemma “04099l”: 04099l.

The 2nd highest number of forms (1) was observed with the lemma “12all”: 12all.

The 3rd highest number of forms (1) was observed with the lemma “16-ventilet”: 16-ventilet.

X occurs with 2 features: Foreign (111; 25% instances), Abbr (32; 7% instances)

X occurs with 2 feature-value pairs: Abbr=Yes, Foreign=Yes

X occurs with 3 feature combinations. The most frequent feature combination is _ (296 tokens). Examples: MEDARB, en, vivaldi, allesammen, at, hel, med, Statskundskab/Samfundsfag, af, aller

Relations

X nodes are attached to their parents using 18 different relations: advmod (95; 22% instances), nmod (60; 14% instances), dep (59; 13% instances), amod (45; 10% instances), conj (40; 9% instances), obj (25; 6% instances), nsubj (23; 5% instances), root (21; 5% instances), obl (18; 4% instances), list (13; 3% instances), flat (10; 2% instances), nmod:poss (10; 2% instances), appos (6; 1% instances), cc (4; 1% instances), mark (4; 1% instances), fixed (3; 1% instances), obl:tmod (2; 0% instances), obl:loc (1; 0% instances)

Parents of X nodes belong to 13 different parts of speech: NOUN (108; 25% instances), VERB (94; 21% instances), X (71; 16% instances), PROPN (47; 11% instances), NUM (33; 8% instances), (21; 5% instances), ADJ (20; 5% instances), PRON (17; 4% instances), SYM (11; 3% instances), ADV (8; 2% instances), AUX (4; 1% instances), PART (3; 1% instances), ADP (2; 0% instances)

216 (49%) X nodes are leaves.

130 (30%) X nodes have one child.

56 (13%) X nodes have two children.

37 (8%) X nodes have three or more children.

The highest child degree of a X node is 9.

Children of X nodes are attached using 22 different relations: conj (75; 19% instances), punct (64; 16% instances), dep (53; 13% instances), case (51; 13% instances), nmod (41; 10% instances), cc (17; 4% instances), obj (15; 4% instances), amod (13; 3% instances), nsubj (12; 3% instances), advmod (10; 3% instances), advcl (8; 2% instances), nmod:poss (8; 2% instances), flat (6; 2% instances), appos (5; 1% instances), list (5; 1% instances), fixed (3; 1% instances), mark (3; 1% instances), acl:relcl (2; 1% instances), nummod (2; 1% instances), expl (1; 0% instances), obl:loc (1; 0% instances), vocative (1; 0% instances)

Children of X nodes belong to 13 different parts of speech: NOUN (106; 27% instances), X (71; 18% instances), PUNCT (64; 16% instances), ADP (51; 13% instances), PROPN (34; 9% instances), ADJ (17; 4% instances), ADV (15; 4% instances), CCONJ (13; 3% instances), VERB (13; 3% instances), NUM (4; 1% instances), PRON (4; 1% instances), SCONJ (2; 1% instances), SYM (2; 1% instances)