home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Danish-DDT: POS Tags: X

There are 284 X lemmas (2%), 283 X types (2%) and 343 X tokens (0%). Out of 17 observed tags, the rank of X is: 7 in number of lemmas, 7 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: of, MEDARB, en, we, are, frue, km/t., med, the, vind.

The 10 most frequent X types: of, MEDARB, en, we, are, fru, km/t., med, the, vind.

The 10 most frequent ambiguous lemmas: en (DET 2186, PRON 103, X 5, ADV 2, NUM 2), med (ADP 1045, ADV 82, X 3), the (X 3, NOUN 1), vivaldi (X 3, ADV 1), følge (VERB 40, NOUN 12, X 2), hel (ADJ 98, X 2, ADV 1), in (X 2, ADJ 1), la (INTJ 2, X 2), mindre (ADV 11, X 2), K. (PROPN 6, X 1)

The 10 most frequent ambiguous types: en (DET 1395, PRON 48, X 5, ADV 2, NUM 1), med (ADP 1008, ADV 80, X 3), the (X 3, NOUN 1), vivaldi (X 3, ADV 1), forsknings- (NOUN 2, X 1), følge (NOUN 11, VERB 8, X 2), hel (ADJ 12, X 2, ADV 1), in (X 2, ADJ 1), la (INTJ 2, X 2), mindre (ADJ 26, ADV 11, X 2)

Morphology

The form / lemma ratio of X is 0.996479 (the average of all parts of speech is 1.355884).

The 1st highest number of forms (1) was observed with the lemma “04099l”: 04099l.

The 2nd highest number of forms (1) was observed with the lemma “12all”: 12all.

The 3rd highest number of forms (1) was observed with the lemma “16-ventilet”: 16-ventilet.

X occurs with 2 features: Foreign (101; 29% instances), Abbr (31; 9% instances)

X occurs with 2 feature-value pairs: Abbr=Yes, Foreign=Yes

X occurs with 3 feature combinations. The most frequent feature combination is _ (211 tokens). Examples: MEDARB, en, med, vivaldi, Statskundskab/Samfundsfag, aller, forsknings-, følge, hel, lys-

Relations

X nodes are attached to their parents using 17 different relations: nmod (60; 17% instances), dep (59; 17% instances), amod (45; 13% instances), conj (41; 12% instances), obj (25; 7% instances), nsubj (23; 7% instances), root (22; 6% instances), obl (17; 5% instances), list (11; 3% instances), flat (10; 3% instances), nmod:poss (10; 3% instances), appos (6; 2% instances), cc (4; 1% instances), mark (4; 1% instances), fixed (3; 1% instances), obl:tmod (2; 1% instances), obl:lmod (1; 0% instances)

Parents of X nodes belong to 11 different parts of speech: NOUN (77; 22% instances), X (61; 18% instances), PROPN (46; 13% instances), VERB (42; 12% instances), NUM (33; 10% instances), (22; 6% instances), ADJ (18; 5% instances), PRON (17; 5% instances), ADV (15; 4% instances), SYM (11; 3% instances), PART (1; 0% instances)

185 (54%) X nodes are leaves.

93 (27%) X nodes have one child.

36 (10%) X nodes have two children.

29 (8%) X nodes have three or more children.

The highest child degree of a X node is 12.

Children of X nodes are attached using 21 different relations: punct (71; 24% instances), conj (59; 20% instances), dep (48; 16% instances), nmod (27; 9% instances), cc (17; 6% instances), amod (12; 4% instances), nsubj (10; 3% instances), obj (8; 3% instances), list (7; 2% instances), flat (6; 2% instances), advmod (5; 2% instances), nmod:poss (5; 2% instances), appos (4; 1% instances), nummod (3; 1% instances), advcl (2; 1% instances), case (2; 1% instances), advmod:lmod (1; 0% instances), det (1; 0% instances), expl (1; 0% instances), mark (1; 0% instances), vocative (1; 0% instances)

Children of X nodes belong to 14 different parts of speech: NOUN (77; 26% instances), PUNCT (71; 24% instances), X (61; 21% instances), PROPN (33; 11% instances), ADJ (13; 4% instances), CCONJ (13; 4% instances), ADV (7; 2% instances), NUM (4; 1% instances), VERB (4; 1% instances), ADP (2; 1% instances), PRON (2; 1% instances), SYM (2; 1% instances), DET (1; 0% instances), SCONJ (1; 0% instances)