Treebank Statistics: UD_Danish-DDT: POS Tags: X
There are 284 X
lemmas (2%), 283 X
types (2%) and 343 X
tokens (0%).
Out of 17 observed tags, the rank of X
is: 7 in number of lemmas, 7 in number of types and 15 in number of tokens.
The 10 most frequent X
lemmas: of, MEDARB, en, we, are, frue, km/t., med, the, vind.
The 10 most frequent X
types: of, MEDARB, en, we, are, fru, km/t., med, the, vind.
The 10 most frequent ambiguous lemmas: en (DET 2186, PRON 103, X 5, ADV 2, NUM 2), med (ADP 1045, ADV 82, X 3), the (X 3, NOUN 1), vivaldi (X 3, ADV 1), følge (VERB 40, NOUN 12, X 2), hel (ADJ 98, X 2, ADV 1), in (X 2, ADJ 1), la (INTJ 2, X 2), mindre (ADV 11, X 2), K. (PROPN 6, X 1)
The 10 most frequent ambiguous types: en (DET 1395, PRON 48, X 5, ADV 2, NUM 1), med (ADP 1008, ADV 80, X 3), the (X 3, NOUN 1), vivaldi (X 3, ADV 1), forsknings- (NOUN 2, X 1), følge (NOUN 11, VERB 8, X 2), hel (ADJ 12, X 2, ADV 1), in (X 2, ADJ 1), la (INTJ 2, X 2), mindre (ADJ 26, ADV 11, X 2)
- en
- DET 1395: H.L. Hansen var en udsædvanlig og frodig personlighed .
- PRON 48: Mit hus er et lille skovløberhus bag en af de røde låger .
- X 5: Jeg tror , jeg er blevet mødt med mere en nysgerrig velvillighed . “
- ADV 2: ” Afghanerne er en fantastisk folk .
- NUM 1: Bestyrelsesformanden fik halvandet års fængsel og en bøde på en million kroner for mandagsvig , skattesvig af særlig grov karakter og overtrædelse af aktieselskabsloven og bank- og sparekasseloven .
- med
- the
- vivaldi
- forsknings-
- NOUN 2: Bemærkninger til forslag til folketingsbeslutning om et bioteknisk forsknings- og udviklingsprogram , fremsat 21. marts 1986 af Jytte Hilden m.fl. .
- X 1: Men nu får vi altså en chance , både ESA og NASA åbner mulighed for at få små robuste forsknings- og undervisningsdrabanter med som blaffere ved de kommercielle opsendelser , “ siger John Jørgensen .
- følge
- NOUN 11: Hertil kommer besparelser som følge af reduceret fravær på arbejdet m.v. .
- VERB 8: Biologer udstyrer truede dyr med radiosendere , så de kan følge dem konstant og få viden , der kan redde dyrene
- X 2: ” Den udtalelse er i følge Jan Carlzon selv fremsat under en diskussion om faren for , at medier drager for hastige konklusioner .
- hel
- in
- la
- mindre
- ADJ 26: Skær courgette og agurk i mindre stykker på tværs .
- ADV 11: Dertil kommer en lang række forskellige mere eller mindre officielle rejser .
- X 2: I mange arbejdsfunktioner i såvel den offentlige som den private sektor vil det ikke være muligt at give medarbejderen fri til at deltage i uddannelse , med mindre der ansættes erstatningsarbejdskraft i kursusperioden .
Morphology
The form / lemma ratio of X
is 0.996479 (the average of all parts of speech is 1.355884).
The 1st highest number of forms (1) was observed with the lemma “04099l”: 04099l.
The 2nd highest number of forms (1) was observed with the lemma “12all”: 12all.
The 3rd highest number of forms (1) was observed with the lemma “16-ventilet”: 16-ventilet.
X
occurs with 2 features: Foreign (101; 29% instances), Abbr (31; 9% instances)
X
occurs with 2 feature-value pairs: Abbr=Yes
, Foreign=Yes
X
occurs with 3 feature combinations.
The most frequent feature combination is _
(211 tokens).
Examples: MEDARB, en, med, vivaldi, Statskundskab/Samfundsfag, aller, forsknings-, følge, hel, lys-
Relations
X
nodes are attached to their parents using 17 different relations: nmod (60; 17% instances), dep (59; 17% instances), amod (45; 13% instances), conj (41; 12% instances), obj (25; 7% instances), nsubj (23; 7% instances), root (22; 6% instances), obl (17; 5% instances), list (11; 3% instances), flat (10; 3% instances), nmod:poss (10; 3% instances), appos (6; 2% instances), cc (4; 1% instances), mark (4; 1% instances), fixed (3; 1% instances), obl:tmod (2; 1% instances), obl:lmod (1; 0% instances)
Parents of X
nodes belong to 11 different parts of speech: NOUN (77; 22% instances), X (61; 18% instances), PROPN (46; 13% instances), VERB (42; 12% instances), NUM (33; 10% instances), (22; 6% instances), ADJ (18; 5% instances), PRON (17; 5% instances), ADV (15; 4% instances), SYM (11; 3% instances), PART (1; 0% instances)
185 (54%) X
nodes are leaves.
93 (27%) X
nodes have one child.
36 (10%) X
nodes have two children.
29 (8%) X
nodes have three or more children.
The highest child degree of a X
node is 12.
Children of X
nodes are attached using 21 different relations: punct (71; 24% instances), conj (59; 20% instances), dep (48; 16% instances), nmod (27; 9% instances), cc (17; 6% instances), amod (12; 4% instances), nsubj (10; 3% instances), obj (8; 3% instances), list (7; 2% instances), flat (6; 2% instances), advmod (5; 2% instances), nmod:poss (5; 2% instances), appos (4; 1% instances), nummod (3; 1% instances), advcl (2; 1% instances), case (2; 1% instances), advmod:lmod (1; 0% instances), det (1; 0% instances), expl (1; 0% instances), mark (1; 0% instances), vocative (1; 0% instances)
Children of X
nodes belong to 14 different parts of speech: NOUN (77; 26% instances), PUNCT (71; 24% instances), X (61; 21% instances), PROPN (33; 11% instances), ADJ (13; 4% instances), CCONJ (13; 4% instances), ADV (7; 2% instances), NUM (4; 1% instances), VERB (4; 1% instances), ADP (2; 1% instances), PRON (2; 1% instances), SYM (2; 1% instances), DET (1; 0% instances), SCONJ (1; 0% instances)