X
: other
This document is a placeholder for the language-specific documentation
for X
.
Treebank Statistics (UD_Dutch)
There are 1358 X
lemmas (6%), 1356 X
types (5%) and 4635 X
tokens (2%).
Out of 16 observed tags, the rank of X
is: 5 in number of lemmas, 5 in number of types and 11 in number of tokens.
The 10 most frequent X
lemmas: van, het, op, flo, voor, met, ten, aan, een, onder
The 10 most frequent X
types: van, het, op, flo, voor, met, ten, aan, een, onder
The 10 most frequent ambiguous lemmas: van (ADP 5616, X 384, PROPN 200, ADV 88), het (DET 4283, PRON 1155, X 222, PROPN 8), op (ADP 1586, ADV 196, X 154, PROPN 3, ADJ 1, SCONJ 1), voor (ADP 1429, ADV 122, X 102, PROPN 24, SCONJ 4, ADJ 2, NOUN 1, VERB 1), met (ADP 1403, X 86, ADV 4), ten (X 95, ADP 4), aan (ADP 842, ADV 174, X 72, PROPN 5), een (DET 4476, X 50, NUM 21, PROPN 3, CONJ 2), onder (ADP 159, X 47, ADV 7, NOUN 1), te (ADP 1878, ADV 117, X 46)
The 10 most frequent ambiguous types: van (ADP 5516, X 384, PROPN 199, ADV 87), het (DET 3802, PRON 793, X 222, PROPN 8), op (ADP 1444, ADV 196, X 152, PROPN 3), voor (ADP 1301, ADV 121, X 102, PROPN 24, SCONJ 4), met (ADP 1295, X 86), ten (X 95, ADP 2), aan (ADP 795, ADV 174, X 72, PROPN 5), een (DET 4196, X 50, NUM 21, PROPN 2), onder (ADP 131, X 47, ADV 7), te (ADP 1868, ADV 117, X 46)
- van
- het
- op
- voor
- ADP 1301: Boris Vascovic hield de hoop voor Smederevo levend .
- ADV 121: Ono bereidde beide treffers van Yanasigawa voor .
- X 102: voor het geval er iemand wordt gepakt of doorslaat
- PROPN 24: Hoeveel heeft IBM voor Lotus betaald ?
- SCONJ 4: Caris heeft een lange aanloop gehad voor hij besloot schilder te worden .
- met
- ten
- aan
- een
- onder
- te
Morphology
The form / lemma ratio of X
is 0.998527 (the average of all parts of speech is 1.258498).
The 1st highest number of forms (4) was observed with the lemma “of”: jaartje, keer, maand, of.
The 2nd highest number of forms (2) was observed with the lemma “Europees”: Europees, Europese.
The 3rd highest number of forms (1) was observed with the lemma “’n”: ‘n.
X
occurs with 17 features: Number (3582; 77% instances), Degree (1188; 26% instances), Gender (613; 13% instances), Definite (522; 11% instances), PronType (406; 9% instances), Case (301; 6% instances), VerbForm (191; 4% instances), Person (128; 3% instances), Tense (127; 3% instances), Mood (102; 2% instances), Aspect (74; 2% instances), Subcat (57; 1% instances), Variant (40; 1% instances), VerbType (23; 0% instances), Foreign (16; 0% instances), Poss (15; 0% instances), Reflex (4; 0% instances)
X
occurs with 37 feature-value pairs: Aspect=Imp
, Case=Dat
, Case=Gen
, Case=Nom
, Definite=Def
, Degree=Cmp
, Degree=Pos
, Degree=Sup
, Foreign=Foreign
, Gender=Com
, Gender=Neut
, Mood=Imp
, Mood=Ind
, Mood=Sub
, Number=Plur
, Number=Plur,Sing
, Number=Sing
, Person=1
, Person=2
, Person=3
, Poss=Yes
, PronType=Dem
, PronType=Ind
, PronType=Int
, PronType=Prs
, PronType=Rel
, Reflex=Yes
, Subcat=Intr
, Subcat=Tran
, Tense=Past
, Tense=Pres
, Variant=Short
, VerbForm=Fin
, VerbForm=Inf
, VerbForm=Part
, VerbType=Aux,Cop
, VerbType=Mod
X
occurs with 116 feature combinations.
The most frequent feature combination is Number=Sing
(1868 tokens).
Examples: van, op, flo, met, ten, het, aan, ter, voor, een
Relations
X
nodes are attached to their parents using 22 different relations: compound (2859; 62% instances), advmod (580; 13% instances), nmod (367; 8% instances), compound:prt (247; 5% instances), nsubj (120; 3% instances), dobj (105; 2% instances), root (101; 2% instances), mark (61; 1% instances), appos (60; 1% instances), conj (43; 1% instances), dep (28; 1% instances), cc (20; 0% instances), acl (10; 0% instances), aux (6; 0% instances), parataxis (6; 0% instances), advcl (5; 0% instances), ccomp (5; 0% instances), xcomp (5; 0% instances), cop (3; 0% instances), case (2; 0% instances), amod (1; 0% instances), name (1; 0% instances)
Parents of X
nodes belong to 16 different parts of speech: X (2256; 49% instances), VERB (685; 15% instances), ADP (521; 11% instances), NOUN (472; 10% instances), AUX (276; 6% instances), ROOT (101; 2% instances), NUM (73; 2% instances), ADJ (66; 1% instances), PRON (60; 1% instances), PROPN (47; 1% instances), CONJ (26; 1% instances), ADV (25; 1% instances), PUNCT (11; 0% instances), SCONJ (9; 0% instances), DET (6; 0% instances), SYM (1; 0% instances)
2885 (62%) X
nodes are leaves.
521 (11%) X
nodes have one child.
456 (10%) X
nodes have two children.
773 (17%) X
nodes have three or more children.
The highest child degree of a X
node is 30.
Children of X
nodes are attached using 26 different relations: compound (2600; 57% instances), case (320; 7% instances), det (278; 6% instances), dobj (264; 6% instances), punct (261; 6% instances), advmod (163; 4% instances), nmod (159; 3% instances), mark (103; 2% instances), cop (87; 2% instances), nsubj (59; 1% instances), conj (49; 1% instances), dep (35; 1% instances), advcl (34; 1% instances), cc (34; 1% instances), appos (25; 1% instances), xcomp (22; 0% instances), ccomp (13; 0% instances), aux (10; 0% instances), parataxis (9; 0% instances), acl (6; 0% instances), csubj (5; 0% instances), neg (4; 0% instances), nummod (3; 0% instances), amod (2; 0% instances), compound:prt (1; 0% instances), det:nummod (1; 0% instances)
Children of X
nodes belong to 15 different parts of speech: X (2256; 50% instances), ADP (531; 12% instances), NOUN (355; 8% instances), DET (284; 6% instances), PUNCT (280; 6% instances), NUM (206; 5% instances), PROPN (139; 3% instances), VERB (104; 2% instances), AUX (97; 2% instances), ADV (92; 2% instances), PRON (76; 2% instances), ADJ (64; 1% instances), CONJ (30; 1% instances), SCONJ (28; 1% instances), SYM (5; 0% instances)
Treebank Statistics (UD_Dutch-LassySmall)
There are 384 X
lemmas (3%), 384 X
types (2%) and 640 X
tokens (1%).
Out of 17 observed tags, the rank of X
is: 6 in number of lemmas, 6 in number of types and 15 in number of tokens.
The 10 most frequent X
lemmas: Bel, sp.a, o.a., ca., les, VVKSM, de, nr., Vive, grand
The 10 most frequent X
types: Bel, sp.a, o.a., ca., les, VVKSM, de, nr., Vive, grand
The 10 most frequent ambiguous lemmas: sp.a (X 22, PROPN 3), o.a. (X 18, SYM 1), ca. (X 16, ADV 3), les (X 2, NOUN 2), VVKSM (X 7, NOUN 1), de (DET 5884, PROPN 73, X 6), nr. (X 7, NOUN 2), la (PROPN 5, X 5), VGC (PROPN 6, X 4), des (PROPN 14, X 4)
The 10 most frequent ambiguous types: sp.a (X 22, PROPN 1), o.a. (X 18, SYM 1), ca. (X 16, ADV 3), VVKSM (X 7, NOUN 1), de (DET 4905, PROPN 73, X 6), nr. (X 7, NOUN 2), la (X 5, PROPN 5), VGC (PROPN 6, X 4), des (PROPN 14, X 4, DET 4), MR (X 3, PROPN 2)
- sp.a
- o.a.
- ca.
- VVKSM
- de
- nr.
- la
- VGC
- des
- MR
Morphology
The form / lemma ratio of X
is 1.000000 (the average of all parts of speech is 1.179900).
The 1st highest number of forms (1) was observed with the lemma “–foto’s”: –foto’s.
The 2nd highest number of forms (1) was observed with the lemma “-Berchem”: -Berchem.
The 3rd highest number of forms (1) was observed with the lemma “-Congres”: -Congres.
X
does not occur with any features.
Relations
X
nodes are attached to their parents using 13 different relations: nmod (245; 38% instances), mwe (164; 26% instances), root (55; 9% instances), appos (54; 8% instances), conj (52; 8% instances), parataxis (20; 3% instances), nsubj (15; 2% instances), dobj (13; 2% instances), advcl (5; 1% instances), cc (5; 1% instances), mark (5; 1% instances), acl (4; 1% instances), amod (3; 0% instances)
Parents of X
nodes belong to 13 different parts of speech: NOUN (158; 25% instances), PROPN (158; 25% instances), X (131; 20% instances), VERB (69; 11% instances), ROOT (55; 9% instances), ADJ (33; 5% instances), NUM (14; 2% instances), PUNCT (9; 1% instances), SYM (5; 1% instances), ADV (2; 0% instances), DET (2; 0% instances), PRON (2; 0% instances), SCONJ (2; 0% instances)
406 (63%) X
nodes are leaves.
50 (8%) X
nodes have one child.
58 (9%) X
nodes have two children.
126 (20%) X
nodes have three or more children.
The highest child degree of a X
node is 14.
Children of X
nodes are attached using 19 different relations: mwe (144; 19% instances), punct (114; 15% instances), conj (97; 13% instances), case (72; 9% instances), cc (70; 9% instances), nmod (59; 8% instances), det (58; 7% instances), name (51; 7% instances), appos (23; 3% instances), parataxis (23; 3% instances), amod (14; 2% instances), nummod (10; 1% instances), acl (9; 1% instances), advmod (8; 1% instances), mark (7; 1% instances), cop (6; 1% instances), nsubj (6; 1% instances), dobj (3; 0% instances), advcl (1; 0% instances)
Children of X
nodes belong to 16 different parts of speech: X (131; 17% instances), PUNCT (117; 15% instances), PROPN (116; 15% instances), NOUN (99; 13% instances), ADP (76; 10% instances), CONJ (69; 9% instances), DET (62; 8% instances), NUM (31; 4% instances), ADJ (25; 3% instances), VERB (14; 2% instances), ADV (9; 1% instances), SYM (7; 1% instances), AUX (6; 1% instances), PART (5; 1% instances), SCONJ (5; 1% instances), PRON (3; 0% instances)
X in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]