This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home nl/pos issue tracker

X: other

This document is a placeholder for the language-specific documentation for X.


Treebank Statistics (UD_Dutch)

There are 1358 X lemmas (6%), 1356 X types (5%) and 4635 X tokens (2%). Out of 16 observed tags, the rank of X is: 5 in number of lemmas, 5 in number of types and 11 in number of tokens.

The 10 most frequent X lemmas: van, het, op, flo, voor, met, ten, aan, een, onder

The 10 most frequent X types: van, het, op, flo, voor, met, ten, aan, een, onder

The 10 most frequent ambiguous lemmas: van (ADP 5616, X 384, PROPN 200, ADV 88), het (DET 4283, PRON 1155, X 222, PROPN 8), op (ADP 1586, ADV 196, X 154, PROPN 3, ADJ 1, SCONJ 1), voor (ADP 1429, ADV 122, X 102, PROPN 24, SCONJ 4, ADJ 2, NOUN 1, VERB 1), met (ADP 1403, X 86, ADV 4), ten (X 95, ADP 4), aan (ADP 842, ADV 174, X 72, PROPN 5), een (DET 4476, X 50, NUM 21, PROPN 3, CONJ 2), onder (ADP 159, X 47, ADV 7, NOUN 1), te (ADP 1878, ADV 117, X 46)

The 10 most frequent ambiguous types: van (ADP 5516, X 384, PROPN 199, ADV 87), het (DET 3802, PRON 793, X 222, PROPN 8), op (ADP 1444, ADV 196, X 152, PROPN 3), voor (ADP 1301, ADV 121, X 102, PROPN 24, SCONJ 4), met (ADP 1295, X 86), ten (X 95, ADP 2), aan (ADP 795, ADV 174, X 72, PROPN 5), een (DET 4196, X 50, NUM 21, PROPN 2), onder (ADP 131, X 47, ADV 7), te (ADP 1868, ADV 117, X 46)

Morphology

The form / lemma ratio of X is 0.998527 (the average of all parts of speech is 1.258498).

The 1st highest number of forms (4) was observed with the lemma “of”: jaartje, keer, maand, of.

The 2nd highest number of forms (2) was observed with the lemma “Europees”: Europees, Europese.

The 3rd highest number of forms (1) was observed with the lemma “’n”: ‘n.

X occurs with 17 features: Number (3582; 77% instances), Degree (1188; 26% instances), Gender (613; 13% instances), Definite (522; 11% instances), PronType (406; 9% instances), Case (301; 6% instances), VerbForm (191; 4% instances), Person (128; 3% instances), Tense (127; 3% instances), Mood (102; 2% instances), Aspect (74; 2% instances), Subcat (57; 1% instances), Variant (40; 1% instances), VerbType (23; 0% instances), Foreign (16; 0% instances), Poss (15; 0% instances), Reflex (4; 0% instances)

X occurs with 37 feature-value pairs: Aspect=Imp, Case=Dat, Case=Gen, Case=Nom, Definite=Def, Degree=Cmp, Degree=Pos, Degree=Sup, Foreign=Foreign, Gender=Com, Gender=Neut, Mood=Imp, Mood=Ind, Mood=Sub, Number=Plur, Number=Plur,Sing, Number=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Rel, Reflex=Yes, Subcat=Intr, Subcat=Tran, Tense=Past, Tense=Pres, Variant=Short, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, VerbType=Aux,Cop, VerbType=Mod

X occurs with 116 feature combinations. The most frequent feature combination is Number=Sing (1868 tokens). Examples: van, op, flo, met, ten, het, aan, ter, voor, een

Relations

X nodes are attached to their parents using 22 different relations: compound (2859; 62% instances), advmod (580; 13% instances), nmod (367; 8% instances), compound:prt (247; 5% instances), nsubj (120; 3% instances), dobj (105; 2% instances), root (101; 2% instances), mark (61; 1% instances), appos (60; 1% instances), conj (43; 1% instances), dep (28; 1% instances), cc (20; 0% instances), acl (10; 0% instances), aux (6; 0% instances), parataxis (6; 0% instances), advcl (5; 0% instances), ccomp (5; 0% instances), xcomp (5; 0% instances), cop (3; 0% instances), case (2; 0% instances), amod (1; 0% instances), name (1; 0% instances)

Parents of X nodes belong to 16 different parts of speech: X (2256; 49% instances), VERB (685; 15% instances), ADP (521; 11% instances), NOUN (472; 10% instances), AUX (276; 6% instances), ROOT (101; 2% instances), NUM (73; 2% instances), ADJ (66; 1% instances), PRON (60; 1% instances), PROPN (47; 1% instances), CONJ (26; 1% instances), ADV (25; 1% instances), PUNCT (11; 0% instances), SCONJ (9; 0% instances), DET (6; 0% instances), SYM (1; 0% instances)

2885 (62%) X nodes are leaves.

521 (11%) X nodes have one child.

456 (10%) X nodes have two children.

773 (17%) X nodes have three or more children.

The highest child degree of a X node is 30.

Children of X nodes are attached using 26 different relations: compound (2600; 57% instances), case (320; 7% instances), det (278; 6% instances), dobj (264; 6% instances), punct (261; 6% instances), advmod (163; 4% instances), nmod (159; 3% instances), mark (103; 2% instances), cop (87; 2% instances), nsubj (59; 1% instances), conj (49; 1% instances), dep (35; 1% instances), advcl (34; 1% instances), cc (34; 1% instances), appos (25; 1% instances), xcomp (22; 0% instances), ccomp (13; 0% instances), aux (10; 0% instances), parataxis (9; 0% instances), acl (6; 0% instances), csubj (5; 0% instances), neg (4; 0% instances), nummod (3; 0% instances), amod (2; 0% instances), compound:prt (1; 0% instances), det:nummod (1; 0% instances)

Children of X nodes belong to 15 different parts of speech: X (2256; 50% instances), ADP (531; 12% instances), NOUN (355; 8% instances), DET (284; 6% instances), PUNCT (280; 6% instances), NUM (206; 5% instances), PROPN (139; 3% instances), VERB (104; 2% instances), AUX (97; 2% instances), ADV (92; 2% instances), PRON (76; 2% instances), ADJ (64; 1% instances), CONJ (30; 1% instances), SCONJ (28; 1% instances), SYM (5; 0% instances)


Treebank Statistics (UD_Dutch-LassySmall)

There are 384 X lemmas (3%), 384 X types (2%) and 640 X tokens (1%). Out of 17 observed tags, the rank of X is: 6 in number of lemmas, 6 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: Bel, sp.a, o.a., ca., les, VVKSM, de, nr., Vive, grand

The 10 most frequent X types: Bel, sp.a, o.a., ca., les, VVKSM, de, nr., Vive, grand

The 10 most frequent ambiguous lemmas: sp.a (X 22, PROPN 3), o.a. (X 18, SYM 1), ca. (X 16, ADV 3), les (X 2, NOUN 2), VVKSM (X 7, NOUN 1), de (DET 5884, PROPN 73, X 6), nr. (X 7, NOUN 2), la (PROPN 5, X 5), VGC (PROPN 6, X 4), des (PROPN 14, X 4)

The 10 most frequent ambiguous types: sp.a (X 22, PROPN 1), o.a. (X 18, SYM 1), ca. (X 16, ADV 3), VVKSM (X 7, NOUN 1), de (DET 4905, PROPN 73, X 6), nr. (X 7, NOUN 2), la (X 5, PROPN 5), VGC (PROPN 6, X 4), des (PROPN 14, X 4, DET 4), MR (X 3, PROPN 2)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.179900).

The 1st highest number of forms (1) was observed with the lemma “–foto’s”: –foto’s.

The 2nd highest number of forms (1) was observed with the lemma “-Berchem”: -Berchem.

The 3rd highest number of forms (1) was observed with the lemma “-Congres”: -Congres.

X does not occur with any features.

Relations

X nodes are attached to their parents using 13 different relations: nmod (245; 38% instances), mwe (164; 26% instances), root (55; 9% instances), appos (54; 8% instances), conj (52; 8% instances), parataxis (20; 3% instances), nsubj (15; 2% instances), dobj (13; 2% instances), advcl (5; 1% instances), cc (5; 1% instances), mark (5; 1% instances), acl (4; 1% instances), amod (3; 0% instances)

Parents of X nodes belong to 13 different parts of speech: NOUN (158; 25% instances), PROPN (158; 25% instances), X (131; 20% instances), VERB (69; 11% instances), ROOT (55; 9% instances), ADJ (33; 5% instances), NUM (14; 2% instances), PUNCT (9; 1% instances), SYM (5; 1% instances), ADV (2; 0% instances), DET (2; 0% instances), PRON (2; 0% instances), SCONJ (2; 0% instances)

406 (63%) X nodes are leaves.

50 (8%) X nodes have one child.

58 (9%) X nodes have two children.

126 (20%) X nodes have three or more children.

The highest child degree of a X node is 14.

Children of X nodes are attached using 19 different relations: mwe (144; 19% instances), punct (114; 15% instances), conj (97; 13% instances), case (72; 9% instances), cc (70; 9% instances), nmod (59; 8% instances), det (58; 7% instances), name (51; 7% instances), appos (23; 3% instances), parataxis (23; 3% instances), amod (14; 2% instances), nummod (10; 1% instances), acl (9; 1% instances), advmod (8; 1% instances), mark (7; 1% instances), cop (6; 1% instances), nsubj (6; 1% instances), dobj (3; 0% instances), advcl (1; 0% instances)

Children of X nodes belong to 16 different parts of speech: X (131; 17% instances), PUNCT (117; 15% instances), PROPN (116; 15% instances), NOUN (99; 13% instances), ADP (76; 10% instances), CONJ (69; 9% instances), DET (62; 8% instances), NUM (31; 4% instances), ADJ (25; 3% instances), VERB (14; 2% instances), ADV (9; 1% instances), SYM (7; 1% instances), AUX (6; 1% instances), PART (5; 1% instances), SCONJ (5; 1% instances), PRON (3; 0% instances)


X in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]