X

This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.

home nl/pos issue tracker

`X`: other

This document is a placeholder for the language-specific documentation for X.

Treebank Statistics (UD_Dutch)

There are 1358 X lemmas (6%), 1356 X types (5%) and 4635 X tokens (2%). Out of 16 observed tags, the rank of X is: 5 in number of lemmas, 5 in number of types and 11 in number of tokens.

The 10 most frequent X lemmas: van, het, op, flo, voor, met, ten, aan, een, onder

The 10 most frequent X types: van, het, op, flo, voor, met, ten, aan, een, onder

The 10 most frequent ambiguous lemmas: van (ADP 5616, X 384, PROPN 200, ADV 88), het (DET 4283, PRON 1155, X 222, PROPN 8), op (ADP 1586, ADV 196, X 154, PROPN 3, ADJ 1, SCONJ 1), voor (ADP 1429, ADV 122, X 102, PROPN 24, SCONJ 4, ADJ 2, NOUN 1, VERB 1), met (ADP 1403, X 86, ADV 4), ten (X 95, ADP 4), aan (ADP 842, ADV 174, X 72, PROPN 5), een (DET 4476, X 50, NUM 21, PROPN 3, CONJ 2), onder (ADP 159, X 47, ADV 7, NOUN 1), te (ADP 1878, ADV 117, X 46)

The 10 most frequent ambiguous types: van (ADP 5516, X 384, PROPN 199, ADV 87), het (DET 3802, PRON 793, X 222, PROPN 8), op (ADP 1444, ADV 196, X 152, PROPN 3), voor (ADP 1301, ADV 121, X 102, PROPN 24, SCONJ 4), met (ADP 1295, X 86), ten (X 95, ADP 2), aan (ADP 795, ADV 174, X 72, PROPN 5), een (DET 4196, X 50, NUM 21, PROPN 2), onder (ADP 131, X 47, ADV 7), te (ADP 1868, ADV 117, X 46)

van
- ADP 5516: Ono bereidde beide treffers van Yanasigawa voor .
- X 384: Daarom zet Brussel alles op de ‘ Verklaring van Laken ‘ .
- PROPN 199: Topman O. van der Straaten neemt zijn taken waar .
- ADV 87: die vroeg aan mij van is die dan getrouwd ?
het
- DET 3802: Voor het oefendrieluik met Pakistan zette Bellaart hoog in .
- PRON 793: Van Hanegem nam het in februari over van Dolf Roks .
- X 222: voor het geval er iemand wordt gepakt of doorslaat
- PROPN 8: Dr. Denker , Nieuwsblad van het Noorden , Groningen .
op
- ADP 1444: Hij rekende op drie overwinningen .
- ADV 196: Je schiet er ook niets mee op .
- X 152: En homohaters zijn nauwelijks op de been in Rotterdam .
- PROPN 3: In Zuid-West Nederland zijn er plannen om de A58 bij Wouw en Bergen op Zoom te blokkeren .
voor
- ADP 1301: Boris Vascovic hield de hoop voor Smederevo levend .
- ADV 121: Ono bereidde beide treffers van Yanasigawa voor .
- X 102: voor het geval er iemand wordt gepakt of doorslaat
- PROPN 24: Hoeveel heeft IBM voor Lotus betaald ?
- SCONJ 4: Caris heeft een lange aanloop gehad voor hij besloot schilder te worden .
met
- ADP 1295: Voor het oefendrieluik met Pakistan zette Bellaart hoog in .
- X 86: Het is , schrijft de premier , in overleg met haar vastgesteld .
ten
- X 95: Welke veerboot zonk ten zuidoosten van het eiland Utö ?
- ADP 2: En ten tweede kun je al je intervieuws zelf schrijven . “
aan
- ADP 795: Ze denkt aan het samenvoegen van scholen .
- ADV 174: Bij functies kan men ondermeer denken aan :
- X 72: aan de hand van deze plattegrond zul je je wel kunnen oriënteren
- PROPN 5: In het molenmuseum in Koog aan de Zaan exposeert Jan Kruyver tekeningen en schilderijen van molens .
een
- DET 4196: Heerenveen kende een goede start .
- X 50: wat voor een auto wil je ?
- NUM 21: Ze verraste met opnieuw een wereldrecord
- PROPN 2: Hoe heet de roverhoofdman uit ` Duizend en een nacht ‘ die de berg opent met de spreuk ‘ Sesam , open u ! ‘ ?
onder
- ADP 131: De meeste functies zijn onder meer dan één activiteit onder te brengen .
- X 47: Te denken valt onder meer aan :
- ADV 7: De meeste functies zijn onder meer dan één activiteit onder te brengen .
te
- ADP 1868: Bellaart weigerde zich te verschuilen achter allerlei excuses .
- ADV 117: Dat hebben we tot nu toe te weinig gedaan en dat moet beter . “
- X 46: in de Winkel van Sinkel is van alles te koop .

Morphology

The form / lemma ratio of X is 0.998527 (the average of all parts of speech is 1.258498).

The 1st highest number of forms (4) was observed with the lemma “of”: jaartje, keer, maand, of.

The 2nd highest number of forms (2) was observed with the lemma “Europees”: Europees, Europese.

The 3rd highest number of forms (1) was observed with the lemma “’n”: ‘n.

X occurs with 17 features: Number (3582; 77% instances), Degree (1188; 26% instances), Gender (613; 13% instances), Definite (522; 11% instances), PronType (406; 9% instances), Case (301; 6% instances), VerbForm (191; 4% instances), Person (128; 3% instances), Tense (127; 3% instances), Mood (102; 2% instances), Aspect (74; 2% instances), Subcat (57; 1% instances), Variant (40; 1% instances), VerbType (23; 0% instances), Foreign (16; 0% instances), Poss (15; 0% instances), Reflex (4; 0% instances)

X occurs with 37 feature-value pairs: Aspect=Imp, Case=Dat, Case=Gen, Case=Nom, Definite=Def, Degree=Cmp, Degree=Pos, Degree=Sup, Foreign=Foreign, Gender=Com, Gender=Neut, Mood=Imp, Mood=Ind, Mood=Sub, Number=Plur, Number=Plur,Sing, Number=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Rel, Reflex=Yes, Subcat=Intr, Subcat=Tran, Tense=Past, Tense=Pres, Variant=Short, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, VerbType=Aux,Cop, VerbType=Mod

X occurs with 116 feature combinations. The most frequent feature combination is Number=Sing (1868 tokens). Examples: van, op, flo, met, ten, het, aan, ter, voor, een

Relations

X nodes are attached to their parents using 22 different relations: compound (2859; 62% instances), advmod (580; 13% instances), nmod (367; 8% instances), compound:prt (247; 5% instances), nsubj (120; 3% instances), dobj (105; 2% instances), root (101; 2% instances), mark (61; 1% instances), appos (60; 1% instances), conj (43; 1% instances), dep (28; 1% instances), cc (20; 0% instances), acl (10; 0% instances), aux (6; 0% instances), parataxis (6; 0% instances), advcl (5; 0% instances), ccomp (5; 0% instances), xcomp (5; 0% instances), cop (3; 0% instances), case (2; 0% instances), amod (1; 0% instances), name (1; 0% instances)

Parents of X nodes belong to 16 different parts of speech: X (2256; 49% instances), VERB (685; 15% instances), ADP (521; 11% instances), NOUN (472; 10% instances), AUX (276; 6% instances), ROOT (101; 2% instances), NUM (73; 2% instances), ADJ (66; 1% instances), PRON (60; 1% instances), PROPN (47; 1% instances), CONJ (26; 1% instances), ADV (25; 1% instances), PUNCT (11; 0% instances), SCONJ (9; 0% instances), DET (6; 0% instances), SYM (1; 0% instances)

2885 (62%) X nodes are leaves.

521 (11%) X nodes have one child.

456 (10%) X nodes have two children.

773 (17%) X nodes have three or more children.

The highest child degree of a X node is 30.

Children of X nodes are attached using 26 different relations: compound (2600; 57% instances), case (320; 7% instances), det (278; 6% instances), dobj (264; 6% instances), punct (261; 6% instances), advmod (163; 4% instances), nmod (159; 3% instances), mark (103; 2% instances), cop (87; 2% instances), nsubj (59; 1% instances), conj (49; 1% instances), dep (35; 1% instances), advcl (34; 1% instances), cc (34; 1% instances), appos (25; 1% instances), xcomp (22; 0% instances), ccomp (13; 0% instances), aux (10; 0% instances), parataxis (9; 0% instances), acl (6; 0% instances), csubj (5; 0% instances), neg (4; 0% instances), nummod (3; 0% instances), amod (2; 0% instances), compound:prt (1; 0% instances), det:nummod (1; 0% instances)

Children of X nodes belong to 15 different parts of speech: X (2256; 50% instances), ADP (531; 12% instances), NOUN (355; 8% instances), DET (284; 6% instances), PUNCT (280; 6% instances), NUM (206; 5% instances), PROPN (139; 3% instances), VERB (104; 2% instances), AUX (97; 2% instances), ADV (92; 2% instances), PRON (76; 2% instances), ADJ (64; 1% instances), CONJ (30; 1% instances), SCONJ (28; 1% instances), SYM (5; 0% instances)

Treebank Statistics (UD_Dutch-LassySmall)

There are 384 X lemmas (3%), 384 X types (2%) and 640 X tokens (1%). Out of 17 observed tags, the rank of X is: 6 in number of lemmas, 6 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: Bel, sp.a, o.a., ca., les, VVKSM, de, nr., Vive, grand

The 10 most frequent X types: Bel, sp.a, o.a., ca., les, VVKSM, de, nr., Vive, grand

The 10 most frequent ambiguous lemmas: sp.a (X 22, PROPN 3), o.a. (X 18, SYM 1), ca. (X 16, ADV 3), les (X 2, NOUN 2), VVKSM (X 7, NOUN 1), de (DET 5884, PROPN 73, X 6), nr. (X 7, NOUN 2), la (PROPN 5, X 5), VGC (PROPN 6, X 4), des (PROPN 14, X 4)

The 10 most frequent ambiguous types: sp.a (X 22, PROPN 1), o.a. (X 18, SYM 1), ca. (X 16, ADV 3), VVKSM (X 7, NOUN 1), de (DET 4905, PROPN 73, X 6), nr. (X 7, NOUN 2), la (X 5, PROPN 5), VGC (PROPN 6, X 4), des (PROPN 14, X 4, DET 4), MR (X 3, PROPN 2)

sp.a
- X 22: Tuur Van Wallendael ( sp.a )
- PROPN 1: sp.a : 17
o.a.
- X 18: Hij maakte er kennis met o.a. Fernand Khnopff .
- SYM 1: Vanaf 3500 tot 2000 v.Chr. leefden o.a. in de Kempen , de Leemstreek en de Maasvallei culturen van het midden-neolithicum .
ca.
- X 16: Nederlands ( ca. 60% )
- ADV 3: Er wonen ca. 71.300 inwoners , voor de overgrote meerderheid Duitstalig .
VVKSM
- X 7: VVKSM Sint-Martinus ( Nieuwkerken )
- NOUN 1: VVKSM Ename , Oudenaarde
de
- DET 4905: Plechtige aankondiging van de dood des Konings
- PROPN 73: Zo leerde de prins barones Sybille de Selys-Longchamps kennen .
- X 6: In het Frans is dit : « de Belgique » , en in het Duits : « von Belgien » .
nr.
- X 7: Hij eindigde op nr. 29 .
- NOUN 2: Kim Clijsters is een voormalig nr. 1 in het enkelspel en nr. 4 in het dubbelspel bij de juniores ( 1998 ) .
la
- X 5: « Leve de republiek , Vive la république européenne , Vive Lahaut !
- PROPN 5: Interrégionale Wallonne de la FGTB
VGC
- PROPN 6: De VGC vervult een belangrijke rol voor de Brusselse Vlamingen .
- X 4: De Franse Gemeenschapscommissie oefent vergelijkbare bevoegdheden uit als de Vlaamse Gemeenschapscommissie ( VGC ) .
des
- PROPN 14: 1913 - « Villa des Roses » , roman
- X 4: In 1840 werd zijn stoffelijk overschot naar Parijs overgebracht , en bijgezet in de Dôme des Invalides .
- DET 4: Plechtige aankondiging van de dood des Konings
MR
- X 3: Mouvement Réformateur ( MR ) : 25 zetels
- PROPN 2: N.B. hiervan zijn tevens zes Belgische Europarlementariërs lid : de Vlaamse Liberalen en Democraten ( VLD ) / Vivant ( 3 zetels ) en de Mouvement Réformateur ( MR ) ( 3 zetels ) .

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.179900).

The 1st highest number of forms (1) was observed with the lemma “–foto’s”: –foto’s.

The 2nd highest number of forms (1) was observed with the lemma “-Berchem”: -Berchem.

The 3rd highest number of forms (1) was observed with the lemma “-Congres”: -Congres.

X does not occur with any features.

Relations

X nodes are attached to their parents using 13 different relations: nmod (245; 38% instances), mwe (164; 26% instances), root (55; 9% instances), appos (54; 8% instances), conj (52; 8% instances), parataxis (20; 3% instances), nsubj (15; 2% instances), dobj (13; 2% instances), advcl (5; 1% instances), cc (5; 1% instances), mark (5; 1% instances), acl (4; 1% instances), amod (3; 0% instances)

Parents of X nodes belong to 13 different parts of speech: NOUN (158; 25% instances), PROPN (158; 25% instances), X (131; 20% instances), VERB (69; 11% instances), ROOT (55; 9% instances), ADJ (33; 5% instances), NUM (14; 2% instances), PUNCT (9; 1% instances), SYM (5; 1% instances), ADV (2; 0% instances), DET (2; 0% instances), PRON (2; 0% instances), SCONJ (2; 0% instances)

406 (63%) X nodes are leaves.

50 (8%) X nodes have one child.

58 (9%) X nodes have two children.

126 (20%) X nodes have three or more children.

The highest child degree of a X node is 14.

Children of X nodes are attached using 19 different relations: mwe (144; 19% instances), punct (114; 15% instances), conj (97; 13% instances), case (72; 9% instances), cc (70; 9% instances), nmod (59; 8% instances), det (58; 7% instances), name (51; 7% instances), appos (23; 3% instances), parataxis (23; 3% instances), amod (14; 2% instances), nummod (10; 1% instances), acl (9; 1% instances), advmod (8; 1% instances), mark (7; 1% instances), cop (6; 1% instances), nsubj (6; 1% instances), dobj (3; 0% instances), advcl (1; 0% instances)

Children of X nodes belong to 16 different parts of speech: X (131; 17% instances), PUNCT (117; 15% instances), PROPN (116; 15% instances), NOUN (99; 13% instances), ADP (76; 10% instances), CONJ (69; 9% instances), DET (62; 8% instances), NUM (31; 4% instances), ADJ (25; 3% instances), VERB (14; 2% instances), ADV (9; 1% instances), SYM (7; 1% instances), AUX (6; 1% instances), PART (5; 1% instances), SCONJ (5; 1% instances), PRON (3; 0% instances)

X in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]

X: other

Treebank Statistics (UD_Dutch)

Morphology

Relations

Treebank Statistics (UD_Dutch-LassySmall)

Morphology

Relations

`X`: other