home nl/pos edit page issue tracker

X: other

This document is a placeholder for the language-specific documentation for X.


Treebank Statistics (UD_Dutch)

There are 4034 X lemmas (17%), 4036 X types (14%) and 5663 X tokens (3%). Out of 16 observed tags, the rank of X is: 2 in number of lemmas, 2 in number of types and 11 in number of tokens.

The 10 most frequent X lemmas: Den_Haag, voor_het_eerst, Verenigde_Staten, Tweede_Kamer, met_name, aan_het, Integraal_Dossier_JGZ, onder_meer, in_plaats_van, in_verband_met

The 10 most frequent X types: Den_Haag, voor_het_eerst, Verenigde_Staten, Tweede_Kamer, met_name, aan_het, Integraal_Dossier_JGZ, onder_meer, in_plaats_van, in_verband_met

The 10 most frequent ambiguous lemmas: onder_meer (X 17, ADV 9), ten_aanzien_van (X 13, ADP 1), ‘s_avonds (X 9, ADV 1), tot_en_met (X 8, ADP 2), onder_andere (ADV 20, X 4), ten_opzichte_van (X 4, ADP 2), ten_gevolge_van (X 2, ADP 1), een_uur (NUM 2, X 2), nota_bene (X 2, ADV 1), Ebola_virus (PROPN 1, X 1)

The 10 most frequent ambiguous types: een_uur (NUM 2, X 2), Screeningen (NOUN 2, VERB 1, X 1), ` (PROPN 21, NOUN 12, X 1), plaats (NOUN 139, X 1), won (VERB 58, AUX 2, X 1)

Morphology

The form / lemma ratio of X is 1.000496 (the average of all parts of speech is 1.266833).

The 1st highest number of forms (3) was observed with the lemma “een_of”: een_jaartje_of, een_keer_of, een_maand_of.

The 2nd highest number of forms (2) was observed with the lemma “Dolle_Mina”: Dolle_Mina, Dolle_Mina’s.

The 3rd highest number of forms (1) was observed with the lemma “”Beer“_Wentink”: Beer“_Wentink.

X occurs with 18 features: Number (5130; 91% instances), Degree (554; 10% instances), Definite (421; 7% instances), Gender (230; 4% instances), PronType (179; 3% instances), Case (167; 3% instances), VerbForm (82; 1% instances), Tense (48; 1% instances), Mood (43; 1% instances), Person (41; 1% instances), Aspect (26; 0% instances), nl-feat/Subcat (19; 0% instances), nl-feat/Variant (19; 0% instances), nl-feat/Foreign (16; 0% instances), nl-feat/VerbType (11; 0% instances), Poss (5; 0% instances), Reflex (2; 0% instances), nl-feat/PunctType (1; 0% instances)

X occurs with 38 feature-value pairs: Aspect=Imp, Case=Dat, Case=Gen, Case=Nom, Definite=Def, Degree=Cmp, Degree=Pos, Degree=Sup, Foreign=Foreign, Gender=Com, Gender=Neut, Mood=Imp, Mood=Ind, Mood=Sub, Number=Plur, Number=Plur,Sing, Number=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Dem, PronType=Ind, PronType=Int, PronType=Prs, PronType=Rel, PunctType=Comm, Reflex=Yes, Subcat=Intr, Subcat=Tran, Tense=Past, Tense=Pres, Variant=Short, VerbForm=Fin, VerbForm=Inf, VerbForm=Part, VerbType=Aux,Cop, VerbType=Mod

X occurs with 120 feature combinations. The most frequent feature combination is Number=Sing (4104 tokens). Examples: Den_Haag, Tweede_Kamer, in_plaats_van, in_verband_met, ten_aanzien_van, in_staat, op_basis_van, tot_stand, aan_de_hand_van, als_gevolg_van

Relations

X nodes are attached to their parents using 22 different relations: advmod (1138; 20% instances), nmod (1112; 20% instances), nsubj (994; 18% instances), appos (885; 16% instances), nl-dep/compound:prt (392; 7% instances), dobj (385; 7% instances), conj (332; 6% instances), root (177; 3% instances), dep (67; 1% instances), mark (67; 1% instances), aux (34; 1% instances), cc (16; 0% instances), acl (14; 0% instances), parataxis (14; 0% instances), name (12; 0% instances), advcl (7; 0% instances), cop (5; 0% instances), expl (5; 0% instances), case (2; 0% instances), iobj (2; 0% instances), mwe (2; 0% instances), amod (1; 0% instances)

Parents of X nodes belong to 15 different parts of speech: NOUN (2095; 37% instances), VERB (1412; 25% instances), AUX (1056; 19% instances), X (333; 6% instances), ADJ (177; 3% instances), ROOT (177; 3% instances), PRON (139; 2% instances), PROPN (116; 2% instances), NUM (68; 1% instances), ADV (52; 1% instances), SCONJ (22; 0% instances), CONJ (9; 0% instances), SYM (3; 0% instances), ADP (2; 0% instances), DET (2; 0% instances)

2899 (51%) X nodes are leaves.

1319 (23%) X nodes have one child.

781 (14%) X nodes have two children.

664 (12%) X nodes have three or more children.

The highest child degree of a X node is 29.

Children of X nodes are attached using 23 different relations: case (1390; 24% instances), nmod (1338; 23% instances), punct (940; 16% instances), advmod (387; 7% instances), dobj (368; 6% instances), conj (357; 6% instances), cc (235; 4% instances), advcl (157; 3% instances), cop (145; 3% instances), appos (143; 3% instances), nsubj (106; 2% instances), mark (48; 1% instances), dep (27; 0% instances), parataxis (17; 0% instances), aux (13; 0% instances), name (11; 0% instances), neg (7; 0% instances), csubj (6; 0% instances), nummod (5; 0% instances), ccomp (4; 0% instances), xcomp (3; 0% instances), nl-dep/det:nummod (1; 0% instances), expl (1; 0% instances)

Children of X nodes belong to 15 different parts of speech: ADP (1384; 24% instances), PRON (1020; 18% instances), PUNCT (940; 16% instances), NOUN (663; 12% instances), PROPN (347; 6% instances), X (333; 6% instances), CONJ (225; 4% instances), ADJ (179; 3% instances), VERB (171; 3% instances), AUX (155; 3% instances), ADV (137; 2% instances), NUM (100; 2% instances), SCONJ (45; 1% instances), SYM (9; 0% instances), DET (1; 0% instances)


X in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]