X
: other
This document is a placeholder for the language-specific documentation
for X
.
Treebank Statistics (UD_Dutch)
There are 4034 X
lemmas (17%), 4036 X
types (14%) and 5663 X
tokens (3%).
Out of 16 observed tags, the rank of X
is: 2 in number of lemmas, 2 in number of types and 11 in number of tokens.
The 10 most frequent X
lemmas: Den_Haag, voor_het_eerst, Verenigde_Staten, Tweede_Kamer, met_name, aan_het, Integraal_Dossier_JGZ, onder_meer, in_plaats_van, in_verband_met
The 10 most frequent X
types: Den_Haag, voor_het_eerst, Verenigde_Staten, Tweede_Kamer, met_name, aan_het, Integraal_Dossier_JGZ, onder_meer, in_plaats_van, in_verband_met
The 10 most frequent ambiguous lemmas: onder_meer (X 17, ADV 9), ten_aanzien_van (X 13, ADP 1), ‘s_avonds (X 9, ADV 1), tot_en_met (X 8, ADP 2), onder_andere (ADV 20, X 4), ten_opzichte_van (X 4, ADP 2), ten_gevolge_van (X 2, ADP 1), een_uur (NUM 2, X 2), nota_bene (X 2, ADV 1), Ebola_virus (PROPN 1, X 1)
The 10 most frequent ambiguous types: een_uur (NUM 2, X 2), Screeningen (NOUN 2, VERB 1, X 1), ` (PROPN 21, NOUN 12, X 1), plaats (NOUN 139, X 1), won (VERB 58, AUX 2, X 1)
- een_uur
- Screeningen
- `
- plaats
- won
Morphology
The form / lemma ratio of X
is 1.000496 (the average of all parts of speech is 1.266833).
The 1st highest number of forms (3) was observed with the lemma “een_of”: een_jaartje_of, een_keer_of, een_maand_of.
The 2nd highest number of forms (2) was observed with the lemma “Dolle_Mina”: Dolle_Mina, Dolle_Mina’s.
The 3rd highest number of forms (1) was observed with the lemma “”Beer“_Wentink”: “Beer“_Wentink.
X
occurs with 18 features: Number (5130; 91% instances), Degree (554; 10% instances), Definite (421; 7% instances), Gender (230; 4% instances), PronType (179; 3% instances), Case (167; 3% instances), VerbForm (82; 1% instances), Tense (48; 1% instances), Mood (43; 1% instances), Person (41; 1% instances), Aspect (26; 0% instances), nl-feat/Subcat (19; 0% instances), nl-feat/Variant (19; 0% instances), nl-feat/Foreign (16; 0% instances), nl-feat/VerbType (11; 0% instances), Poss (5; 0% instances), Reflex (2; 0% instances), nl-feat/PunctType (1; 0% instances)
X
occurs with 38 feature-value pairs: Aspect=Imp
, Case=Dat
, Case=Gen
, Case=Nom
, Definite=Def
, Degree=Cmp
, Degree=Pos
, Degree=Sup
, Foreign=Foreign
, Gender=Com
, Gender=Neut
, Mood=Imp
, Mood=Ind
, Mood=Sub
, Number=Plur
, Number=Plur,Sing
, Number=Sing
, Person=1
, Person=2
, Person=3
, Poss=Yes
, PronType=Dem
, PronType=Ind
, PronType=Int
, PronType=Prs
, PronType=Rel
, PunctType=Comm
, Reflex=Yes
, Subcat=Intr
, Subcat=Tran
, Tense=Past
, Tense=Pres
, Variant=Short
, VerbForm=Fin
, VerbForm=Inf
, VerbForm=Part
, VerbType=Aux,Cop
, VerbType=Mod
X
occurs with 120 feature combinations.
The most frequent feature combination is Number=Sing
(4104 tokens).
Examples: Den_Haag, Tweede_Kamer, in_plaats_van, in_verband_met, ten_aanzien_van, in_staat, op_basis_van, tot_stand, aan_de_hand_van, als_gevolg_van
Relations
X
nodes are attached to their parents using 22 different relations: advmod (1138; 20% instances), nmod (1112; 20% instances), nsubj (994; 18% instances), appos (885; 16% instances), nl-dep/compound:prt (392; 7% instances), dobj (385; 7% instances), conj (332; 6% instances), root (177; 3% instances), dep (67; 1% instances), mark (67; 1% instances), aux (34; 1% instances), cc (16; 0% instances), acl (14; 0% instances), parataxis (14; 0% instances), name (12; 0% instances), advcl (7; 0% instances), cop (5; 0% instances), expl (5; 0% instances), case (2; 0% instances), iobj (2; 0% instances), mwe (2; 0% instances), amod (1; 0% instances)
Parents of X
nodes belong to 15 different parts of speech: NOUN (2095; 37% instances), VERB (1412; 25% instances), AUX (1056; 19% instances), X (333; 6% instances), ADJ (177; 3% instances), ROOT (177; 3% instances), PRON (139; 2% instances), PROPN (116; 2% instances), NUM (68; 1% instances), ADV (52; 1% instances), SCONJ (22; 0% instances), CONJ (9; 0% instances), SYM (3; 0% instances), ADP (2; 0% instances), DET (2; 0% instances)
2899 (51%) X
nodes are leaves.
1319 (23%) X
nodes have one child.
781 (14%) X
nodes have two children.
664 (12%) X
nodes have three or more children.
The highest child degree of a X
node is 29.
Children of X
nodes are attached using 23 different relations: case (1390; 24% instances), nmod (1338; 23% instances), punct (940; 16% instances), advmod (387; 7% instances), dobj (368; 6% instances), conj (357; 6% instances), cc (235; 4% instances), advcl (157; 3% instances), cop (145; 3% instances), appos (143; 3% instances), nsubj (106; 2% instances), mark (48; 1% instances), dep (27; 0% instances), parataxis (17; 0% instances), aux (13; 0% instances), name (11; 0% instances), neg (7; 0% instances), csubj (6; 0% instances), nummod (5; 0% instances), ccomp (4; 0% instances), xcomp (3; 0% instances), nl-dep/det:nummod (1; 0% instances), expl (1; 0% instances)
Children of X
nodes belong to 15 different parts of speech: ADP (1384; 24% instances), PRON (1020; 18% instances), PUNCT (940; 16% instances), NOUN (663; 12% instances), PROPN (347; 6% instances), X (333; 6% instances), CONJ (225; 4% instances), ADJ (179; 3% instances), VERB (171; 3% instances), AUX (155; 3% instances), ADV (137; 2% instances), NUM (100; 2% instances), SCONJ (45; 1% instances), SYM (9; 0% instances), DET (1; 0% instances)
X in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]