Treebank Statistics: UD_Dutch-LassySmall: POS Tags: X
There are 28 X
lemmas (0%), 38 X
types (0%) and 118 X
tokens (0%).
Out of 16 observed tags, the rank of X
is: 12 in number of lemmas, 11 in number of types and 15 in number of tokens.
The 10 most frequent X
lemmas: circa, onder ander, nummer, bijvoorbeeld, voor Christus, Nederland, liberaal, enzovoorts, katholiek, onder veel
The 10 most frequent X
types: ca., o.a., nr., v.Chr., Ned, lib, kath, o.m., blz., bv
The 10 most frequent ambiguous lemmas: circa (X 20, ADV 2), nummer (NOUN 16, X 12), bijvoorbeeld (ADV 20, X 9), Nederland (PROPN 78, X 6), liberaal (ADJ 18, NOUN 14, X 6), katholiek (NOUN 12, ADJ 11, X 4), bladzijde (NOUN 4, X 3), zogenaamd (ADJ 34, X 2), Onze-Lieve-Vrouw (PROPN 1, X 1), junior (NOUN 4, ADJ 1, X 1)
The 10 most frequent ambiguous types:
Morphology
The form / lemma ratio of X
is 1.357143 (the average of all parts of speech is 1.168496).
The 1st highest number of forms (5) was observed with the lemma “bijvoorbeeld”: b.v., bijv, bijv., bv, bv..
The 2nd highest number of forms (2) was observed with the lemma “circa”: ca, ca..
The 3rd highest number of forms (2) was observed with the lemma “dat wil zeggen”: d.w.z., dwz..
X
occurs with 1 features: Abbr (118; 100% instances)
X
occurs with 1 feature-value pairs: Abbr=Yes
X
occurs with 1 feature combinations.
The most frequent feature combination is Abbr=Yes
(118 tokens).
Examples: ca., o.a., nr., v.Chr., Ned, lib, kath, o.m., blz., bv
Relations
X
nodes are attached to their parents using 11 different relations: nmod (70; 59% instances), fixed (9; 8% instances), obl (9; 8% instances), case (6; 5% instances), mark (5; 4% instances), parataxis (5; 4% instances), amod (4; 3% instances), cc (4; 3% instances), acl (2; 2% instances), conj (2; 2% instances), root (2; 2% instances)
Parents of X
nodes belong to 10 different parts of speech: NOUN (37; 31% instances), PROPN (33; 28% instances), NUM (18; 15% instances), SYM (10; 8% instances), VERB (10; 8% instances), X (3; 3% instances), DET (2; 2% instances), PRON (2; 2% instances), (2; 2% instances), ADJ (1; 1% instances)
78 (66%) X
nodes are leaves.
3 (3%) X
nodes have one child.
16 (14%) X
nodes have two children.
21 (18%) X
nodes have three or more children.
The highest child degree of a X
node is 8.
Children of X
nodes are attached using 15 different relations: punct (62; 54% instances), appos (13; 11% instances), parataxis (12; 10% instances), case (8; 7% instances), nummod (5; 4% instances), nmod (4; 3% instances), flat (3; 3% instances), acl (1; 1% instances), amod (1; 1% instances), cc (1; 1% instances), conj (1; 1% instances), cop (1; 1% instances), det (1; 1% instances), nsubj (1; 1% instances), orphan (1; 1% instances)
Children of X
nodes belong to 13 different parts of speech: PUNCT (62; 54% instances), NUM (17; 15% instances), ADP (9; 8% instances), NOUN (5; 4% instances), SYM (5; 4% instances), PROPN (4; 3% instances), VERB (3; 3% instances), X (3; 3% instances), ADJ (2; 2% instances), DET (2; 2% instances), AUX (1; 1% instances), CCONJ (1; 1% instances), PRON (1; 1% instances)