home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Dutch-LassySmall: POS Tags: X

There are 1592 X lemmas (6%), 1609 X types (5%) and 3283 X tokens (1%). Out of 16 observed tags, the rank of X is: 5 in number of lemmas, 5 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: the, of, de, Star, Trek, onder ander, Army, les, Nederland, circa

The 10 most frequent X types: the, of, de, Star, Trek, o.a., Army, les, la, in

The 10 most frequent ambiguous lemmas: the (X 72, PROPN 2), of (CCONJ 472, X 74, SCONJ 34, PROPN 22), de (DET 18977, PROPN 197, X 41), Star (X 40, PROPN 7), les (NOUN 6, X 6), Nederland (PROPN 153, X 30), circa (X 29, ADV 11, DET 3), la (PROPN 16, X 15), in (ADP 7144, X 15, PROPN 5), nummer (NOUN 80, X 25)

The 10 most frequent ambiguous types: the (X 72, PROPN 2), of (CCONJ 462, X 74, SCONJ 31, PROPN 22), de (DET 16356, PROPN 197, X 41), Star (X 40, PROPN 7), les (X 6, NOUN 3), la (PROPN 16, X 15), in (ADP 5997, X 15, PROPN 5), ca. (X 24, DET 2), Potomac (X 19, PROPN 7), and (X 11, PROPN 3)

Morphology

The form / lemma ratio of X is 1.010678 (the average of all parts of speech is 1.223407).

The 1st highest number of forms (5) was observed with the lemma “bijvoorbeeld”: b.v., bijv, bijv., bv, bv..

The 2nd highest number of forms (3) was observed with the lemma “nummer”: nr, nr., nrs.

The 3rd highest number of forms (3) was observed with the lemma “onder ander”: o.a., oa, oa..

X occurs with 2 features: Foreign (2849; 87% instances), Abbr (257; 8% instances)

X occurs with 2 feature-value pairs: Abbr=Yes, Foreign=Yes

X occurs with 3 feature combinations. The most frequent feature combination is Foreign=Yes (2849 tokens). Examples: the, of, de, Star, Trek, Army, les, la, in, grand

Relations

X nodes are attached to their parents using 24 different relations: fixed (1926; 59% instances), nmod (373; 11% instances), appos (174; 5% instances), conj (149; 5% instances), root (121; 4% instances), nsubj (120; 4% instances), obl (105; 3% instances), obj (76; 2% instances), parataxis (64; 2% instances), obl:arg (35; 1% instances), acl (27; 1% instances), nsubj:pass (27; 1% instances), xcomp (18; 1% instances), mark (16; 0% instances), advcl (15; 0% instances), case (14; 0% instances), obl:agent (7; 0% instances), amod (6; 0% instances), acl:relcl (3; 0% instances), ccomp (2; 0% instances), flat (2; 0% instances), csubj (1; 0% instances), iobj (1; 0% instances), orphan (1; 0% instances)

Parents of X nodes belong to 12 different parts of speech: X (1914; 58% instances), NOUN (548; 17% instances), VERB (392; 12% instances), (121; 4% instances), PROPN (106; 3% instances), SYM (83; 3% instances), NUM (70; 2% instances), ADJ (31; 1% instances), DET (9; 0% instances), ADV (4; 0% instances), PRON (4; 0% instances), ADP (1; 0% instances)

2113 (64%) X nodes are leaves.

184 (6%) X nodes have one child.

232 (7%) X nodes have two children.

754 (23%) X nodes have three or more children.

The highest child degree of a X node is 59.

Children of X nodes are attached using 27 different relations: fixed (1905; 43% instances), punct (973; 22% instances), case (301; 7% instances), det (279; 6% instances), conj (244; 6% instances), nmod (133; 3% instances), amod (102; 2% instances), cc (81; 2% instances), parataxis (74; 2% instances), appos (62; 1% instances), acl (36; 1% instances), mark (32; 1% instances), cop (29; 1% instances), nsubj (28; 1% instances), nummod (28; 1% instances), acl:relcl (27; 1% instances), nmod:poss (24; 1% instances), advmod (10; 0% instances), flat (9; 0% instances), cc:preconj (6; 0% instances), advcl (4; 0% instances), obl (3; 0% instances), aux:pass (1; 0% instances), ccomp (1; 0% instances), nsubj:pass (1; 0% instances), obj (1; 0% instances), orphan (1; 0% instances)

Children of X nodes belong to 15 different parts of speech: X (1914; 44% instances), PUNCT (973; 22% instances), ADP (304; 7% instances), DET (282; 6% instances), NOUN (242; 6% instances), SYM (129; 3% instances), ADJ (103; 2% instances), PROPN (98; 2% instances), CCONJ (96; 2% instances), VERB (74; 2% instances), NUM (68; 2% instances), AUX (30; 1% instances), ADV (29; 1% instances), SCONJ (29; 1% instances), PRON (24; 1% instances)