home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Dutch-LassySmall: POS Tags: X

There are 342 X lemmas (3%), 342 X types (2%) and 467 X tokens (0%). Out of 16 observed tags, the rank of X is: 6 in number of lemmas, 6 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: o.a., de, la, les, the, Vive, of, art, cordon, des

The 10 most frequent X types: o.a., de, la, les, the, Vive, of, art, cordon, des

The 10 most frequent ambiguous lemmas: de (DET 5869, PROPN 93, X 8), la (X 8, PROPN 2), les (NOUN 2, X 2), of (CCONJ 113, SCONJ 8, X 6, PROPN 4), des (PROPN 14, X 4), le (X 3, PROPN 2), Belgique (PROPN 4, X 3), française (X 2, NOUN 1, PROPN 1), pas (ADV 29, X 3), : (PUNCT 674, SYM 3, X 2)

The 10 most frequent ambiguous types: de (DET 4883, PROPN 93, X 8), la (X 8, PROPN 2), of (CCONJ 111, SCONJ 8, X 6, PROPN 4), des (PROPN 14, DET 4, X 4), le (X 3, PROPN 2), Belgique (PROPN 4, X 3), française (X 2, NOUN 1, PROPN 1), pas (ADV 23, X 3), : (PUNCT 674, SYM 3, X 2), Belges (X 2, PROPN 1)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.174887).

The 1st highest number of forms (1) was observed with the lemma “–foto’s”: –foto’s.

The 2nd highest number of forms (1) was observed with the lemma “-Berchem”: -Berchem.

The 3rd highest number of forms (1) was observed with the lemma “-Congres”: -Congres.

X occurs with 2 features: Foreign (340; 73% instances), Abbr (45; 10% instances)

X occurs with 2 feature-value pairs: Abbr=Yes, Foreign=Yes

X occurs with 3 feature combinations. The most frequent feature combination is Foreign=Yes (340 tokens). Examples: de, la, les, the, Vive, of, art, cordon, des, design

Relations

X nodes are attached to their parents using 17 different relations: fixed (195; 42% instances), nmod (71; 15% instances), conj (48; 10% instances), root (34; 7% instances), appos (25; 5% instances), obl (25; 5% instances), parataxis (23; 5% instances), nsubj (8; 2% instances), obj (8; 2% instances), case (7; 1% instances), mark (5; 1% instances), cc (4; 1% instances), flat:name (4; 1% instances), advcl (3; 1% instances), amod (3; 1% instances), nsubj:pass (2; 0% instances), xcomp (2; 0% instances)

Parents of X nodes belong to 12 different parts of speech: X (198; 42% instances), NOUN (105; 22% instances), VERB (52; 11% instances), PROPN (42; 9% instances), (34; 7% instances), ADJ (12; 3% instances), SYM (11; 2% instances), DET (5; 1% instances), NUM (3; 1% instances), ADV (2; 0% instances), SCONJ (2; 0% instances), PRON (1; 0% instances)

246 (53%) X nodes are leaves.

47 (10%) X nodes have one child.

42 (9%) X nodes have two children.

132 (28%) X nodes have three or more children.

The highest child degree of a X node is 12.

Children of X nodes are attached using 21 different relations: punct (262; 35% instances), fixed (191; 25% instances), conj (83; 11% instances), case (48; 6% instances), det (38; 5% instances), nmod (33; 4% instances), cc (20; 3% instances), amod (19; 3% instances), appos (11; 1% instances), parataxis (11; 1% instances), nummod (7; 1% instances), cop (6; 1% instances), nsubj (6; 1% instances), acl:relcl (4; 1% instances), mark (4; 1% instances), advmod (3; 0% instances), flat:name (3; 0% instances), acl (2; 0% instances), obl (2; 0% instances), advcl (1; 0% instances), nmod:poss (1; 0% instances)

Children of X nodes belong to 15 different parts of speech: PUNCT (262; 35% instances), X (198; 26% instances), NOUN (87; 12% instances), ADP (53; 7% instances), DET (41; 5% instances), PROPN (22; 3% instances), ADJ (21; 3% instances), CCONJ (19; 3% instances), VERB (12; 2% instances), ADV (11; 1% instances), NUM (11; 1% instances), SYM (7; 1% instances), AUX (6; 1% instances), PRON (3; 0% instances), SCONJ (2; 0% instances)