home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Dutch-Alpino: POS Tags: X

There are 73 X lemmas (0%), 83 X types (0%) and 260 X tokens (0%). Out of 16 observed tags, the rank of X is: 10 in number of lemmas, 11 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: procent, onder ander, jongstleden, zogenaamd, miljoen, onder veel, rooms-katholiek, nummer, aanstaand, bijvoorbeeld

The 10 most frequent X types: pct., o.a., jl., o.m., pct, mln., a.s., etc., rk, v.j.

The 10 most frequent ambiguous lemmas: procent (NOUN 89, X 40), zogenaamd (ADJ 26, X 14), miljoen (NOUN 90, X 11, NUM 2), nummer (NOUN 33, X 9), aanstaand (X 8, ADJ 3), bijvoorbeeld (ADV 45, X 7), circa (ADV 8, X 7), namelijk (ADV 29, X 6), seconde (NOUN 16, X 6), enzovoorts (X 5, ADV 1, CCONJ 1)

The 10 most frequent ambiguous types: m (X 3, SYM 1), B (PROPN 7, NOUN 2, SYM 1, X 1), C.U.R. (PROPN 1, X 1), D (PROPN 2, X 1), K (PROPN 5, SYM 1, X 1), O. (PROPN 2, X 1), P (PROPN 1, X 1), dr. (PROPN 49, X 1), durfde (VERB 3, X 1), soc. (ADV 2, X 1)

Morphology

The form / lemma ratio of X is 1.136986 (the average of all parts of speech is 1.220969).

The 1st highest number of forms (3) was observed with the lemma “miljoen”: min., mln, mln..

The 2nd highest number of forms (3) was observed with the lemma “procent”: p., pct, pct..

The 3rd highest number of forms (3) was observed with the lemma “zogenaamd”: z.g., zg., zgn..

X occurs with 1 features: Abbr (257; 99% instances)

X occurs with 1 feature-value pairs: Abbr=Yes

X occurs with 2 feature combinations. The most frequent feature combination is Abbr=Yes (257 tokens). Examples: pct., o.a., jl., o.m., pct, mln., a.s., etc., rk, v.j.

Relations

X nodes are attached to their parents using 15 different relations: nmod (110; 42% instances), obl (41; 16% instances), fixed (24; 9% instances), cc (21; 8% instances), conj (11; 4% instances), parataxis (11; 4% instances), amod (9; 3% instances), nsubj (7; 3% instances), obj (7; 3% instances), acl (6; 2% instances), case (5; 2% instances), cc:preconj (3; 1% instances), appos (2; 1% instances), mark (2; 1% instances), root (1; 0% instances)

Parents of X nodes belong to 11 different parts of speech: NOUN (91; 35% instances), VERB (53; 20% instances), PROPN (37; 14% instances), NUM (29; 11% instances), SYM (24; 9% instances), X (16; 6% instances), ADJ (6; 2% instances), ADP (1; 0% instances), DET (1; 0% instances), PRON (1; 0% instances), (1; 0% instances)

151 (58%) X nodes are leaves.

26 (10%) X nodes have one child.

46 (18%) X nodes have two children.

37 (14%) X nodes have three or more children.

The highest child degree of a X node is 10.

Children of X nodes are attached using 17 different relations: punct (70; 28% instances), nummod (46; 18% instances), case (35; 14% instances), det (17; 7% instances), nmod (17; 7% instances), parataxis (16; 6% instances), appos (15; 6% instances), conj (11; 4% instances), fixed (10; 4% instances), amod (6; 2% instances), acl (2; 1% instances), cc (2; 1% instances), cop (2; 1% instances), nsubj (2; 1% instances), advcl (1; 0% instances), obl (1; 0% instances), orphan (1; 0% instances)

Children of X nodes belong to 13 different parts of speech: PUNCT (70; 28% instances), NUM (66; 26% instances), ADP (35; 14% instances), NOUN (26; 10% instances), X (16; 6% instances), SYM (14; 6% instances), DET (8; 3% instances), ADV (6; 2% instances), VERB (4; 2% instances), ADJ (3; 1% instances), AUX (2; 1% instances), CCONJ (2; 1% instances), PROPN (2; 1% instances)