home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Dutch-Alpino: POS Tags: X

There are 484 X lemmas (2%), 483 X types (2%) and 755 X tokens (0%). Out of 16 observed tags, the rank of X is: 6 in number of lemmas, 6 in number of types and 14 in number of tokens.

The 10 most frequent X lemmas: a, fancy, o.a., the, jl., o.m., a.s., and, binnen-, etc.

The 10 most frequent X types: a, fancy, o.a., the, jl., o.m., a.s., and, binnen-, etc.

The 10 most frequent ambiguous lemmas: a (X 21, NOUN 2, SYM 1), o.a. (X 20, ADJ 1), the (X 15, PROPN 3), front (X 7, NOUN 3), National (X 6, PROPN 4), flo (NOUN 122, PROPN 12, X 6, ADJ 4), giro (X 3, NOUN 2), la (X 3, PROPN 1), met (ADP 1498, X 3), tot (ADP 586, X 3)

The 10 most frequent ambiguous types: a (X 19, SYM 1), o.a. (X 20, ADJ 1), the (X 15, PROPN 3), front (X 7, NOUN 1), National (X 6, PROPN 4), flo (NOUN 121, PROPN 12, X 6, ADJ 4), giro (X 3, NOUN 2), la (X 3, PROPN 1), m (X 3, NOUN 1), nl. (ADV 3, X 3)

Morphology

The form / lemma ratio of X is 0.997934 (the average of all parts of speech is 1.214322).

The 1st highest number of forms (1) was observed with the lemma “”proef”-”: “proef”-.

The 2nd highest number of forms (1) was observed with the lemma “’n”: ‘n.

The 3rd highest number of forms (1) was observed with the lemma “-avond”: -avond.

X occurs with 2 features: Foreign (519; 69% instances), Abbr (119; 16% instances)

X occurs with 2 feature-value pairs: Abbr=Yes, Foreign=Yes

X occurs with 3 feature combinations. The most frequent feature combination is Foreign=Yes (519 tokens). Examples: fancy, a, the, and, front, to, National, be, fiction, flo

Relations

X nodes are attached to their parents using 23 different relations: fixed (229; 30% instances), nmod (151; 20% instances), flat:name (66; 9% instances), obl (64; 8% instances), nsubj (37; 5% instances), cc (35; 5% instances), conj (28; 4% instances), appos (24; 3% instances), obj (24; 3% instances), root (22; 3% instances), parataxis (18; 2% instances), amod (9; 1% instances), case (9; 1% instances), nsubj:pass (9; 1% instances), acl (7; 1% instances), det (6; 1% instances), xcomp (6; 1% instances), advcl (3; 0% instances), obl:agent (3; 0% instances), iobj (2; 0% instances), advmod (1; 0% instances), ccomp (1; 0% instances), orphan (1; 0% instances)

Parents of X nodes belong to 13 different parts of speech: NOUN (216; 29% instances), X (206; 27% instances), VERB (139; 18% instances), PROPN (90; 12% instances), NUM (28; 4% instances), (22; 3% instances), ADJ (17; 2% instances), SYM (14; 2% instances), ADP (5; 1% instances), ADV (5; 1% instances), PRON (5; 1% instances), DET (4; 1% instances), INTJ (4; 1% instances)

432 (57%) X nodes are leaves.

65 (9%) X nodes have one child.

75 (10%) X nodes have two children.

183 (24%) X nodes have three or more children.

The highest child degree of a X node is 12.

Children of X nodes are attached using 23 different relations: fixed (230; 24% instances), punct (190; 20% instances), conj (127; 13% instances), det (110; 11% instances), case (102; 10% instances), nmod (60; 6% instances), amod (31; 3% instances), flat:name (18; 2% instances), cc (17; 2% instances), parataxis (13; 1% instances), nummod (11; 1% instances), mark (10; 1% instances), acl:relcl (9; 1% instances), nsubj (9; 1% instances), appos (8; 1% instances), cop (8; 1% instances), acl (5; 1% instances), nmod:poss (5; 1% instances), obl (5; 1% instances), advmod (3; 0% instances), advcl (1; 0% instances), aux (1; 0% instances), obj (1; 0% instances)

Children of X nodes belong to 16 different parts of speech: X (206; 21% instances), PUNCT (190; 20% instances), NOUN (143; 15% instances), DET (110; 11% instances), ADP (105; 11% instances), PROPN (51; 5% instances), ADJ (45; 5% instances), NUM (30; 3% instances), VERB (22; 2% instances), CCONJ (17; 2% instances), ADV (15; 2% instances), PRON (12; 1% instances), SYM (11; 1% instances), AUX (9; 1% instances), SCONJ (7; 1% instances), INTJ (1; 0% instances)