home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Afrikaans-AfriBooms: POS Tags: X

There are 251 X lemmas (4%), 250 X types (4%) and 437 X tokens (1%). Out of 16 observed tags, the rank of X is: 6 in number of lemmas, 6 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: GCIS, IRP, SAID, Eskom, MIV, mnr, FIFA, SAPD, Mariene-, OKD

The 10 most frequent X types: GCIS, IRP, SAID, Eskom, MIV, mnr, FIFA, SAPD, Mariene-, OKD

The 10 most frequent ambiguous lemmas: of (CCONJ 375, SCONJ 16, PROPN 7, X 4), as (SCONJ 257, X 1), hoof (NOUN 7, X 3), and (X 2, PROPN 1), Affairs (PROPN 1, X 1), Authority (PROPN 1, X 1), Department (PROPN 1, X 1), Environmental (PROPN 1, X 1), National (PROPN 3, X 1), Tourism (PROPN 1, X 1)

The 10 most frequent ambiguous types: of (CCONJ 371, SCONJ 16, PROPN 7, X 4), As (SCONJ 42, X 3), hoof (NOUN 7, X 1), and (X 2, PROPN 1), Affairs (PROPN 1, X 1), Authority (PROPN 1, X 1), Department (PROPN 1, X 1), Environmental (PROPN 1, X 1), National (PROPN 3, X 1), Tourism (PROPN 1, X 1)

Morphology

The form / lemma ratio of X is 0.996016 (the average of all parts of speech is 1.120642).

The 1st highest number of forms (1) was observed with the lemma “(e)”: (e).

The 2nd highest number of forms (1) was observed with the lemma “-aanleg”: -aanlegte.

The 3rd highest number of forms (1) was observed with the lemma “-aktiwiteit”: -aktiwiteite.

X does not occur with any features.

Relations

X nodes are attached to their parents using 11 different relations: amod (79; 18% instances), conj (66; 15% instances), nmod (66; 15% instances), obl (58; 13% instances), appos (56; 13% instances), dep (48; 11% instances), nsubj (34; 8% instances), obj (24; 5% instances), flat (2; 0% instances), nsubj:pass (2; 0% instances), root (2; 0% instances)

Parents of X nodes belong to 10 different parts of speech: NOUN (212; 49% instances), VERB (102; 23% instances), X (71; 16% instances), PROPN (19; 4% instances), ADJ (16; 4% instances), SYM (10; 2% instances), ADV (3; 1% instances), (2; 0% instances), ADP (1; 0% instances), AUX (1; 0% instances)

81 (19%) X nodes are leaves.

101 (23%) X nodes have one child.

148 (34%) X nodes have two children.

107 (24%) X nodes have three or more children.

The highest child degree of a X node is 8.

Children of X nodes are attached using 13 different relations: punct (190; 24% instances), conj (132; 17% instances), case (130; 17% instances), cc (111; 14% instances), det (108; 14% instances), dep (65; 8% instances), amod (25; 3% instances), obl (7; 1% instances), appos (6; 1% instances), nmod (4; 1% instances), flat (2; 0% instances), cop (1; 0% instances), nsubj (1; 0% instances)

Children of X nodes belong to 14 different parts of speech: PUNCT (190; 24% instances), ADP (121; 15% instances), CCONJ (111; 14% instances), DET (106; 14% instances), NOUN (104; 13% instances), X (71; 9% instances), PROPN (25; 3% instances), ADJ (22; 3% instances), SYM (12; 2% instances), PART (9; 1% instances), PRON (6; 1% instances), VERB (3; 0% instances), AUX (1; 0% instances), SCONJ (1; 0% instances)