home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Afrikaans-AfriBooms: POS Tags: X

There are 251 X lemmas (4%), 250 X types (4%) and 437 X tokens (1%). Out of 16 observed tags, the rank of X is: 6 in number of lemmas, 6 in number of types and 15 in number of tokens.

The 10 most frequent X lemmas: GCIS, IRP, SAID, Eskom, MIV, mnr, FIFA, SAPD, Mariene-, OKD

The 10 most frequent X types: GCIS, IRP, SAID, Eskom, MIV, mnr, FIFA, SAPD, Mariene-, OKD

The 10 most frequent ambiguous lemmas: of (CCONJ 375, SCONJ 16, PROPN 7, X 4), as (SCONJ 257, X 1), hoof (NOUN 7, X 3), and (X 2, PROPN 1), Affairs (PROPN 1, X 1), Authority (PROPN 1, X 1), Department (PROPN 1, X 1), Environmental (PROPN 1, X 1), National (PROPN 3, X 1), Tourism (PROPN 1, X 1)

The 10 most frequent ambiguous types: of (CCONJ 371, SCONJ 16, PROPN 7, X 4), As (SCONJ 42, X 3), hoof (NOUN 7, X 1), and (X 2, PROPN 1), Affairs (PROPN 1, X 1), Authority (PROPN 1, X 1), Department (PROPN 1, X 1), Environmental (PROPN 1, X 1), National (PROPN 3, X 1), Tourism (PROPN 1, X 1)

Morphology

The form / lemma ratio of X is 0.996016 (the average of all parts of speech is 1.122045).

The 1st highest number of forms (1) was observed with the lemma “(e)”: (e).

The 2nd highest number of forms (1) was observed with the lemma “-aanleg”: -aanlegte.

The 3rd highest number of forms (1) was observed with the lemma “-aktiwiteit”: -aktiwiteite.

X does not occur with any features.

Relations

X nodes are attached to their parents using 10 different relations: amod (79; 18% instances), nmod (66; 15% instances), conj (65; 15% instances), obl (59; 14% instances), appos (56; 13% instances), dep (50; 11% instances), nsubj (33; 8% instances), obj (25; 6% instances), nsubj:pass (2; 0% instances), root (2; 0% instances)

Parents of X nodes belong to 11 different parts of speech: NOUN (209; 48% instances), VERB (100; 23% instances), X (71; 16% instances), PROPN (19; 4% instances), ADJ (14; 3% instances), SYM (10; 2% instances), SCONJ (4; 1% instances), ADV (3; 1% instances), AUX (3; 1% instances), ADP (2; 0% instances), (2; 0% instances)

83 (19%) X nodes are leaves.

98 (22%) X nodes have one child.

152 (35%) X nodes have two children.

104 (24%) X nodes have three or more children.

The highest child degree of a X node is 8.

Children of X nodes are attached using 12 different relations: punct (186; 24% instances), case (130; 17% instances), conj (130; 17% instances), cc (109; 14% instances), det (108; 14% instances), dep (66; 9% instances), amod (25; 3% instances), appos (6; 1% instances), obl (6; 1% instances), nmod (4; 1% instances), cop (2; 0% instances), nsubj (1; 0% instances)

Children of X nodes belong to 14 different parts of speech: PUNCT (186; 24% instances), ADP (121; 16% instances), CCONJ (109; 14% instances), DET (106; 14% instances), NOUN (101; 13% instances), X (71; 9% instances), PROPN (25; 3% instances), ADJ (22; 3% instances), SYM (12; 2% instances), PART (9; 1% instances), PRON (6; 1% instances), VERB (3; 0% instances), AUX (1; 0% instances), SCONJ (1; 0% instances)