home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Bororo-BDT: POS Tags: X

There are 28 X lemmas (6%), 40 X types (5%) and 71 X tokens (4%). Out of 16 observed tags, the rank of X is: 5 in number of lemmas, 6 in number of types and 7 in number of tokens.

The 10 most frequent X lemmas: _, dy, du, br, jiboe, a, i, jity, mody, modyre

The 10 most frequent X types: du, dy, jiboe, br, nure, ure, emode, amode, dykeje, jity

The 10 most frequent ambiguous lemmas: _ (VERB 46, NOUN 41, ADV 24, PRON 18, PROPN 17, X 16, ADP 14, PUNCT 13, PART 3, DET 2), dy (X 11, PART 9), a (PRON 15, NOUN 2, X 2), i (PRON 27, ADP 4, NOUN 2, X 2, ADV 1), mody (VERB 2, X 2), barigu (VERB 4, X 1), ce (PRON 3, X 1), dykeje (SCONJ 1, X 1), keje (ADP 12, ADV 1, NOUN 1, X 1), maky (VERB 23, X 1)

The 10 most frequent ambiguous types: dy (PART 8, X 7), ure (PART 32, PRON 24, X 3, NOUN 1, VERB 1), emode (PRON 3, X 3), amode (X 2, PRON 1), dykeje (X 2, SCONJ 1), Are (PRON 3, X 1), Ire (PRON 6, X 1), barigu (VERB 1, X 1), bokwarewu (VERB 1, X 1), dyji (ADP 1, X 1)

Morphology

The form / lemma ratio of X is 1.428571 (the average of all parts of speech is 1.661638).

The 1st highest number of forms (12) was observed with the lemma “_”: Boroge, Eceba, Kurireu, amode, bokwarewu, cedu, dyji, iie, jamedy, jiboe, nure, ure.

The 2nd highest number of forms (5) was observed with the lemma “dy”: duwugere, dy, dykeje, dyre, dywu.

The 3rd highest number of forms (2) was observed with the lemma “a”: Are, amode.

X occurs with 11 features: Mood (19; 27% instances), Number (18; 25% instances), Person (16; 23% instances), Tense (6; 8% instances), Aspect (3; 4% instances), Nomzr (3; 4% instances), Polarity (3; 4% instances), Clusivity (2; 3% instances), Evident (1; 1% instances), Poss (1; 1% instances), Pred (1; 1% instances)

X occurs with 15 feature-value pairs: Aspect=Prog, Clusivity=Ex, Clusivity=In, Evident=Rep, Mood=Ind, Nomzr=Rel, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Poss=Yes, Pred=AtrEq, Tense=Fut

X occurs with 20 feature combinations. The most frequent feature combination is _ (44 tokens). Examples: du, dy, jiboe, br, dykeje, jity, Boroge, Eceba, Kurireu, barigu

Relations

X nodes are attached to their parents using 14 different relations: dep (25; 35% instances), mark (9; 13% instances), nsubj (9; 13% instances), advmod (8; 11% instances), parataxis (5; 7% instances), discourse (4; 6% instances), compound (3; 4% instances), obj (2; 3% instances), acl (1; 1% instances), ccomp (1; 1% instances), fixed (1; 1% instances), nmod (1; 1% instances), obl (1; 1% instances), root (1; 1% instances)

Parents of X nodes belong to 8 different parts of speech: VERB (55; 77% instances), NOUN (4; 6% instances), PROPN (4; 6% instances), X (3; 4% instances), PRON (2; 3% instances), ADV (1; 1% instances), INTJ (1; 1% instances), (1; 1% instances)

54 (76%) X nodes are leaves.

16 (23%) X nodes have one child.

0 (0%) X nodes have two children.

1 (1%) X nodes have three or more children.

The highest child degree of a X node is 4.

Children of X nodes are attached using 5 different relations: fixed (9; 45% instances), punct (6; 30% instances), discourse (2; 10% instances), nsubj (2; 10% instances), dep (1; 5% instances)

Children of X nodes belong to 8 different parts of speech: ADP (6; 30% instances), PUNCT (6; 30% instances), X (3; 15% instances), ADV (1; 5% instances), NOUN (1; 5% instances), PART (1; 5% instances), PRON (1; 5% instances), PROPN (1; 5% instances)