home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Pesh-ChibErgIS: POS Tags: X

There are 138 X lemmas (20%), 138 X types (11%) and 180 X tokens (4%). Out of 15 observed tags, the rank of X is: 3 in number of lemmas, 3 in number of types and 7 in number of tokens.

The 10 most frequent X lemmas: y, laguna, prestar, ###, _, en, pero, tigre, a, joder

The 10 most frequent X types: y, laguna, ###, prestar, en, pero, tigre, a, joder, mangle

The 10 most frequent ambiguous lemmas: _ (PUNCT 681, PART 188, ADP 88, AUX 46, SCONJ 37, DET 6, PRON 4, X 3), a (INTJ 7, X 2), ya (NOUN 2, X 2), ʔãã (INTJ 8, X 2), cuento (NOUN 1, X 1), sirena (NOUN 2, X 1), su (NOUN 2, X 1), tuʔ (PART 1, X 1), ã (PRON 171, VERB 17, NOUN 2, X 1)

The 10 most frequent ambiguous types: a (INTJ 7, X 2), ʔãã (INTJ 8, X 2), cuento (NOUN 1, X 1), sirena (NOUN 2, X 1), tuʔ (PART 1, X 1), ã (PRON 171, NOUN 2, X 1)

Morphology

The form / lemma ratio of X is 1.000000 (the average of all parts of speech is 1.743590).

The 1st highest number of forms (2) was observed with the lemma “_”: ###, inaudible.

The 2nd highest number of forms (1) was observed with the lemma “###”: ###.

The 3rd highest number of forms (1) was observed with the lemma “a”: a.

X does not occur with any features.

Relations

X nodes are attached to their parents using 21 different relations: nmod (46; 26% instances), discourse (24; 13% instances), root (20; 11% instances), obl (13; 7% instances), reparandum (13; 7% instances), obj (12; 7% instances), compound:lvc (11; 6% instances), obl:mod (7; 4% instances), nsubj (6; 3% instances), obl:lmod (5; 3% instances), dep (4; 2% instances), obl:tmod (4; 2% instances), compound (3; 2% instances), xcomp (3; 2% instances), dislocated (2; 1% instances), obl:arg (2; 1% instances), appos (1; 1% instances), compound:svc (1; 1% instances), conj (1; 1% instances), nsubj:pass (1; 1% instances), parataxis (1; 1% instances)

Parents of X nodes belong to 8 different parts of speech: VERB (77; 43% instances), X (57; 32% instances), (20; 11% instances), NOUN (11; 6% instances), PRON (7; 4% instances), PART (6; 3% instances), ADV (1; 1% instances), AUX (1; 1% instances)

76 (42%) X nodes are leaves.

54 (30%) X nodes have one child.

30 (17%) X nodes have two children.

20 (11%) X nodes have three or more children.

The highest child degree of a X node is 8.

Children of X nodes are attached using 20 different relations: reparandum (53; 28% instances), nmod (43; 22% instances), punct (38; 20% instances), discourse (11; 6% instances), case (10; 5% instances), det (9; 5% instances), cop (5; 3% instances), obj (5; 3% instances), advmod (3; 2% instances), compound (3; 2% instances), nsubj (2; 1% instances), parataxis (2; 1% instances), acl (1; 1% instances), advcl (1; 1% instances), aux (1; 1% instances), dislocated (1; 1% instances), mark (1; 1% instances), obl (1; 1% instances), obl:mod (1; 1% instances), obl:tmod (1; 1% instances)

Children of X nodes belong to 14 different parts of speech: X (57; 30% instances), PUNCT (38; 20% instances), NOUN (25; 13% instances), PRON (19; 10% instances), VERB (14; 7% instances), PART (11; 6% instances), ADV (7; 4% instances), AUX (6; 3% instances), ADP (4; 2% instances), INTJ (4; 2% instances), DET (2; 1% instances), NUM (2; 1% instances), SCONJ (2; 1% instances), PROPN (1; 1% instances)