home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Basque-BDT: POS Tags: PRON

There are 26 PRON lemmas (0%), 108 PRON types (0%) and 785 PRON tokens (1%). Out of 16 observed tags, the rank of PRON is: 10 in number of lemmas, 10 in number of types and 13 in number of tokens.

The 10 most frequent PRON lemmas: ni, gu, eurak, inor, zerbait, ezer, zu, elkar, nor, beraiek

The 10 most frequent PRON types: gure, nire, nik, euren, guk, zerbait, ezer, ni, inork, zure

The 10 most frequent ambiguous lemmas: ni (PRON 207, NOUN 1), geu (PRON 6, NOUN 1), deus (ADJ 3, PRON 2)

The 10 most frequent ambiguous types: zuen (AUX 1127, VERB 67, PRON 4), ene (PRON 2, INTJ 1), deus (ADJ 3, PRON 2), hiri (NOUN 24, PRON 1), zuenak (AUX 1, PRON 1)

Morphology

The form / lemma ratio of PRON is 4.153846 (the average of all parts of speech is 2.172787).

The 1st highest number of forms (15) was observed with the lemma “ni”: Nere, Niregatik, ene, ni, nigan, nik, nire, nirea, niregan, niregana, nirekin, niretzako, niretzat, niri, nitaz.

The 2nd highest number of forms (9) was observed with the lemma “gu”: Gutako, gu, guk, gure, gurean, gurekin, guretzat, guri, gutaz.

The 3rd highest number of forms (7) was observed with the lemma “elkar”: elgarrekin, elgarri, elkar, elkarrekiko, elkarrekin, elkarren, elkarri.

PRON occurs with 6 features: PronType (554; 71% instances), Case (226; 29% instances), Definite (207; 26% instances), Number (27; 3% instances), Person (20; 3% instances), Polite (1; 0% instances)

PRON occurs with 21 feature-value pairs: Case=Abs, Case=All, Case=Ben, Case=Cau, Case=Com, Case=Dat, Case=Erg, Case=Gen, Case=Ine, Case=Ins, Case=Loc, Definite=Def, Definite=Ind, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polite=Infm, PronType=Int, PronType=Prs

PRON occurs with 31 feature combinations. The most frequent feature combination is PronType=Prs (525 tokens). Examples: gure, nire, nik, euren, guk, ni, zure, gu, beraiek, niri

Relations

PRON nodes are attached to their parents using 14 different relations: nmod (279; 36% instances), nsubj (260; 33% instances), obl (115; 15% instances), obj (78; 10% instances), iobj (30; 4% instances), ccomp (5; 1% instances), conj (5; 1% instances), advmod (4; 1% instances), det (4; 1% instances), advcl (1; 0% instances), appos (1; 0% instances), aux (1; 0% instances), discourse (1; 0% instances), xcomp (1; 0% instances)

Parents of PRON nodes belong to 14 different parts of speech: VERB (427; 54% instances), NOUN (279; 36% instances), AUX (26; 3% instances), ADJ (15; 2% instances), ADP (8; 1% instances), ADV (8; 1% instances), DET (6; 1% instances), X (5; 1% instances), NUM (3; 0% instances), CCONJ (2; 0% instances), PRON (2; 0% instances), PROPN (2; 0% instances), PART (1; 0% instances), PUNCT (1; 0% instances)

696 (89%) PRON nodes are leaves.

79 (10%) PRON nodes have one child.

7 (1%) PRON nodes have two children.

3 (0%) PRON nodes have three or more children.

The highest child degree of a PRON node is 4.

Children of PRON nodes are attached using 13 different relations: case (41; 39% instances), nmod (16; 15% instances), det (14; 13% instances), dep (8; 8% instances), cop (6; 6% instances), cc (4; 4% instances), conj (4; 4% instances), nsubj (4; 4% instances), punct (3; 3% instances), acl (1; 1% instances), advcl (1; 1% instances), advmod (1; 1% instances), aux (1; 1% instances)

Children of PRON nodes belong to 12 different parts of speech: ADP (41; 39% instances), DET (18; 17% instances), NOUN (10; 10% instances), CCONJ (7; 7% instances), AUX (6; 6% instances), ADJ (5; 5% instances), ADV (5; 5% instances), PUNCT (3; 3% instances), VERB (3; 3% instances), NUM (2; 2% instances), PRON (2; 2% instances), PROPN (2; 2% instances)