home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Basque-BDT: POS Tags: PRON

There are 26 PRON lemmas (0%), 108 PRON types (0%) and 785 PRON tokens (1%). Out of 17 observed tags, the rank of PRON is: 10 in number of lemmas, 10 in number of types and 13 in number of tokens.

The 10 most frequent PRON lemmas: ni, gu, eurak, inor, zerbait, ezer, zu, elkar, nor, beraiek

The 10 most frequent PRON types: gure, nire, nik, euren, guk, zerbait, ezer, ni, inork, zure

The 10 most frequent ambiguous lemmas: ni (PRON 207, NOUN 1), geu (PRON 6, NOUN 1), deus (ADJ 3, PRON 2)

The 10 most frequent ambiguous types: zuen (AUX 1127, VERB 67, PRON 4), ene (PRON 2, INTJ 1), deus (ADJ 3, PRON 2), hiri (NOUN 24, PRON 1), zuenak (AUX 1, PRON 1)

Morphology

The form / lemma ratio of PRON is 4.153846 (the average of all parts of speech is 2.170132).

The 1st highest number of forms (15) was observed with the lemma “ni”: Nere, Niregatik, ene, ni, nigan, nik, nire, nirea, niregan, niregana, nirekin, niretzako, niretzat, niri, nitaz.

The 2nd highest number of forms (9) was observed with the lemma “gu”: Gutako, gu, guk, gure, gurean, gurekin, guretzat, guri, gutaz.

The 3rd highest number of forms (7) was observed with the lemma “elkar”: elgarrekin, elgarri, elkar, elkarrekiko, elkarrekin, elkarren, elkarri.

PRON occurs with 6 features: PronType (554; 71% instances), Case (226; 29% instances), Definite (207; 26% instances), Number (27; 3% instances), Person (20; 3% instances), Polite (1; 0% instances)

PRON occurs with 21 feature-value pairs: Case=Abs, Case=All, Case=Ben, Case=Cau, Case=Com, Case=Dat, Case=Erg, Case=Gen, Case=Ine, Case=Ins, Case=Loc, Definite=Def, Definite=Ind, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polite=Infm, PronType=Int, PronType=Prs

PRON occurs with 31 feature combinations. The most frequent feature combination is PronType=Prs (525 tokens). Examples: gure, nire, nik, euren, guk, ni, zure, gu, beraiek, niri

Relations

PRON nodes are attached to their parents using 13 different relations: nmod (281; 36% instances), nsubj (256; 33% instances), obl (119; 15% instances), obj (78; 10% instances), iobj (30; 4% instances), conj (7; 1% instances), ccomp (5; 1% instances), det (4; 1% instances), advcl (1; 0% instances), appos (1; 0% instances), discourse (1; 0% instances), parataxis (1; 0% instances), xcomp (1; 0% instances)

Parents of PRON nodes belong to 10 different parts of speech: VERB (430; 55% instances), NOUN (296; 38% instances), ADJ (33; 4% instances), ADV (11; 1% instances), DET (7; 1% instances), PRON (2; 0% instances), PROPN (2; 0% instances), X (2; 0% instances), AUX (1; 0% instances), NUM (1; 0% instances)

684 (87%) PRON nodes are leaves.

82 (10%) PRON nodes have one child.

13 (2%) PRON nodes have two children.

6 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 6.

Children of PRON nodes are attached using 15 different relations: case (41; 31% instances), nmod (19; 15% instances), punct (19; 15% instances), det (11; 8% instances), dep (8; 6% instances), cc (7; 5% instances), cop (6; 5% instances), advmod (5; 4% instances), conj (4; 3% instances), nsubj (4; 3% instances), advcl (3; 2% instances), acl (1; 1% instances), aux (1; 1% instances), nummod (1; 1% instances), orphan (1; 1% instances)

Children of PRON nodes belong to 13 different parts of speech: ADP (41; 31% instances), PUNCT (19; 15% instances), DET (18; 14% instances), NOUN (12; 9% instances), CCONJ (9; 7% instances), AUX (7; 5% instances), ADV (6; 5% instances), ADJ (5; 4% instances), VERB (5; 4% instances), PART (3; 2% instances), NUM (2; 2% instances), PRON (2; 2% instances), PROPN (2; 2% instances)