home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Turkish-PUD: POS Tags: PRON

There are 33 PRON lemmas (1%), 86 PRON types (1%) and 306 PRON tokens (2%). Out of 16 observed tags, the rank of PRON is: 9 in number of lemmas, 8 in number of types and 12 in number of tokens.

The 10 most frequent PRON lemmas: bu, kendi, o, biri, bura, ne, ben, ora, biz, şey

The 10 most frequent PRON types: bu, kendi, Bununla, biri, bunu, bunun, onu, bunlar, ne, o

The 10 most frequent ambiguous lemmas: bu (DET 126, PRON 99), kendi (PRON 42, NOUN 1), o (PRON 38, DET 11, NOUN 2), biri (PRON 17, NOUN 1), ne (PRON 10, NOUN 3, ADV 2, DET 1), ora (PRON 8, NOUN 1), şey (NOUN 14, PRON 6), _ (ADJ 133, NOUN 83, AUX 68, PUNCT 62, NUM 30, PROPN 26, VERB 19, ADV 15, ADP 7, PRON 5, X 5, SYM 4, DET 1), bir (DET 415, NUM 14, PRON 5, ADV 2, NOUN 1), bazı (DET 16, PRON 4)

The 10 most frequent ambiguous types: bu (DET 73, PRON 17), biri (PRON 12, NOUN 1), ne (PRON 7, DET 1), o (DET 7, PRON 6), şey (NOUN 7, PRON 4), birine (NOUN 1, PRON 1), çoğu (DET 3, NOUN 3, PRON 1), şeyler (NOUN 3, PRON 1)

Morphology

The form / lemma ratio of PRON is 2.606061 (the average of all parts of speech is 1.517471).

The 1st highest number of forms (12) was observed with the lemma “bu”: Bununla, bu, buna, bunda, bundan, bunlar, bunlara, bunlardan, bunları, bunların, bunu, bunun.

The 2nd highest number of forms (10) was observed with the lemma “kendi”: Kendilerine, kendi, kendilerini, kendinden, kendini, kendisi, kendisine, kendisini, kendisinin, kendisiyle.

The 3rd highest number of forms (8) was observed with the lemma “o”: o, ona, onda, onlara, onların, onu, onun, onunla.

PRON occurs with 9 features: Number (306; 100% instances), Case (247; 81% instances), Polarity (227; 74% instances), Person (47; 15% instances), Definite (40; 13% instances), Reflex (38; 12% instances), Person[psor] (29; 9% instances), PronType (26; 8% instances), Number[psor] (22; 7% instances)

PRON occurs with 23 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Definite=Def, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person=1, Person=2, Person=3, Person[psor]=1, Person[psor]=3, Polarity=Neg, Polarity=Pos, PronType=Ind, PronType=Int, PronType=Rcp, Reflex=Yes

PRON occurs with 73 feature combinations. The most frequent feature combination is Case=Nom|Definite=Def|Number=Sing|Polarity=Pos (32 tokens). Examples: bu, o

Relations

PRON nodes are attached to their parents using 15 different relations: nsubj (95; 31% instances), obj (62; 20% instances), obl (60; 20% instances), nmod:poss (45; 15% instances), amod (8; 3% instances), root (8; 3% instances), iobj (7; 2% instances), advcl (5; 2% instances), nmod (4; 1% instances), appos (3; 1% instances), conj (3; 1% instances), acl (2; 1% instances), compound:redup (2; 1% instances), ccomp (1; 0% instances), parataxis (1; 0% instances)

Parents of PRON nodes belong to 8 different parts of speech: VERB (118; 39% instances), NOUN (113; 37% instances), ADJ (48; 16% instances), PRON (8; 3% instances), (8; 3% instances), ADV (7; 2% instances), PROPN (3; 1% instances), ADP (1; 0% instances)

183 (60%) PRON nodes are leaves.

82 (27%) PRON nodes have one child.

27 (9%) PRON nodes have two children.

14 (5%) PRON nodes have three or more children.

The highest child degree of a PRON node is 8.

Children of PRON nodes are attached using 19 different relations: punct (55; 28% instances), case (33; 17% instances), nmod:poss (30; 15% instances), advmod:emph (15; 8% instances), nsubj (14; 7% instances), cop (12; 6% instances), amod (7; 4% instances), acl (5; 3% instances), conj (5; 3% instances), advcl (3; 2% instances), cc (3; 2% instances), advmod (2; 1% instances), compound:redup (2; 1% instances), det (2; 1% instances), dislocated (2; 1% instances), nmod (2; 1% instances), obl (2; 1% instances), aux (1; 1% instances), parataxis (1; 1% instances)

Children of PRON nodes belong to 11 different parts of speech: PUNCT (55; 28% instances), NOUN (44; 22% instances), ADP (32; 16% instances), ADV (18; 9% instances), AUX (13; 7% instances), ADJ (9; 5% instances), PRON (8; 4% instances), PROPN (7; 4% instances), VERB (5; 3% instances), CCONJ (3; 2% instances), DET (2; 1% instances)