Treebank Statistics: UD_Turkish-GB: POS Tags: PROPN
There are 219 PROPN
lemmas (10%), 359 PROPN
types (7%) and 914 PROPN
tokens (5%).
Out of 16 observed tags, the rank of PROPN
is: 3 in number of lemmas, 3 in number of types and 7 in number of tokens.
The 10 most frequent PROPN
lemmas: Ahmet, Ali, Ayşe, Semra, Mehmet, Erol, Ankara, Zeki, Necla, Fatma
The 10 most frequent PROPN
types: Ahmet, Ali, Ayşe, Semra, Mehmet, Ali’nin, Necla, Erol, Türk, Ankara’ya
The 10 most frequent ambiguous lemmas: Ali (PROPN 42, NOUN 1), Fatma (PROPN 15, PRON 1), sor (VERB 18, PROPN 4), Atatürk (PROPN 3, NOUN 1)
The 10 most frequent ambiguous types: Filiz (PROPN 4, NOUN 2), soralım (VERB 5, PROPN 4), Elif’e (PROPN 2, NOUN 1), Atatürk (NOUN 1, PROPN 1), Bodrum’da (NOUN 1, PROPN 1), Güneş (NOUN 1, PROPN 1)
- Filiz
- soralım
- Elif’e
- Atatürk
- Bodrum’da
- Güneş
Morphology
The form / lemma ratio of PROPN
is 1.639269 (the average of all parts of speech is 2.332157).
The 1st highest number of forms (10) was observed with the lemma “Ahmet”: Ahmet, Ahmete, Ahmetin, Ahmet’e, Ahmet’i, Ahmet’in, Ahmet’le, Ahmet’ler, Ahmet’te, Ahmet’ten.
The 2nd highest number of forms (7) was observed with the lemma “Ayşe”: Ayşe, Ayşe’nin, Ayşenin, Ayşeyle, Ayşe’nin, Ayşe’yi, Ayşe’yle.
The 3rd highest number of forms (7) was observed with the lemma “Semra”: Semra, Semranın, Semra’lar, Semra’nın, Semra’ya, Semra’yla, Semra’yı.
PROPN
occurs with 4 features: Number (914; 100% instances), Case (392; 43% instances), Number[psor] (1; 0% instances), Person[psor] (1; 0% instances)
PROPN
occurs with 11 feature-value pairs: Case=Abl
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Number=Plur
, Number=Sing
, Number[psor]=Sing
, Person[psor]=3
PROPN
occurs with 10 feature combinations.
The most frequent feature combination is Number=Sing
(512 tokens).
Examples: Ahmet, Ali, Ayşe, Semra, Mehmet, Necla, Erol, Türk, Fatma, İngiliz
Relations
PROPN
nodes are attached to their parents using 20 different relations: nsubj (430; 47% instances), obl (168; 18% instances), nmod (167; 18% instances), obj (53; 6% instances), conj (39; 4% instances), root (22; 2% instances), flat (7; 1% instances), advcl (4; 0% instances), nsubj:cop (4; 0% instances), orphan (4; 0% instances), vocative (3; 0% instances), xcomp (3; 0% instances), ccomp (2; 0% instances), obl:agent (2; 0% instances), acl (1; 0% instances), appos (1; 0% instances), compound (1; 0% instances), compound:redup (1; 0% instances), dislocated (1; 0% instances), nmod:comp (1; 0% instances)
Parents of PROPN
nodes belong to 8 different parts of speech: VERB (597; 65% instances), NOUN (193; 21% instances), PROPN (47; 5% instances), ADJ (38; 4% instances), (22; 2% instances), PRON (15; 2% instances), ADV (1; 0% instances), NUM (1; 0% instances)
744 (81%) PROPN
nodes are leaves.
114 (12%) PROPN
nodes have one child.
38 (4%) PROPN
nodes have two children.
18 (2%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 4.
Children of PROPN
nodes are attached using 23 different relations: conj (51; 21% instances), punct (51; 21% instances), cc (28; 11% instances), case (18; 7% instances), aux:q (17; 7% instances), cop (17; 7% instances), nsubj (11; 4% instances), nmod (10; 4% instances), flat (8; 3% instances), advmod:emph (6; 2% instances), acl (4; 2% instances), det (4; 2% instances), obl (4; 2% instances), mark (3; 1% instances), nmod:part (3; 1% instances), advmod (2; 1% instances), appos (2; 1% instances), nsubj:cop (2; 1% instances), amod (1; 0% instances), compound:redup (1; 0% instances), discourse (1; 0% instances), obl:tmod (1; 0% instances), parataxis (1; 0% instances)
Children of PROPN
nodes belong to 12 different parts of speech: PUNCT (51; 21% instances), PROPN (47; 19% instances), NOUN (36; 15% instances), AUX (34; 14% instances), CCONJ (27; 11% instances), ADP (17; 7% instances), PRON (14; 6% instances), ADV (9; 4% instances), DET (5; 2% instances), VERB (4; 2% instances), ADJ (1; 0% instances), SCONJ (1; 0% instances)