Treebank Statistics: UD_Ottoman_Turkish-DUDU: POS Tags: NOUN
There are 2565 NOUN lemmas (52%), 4691 NOUN types (53%) and 8303 NOUN tokens (38%).
Out of 15 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN lemmas: paşa, gün, var, beg, yir, el, üzer, yer, ḥāl, baş
The 10 most frequent NOUN types: var, gün, paşa, üzerine, bin, efendi, melik, gice, oġlı, beglerbegisi
The 10 most frequent ambiguous lemmas: var (VERB 123, NOUN 74, ADJ 3), yir (NOUN 61, VERB 1), yer (NOUN 53, VERB 2), bin (NOUN 32, VERB 10), at (NOUN 27, VERB 6), yüz (NUM 54, NOUN 27, VERB 1), iç (NOUN 25, VERB 16, ADJ 6), yan (NOUN 24, ADJ 2, ADP 2, VERB 2), yoḳ (NOUN 22, ADJ 1), öñ (NOUN 17, ADP 1, VERB 1)
The 10 most frequent ambiguous types: var (NOUN 73, ADJ 3, VERB 3), yoḳ (NOUN 19, ADJ 1), cemʿ (NOUN 16, ADJ 1), fevt (NOUN 15, ADJ 1), içinde (NOUN 13, ADJ 5), mevlānā (NOUN 11, PROPN 2), ḳapudan (NOUN 8, PROPN 1), ṭaşra (NOUN 8, ADP 1), meşġūl (NOUN 7, ADJ 3), nesne (NOUN 7, ADV 1)
- var
- yoḳ
- cemʿ
- fevt
- içinde
- mevlānā
- ḳapudan
- ṭaşra
- meşġūl
- nesne
Morphology
The form / lemma ratio of NOUN is 1.828850 (the average of all parts of speech is 1.775605).
The 1st highest number of forms (20) was observed with the lemma “baş”: baş, başdan, başlar, başları, başların, başlarına, başlarını, başuma, başumuza, başumuzuñ, başuñuza, başuñı, başı, başımuz, başın, başına, başında, başından, başınuñ, başını.
The 2nd highest number of forms (18) was observed with the lemma “el”: el, ele, eli, elile, elin, elinde, elinden, eline, elleri, elleri-y-ile, ellerin, ellerine, ellerüñ, elümden, elüme, elüñ, elüñde, elüñden.
The 3rd highest number of forms (16) was observed with the lemma “memleket”: memleket, memleketi, memleketile, memleketin, memleketinden, memleketine, memleketini, memleketinüñ, memleketlerinde, memleketlerüñ, memlekette, memleketüme, memleketümüzi, memleketüñe, memālik, memālikine.
NOUN occurs with 10 features: Number (8302; 100% instances), Person (8301; 100% instances), Case (8299; 100% instances), Number[psor] (2235; 27% instances), Person[psor] (2235; 27% instances), Gender (1175; 14% instances), Polarity (98; 1% instances), Typo (4; 0% instances), NameType (3; 0% instances), PronType (1; 0% instances)
NOUN occurs with 27 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Equ, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, NameType=Geo, NameType=Prs, Number=Dual, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person=1, Person=2, Person=3, Person[psor]=1, Person[psor]=2, Person[psor]=3, Polarity=Neg, Polarity=Pos, PronType=Int, Typo=Yes
NOUN occurs with 143 feature combinations.
The most frequent feature combination is Case=Nom|Number=Sing|Person=3 (3363 tokens).
Examples: gün, paşa, bin, efendi, melik, gice, yıl, ḫaber, kişi, ādem
Relations
NOUN nodes are attached to their parents using 25 different relations: obl (1968; 24% instances), obj (1130; 14% instances), nsubj (1086; 13% instances), nmod (816; 10% instances), nmod:poss (766; 9% instances), root (749; 9% instances), conj (713; 9% instances), obl:tmod (282; 3% instances), flat (279; 3% instances), advcl (246; 3% instances), ccomp (124; 1% instances), acl (31; 0% instances), amod (26; 0% instances), vocative (25; 0% instances), compound:redup (17; 0% instances), xcomp (13; 0% instances), orphan (10; 0% instances), appos (7; 0% instances), parataxis (4; 0% instances), compound:lvc (3; 0% instances), case (2; 0% instances), csubj (2; 0% instances), flat:name (2; 0% instances), discourse (1; 0% instances), nummod (1; 0% instances)
Parents of NOUN nodes belong to 12 different parts of speech: NOUN (3267; 39% instances), VERB (3217; 39% instances), (749; 9% instances), PROPN (503; 6% instances), ADJ (443; 5% instances), PRON (85; 1% instances), NUM (15; 0% instances), ADV (13; 0% instances), AUX (7; 0% instances), ADP (2; 0% instances), CCONJ (1; 0% instances), DET (1; 0% instances)
2607 (31%) NOUN nodes are leaves.
3501 (42%) NOUN nodes have one child.
1104 (13%) NOUN nodes have two children.
1091 (13%) NOUN nodes have three or more children.
The highest child degree of a NOUN node is 12.
Children of NOUN nodes are attached using 35 different relations: nmod:poss (1183; 12% instances), compound:lvc (891; 9% instances), nmod (886; 9% instances), det (855; 9% instances), obl (781; 8% instances), cc (708; 7% instances), amod (687; 7% instances), conj (679; 7% instances), nsubj (522; 5% instances), case (328; 3% instances), advcl (320; 3% instances), cop (293; 3% instances), obj (277; 3% instances), nummod (274; 3% instances), advmod (224; 2% instances), acl (158; 2% instances), compound (121; 1% instances), obl:tmod (113; 1% instances), ccomp (105; 1% instances), advmod:emph (101; 1% instances), mark (85; 1% instances), discourse (40; 0% instances), flat (23; 0% instances), flat:name (17; 0% instances), compound:redup (16; 0% instances), orphan (15; 0% instances), cc:preconj (14; 0% instances), punct (14; 0% instances), aux:q (9; 0% instances), csubj (8; 0% instances), appos (5; 0% instances), parataxis (5; 0% instances), vocative (5; 0% instances), xcomp (5; 0% instances), aux (4; 0% instances)
Children of NOUN nodes belong to 15 different parts of speech: NOUN (3267; 33% instances), VERB (1525; 16% instances), DET (858; 9% instances), CCONJ (850; 9% instances), PROPN (800; 8% instances), ADJ (753; 8% instances), PRON (442; 5% instances), AUX (309; 3% instances), NUM (298; 3% instances), ADV (239; 2% instances), ADP (202; 2% instances), PART (94; 1% instances), SCONJ (84; 1% instances), INTJ (36; 0% instances), PUNCT (14; 0% instances)