Statistics of NOUN in UD_Ottoman

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Ottoman_Turkish-DUDU: POS Tags: `NOUN`

There are 2565 NOUN lemmas (52%), 4691 NOUN types (53%) and 8303 NOUN tokens (38%). Out of 15 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: paşa, gün, var, beg, yir, el, üzer, yer, ḥāl, baş

The 10 most frequent NOUN types: var, gün, paşa, üzerine, bin, efendi, melik, gice, oġlı, beglerbegisi

The 10 most frequent ambiguous lemmas: var (VERB 123, NOUN 74, ADJ 3), yir (NOUN 61, VERB 1), yer (NOUN 53, VERB 2), bin (NOUN 32, VERB 10), at (NOUN 27, VERB 6), yüz (NUM 54, NOUN 27, VERB 1), iç (NOUN 25, VERB 16, ADJ 6), yan (NOUN 24, ADJ 2, ADP 2, VERB 2), yoḳ (NOUN 22, ADJ 1), öñ (NOUN 17, ADP 1, VERB 1)

The 10 most frequent ambiguous types: var (NOUN 73, ADJ 3, VERB 3), yoḳ (NOUN 19, ADJ 1), cemʿ (NOUN 16, ADJ 1), fevt (NOUN 15, ADJ 1), içinde (NOUN 13, ADJ 5), mevlānā (NOUN 11, PROPN 2), ḳapudan (NOUN 8, PROPN 1), ṭaşra (NOUN 8, ADP 1), meşġūl (NOUN 7, ADJ 3), nesne (NOUN 7, ADV 1)

var
- NOUN 73: evde kim var diyü çaġırdı
- ADJ 3: el-ān ḳabr-i şerīfleri ḳurbinde bir cāmiʿ-i şerīf var dur
- VERB 3: gördi ki selḫānuñ ḥamlı var
yoḳ
- NOUN 19: kemāl-i vużūḥından eks̱er-i mevāżiʿunuñ şerḥe iḥtiyācı yoḳ dur
- ADJ 1: kimesneden ḫavfum yoḳ ve hīç bir pehlevāndan üşenmezin
cemʿ
- NOUN 16: leşker cemʿ idüp üzerine vardı
- ADJ 1: andan ṣoñra cemʿ olup ceẕīmeye naṣīḥat itdiler
fevt
- NOUN 15: ṭoḳuz yüz yetmiş dört senesinde fevt olmışdur
- ADJ 1: ʿabdü’l-kerīm efendi 1227de fevt olup ḥiṣṣe-i meşīḫatı birāderi ʿabdü’l-ḥalīm efendiye intiḳāl etti
içinde
- NOUN 13: bunlaruñ içinde bir pīr kişi var ıdı
- ADJ 5: ol ṭobı bataḳdan çıḳarup gice içinde orduya iletdi
mevlānā
- NOUN 11: mevlānā ḳara muḥyi’d-dīn ḳocailinden dür
- PROPN 2: ve çivi-zāde mevlānā meḥemmed çelebi maʿzūl olup yerine naḳībü’l-eşrāf maʿlūl-zāde mevlānā meḥemmed efendi rūmili ḳāḍī-ʿaskeri oldı
ḳapudan
- NOUN 8: ve ṭonanma-i hümāyūn ile ḳapudan paşa istanbula gelüp dāḫil oldı
- PROPN 1: ve ḳapudan paşa deryādan gelüp el öpüp yerine geçdi
ṭaşra
- NOUN 8: iskender tīz müsterāḥdan ṭaşra çıḳdı
- ADP 1: züheyr bu ḳızuñ atasına muḥabbet-nāmeler göndürmegile ve iştiyāḳ iẓhār itmegile ve ḥadden ṭaşra inʿām ve iḥsān itmegile göñlini temām avladı
meşġūl
- NOUN 7: saʿādetle varup vuṣūl bulduḳlarında yine ṣayd u şikāra meşġūl oldılar
- ADJ 3: ʿilme meşġūl olup çoḳ ʿulūmda müşāreketi var idi
nesne
- NOUN 7: çü işrāḳ eyledi āfāḳı ol nūr cihānda ḳalmadı bir nesne mestūr
- ADV 1: atası bunuñ bu işini görüp müteḥayyir oldı ve bundan ḳatı vehm aldı ve eyitdi bu ʿaceb nesne olacaḳ dur

Morphology

The form / lemma ratio of NOUN is 1.828850 (the average of all parts of speech is 1.775605).

The 1st highest number of forms (20) was observed with the lemma “baş”: baş, başdan, başlar, başları, başların, başlarına, başlarını, başuma, başumuza, başumuzuñ, başuñuza, başuñı, başı, başımuz, başın, başına, başında, başından, başınuñ, başını.

The 2nd highest number of forms (18) was observed with the lemma “el”: el, ele, eli, elile, elin, elinde, elinden, eline, elleri, elleri-y-ile, ellerin, ellerine, ellerüñ, elümden, elüme, elüñ, elüñde, elüñden.

The 3rd highest number of forms (16) was observed with the lemma “memleket”: memleket, memleketi, memleketile, memleketin, memleketinden, memleketine, memleketini, memleketinüñ, memleketlerinde, memleketlerüñ, memlekette, memleketüme, memleketümüzi, memleketüñe, memālik, memālikine.

NOUN occurs with 10 features: Number (8302; 100% instances), Person (8301; 100% instances), Case (8299; 100% instances), Number[psor] (2235; 27% instances), Person[psor] (2235; 27% instances), Gender (1175; 14% instances), Polarity (98; 1% instances), Typo (4; 0% instances), NameType (3; 0% instances), PronType (1; 0% instances)

NOUN occurs with 27 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Equ, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, NameType=Geo, NameType=Prs, Number=Dual, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person=1, Person=2, Person=3, Person[psor]=1, Person[psor]=2, Person[psor]=3, Polarity=Neg, Polarity=Pos, PronType=Int, Typo=Yes

NOUN occurs with 143 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing|Person=3 (3363 tokens). Examples: gün, paşa, bin, efendi, melik, gice, yıl, ḫaber, kişi, ādem

Relations

NOUN nodes are attached to their parents using 25 different relations: obl (1968; 24% instances), obj (1130; 14% instances), nsubj (1086; 13% instances), nmod (816; 10% instances), nmod:poss (766; 9% instances), root (749; 9% instances), conj (713; 9% instances), obl:tmod (282; 3% instances), flat (279; 3% instances), advcl (246; 3% instances), ccomp (124; 1% instances), acl (31; 0% instances), amod (26; 0% instances), vocative (25; 0% instances), compound:redup (17; 0% instances), xcomp (13; 0% instances), orphan (10; 0% instances), appos (7; 0% instances), parataxis (4; 0% instances), compound:lvc (3; 0% instances), case (2; 0% instances), csubj (2; 0% instances), flat:name (2; 0% instances), discourse (1; 0% instances), nummod (1; 0% instances)

Parents of NOUN nodes belong to 12 different parts of speech: NOUN (3267; 39% instances), VERB (3217; 39% instances), (749; 9% instances), PROPN (503; 6% instances), ADJ (443; 5% instances), PRON (85; 1% instances), NUM (15; 0% instances), ADV (13; 0% instances), AUX (7; 0% instances), ADP (2; 0% instances), CCONJ (1; 0% instances), DET (1; 0% instances)

2607 (31%) NOUN nodes are leaves.

3501 (42%) NOUN nodes have one child.

1104 (13%) NOUN nodes have two children.

1091 (13%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 12.

Children of NOUN nodes are attached using 35 different relations: nmod:poss (1183; 12% instances), compound:lvc (891; 9% instances), nmod (886; 9% instances), det (855; 9% instances), obl (781; 8% instances), cc (708; 7% instances), amod (687; 7% instances), conj (679; 7% instances), nsubj (522; 5% instances), case (328; 3% instances), advcl (320; 3% instances), cop (293; 3% instances), obj (277; 3% instances), nummod (274; 3% instances), advmod (224; 2% instances), acl (158; 2% instances), compound (121; 1% instances), obl:tmod (113; 1% instances), ccomp (105; 1% instances), advmod:emph (101; 1% instances), mark (85; 1% instances), discourse (40; 0% instances), flat (23; 0% instances), flat:name (17; 0% instances), compound:redup (16; 0% instances), orphan (15; 0% instances), cc:preconj (14; 0% instances), punct (14; 0% instances), aux:q (9; 0% instances), csubj (8; 0% instances), appos (5; 0% instances), parataxis (5; 0% instances), vocative (5; 0% instances), xcomp (5; 0% instances), aux (4; 0% instances)

Children of NOUN nodes belong to 15 different parts of speech: NOUN (3267; 33% instances), VERB (1525; 16% instances), DET (858; 9% instances), CCONJ (850; 9% instances), PROPN (800; 8% instances), ADJ (753; 8% instances), PRON (442; 5% instances), AUX (309; 3% instances), NUM (298; 3% instances), ADV (239; 2% instances), ADP (202; 2% instances), PART (94; 1% instances), SCONJ (84; 1% instances), INTJ (36; 0% instances), PUNCT (14; 0% instances)

Treebank Statistics: UD_Ottoman_Turkish-DUDU: POS Tags: NOUN

Morphology

Relations

Treebank Statistics: UD_Ottoman_Turkish-DUDU: POS Tags: `NOUN`