Treebank Statistics: UD_Ottoman_Turkish-TueCL: POS Tags: NOUN
There are 125 NOUN lemmas (43%), 155 NOUN types (36%) and 249 NOUN tokens (27%).
Out of 14 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN lemmas: kitāb, ev, mekteb, çocuḳ, ʿaraba, ṣabāḥ, ki, anne, birāder, dōst
The 10 most frequent NOUN types: evde, kitāb, kitābı, mektebe, ṣabāḥ, eve, çocuḳ, Muʿallim, ekmek, taḥrīr
The 10 most frequent ambiguous lemmas: ki (NOUN 4, PRON 3, SCONJ 2), sāʿāt (ADV 1, NOUN 1)
The 10 most frequent ambiguous types: ki (NOUN 2, SCONJ 2), sāʿāt (ADV 1, NOUN 1)
- ki
- sāʿāt
Morphology
The form / lemma ratio of NOUN is 1.240000 (the average of all parts of speech is 1.488055).
The 1st highest number of forms (5) was observed with the lemma “ev”: Evin, ev, evde, evdeki, eve.
The 2nd highest number of forms (4) was observed with the lemma “ʿaraba”: ʿaraba, ʿarabalıḳ, ʿarabam, ʿarabayı.
The 3rd highest number of forms (3) was observed with the lemma “dōst”: dōstlarundan, dōstına, dōstınıla.
NOUN occurs with 7 features: Case (247; 99% instances), Number (246; 99% instances), Person[psor] (48; 19% instances), Number[psor] (47; 19% instances), Gender (8; 3% instances), Polarity (3; 1% instances), Person (2; 1% instances)
NOUN occurs with 18 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person=3, Person[psor]=1, Person[psor]=2, Person[psor]=3, Polarity=Neg, Polarity=Pos
NOUN occurs with 40 feature combinations.
The most frequent feature combination is Case=Nom|Number=Sing (115 tokens).
Examples: kitāb, ṣabāḥ, çocuḳ, Muʿallim, ekmek, taḥrīr, terk, zann, alḳol, bisiklet
Relations
NOUN nodes are attached to their parents using 17 different relations: obj (56; 22% instances), obl (49; 20% instances), nsubj (40; 16% instances), compound:lvc (30; 12% instances), root (17; 7% instances), nmod (13; 5% instances), obl:tmod (13; 5% instances), nmod:poss (10; 4% instances), conj (4; 2% instances), orphan (4; 2% instances), nsubj:pass (3; 1% instances), amod (2; 1% instances), ccomp (2; 1% instances), nsubj:outer (2; 1% instances), obl:agent (2; 1% instances), compound (1; 0% instances), parataxis (1; 0% instances)
Parents of NOUN nodes belong to 7 different parts of speech: VERB (169; 68% instances), NOUN (29; 12% instances), ADJ (26; 10% instances), (17; 7% instances), PROPN (4; 2% instances), AUX (2; 1% instances), PRON (2; 1% instances)
156 (63%) NOUN nodes are leaves.
66 (27%) NOUN nodes have one child.
11 (4%) NOUN nodes have two children.
16 (6%) NOUN nodes have three or more children.
The highest child degree of a NOUN node is 6.
Children of NOUN nodes are attached using 19 different relations: nsubj (19; 13% instances), punct (19; 13% instances), nmod (15; 10% instances), amod (14; 10% instances), det (14; 10% instances), nmod:poss (13; 9% instances), aux:q (10; 7% instances), cop (8; 5% instances), advmod (7; 5% instances), aux (6; 4% instances), acl (5; 3% instances), nummod (4; 3% instances), conj (3; 2% instances), advcl (2; 1% instances), advmod:emph (2; 1% instances), cc (2; 1% instances), orphan (2; 1% instances), case (1; 1% instances), compound (1; 1% instances)
Children of NOUN nodes belong to 12 different parts of speech: NOUN (29; 20% instances), AUX (24; 16% instances), PROPN (20; 14% instances), PUNCT (19; 13% instances), ADJ (14; 10% instances), DET (14; 10% instances), ADV (9; 6% instances), VERB (6; 4% instances), PRON (5; 3% instances), NUM (4; 3% instances), CCONJ (2; 1% instances), ADP (1; 1% instances)