Treebank Statistics: UD_Ottoman_Turkish-BOUN: POS Tags: NUM
There are 20 NUM
lemmas (1%), 26 NUM
types (1%) and 93 NUM
tokens (1%).
Out of 16 observed tags, the rank of NUM
is: 8 in number of lemmas, 8 in number of types and 12 in number of tokens.
The 10 most frequent NUM
lemmas: iki, on, bir, üç, beş, dört, dokuzuncu, kırk, sekiz, yedinci
The 10 most frequent NUM
types: iki, on, üç, beş, bir, birer, dört, İki, birinci, dokuzuncu
The 10 most frequent ambiguous lemmas: bir (DET 263, NUM 10, ADV 7), beş (NUM 7, NOUN 1), bin (VERB 2, NUM 1), kaç (NUM 1, VERB 1)
The 10 most frequent ambiguous types: bir (DET 246, ADV 4, NUM 3), birinci (ADV 1, NUM 1)
- bir
- birinci
- ADV 1: Dört beş sene evvel lisanına rekâket gelmiş olan şu zavallı adamın o zamandan beri ma’nâlı olarak birinci tefevvüh eylediği bu “ Jân “ kelimesi ağzından çıkar çıkmaz gözünden de bir damla yaş kopmuş idi .
- NUM 1: Bir zamanlar Arapları mertebe-i aliâ-ı medeniyete ıs’âd eden evvela şiire , sonra felsefe daha sonra da erbab-ı fünûn değil miydi ? Bu gün garplılardan mesela Fransa’yı tarih-i tekâmül itibariyle tedkik edecek olsak görüyoruz ki on birinci asırda Cornei , Milavir , Racine gibi dühât-ı şiir eğilmiş ; sonra on sekizinci asırda yaşayan ve Voltaire , Rousseau , Montesquieu gibi mütefekkirler yetişmiş …
Morphology
The form / lemma ratio of NUM
is 1.300000 (the average of all parts of speech is 1.583924).
The 1st highest number of forms (4) was observed with the lemma “iki”: iki, ikimiz, ikinci, İki.
The 2nd highest number of forms (3) was observed with the lemma “bir”: bir, birer, birinci.
The 3rd highest number of forms (2) was observed with the lemma “üç”: üç, üçünden.
NUM
occurs with 6 features: NumType (88; 95% instances), Case (8; 9% instances), Number (8; 9% instances), Person (8; 9% instances), Number[psor] (2; 2% instances), Person[psor] (2; 2% instances)
NUM
occurs with 12 feature-value pairs: Case=Abl
, Case=Loc
, Case=Nom
, NumType=Card
, NumType=Dist
, NumType=Ord
, Number=Sing
, Number[psor]=Plur
, Number[psor]=Sing
, Person=3
, Person[psor]=1
, Person[psor]=3
NUM
occurs with 10 feature combinations.
The most frequent feature combination is NumType=Card
(75 tokens).
Examples: iki, on, üç, beş, bir, dört, İki, kırk, yirmi, 12
Relations
NUM
nodes are attached to their parents using 10 different relations: nummod (57; 61% instances), flat (14; 15% instances), amod (10; 11% instances), compound (3; 3% instances), nmod:poss (3; 3% instances), obl (2; 2% instances), conj (1; 1% instances), nsubj (1; 1% instances), obj (1; 1% instances), root (1; 1% instances)
Parents of NUM
nodes belong to 6 different parts of speech: NOUN (68; 73% instances), NUM (18; 19% instances), ADJ (4; 4% instances), ADV (1; 1% instances), (1; 1% instances), VERB (1; 1% instances)
67 (72%) NUM
nodes are leaves.
21 (23%) NUM
nodes have one child.
2 (2%) NUM
nodes have two children.
3 (3%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 5.
Children of NUM
nodes are attached using 13 different relations: flat (15; 42% instances), punct (5; 14% instances), compound (4; 11% instances), amod (2; 6% instances), aux (2; 6% instances), advmod (1; 3% instances), advmod:emph (1; 3% instances), case (1; 3% instances), conj (1; 3% instances), nmod (1; 3% instances), nmod:poss (1; 3% instances), obl (1; 3% instances), orphan (1; 3% instances)
Children of NUM
nodes belong to 9 different parts of speech: NUM (18; 50% instances), PUNCT (5; 14% instances), NOUN (4; 11% instances), ADJ (2; 6% instances), ADV (2; 6% instances), AUX (2; 6% instances), ADP (1; 3% instances), PART (1; 3% instances), PROPN (1; 3% instances)