home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Ottoman_Turkish-BOUN: POS Tags: NUM

There are 20 NUM lemmas (1%), 26 NUM types (1%) and 93 NUM tokens (1%). Out of 16 observed tags, the rank of NUM is: 8 in number of lemmas, 8 in number of types and 12 in number of tokens.

The 10 most frequent NUM lemmas: iki, on, bir, üç, beş, dört, dokuzuncu, kırk, sekiz, yedinci

The 10 most frequent NUM types: iki, on, üç, beş, bir, birer, dört, İki, birinci, dokuzuncu

The 10 most frequent ambiguous lemmas: bir (DET 263, NUM 10, ADV 7), beş (NUM 7, NOUN 1), bin (VERB 2, NUM 1), kaç (NUM 1, VERB 1)

The 10 most frequent ambiguous types: bir (DET 246, ADV 4, NUM 3), birinci (ADV 1, NUM 1)

Morphology

The form / lemma ratio of NUM is 1.300000 (the average of all parts of speech is 1.583924).

The 1st highest number of forms (4) was observed with the lemma “iki”: iki, ikimiz, ikinci, İki.

The 2nd highest number of forms (3) was observed with the lemma “bir”: bir, birer, birinci.

The 3rd highest number of forms (2) was observed with the lemma “üç”: üç, üçünden.

NUM occurs with 6 features: NumType (88; 95% instances), Case (8; 9% instances), Number (8; 9% instances), Person (8; 9% instances), Number[psor] (2; 2% instances), Person[psor] (2; 2% instances)

NUM occurs with 12 feature-value pairs: Case=Abl, Case=Loc, Case=Nom, NumType=Card, NumType=Dist, NumType=Ord, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person=3, Person[psor]=1, Person[psor]=3

NUM occurs with 10 feature combinations. The most frequent feature combination is NumType=Card (75 tokens). Examples: iki, on, üç, beş, bir, dört, İki, kırk, yirmi, 12

Relations

NUM nodes are attached to their parents using 10 different relations: nummod (57; 61% instances), flat (14; 15% instances), amod (10; 11% instances), compound (3; 3% instances), nmod:poss (3; 3% instances), obl (2; 2% instances), conj (1; 1% instances), nsubj (1; 1% instances), obj (1; 1% instances), root (1; 1% instances)

Parents of NUM nodes belong to 6 different parts of speech: NOUN (68; 73% instances), NUM (18; 19% instances), ADJ (4; 4% instances), ADV (1; 1% instances), (1; 1% instances), VERB (1; 1% instances)

67 (72%) NUM nodes are leaves.

21 (23%) NUM nodes have one child.

2 (2%) NUM nodes have two children.

3 (3%) NUM nodes have three or more children.

The highest child degree of a NUM node is 5.

Children of NUM nodes are attached using 13 different relations: flat (15; 42% instances), punct (5; 14% instances), compound (4; 11% instances), amod (2; 6% instances), aux (2; 6% instances), advmod (1; 3% instances), advmod:emph (1; 3% instances), case (1; 3% instances), conj (1; 3% instances), nmod (1; 3% instances), nmod:poss (1; 3% instances), obl (1; 3% instances), orphan (1; 3% instances)

Children of NUM nodes belong to 9 different parts of speech: NUM (18; 50% instances), PUNCT (5; 14% instances), NOUN (4; 11% instances), ADJ (2; 6% instances), ADV (2; 6% instances), AUX (2; 6% instances), ADP (1; 3% instances), PART (1; 3% instances), PROPN (1; 3% instances)