home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Turkish-FrameNet: POS Tags: NUM

There are 37 NUM lemmas (1%), 54 NUM types (1%) and 171 NUM tokens (1%). Out of 15 observed tags, the rank of NUM is: 6 in number of lemmas, 7 in number of types and 11 in number of tokens.

The 10 most frequent NUM lemmas: iki, bir, beş, bin, on, üç, 2, altı, sekiz, 1

The 10 most frequent NUM types: iki, bir, beş, bin, üç, İki, on, 2, altı, sekiz

The 10 most frequent ambiguous lemmas: bir (DET 297, NUM 21, ADV 7), bin (NUM 12, VERB 3), yüz (NOUN 22, VERB 3, NUM 2)

The 10 most frequent ambiguous types: bir (DET 269, NUM 10, ADV 7), altı (NUM 4, NOUN 2), yüz (NOUN 8, NUM 2), 20 (NUM 1, X 1), birine (NUM 1, PRON 1)

Morphology

The form / lemma ratio of NUM is 1.459459 (the average of all parts of speech is 1.977290).

The 1st highest number of forms (7) was observed with the lemma “iki”: iki, ikide, ikisi, ikisini, ikiye, İki, İkimiz.

The 2nd highest number of forms (5) was observed with the lemma “bir”: bir, bire, birer, birincisi, birine.

The 3rd highest number of forms (2) was observed with the lemma “2”: 2, 2’yi.

NUM occurs with 5 features: NumType (171; 100% instances), Case (23; 13% instances), Number (23; 13% instances), Number[psor] (10; 6% instances), Person[psor] (10; 6% instances)

NUM occurs with 14 feature-value pairs: Case=Acc, Case=Dat, Case=Loc, Case=Nom, NumType=Card, NumType=Dist, NumType=Ord, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person[psor]=1, Person[psor]=2, Person[psor]=3

NUM occurs with 14 feature combinations. The most frequent feature combination is NumType=Card (142 tokens). Examples: iki, bir, beş, bin, üç, İki, on, 2, altı, sekiz

Relations

NUM nodes are attached to their parents using 7 different relations: nummod (116; 68% instances), compound (28; 16% instances), obj (6; 4% instances), obl (6; 4% instances), amod (5; 3% instances), nmod (5; 3% instances), nsubj (5; 3% instances)

Parents of NUM nodes belong to 6 different parts of speech: NOUN (118; 69% instances), VERB (24; 14% instances), NUM (23; 13% instances), ADJ (4; 2% instances), ADV (1; 1% instances), X (1; 1% instances)

136 (80%) NUM nodes are leaves.

30 (18%) NUM nodes have one child.

4 (2%) NUM nodes have two children.

1 (1%) NUM nodes have three or more children.

The highest child degree of a NUM node is 3.

Children of NUM nodes are attached using 7 different relations: compound (20; 49% instances), nmod (6; 15% instances), punct (5; 12% instances), advmod (4; 10% instances), nummod (3; 7% instances), case (2; 5% instances), det (1; 2% instances)

Children of NUM nodes belong to 7 different parts of speech: NUM (23; 56% instances), NOUN (6; 15% instances), PUNCT (5; 12% instances), ADV (4; 10% instances), ADP (1; 2% instances), CCONJ (1; 2% instances), DET (1; 2% instances)