home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Turkish-Penn: POS Tags: ADJ

There are 2389 ADJ lemmas (13%), 3242 ADJ types (9%) and 19643 ADJ tokens (11%). Out of 15 observed tags, the rank of ADJ is: 3 in number of lemmas, 4 in number of types and 3 in number of tokens.

The 10 most frequent ADJ lemmas: ol, büyük, yeni, diğer, son, et, var, yaklaşık, iyi, yüksek

The 10 most frequent ADJ types: büyük, olan, yeni, diğer, son, var, yaklaşık, yüksek, iyi, çok

The 10 most frequent ambiguous lemmas: ol (NOUN 945, VERB 930, ADV 879, ADJ 617), büyük (ADJ 504, NOUN 9, VERB 5), yeni (ADJ 376, VERB 2, NOUN 1), diğer (ADJ 311, NOUN 61), son (ADJ 291, NOUN 172, VERB 1), et (VERB 1089, NOUN 416, ADJ 285, ADV 37), var (ADJ 265, VERB 140, NOUN 17, ADV 4), iyi (ADJ 224, NOUN 19, VERB 17, ADV 4), yüksek (ADJ 213, VERB 19, NOUN 11, ADV 1), çok (ADV 276, ADJ 204, NOUN 54, DET 26, ADP 4)

The 10 most frequent ambiguous types: olan (ADJ 423, ADP 1, ADV 1), yeni (ADJ 291, NOUN 1), son (ADJ 214, NOUN 11), var (ADJ 264, NOUN 1), yüksek (ADJ 191, VERB 1), çok (ADV 262, ADJ 193, DET 24, ADP 3, X 3, NOUN 1), geçen (ADJ 138, NOUN 1), fazla (ADJ 159, ADV 123, ADP 3, NOUN 1), yıllık (ADJ 137, NOUN 18, PRON 1), aynı (ADJ 102, NOUN 2)

Morphology

The form / lemma ratio of ADJ is 1.357053 (the average of all parts of speech is 2.012465).

The 1st highest number of forms (28) was observed with the lemma “et”: EDİLEBİLİR, edebileceği, edebileceğimiz, edebilen, edecek, edeceği, edemediği, edemeyen, eden, edici, edildiği, edilebilir, edilecek, edileceği, edilemeyecek, edilemez, edilen, edilmemiş, edilmesi, edilmiş, etmedeki, etmediği, etmesi, etmeyecek, etmeyen, etmiş, ettiği, olan.

The 2nd highest number of forms (24) was observed with the lemma “ol”: başka, ekonomisinin, olabilecek, olabileceğiniz, olacak, olacağı, olamayan, olamaz, olan, oldukları, olduğu, olduğum, olduğumuz, olmadığı, olmadığım, olmadığımız, olmaması, olmamış, olması, olmayacak, olmayan, olmuş, olunan, olur.

The 3rd highest number of forms (24) was observed with the lemma “yap”: YAPILAN, olduğu, yapabildiğiniz, yapabilecek, yapabilecekleri, yapabileceği, yapabilmesi, yapacak, yapacağı, yapacağınız, yapan, yapar, yapması, yapmış, yaptıkları, yaptığı, yaptığım, yaptığımız, yapılacak, yapılacağı, yapılan, yapıldığı, yapılması, yapılmış.

ADJ occurs with 1 features: NumType (134; 1% instances)

ADJ occurs with 3 feature-value pairs: NumType=Card, NumType=Dist, NumType=Ord

ADJ occurs with 4 feature combinations. The most frequent feature combination is _ (19509 tokens). Examples: büyük, olan, yeni, diğer, son, var, yaklaşık, yüksek, iyi, çok

Relations

ADJ nodes are attached to their parents using 22 different relations: amod (11006; 56% instances), acl (2843; 14% instances), advmod (1979; 10% instances), nmod (889; 5% instances), root (732; 4% instances), compound (731; 4% instances), conj (349; 2% instances), advcl (202; 1% instances), nsubj (192; 1% instances), ccomp (143; 1% instances), xcomp (137; 1% instances), obl (113; 1% instances), parataxis (80; 0% instances), obj (73; 0% instances), csubj (63; 0% instances), flat (29; 0% instances), discourse (25; 0% instances), appos (19; 0% instances), dep (14; 0% instances), fixed (14; 0% instances), list (7; 0% instances), clf (3; 0% instances)

Parents of ADJ nodes belong to 14 different parts of speech: NOUN (13995; 71% instances), VERB (1800; 9% instances), ADJ (1604; 8% instances), (732; 4% instances), PROPN (590; 3% instances), ADV (320; 2% instances), NUM (285; 1% instances), DET (251; 1% instances), PRON (27; 0% instances), X (18; 0% instances), ADP (11; 0% instances), AUX (5; 0% instances), INTJ (3; 0% instances), SCONJ (2; 0% instances)

10634 (54%) ADJ nodes are leaves.

6438 (33%) ADJ nodes have one child.

1478 (8%) ADJ nodes have two children.

1093 (6%) ADJ nodes have three or more children.

The highest child degree of a ADJ node is 10.

Children of ADJ nodes are attached using 30 different relations: advmod (2184; 16% instances), nmod (1993; 15% instances), punct (1490; 11% instances), obl (1453; 11% instances), nsubj (1356; 10% instances), compound (864; 6% instances), obj (727; 5% instances), amod (681; 5% instances), nummod (594; 4% instances), cc (414; 3% instances), conj (367; 3% instances), det (273; 2% instances), case (228; 2% instances), advcl (216; 2% instances), aux (193; 1% instances), discourse (105; 1% instances), xcomp (101; 1% instances), mark (91; 1% instances), ccomp (81; 1% instances), acl (79; 1% instances), csubj (66; 0% instances), flat (55; 0% instances), fixed (47; 0% instances), parataxis (34; 0% instances), appos (20; 0% instances), clf (7; 0% instances), list (4; 0% instances), vocative (3; 0% instances), iobj (2; 0% instances), dep (1; 0% instances)

Children of ADJ nodes belong to 15 different parts of speech: NOUN (5586; 41% instances), ADV (2051; 15% instances), ADJ (1604; 12% instances), PUNCT (1490; 11% instances), PROPN (727; 5% instances), NUM (680; 5% instances), CCONJ (492; 4% instances), DET (333; 2% instances), PRON (206; 2% instances), AUX (195; 1% instances), ADP (191; 1% instances), VERB (133; 1% instances), SCONJ (20; 0% instances), X (20; 0% instances), INTJ (1; 0% instances)