home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Turkish-Penn: POS Tags: ADV

There are 596 ADV lemmas (3%), 790 ADV types (2%) and 9969 ADV tokens (5%). Out of 15 observed tags, the rank of ADV is: 6 in number of lemmas, 6 in number of types and 6 in number of tokens.

The 10 most frequent ADV lemmas: daha, ol, en, çok, ancak, ayrıca, sonra, tarafından, sadece, art

The 10 most frequent ADV types: daha, olarak, en, çok, ancak, ayrıca, sonra, tarafından, sadece, yeniden

The 10 most frequent ambiguous lemmas: ol (NOUN 945, VERB 930, ADV 879, ADJ 617), en (ADV 511, NOUN 6), çok (ADV 276, ADJ 204, NOUN 54, DET 26, ADP 4), ancak (CCONJ 352, ADV 274), sonra (ADV 188, ADP 105, NOUN 58, ADJ 1), tarafından (ADV 186, NOUN 1), art (NOUN 193, ADV 154, VERB 111, ADJ 50), düş (VERB 250, ADV 149, NOUN 132, ADJ 30), önce (ADV 145, ADP 66, NOUN 39, ADJ 18, VERB 1), şimdi (ADV 145, NOUN 22)

The 10 most frequent ambiguous types: olarak (ADV 814, NOUN 1, PRON 1), en (ADV 428, NOUN 1), çok (ADV 262, ADJ 193, DET 24, ADP 3, X 3, NOUN 1), ancak (ADV 82, CCONJ 64), sonra (ADV 171, ADP 105, NOUN 4), tarafından (ADV 186, NOUN 128), yeniden (ADV 145, NOUN 1), önce (ADV 144, ADP 66, NOUN 1), geri (ADV 140, ADJ 28, NOUN 11), bile (ADV 143, VERB 1)

Morphology

The form / lemma ratio of ADV is 1.325503 (the average of all parts of speech is 2.012465).

The 1st highest number of forms (13) was observed with the lemma “et”: edemeyince, ederek, ederken, edilip, edilmeden, edince, edip, ediyor, etmeden, etmedikçe, etmeksizin, ettikçe, ettirerek.

The 2nd highest number of forms (11) was observed with the lemma “ol”: düşerek, olan, olarak, olmadan, olmadıkça, olmaksızın, olmayınca, olunca, olup, olurken, yakın.

The 3rd highest number of forms (8) was observed with the lemma “al”: alamadan, alarak, almadan, alınca, alınıp, alıp, alırken, alırlarken.

ADV occurs with 3 features: Degree (1548; 16% instances), PronType (99; 1% instances), Typo (2; 0% instances)

ADV occurs with 5 feature-value pairs: Degree=Cmp, Degree=Sup, PronType=Ind, PronType=Int, Typo=Yes

ADV occurs with 6 feature combinations. The most frequent feature combination is _ (8320 tokens). Examples: olarak, çok, ancak, ayrıca, sonra, tarafından, sadece, yeniden, önce, artarak

Relations

ADV nodes are attached to their parents using 22 different relations: advmod (5585; 56% instances), advcl (1539; 15% instances), case (753; 8% instances), obl (388; 4% instances), discourse (365; 4% instances), amod (362; 4% instances), cc (303; 3% instances), compound (274; 3% instances), nmod (126; 1% instances), root (75; 1% instances), mark (48; 0% instances), fixed (38; 0% instances), conj (32; 0% instances), obj (23; 0% instances), ccomp (12; 0% instances), nsubj (12; 0% instances), acl (11; 0% instances), parataxis (9; 0% instances), xcomp (7; 0% instances), appos (3; 0% instances), flat (3; 0% instances), iobj (1; 0% instances)

Parents of ADV nodes belong to 15 different parts of speech: VERB (4446; 45% instances), NOUN (2460; 25% instances), ADJ (2051; 21% instances), ADV (623; 6% instances), PROPN (122; 1% instances), NUM (91; 1% instances), (75; 1% instances), PRON (36; 0% instances), DET (29; 0% instances), AUX (9; 0% instances), ADP (8; 0% instances), CCONJ (8; 0% instances), X (6; 0% instances), SCONJ (3; 0% instances), INTJ (2; 0% instances)

6529 (65%) ADV nodes are leaves.

2839 (28%) ADV nodes have one child.

436 (4%) ADV nodes have two children.

165 (2%) ADV nodes have three or more children.

The highest child degree of a ADV node is 8.

Children of ADV nodes are attached using 27 different relations: nmod (1022; 24% instances), advmod (768; 18% instances), nsubj (480; 11% instances), obl (382; 9% instances), nummod (305; 7% instances), obj (294; 7% instances), compound (219; 5% instances), punct (203; 5% instances), case (150; 3% instances), amod (82; 2% instances), xcomp (79; 2% instances), fixed (72; 2% instances), cc (52; 1% instances), det (40; 1% instances), advcl (35; 1% instances), conj (28; 1% instances), aux (23; 1% instances), ccomp (21; 0% instances), mark (19; 0% instances), discourse (16; 0% instances), parataxis (15; 0% instances), csubj (7; 0% instances), acl (6; 0% instances), appos (4; 0% instances), goeswith (2; 0% instances), flat (1; 0% instances), list (1; 0% instances)

Children of ADV nodes belong to 15 different parts of speech: NOUN (2104; 49% instances), ADV (623; 14% instances), NUM (333; 8% instances), ADJ (320; 7% instances), CCONJ (302; 7% instances), PUNCT (203; 5% instances), PROPN (183; 4% instances), PRON (84; 2% instances), DET (56; 1% instances), ADP (43; 1% instances), AUX (23; 1% instances), X (23; 1% instances), VERB (20; 0% instances), SCONJ (7; 0% instances), INTJ (2; 0% instances)