home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Turkish-Penn: POS Tags: DET

There are 11 DET lemmas (0%), 13 DET types (0%) and 6832 DET tokens (4%). Out of 15 observed tags, the rank of DET is: 13 in number of lemmas, 14 in number of types and 8 in number of tokens.

The 10 most frequent DET lemmas: bir, bu, bazı, her, birçok, tüm, o, birkaç, hiçbir, şu

The 10 most frequent DET types: bir, bu, bazı, her, birçok, tüm, o, birkaç, hiçbir, şu

The 10 most frequent ambiguous lemmas: bir (DET 3780, NUM 263, ADJ 38, ADV 3), bu (DET 1505, PRON 791, X 6), bazı (DET 326, PRON 65, ADJ 33), tüm (DET 187, NOUN 17), o (PRON 572, DET 154, PROPN 2, X 1), şu (DET 108, PRON 17, X 4), çok (ADV 299, ADJ 218, DET 18, ADP 1, NOUN 1)

The 10 most frequent ambiguous types: bir (DET 3455, NUM 191, ADJ 27, ADV 1), bu (DET 857, PRON 79), bazı (DET 212, ADJ 15), tüm (DET 140, NOUN 3), o (DET 87, PRON 26), şu (DET 75, PRON 3), çok (ADV 283, ADJ 182, DET 16, X 3, ADP 1, NOUN 1)

Morphology

The form / lemma ratio of DET is 1.181818 (the average of all parts of speech is 2.343544).

The 1st highest number of forms (3) was observed with the lemma “bir”: BİR, Bİr, bir.

The 2nd highest number of forms (2) was observed with the lemma “birkaç”: birkaç, birçok.

The 3rd highest number of forms (2) was observed with the lemma “bu”: bu, o.

DET occurs with 4 features: PronType (6814; 100% instances), Definite (6702; 98% instances), ExtPos (15; 0% instances), Typo (4; 0% instances)

DET occurs with 9 feature-value pairs: Definite=Def, Definite=Ind, ExtPos=ADV, ExtPos=CCONJ, PronType=Art, PronType=Dem, PronType=Ind, PronType=Neg, Typo=Yes

DET occurs with 11 feature combinations. The most frequent feature combination is Definite=Ind|PronType=Art (3774 tokens). Examples: bir, BİR, Bİr

Relations

DET nodes are attached to their parents using 17 different relations: det (6293; 92% instances), nsubj (218; 3% instances), amod (111; 2% instances), compound (77; 1% instances), advmod (39; 1% instances), nmod (26; 0% instances), dep (20; 0% instances), discourse (13; 0% instances), fixed (11; 0% instances), root (7; 0% instances), obj (6; 0% instances), csubj (3; 0% instances), obl (3; 0% instances), ccomp (2; 0% instances), advcl (1; 0% instances), conj (1; 0% instances), parataxis (1; 0% instances)

Parents of DET nodes belong to 13 different parts of speech: NOUN (6015; 88% instances), ADJ (269; 4% instances), VERB (267; 4% instances), PROPN (86; 1% instances), NUM (84; 1% instances), PRON (34; 0% instances), ADV (33; 0% instances), DET (17; 0% instances), ADP (13; 0% instances), (7; 0% instances), X (3; 0% instances), AUX (2; 0% instances), SCONJ (2; 0% instances)

6629 (97%) DET nodes are leaves.

188 (3%) DET nodes have one child.

9 (0%) DET nodes have two children.

6 (0%) DET nodes have three or more children.

The highest child degree of a DET node is 6.

Children of DET nodes are attached using 21 different relations: compound (53; 23% instances), case (41; 18% instances), fixed (35; 15% instances), advmod (24; 10% instances), punct (17; 7% instances), det (11; 5% instances), nmod (10; 4% instances), nsubj (8; 3% instances), cc (5; 2% instances), aux (4; 2% instances), conj (4; 2% instances), goeswith (4; 2% instances), amod (3; 1% instances), appos (2; 1% instances), discourse (2; 1% instances), flat (2; 1% instances), mark (2; 1% instances), acl (1; 0% instances), advcl (1; 0% instances), csubj (1; 0% instances), obl (1; 0% instances)

Children of DET nodes belong to 14 different parts of speech: ADJ (57; 25% instances), ADP (42; 18% instances), NOUN (34; 15% instances), CCONJ (23; 10% instances), ADV (18; 8% instances), DET (17; 7% instances), PUNCT (17; 7% instances), PRON (5; 2% instances), AUX (4; 2% instances), VERB (4; 2% instances), X (4; 2% instances), PROPN (3; 1% instances), NUM (2; 1% instances), SCONJ (1; 0% instances)