home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-SynTagRus: POS Tags: DET

There are 40 DET lemmas (0%), 343 DET types (0%) and 41139 DET tokens (3%). Out of 17 observed tags, the rank of DET is: 12 in number of lemmas, 8 in number of types and 11 in number of tokens.

The 10 most frequent DET lemmas: этот, свой, весь, тот, такой, его, наш, их, мой, какой

The 10 most frequent DET types: его, все, их, этот, эти, ее, этой, такой, всех, своей

The 10 most frequent ambiguous lemmas: такой (DET 3535, ADJ 1), какой (DET 1038, ADJ 1), один (NUM 2706, DET 984, NOUN 3), другой (ADJ 2063, DET 658), самый (ADJ 1600, DET 528), сам (ADJ 1186, DET 514), никакой (DET 508, ADJ 1), иной (ADJ 385, DET 90), прочий (ADJ 103, DET 58), кой (DET 34, ADJ 9)

The 10 most frequent ambiguous types: его (DET 2183, PRON 2063), все (DET 1667, PRON 1487, PART 677), их (PRON 1530, DET 1191), ее (PRON 1109, DET 897), этой (DET 923, PRON 1), всех (DET 715, PRON 231), это (PRON 4381, DET 568, PART 317), этом (PRON 1062, DET 656), этого (PRON 737, DET 644), том (PRON 1206, DET 546, NOUN 13)

Morphology

The form / lemma ratio of DET is 8.575000 (the average of all parts of speech is 2.654430).

The 1st highest number of forms (15) was observed with the lemma “свой”: свое, своего, своей, своем, своему, своею, свои, своим, своими, своих, свой, свою, своя, своё, своём.

The 2nd highest number of forms (14) was observed with the lemma “весь”: весь, все, всего, всей, всем, всеми, всему, всех, всею, всея, всю, вся, всё, всём.

The 3rd highest number of forms (14) was observed with the lemma “мой”: мое, моего, моей, моем, моему, моею, мои, моим, моими, моих, мой, мою, моя, моём.

DET occurs with 10 features: Number (36546; 89% instances), Case (36545; 89% instances), PronType (27807; 68% instances), Gender (24512; 60% instances), Poss (8758; 21% instances), Reflex (3223; 8% instances), Animacy (1767; 4% instances), Abbr (17; 0% instances), Typo (2; 0% instances), Variant (1; 0% instances)

DET occurs with 24 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Poss=Yes, PronType=Dem, PronType=Ind, PronType=Int,Rel, PronType=Neg, PronType=Prs, PronType=Tot, Reflex=Yes, Typo=Yes, Variant=Short

DET occurs with 203 feature combinations. The most frequent feature combination is Poss=Yes|PronType=Prs (3073 tokens). Examples: его, их, ее, её

Relations

DET nodes are attached to their parents using 24 different relations: det (37906; 92% instances), nsubj (936; 2% instances), obl (474; 1% instances), amod (448; 1% instances), fixed (430; 1% instances), root (266; 1% instances), conj (232; 1% instances), obj (104; 0% instances), parataxis (50; 0% instances), nmod (48; 0% instances), xcomp (45; 0% instances), iobj (41; 0% instances), nsubj:pass (39; 0% instances), orphan (29; 0% instances), ccomp (28; 0% instances), advmod (19; 0% instances), acl (15; 0% instances), appos (14; 0% instances), advcl (5; 0% instances), csubj (5; 0% instances), acl:relcl (2; 0% instances), dislocated (1; 0% instances), obl:agent (1; 0% instances), obl:tmod (1; 0% instances)

Parents of DET nodes belong to 14 different parts of speech: NOUN (35147; 85% instances), VERB (2084; 5% instances), ADJ (1088; 3% instances), PRON (1047; 3% instances), PROPN (476; 1% instances), ADP (384; 1% instances), DET (325; 1% instances), (266; 1% instances), NUM (170; 0% instances), ADV (87; 0% instances), PART (45; 0% instances), X (11; 0% instances), SYM (8; 0% instances), CCONJ (1; 0% instances)

35762 (87%) DET nodes are leaves.

3546 (9%) DET nodes have one child.

1094 (3%) DET nodes have two children.

737 (2%) DET nodes have three or more children.

The highest child degree of a DET node is 20.

Children of DET nodes are attached using 31 different relations: advmod (2422; 28% instances), punct (1370; 16% instances), acl:relcl (794; 9% instances), obl (651; 8% instances), case (564; 7% instances), conj (498; 6% instances), cc (438; 5% instances), nmod (328; 4% instances), nsubj (318; 4% instances), det (196; 2% instances), parataxis (177; 2% instances), fixed (146; 2% instances), amod (142; 2% instances), cop (130; 2% instances), ccomp (109; 1% instances), mark (88; 1% instances), advcl (65; 1% instances), acl (50; 1% instances), orphan (34; 0% instances), flat:foreign (20; 0% instances), appos (19; 0% instances), expl (13; 0% instances), discourse (11; 0% instances), nummod:gov (7; 0% instances), aux (6; 0% instances), csubj (6; 0% instances), iobj (3; 0% instances), nummod (3; 0% instances), dislocated (1; 0% instances), nsubj:outer (1; 0% instances), obl:tmod (1; 0% instances)

Children of DET nodes belong to 15 different parts of speech: PART (2278; 26% instances), PUNCT (1370; 16% instances), VERB (983; 11% instances), NOUN (795; 9% instances), ADJ (585; 7% instances), ADP (568; 7% instances), ADV (544; 6% instances), CCONJ (414; 5% instances), PRON (370; 4% instances), DET (325; 4% instances), AUX (138; 2% instances), PROPN (113; 1% instances), SCONJ (105; 1% instances), NUM (15; 0% instances), X (8; 0% instances)