Treebank Statistics: UD_Russian-SynTagRus: POS Tags: DET
There are 57 DET lemmas (0%), 445 DET types (0%) and 54748 DET tokens (4%).
Out of 17 observed tags, the rank of DET is: 11 in number of lemmas, 8 in number of types and 9 in number of tokens.
The 10 most frequent DET lemmas: этот, который, свой, весь, тот, такой, другой, его, наш, самый
The 10 most frequent DET types: его, все, которые, их, который, этот, эти, ее, этой, которых
The 10 most frequent ambiguous lemmas: один (NUM 2706, DET 984, NOUN 3), капиталистический (ADJ 49, DET 1), тем (SCONJ 110, DET 1)
The 10 most frequent ambiguous types: его (DET 2184, PRON 2062), все (DET 1669, PRON 1481, PART 676, ADV 5), их (PRON 1531, DET 1190), ее (PRON 1108, DET 898), этой (DET 923, PRON 1), всех (DET 715, PRON 231), это (PRON 3737, PART 961, DET 568), этом (PRON 1061, DET 656, PART 1), этого (PRON 736, DET 644, PART 1), том (PRON 1206, DET 546, NOUN 13)
- его
- все
- их
- ее
- этой
- всех
- это
- этом
- этого
- том
Morphology
The form / lemma ratio of DET is 7.807018 (the average of all parts of speech is 2.668831).
The 1st highest number of forms (15) was observed with the lemma “свой”: свое, своего, своей, своем, своему, своею, свои, своим, своими, своих, свой, свою, своя, своё, своём.
The 2nd highest number of forms (14) was observed with the lemma “весь”: весь, все, всего, всей, всем, всеми, всему, всех, всею, всея, всю, вся, всё, всём.
The 3rd highest number of forms (14) was observed with the lemma “мой”: мое, моего, моей, моем, моему, моею, мои, моим, моими, моих, мой, мою, моя, моём.
DET occurs with 11 features: PronType (54748; 100% instances), Number (50153; 92% instances), Case (49961; 91% instances), Gender (32357; 59% instances), Poss (12985; 24% instances), Reflex (4510; 8% instances), Animacy (4063; 7% instances), Variant (192; 0% instances), ExtPos (139; 0% instances), Abbr (18; 0% instances), Typo (1; 0% instances)
DET occurs with 33 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, ExtPos=ADP, ExtPos=ADV, ExtPos=DET, ExtPos=NOUN, ExtPos=PRON, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Poss=Yes, PronType=Dem, PronType=Emp, PronType=Exc, PronType=Ind, PronType=Int, PronType=Int,Rel, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Typo=Yes, Variant=Short
DET occurs with 346 feature combinations.
The most frequent feature combination is Poss=Yes|PronType=Prs (4577 tokens).
Examples: его, их, ее, её, Ей, ея
Relations
DET nodes are attached to their parents using 30 different relations: det (42169; 77% instances), nsubj (4128; 8% instances), obl (2037; 4% instances), obj (1283; 2% instances), nmod (886; 2% instances), conj (687; 1% instances), obl:float (658; 1% instances), acl (612; 1% instances), fixed (459; 1% instances), nsubj:pass (454; 1% instances), root (438; 1% instances), iobj (311; 1% instances), xcomp (127; 0% instances), parataxis (81; 0% instances), advmod (76; 0% instances), ccomp (58; 0% instances), acl:relcl (55; 0% instances), orphan (51; 0% instances), obl:tmod (48; 0% instances), parataxis:discourse (47; 0% instances), advcl (32; 0% instances), appos (21; 0% instances), csubj (9; 0% instances), obl:depict (7; 0% instances), flat (4; 0% instances), obl:agent (4; 0% instances), case (2; 0% instances), csubj:pass (2; 0% instances), dislocated (1; 0% instances), flat:name (1; 0% instances)
Parents of DET nodes belong to 14 different parts of speech: NOUN (39342; 72% instances), VERB (8825; 16% instances), ADJ (2540; 5% instances), PRON (1428; 3% instances), DET (807; 1% instances), PROPN (540; 1% instances), (438; 1% instances), ADP (388; 1% instances), NUM (204; 0% instances), ADV (132; 0% instances), PART (71; 0% instances), X (22; 0% instances), SYM (9; 0% instances), CCONJ (2; 0% instances)
45826 (84%) DET nodes are leaves.
6367 (12%) DET nodes have one child.
1475 (3%) DET nodes have two children.
1080 (2%) DET nodes have three or more children.
The highest child degree of a DET node is 9.
Children of DET nodes are attached using 35 different relations: advmod (3161; 23% instances), case (2598; 19% instances), punct (2234; 17% instances), cc (845; 6% instances), acl:relcl (801; 6% instances), nsubj (615; 5% instances), conj (593; 4% instances), nmod (469; 3% instances), obl (420; 3% instances), det (379; 3% instances), fixed (214; 2% instances), cop (213; 2% instances), parataxis (206; 2% instances), mark (145; 1% instances), ccomp (133; 1% instances), advcl (94; 1% instances), acl (85; 1% instances), orphan (83; 1% instances), parataxis:discourse (75; 1% instances), obl:tmod (27; 0% instances), appos (24; 0% instances), amod (16; 0% instances), expl (14; 0% instances), csubj (13; 0% instances), nummod:gov (10; 0% instances), aux (9; 0% instances), obl:pronmod (9; 0% instances), iobj (8; 0% instances), nummod (5; 0% instances), discourse (4; 0% instances), flat (4; 0% instances), vocative (4; 0% instances), dislocated (2; 0% instances), nsubj:outer (1; 0% instances), nsubj:pass (1; 0% instances)
Children of DET nodes belong to 15 different parts of speech: PART (2593; 19% instances), ADP (2588; 19% instances), PUNCT (2234; 17% instances), NOUN (1203; 9% instances), VERB (1141; 8% instances), CCONJ (811; 6% instances), DET (807; 6% instances), ADV (727; 5% instances), PRON (531; 4% instances), ADJ (325; 2% instances), AUX (224; 2% instances), SCONJ (190; 1% instances), PROPN (104; 1% instances), NUM (23; 0% instances), X (13; 0% instances)