home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-Taiga: POS Tags: DET

There are 83 DET lemmas (0%), 522 DET types (0%) and 63833 DET tokens (4%). Out of 17 observed tags, the rank of DET is: 13 in number of lemmas, 8 in number of types and 10 in number of tokens.

The 10 most frequent DET lemmas: этот, свой, который, весь, такой, его, тот, другой, один, мой

The 10 most frequent DET types: его, все, ее, их, этот, которые, эти, это, своей, такой

The 10 most frequent ambiguous lemmas: этот (DET 7730, PRON 6), который (DET 5473, X 1), весь (DET 5269, PRON 6, NOUN 3, X 1), его (DET 3625, X 2), тот (DET 3617, PRON 2), другой (DET 3505, ADJ 1), один (DET 2678, NUM 1901), их (DET 1612, CCONJ 1), сей (DET 282, PRON 1), все (PRON 1004, PART 10, DET 2)

The 10 most frequent ambiguous types: его (DET 3371, PRON 2174, X 2), все (DET 1775, PRON 1387, ADV 521, PART 12), ее (DET 1505, PRON 1290), их (DET 1526, PRON 1376, X 2, CCONJ 1), это (PRON 3287, PART 1038, DET 866), всех (DET 815, PRON 280), который (DET 812, X 1), этого (DET 753, PRON 518), один (DET 488, NUM 421), эта (DET 355, NOUN 2)

Morphology

The form / lemma ratio of DET is 6.289157 (the average of all parts of speech is 2.706111).

The 1st highest number of forms (23) was observed with the lemma “какой-то”: какая, какая-то, какаято, какие, какие-то, каким, каким-то, какими, какими-то, каких-то, какого, какого-то, какое, какое-то, какой, какой-то, какойто, каком-то, какому-то, какою-то, какую, какую-то, кого.

The 2nd highest number of forms (20) was observed with the lemma “никакой”: какие, каким, какими, каких, какое, какой, каком, какую, ни, никакаго, никакая, никакие, никаким, никакими, никаких, никакого, никакое, никакой, никакому, никакую.

The 3rd highest number of forms (18) was observed with the lemma “весь”: Всёе, вeсь, весь, всëм, все, всего, всей, всем, всеми, всему, всех, всею, всея, всю, вся, всё, всём, свей.

DET occurs with 11 features: PronType (63833; 100% instances), Number (55996; 88% instances), Case (55708; 87% instances), Gender (38369; 60% instances), Poss (18900; 30% instances), Animacy (7095; 11% instances), Reflex (5886; 9% instances), ExtPos (1187; 2% instances), Abbr (587; 1% instances), Variant (302; 0% instances), Typo (68; 0% instances)

DET occurs with 32 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, ExtPos=ADJ, ExtPos=ADV, ExtPos=DET, ExtPos=NOUN, ExtPos=PRON, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Poss=Yes, PronType=Dem, PronType=Emp, PronType=Exc, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Typo=Yes, Variant=Short

DET occurs with 426 feature combinations. The most frequent feature combination is Poss=Yes|PronType=Prs (7250 tokens). Examples: его, ее, их, её, eго, ея

Relations

DET nodes are attached to their parents using 31 different relations: det (49806; 78% instances), nsubj (3623; 6% instances), obl (2454; 4% instances), conj (1527; 2% instances), obj (1452; 2% instances), obl:float (970; 2% instances), nmod (901; 1% instances), root (795; 1% instances), iobj (533; 1% instances), fixed (426; 1% instances), nsubj:pass (259; 0% instances), xcomp (236; 0% instances), acl (172; 0% instances), parataxis (153; 0% instances), advmod (122; 0% instances), appos (79; 0% instances), ccomp (71; 0% instances), orphan (61; 0% instances), acl:relcl (60; 0% instances), parataxis:discourse (36; 0% instances), advcl (34; 0% instances), amod (14; 0% instances), obl:agent (12; 0% instances), csubj (11; 0% instances), obl:tmod (8; 0% instances), list (6; 0% instances), obl:depict (6; 0% instances), vocative (3; 0% instances), dep (1; 0% instances), expl (1; 0% instances), flat (1; 0% instances)

Parents of DET nodes belong to 17 different parts of speech: NOUN (46764; 73% instances), VERB (9070; 14% instances), ADJ (2908; 5% instances), PRON (1604; 3% instances), DET (1034; 2% instances), PROPN (925; 1% instances), (795; 1% instances), ADP (329; 1% instances), NUM (192; 0% instances), ADV (118; 0% instances), PART (36; 0% instances), X (32; 0% instances), INTJ (11; 0% instances), AUX (7; 0% instances), CCONJ (5; 0% instances), SYM (2; 0% instances), SCONJ (1; 0% instances)

53504 (84%) DET nodes are leaves.

6908 (11%) DET nodes have one child.

1928 (3%) DET nodes have two children.

1493 (2%) DET nodes have three or more children.

The highest child degree of a DET node is 11.

Children of DET nodes are attached using 41 different relations: case (2866; 17% instances), punct (2671; 16% instances), advmod (2270; 14% instances), cc (1415; 9% instances), nmod (1326; 8% instances), fixed (1259; 8% instances), conj (1049; 6% instances), nsubj (929; 6% instances), obl (681; 4% instances), acl:relcl (556; 3% instances), det (282; 2% instances), parataxis (260; 2% instances), cop (250; 2% instances), acl (119; 1% instances), appos (114; 1% instances), orphan (102; 1% instances), parataxis:discourse (102; 1% instances), mark (86; 1% instances), goeswith (41; 0% instances), advcl (36; 0% instances), amod (24; 0% instances), discourse (22; 0% instances), ccomp (18; 0% instances), obl:tmod (17; 0% instances), expl (16; 0% instances), nummod:gov (14; 0% instances), dislocated (13; 0% instances), aux (12; 0% instances), iobj (8; 0% instances), obl:pronmod (8; 0% instances), vocative (8; 0% instances), csubj (6; 0% instances), list (6; 0% instances), nummod (4; 0% instances), obl:float (4; 0% instances), compound (1; 0% instances), flat (1; 0% instances), flat:goeswith (1; 0% instances), nsubj:outer (1; 0% instances), obl:depict (1; 0% instances), reparandum (1; 0% instances)

Children of DET nodes belong to 17 different parts of speech: ADP (2815; 17% instances), PART (2694; 16% instances), PUNCT (2671; 16% instances), NOUN (2436; 15% instances), CCONJ (1402; 8% instances), DET (1034; 6% instances), VERB (910; 5% instances), PRON (807; 5% instances), ADV (638; 4% instances), ADJ (421; 3% instances), AUX (263; 2% instances), PROPN (229; 1% instances), SCONJ (146; 1% instances), X (60; 0% instances), NUM (55; 0% instances), INTJ (10; 0% instances), SYM (9; 0% instances)