Treebank Statistics: UD_Russian-Taiga: POS Tags: DET
There are 83 DET
lemmas (0%), 522 DET
types (0%) and 63833 DET
tokens (4%).
Out of 17 observed tags, the rank of DET
is: 13 in number of lemmas, 8 in number of types and 10 in number of tokens.
The 10 most frequent DET
lemmas: этот, свой, который, весь, такой, его, тот, другой, один, мой
The 10 most frequent DET
types: его, все, ее, их, этот, которые, эти, это, своей, такой
The 10 most frequent ambiguous lemmas: этот (DET 7730, PRON 6), который (DET 5473, X 1), весь (DET 5269, PRON 6, NOUN 3, X 1), его (DET 3625, X 2), тот (DET 3617, PRON 2), другой (DET 3505, ADJ 1), один (DET 2678, NUM 1901), их (DET 1612, CCONJ 1), сей (DET 282, PRON 1), все (PRON 1004, PART 10, DET 2)
The 10 most frequent ambiguous types: его (DET 3371, PRON 2174, X 2), все (DET 1775, PRON 1387, ADV 521, PART 12), ее (DET 1505, PRON 1290), их (DET 1526, PRON 1376, X 2, CCONJ 1), это (PRON 3287, PART 1038, DET 866), всех (DET 815, PRON 280), который (DET 812, X 1), этого (DET 753, PRON 518), один (DET 488, NUM 421), эта (DET 355, NOUN 2)
- его
- все
- ее
- их
- это
- всех
- который
- этого
- один
- эта
Morphology
The form / lemma ratio of DET
is 6.289157 (the average of all parts of speech is 2.706111).
The 1st highest number of forms (23) was observed with the lemma “какой-то”: какая, какая-то, какаято, какие, какие-то, каким, каким-то, какими, какими-то, каких-то, какого, какого-то, какое, какое-то, какой, какой-то, какойто, каком-то, какому-то, какою-то, какую, какую-то, кого.
The 2nd highest number of forms (20) was observed with the lemma “никакой”: какие, каким, какими, каких, какое, какой, каком, какую, ни, никакаго, никакая, никакие, никаким, никакими, никаких, никакого, никакое, никакой, никакому, никакую.
The 3rd highest number of forms (18) was observed with the lemma “весь”: Всёе, вeсь, весь, всëм, все, всего, всей, всем, всеми, всему, всех, всею, всея, всю, вся, всё, всём, свей.
DET
occurs with 11 features: PronType (63833; 100% instances), Number (55996; 88% instances), Case (55708; 87% instances), Gender (38369; 60% instances), Poss (18900; 30% instances), Animacy (7095; 11% instances), Reflex (5886; 9% instances), ExtPos (1187; 2% instances), Abbr (587; 1% instances), Variant (302; 0% instances), Typo (68; 0% instances)
DET
occurs with 32 feature-value pairs: Abbr=Yes
, Animacy=Anim
, Animacy=Inan
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, ExtPos=ADJ
, ExtPos=ADV
, ExtPos=DET
, ExtPos=NOUN
, ExtPos=PRON
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Plur
, Number=Sing
, Poss=Yes
, PronType=Dem
, PronType=Emp
, PronType=Exc
, PronType=Ind
, PronType=Int
, PronType=Neg
, PronType=Prs
, PronType=Rel
, PronType=Tot
, Reflex=Yes
, Typo=Yes
, Variant=Short
DET
occurs with 426 feature combinations.
The most frequent feature combination is Poss=Yes|PronType=Prs
(7250 tokens).
Examples: его, ее, их, её, eго, ея
Relations
DET
nodes are attached to their parents using 31 different relations: det (49806; 78% instances), nsubj (3623; 6% instances), obl (2454; 4% instances), conj (1527; 2% instances), obj (1452; 2% instances), obl:float (970; 2% instances), nmod (901; 1% instances), root (795; 1% instances), iobj (533; 1% instances), fixed (426; 1% instances), nsubj:pass (259; 0% instances), xcomp (236; 0% instances), acl (172; 0% instances), parataxis (153; 0% instances), advmod (122; 0% instances), appos (79; 0% instances), ccomp (71; 0% instances), orphan (61; 0% instances), acl:relcl (60; 0% instances), parataxis:discourse (36; 0% instances), advcl (34; 0% instances), amod (14; 0% instances), obl:agent (12; 0% instances), csubj (11; 0% instances), obl:tmod (8; 0% instances), list (6; 0% instances), obl:depict (6; 0% instances), vocative (3; 0% instances), dep (1; 0% instances), expl (1; 0% instances), flat (1; 0% instances)
Parents of DET
nodes belong to 17 different parts of speech: NOUN (46764; 73% instances), VERB (9070; 14% instances), ADJ (2908; 5% instances), PRON (1604; 3% instances), DET (1034; 2% instances), PROPN (925; 1% instances), (795; 1% instances), ADP (329; 1% instances), NUM (192; 0% instances), ADV (118; 0% instances), PART (36; 0% instances), X (32; 0% instances), INTJ (11; 0% instances), AUX (7; 0% instances), CCONJ (5; 0% instances), SYM (2; 0% instances), SCONJ (1; 0% instances)
53504 (84%) DET
nodes are leaves.
6908 (11%) DET
nodes have one child.
1928 (3%) DET
nodes have two children.
1493 (2%) DET
nodes have three or more children.
The highest child degree of a DET
node is 11.
Children of DET
nodes are attached using 41 different relations: case (2866; 17% instances), punct (2671; 16% instances), advmod (2270; 14% instances), cc (1415; 9% instances), nmod (1326; 8% instances), fixed (1259; 8% instances), conj (1049; 6% instances), nsubj (929; 6% instances), obl (681; 4% instances), acl:relcl (556; 3% instances), det (282; 2% instances), parataxis (260; 2% instances), cop (250; 2% instances), acl (119; 1% instances), appos (114; 1% instances), orphan (102; 1% instances), parataxis:discourse (102; 1% instances), mark (86; 1% instances), goeswith (41; 0% instances), advcl (36; 0% instances), amod (24; 0% instances), discourse (22; 0% instances), ccomp (18; 0% instances), obl:tmod (17; 0% instances), expl (16; 0% instances), nummod:gov (14; 0% instances), dislocated (13; 0% instances), aux (12; 0% instances), iobj (8; 0% instances), obl:pronmod (8; 0% instances), vocative (8; 0% instances), csubj (6; 0% instances), list (6; 0% instances), nummod (4; 0% instances), obl:float (4; 0% instances), compound (1; 0% instances), flat (1; 0% instances), flat:goeswith (1; 0% instances), nsubj:outer (1; 0% instances), obl:depict (1; 0% instances), reparandum (1; 0% instances)
Children of DET
nodes belong to 17 different parts of speech: ADP (2815; 17% instances), PART (2694; 16% instances), PUNCT (2671; 16% instances), NOUN (2436; 15% instances), CCONJ (1402; 8% instances), DET (1034; 6% instances), VERB (910; 5% instances), PRON (807; 5% instances), ADV (638; 4% instances), ADJ (421; 3% instances), AUX (263; 2% instances), PROPN (229; 1% instances), SCONJ (146; 1% instances), X (60; 0% instances), NUM (55; 0% instances), INTJ (10; 0% instances), SYM (9; 0% instances)