Treebank Statistics: UD_Old_East_Slavic-RNC: POS Tags: DET
There are 55 DET
lemmas (1%), 650 DET
types (3%) and 4530 DET
tokens (5%).
Out of 17 observed tags, the rank of DET
is: 10 in number of lemmas, 5 in number of types and 8 in number of tokens.
The 10 most frequent DET
lemmas: тотъ, весь, свой, твой, нашъ, сей, мой, который, всякий, иной
The 10 most frequent DET
types: того, всеа, своего, сей, твой, те, все, которые, тех, тѣхъ
The 10 most frequent ambiguous lemmas: сей (DET 289, PRON 1), другой (DET 82, ADJ 2), самъ (DET 72, NOUN 1), одинъ (NUM 118, DET 6, ADJ 2), единъ (NUM 28, DET 4), премногий (ADJ 2, DET 2), злодѣй (NOUN 2, DET 1), смѣкальный (ADJ 8, DET 1)
The 10 most frequent ambiguous types: того (DET 126, PRON 70), все (DET 64, ADV 3), то (PRON 93, DET 54, SCONJ 14, PART 4), тое (DET 46, PRON 1), вся (DET 40, PRON 2), сего (DET 33, PRON 20), тово (DET 27, PRON 16), тому (PRON 37, DET 33), та (DET 31, PART 1), т. (DET 28, PRON 3)
- того
- все
- то
- PRON 93: [Начало оторвано] … про то де онъ Захарко не вѣдаетъ .
- DET 54: Да 6 бочек вина , и то вино не мерено .
- SCONJ 14: И аще ли , господине , тако пребудет в недузе том , то воистинну , господине , вѣждь , яко нѣчто добродѣтели ея хощетъ Богъ упокоити ю отъ маловременныя сея и болѣзненыя жизни ко оному нестареющемуся блаженству .
- PART 4: А пожнѣ , что твои и твоихъ мужь пошло , то твое и твоихъ мужь ; а новгородьское Новугороду .
- тое
- вся
- сего
- тово
- тому
- та
- т.
- DET 28: А мы , с. т. , с той земли станем платить аброк .
- PRON 3: Въ нынѣшнемъ государь , въ 206 году , генваря въ 5 день , билъ челомъ тебѣ великому государю , полная [ т. е. титулъ ] на Кунгурѣ въ приказной избѣ подалъ челобитную мнѣ холопу твоему кунгурской стрѣлецъ Ивашко Моржегоревъ на отставного подъячего Ивана Шавкунова да на человѣка ево на Петрушку Нижегородова въ угрозѣ ему Ивашку къ смертному убійству и въ иномъ ево Ивановѣ воровствѣ .
Morphology
The form / lemma ratio of DET
is 11.818182 (the average of all parts of speech is 2.250521).
The 1st highest number of forms (46) was observed with the lemma “всякий”: ВСЯКОЯ, всякiя, всяка, всякаго, всякая, всякие, всякии, всякий, всяким, всякими, всякимъ, всяких, всякихъ, всякия, всякова, всяково, всяког(о), всякого, всякое, всякой, всякому, всякомъ, всякою, всяку, всякую, всякъ, всякіе, всякія, всяцей, всꙗкаго, всꙗки(м), всꙗкими, всꙗкихъ, всꙗкиꙗ, всꙗко, всꙗкого, всꙗкой, всꙗкомꙋ, всꙗкъ, всꙗкіꙗ, въся(ко)мъ, въсякимъ, въсякое, въсякой, въсякомъ, въсякую.
The 2nd highest number of forms (46) was observed with the lemma “тотъ”: т[а], та, таво, те, тем, теми, темъ, тех, тии, тимъ, тихъ, то, то(й), то(м), тово, тог[о], того, тое, тоей, тои, той, том, тому, томъ, томꙋ, тот, тотъ, тою, тоя, тоі, тоѣ, ту, тую, тые, тыи, тыми, тымъ, тыхъ, тѣ, тѣи, тѣм, тѣми, тѣмъ, тѣни, тѣх, тѣхъ.
The 3rd highest number of forms (44) was observed with the lemma “весь”: ве(с), вес, вес[ь], весь, все, все(м), всеа, всег[о], всего, всее, всеи, всей, всем, всеми, всему, всемъ, всемꙋ, всех, всехъ, всею, всея, вси, всимъ, всихъ, всомъ, всь, всю, вся, всяго, всѣ, всѣи, всѣм, всѣми, всѣмъ, всѣх, всѣхъ, всꙗ, въвесь, въсе, въсемъ, въсехъ, въсю, вьсе, вьси.
DET
occurs with 10 features: PronType (4523; 100% instances), Case (4494; 99% instances), Number (4492; 99% instances), Gender (4491; 99% instances), Poss (1572; 35% instances), Reflex (522; 12% instances), Animacy (190; 4% instances), Variant (53; 1% instances), Abbr (38; 1% instances), Typo (1; 0% instances)
DET
occurs with 27 feature-value pairs: Abbr=Yes
, Animacy=Anim
, Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Dual
, Number=Plur
, Number=Sing
, Poss=Yes
, PronType=Dem
, PronType=Emp
, PronType=Exc
, PronType=Ind
, PronType=Int
, PronType=Neg
, PronType=Prs
, PronType=Rel
, PronType=Tot
, Reflex=Yes
, Typo=Yes
, Variant=Short
DET
occurs with 301 feature combinations.
The most frequent feature combination is Case=Gen|Gender=Fem|Number=Sing|PronType=Tot
(170 tokens).
Examples: всеа, всея, всеи, всей, всее, всякия, всꙗкіꙗ, другой, ВСЯКОЯ, всякiя
Relations
DET
nodes are attached to their parents using 23 different relations: det (3955; 87% instances), nsubj (139; 3% instances), obl (123; 3% instances), obj (91; 2% instances), obl:float (49; 1% instances), nmod (40; 1% instances), conj (34; 1% instances), iobj (33; 1% instances), nsubj:pass (21; 0% instances), acl (12; 0% instances), fixed (10; 0% instances), orphan (4; 0% instances), ccomp (3; 0% instances), parataxis (3; 0% instances), advcl (2; 0% instances), dep (2; 0% instances), obl:agent (2; 0% instances), root (2; 0% instances), acl:relcl (1; 0% instances), dislocated (1; 0% instances), expl:pv (1; 0% instances), obl:depict (1; 0% instances), xcomp (1; 0% instances)
Parents of DET
nodes belong to 11 different parts of speech: NOUN (3704; 82% instances), VERB (427; 9% instances), PROPN (199; 4% instances), ADJ (103; 2% instances), PRON (49; 1% instances), DET (25; 1% instances), ADP (9; 0% instances), ADV (6; 0% instances), NUM (5; 0% instances), (2; 0% instances), PART (1; 0% instances)
4013 (89%) DET
nodes are leaves.
431 (10%) DET
nodes have one child.
51 (1%) DET
nodes have two children.
35 (1%) DET
nodes have three or more children.
The highest child degree of a DET
node is 15.
Children of DET
nodes are attached using 26 different relations: advmod (183; 27% instances), case (141; 21% instances), punct (46; 7% instances), appos (42; 6% instances), acl:relcl (41; 6% instances), cc (39; 6% instances), conj (37; 5% instances), acl (27; 4% instances), discourse (20; 3% instances), vocative (20; 3% instances), det (18; 3% instances), nsubj (14; 2% instances), nmod (10; 1% instances), orphan (10; 1% instances), cop (7; 1% instances), mark (5; 1% instances), dislocated (4; 1% instances), dep (3; 0% instances), obl (3; 0% instances), parataxis (3; 0% instances), advcl (2; 0% instances), amod (2; 0% instances), expl (2; 0% instances), aux (1; 0% instances), fixed (1; 0% instances), obl:pronmod (1; 0% instances)
Children of DET
nodes belong to 15 different parts of speech: PART (201; 29% instances), ADP (140; 21% instances), NOUN (105; 15% instances), VERB (62; 9% instances), PUNCT (46; 7% instances), CCONJ (41; 6% instances), DET (25; 4% instances), ADJ (17; 2% instances), PRON (15; 2% instances), AUX (9; 1% instances), PROPN (6; 1% instances), SCONJ (6; 1% instances), ADV (3; 0% instances), NUM (3; 0% instances), X (3; 0% instances)