Treebank Statistics: UD_Bulgarian-BTB: POS Tags: DET
There are 23 DET
lemmas (0%), 141 DET
types (1%) and 2433 DET
tokens (2%).
Out of 16 observed tags, the rank of DET
is: 11 in number of lemmas, 7 in number of types and 11 in number of tokens.
The 10 most frequent DET
lemmas: този, всеки, един, какъв, наш, мой, свой, такъв, някой, никакъв
The 10 most frequent DET
types: тази, този, тези, това, всички, един, какво, една, всеки, всяка
The 10 most frequent ambiguous lemmas: този (DET 793, PRON 540), всеки (DET 292, PRON 112), един (DET 250, NUM 225, PRON 6), наш (DET 184, PRON 164), мой (PRON 378, DET 162), свой (PRON 623, DET 123), някой (PRON 92, DET 90), ваш (DET 35, PRON 33), какъвто (DET 22, PRON 10), кой (PRON 96, DET 20, PROPN 1)
The 10 most frequent ambiguous types: това (PRON 288, DET 131), всички (DET 129, PRON 30), един (DET 88, NUM 59, PRON 1), една (DET 80, NUM 53), всеки (DET 47, PRON 6), едно (DET 42, NUM 40, PRON 2), някои (DET 36, PRON 6), някой (PRON 17, DET 12), каквото (PRON 9, DET 8), кой (PRON 31, DET 8)
- това
- всички
- един
- една
- всеки
- едно
- някои
- някой
- каквото
- кой
Morphology
The form / lemma ratio of DET
is 6.130435 (the average of all parts of speech is 1.727244).
The 1st highest number of forms (27) was observed with the lemma “мой”: Моят, мое, моето, мои, моите, мой, моя, моята, негов, негова, неговата, негови, неговите, неговия, неговият, негово, неговото, неин, нейна, нейната, нейни, нейните, нейния, нейният, нейното, твое, твоите.
The 2nd highest number of forms (18) was observed with the lemma “наш”: наш, наша, нашата, наше, нашето, наши, нашите, нашия, нашият, техен, техни, техните, техния, техният, тяхна, тяхната, тяхно, тяхното.
The 3rd highest number of forms (15) was observed with the lemma “този”: онази, онези, онзи, ония, онова, оня, тeзи, тази, тая, тези, тия, това, този, тоя, туй.
DET
occurs with 9 features: Number (2433; 100% instances), PronType (2433; 100% instances), Gender (1718; 71% instances), Case (899; 37% instances), Definite (793; 33% instances), Poss (512; 21% instances), Person (389; 16% instances), Reflex (123; 5% instances), Animacy (1; 0% instances)
DET
occurs with 22 feature-value pairs: Animacy=Anim
, Case=Acc
, Case=Nom
, Definite=Def
, Definite=Ind
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, Poss=Yes
, PronType=Dem
, PronType=Ind
, PronType=Int
, PronType=Neg
, PronType=Prs
, PronType=Rel
, PronType=Tot
, Reflex=Yes
DET
occurs with 81 feature combinations.
The most frequent feature combination is Gender=Masc|Number=Sing|PronType=Dem
(235 tokens).
Examples: този, такъв, тоя, оня, онзи
Relations
DET
nodes are attached to their parents using 18 different relations: det (2014; 83% instances), nsubj (109; 4% instances), obj (108; 4% instances), root (68; 3% instances), iobj (37; 2% instances), nmod (25; 1% instances), conj (18; 1% instances), obl (18; 1% instances), nsubj:pass (14; 1% instances), ccomp (10; 0% instances), advcl (3; 0% instances), appos (3; 0% instances), acl (1; 0% instances), acl:relcl (1; 0% instances), csubj (1; 0% instances), discourse (1; 0% instances), vocative (1; 0% instances), xcomp (1; 0% instances)
Parents of DET
nodes belong to 10 different parts of speech: NOUN (2047; 84% instances), VERB (246; 10% instances), (68; 3% instances), AUX (36; 1% instances), PROPN (15; 1% instances), PRON (7; 0% instances), ADJ (6; 0% instances), ADV (5; 0% instances), DET (2; 0% instances), NUM (1; 0% instances)
2071 (85%) DET
nodes are leaves.
179 (7%) DET
nodes have one child.
82 (3%) DET
nodes have two children.
101 (4%) DET
nodes have three or more children.
The highest child degree of a DET
node is 7.
Children of DET
nodes are attached using 21 different relations: punct (100; 14% instances), nmod (94; 13% instances), case (93; 13% instances), cop (84; 12% instances), acl:relcl (78; 11% instances), nsubj (75; 11% instances), advmod (58; 8% instances), acl (41; 6% instances), fixed (17; 2% instances), cc (16; 2% instances), discourse (14; 2% instances), obl (11; 2% instances), conj (10; 1% instances), aux (7; 1% instances), det (6; 1% instances), expl (2; 0% instances), iobj (2; 0% instances), mark (2; 0% instances), advcl (1; 0% instances), amod (1; 0% instances), csubj (1; 0% instances)
Children of DET
nodes belong to 14 different parts of speech: NOUN (149; 21% instances), VERB (109; 15% instances), AUX (103; 14% instances), PUNCT (100; 14% instances), ADP (93; 13% instances), ADV (48; 7% instances), PRON (30; 4% instances), PART (27; 4% instances), CCONJ (22; 3% instances), ADJ (19; 3% instances), PROPN (8; 1% instances), DET (2; 0% instances), SCONJ (2; 0% instances), NUM (1; 0% instances)