Treebank Statistics: UD_Czech-PUD: POS Tags: DET
There are 27 DET lemmas (1%), 126 DET types (2%) and 841 DET tokens (5%).
Out of 15 observed tags, the rank of DET is: 8 in number of lemmas, 7 in number of types and 7 in number of tokens.
The 10 most frequent DET lemmas: který, ten, jeho, tento, svůj, mnoho, můj, každý, všechen, některý
The 10 most frequent DET types: to, který, jeho, které, která, jejich, své, mnoho, toho, její
The 10 most frequent ambiguous lemmas: ten (DET 177, PRON 1), hodně (ADV 15, DET 13), jenž (PRON 19, DET 8), málo (ADV 5, DET 3)
The 10 most frequent ambiguous types: to (DET 75, PART 6), více (ADV 16, DET 9), víc (ADV 6, DET 4), jehož (DET 3, PRON 1), málo (ADV 1, DET 1), méně (ADV 3, DET 1)
- to
- více
- víc
- jehož
- málo
- méně
Morphology
The form / lemma ratio of DET is 4.666667 (the average of all parts of speech is 1.426064).
The 1st highest number of forms (13) was observed with the lemma “ten”: ta, ten, to, toho, tom, tomu, tou, ty, té, tím, těch, těm, těmi.
The 2nd highest number of forms (12) was observed with the lemma “tento”: tato, tento, tohoto, tomto, toto, tuto, tyto, této, tímto, těchto, těmito, těmto.
The 3rd highest number of forms (11) was observed with the lemma “který”: kterou, která, které, kterého, kterém, kterému, který, kterých, kterým, kterými, kteří.
DET occurs with 15 features: PronType (841; 100% instances), Case (735; 87% instances), Number (697; 83% instances), Gender (641; 76% instances), Poss (226; 27% instances), Number[psor] (139; 17% instances), Person (139; 17% instances), Animacy (95; 11% instances), Reflex (87; 10% instances), Gender[psor] (82; 10% instances), NumType (46; 5% instances), Polarity (15; 2% instances), Degree (14; 2% instances), Abbr (13; 2% instances), Variant (7; 1% instances)
DET occurs with 37 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Degree=Cmp, Gender=Fem, Gender=Fem,Neut, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Gender[psor]=Fem, Gender[psor]=Masc,Neut, NumType=Card, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person=1, Person=2, Person=3, Polarity=Pos, Poss=Yes, PronType=Dem, PronType=Emp, PronType=Ind, PronType=Int,Rel, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Variant=Short
DET occurs with 154 feature combinations.
The most frequent feature combination is Case=Nom|Gender=Neut|Number=Sing|PronType=Dem (86 tokens).
Examples: to, toto, tohle
Relations
DET nodes are attached to their parents using 17 different relations: det (419; 50% instances), nsubj (218; 26% instances), det:numgov (43; 5% instances), obl (39; 5% instances), obj (36; 4% instances), obl:arg (27; 3% instances), nsubj:pass (17; 2% instances), nmod (14; 2% instances), det:nummod (10; 1% instances), conj (6; 1% instances), advcl (3; 0% instances), root (3; 0% instances), acl (2; 0% instances), acl:relcl (1; 0% instances), amod (1; 0% instances), ccomp (1; 0% instances), csubj (1; 0% instances)
Parents of DET nodes belong to 10 different parts of speech: NOUN (497; 59% instances), VERB (245; 29% instances), ADJ (66; 8% instances), ADV (7; 1% instances), DET (7; 1% instances), PRON (6; 1% instances), PROPN (5; 1% instances), NUM (3; 0% instances), (3; 0% instances), AUX (2; 0% instances)
716 (85%) DET nodes are leaves.
85 (10%) DET nodes have one child.
31 (4%) DET nodes have two children.
9 (1%) DET nodes have three or more children.
The highest child degree of a DET node is 5.
Children of DET nodes are attached using 18 different relations: case (58; 32% instances), punct (27; 15% instances), acl (24; 13% instances), acl:relcl (15; 8% instances), advmod (10; 5% instances), nmod (8; 4% instances), obl (8; 4% instances), cop (7; 4% instances), advmod:emph (6; 3% instances), nsubj (6; 3% instances), conj (4; 2% instances), cc (3; 2% instances), advcl (1; 1% instances), amod (1; 1% instances), det (1; 1% instances), mark (1; 1% instances), orphan (1; 1% instances), vocative (1; 1% instances)
Children of DET nodes belong to 14 different parts of speech: ADP (58; 32% instances), PUNCT (27; 15% instances), VERB (25; 14% instances), NOUN (19; 10% instances), ADV (12; 7% instances), ADJ (9; 5% instances), AUX (8; 4% instances), DET (7; 4% instances), PART (5; 3% instances), PROPN (4; 2% instances), CCONJ (3; 2% instances), PRON (3; 2% instances), NUM (1; 1% instances), SCONJ (1; 1% instances)