Treebank Statistics: UD_Czech-PDTC: POS Tags: DET
There are 81 DET lemmas (0%), 486 DET types (0%) and 154606 DET tokens (4%).
Out of 17 observed tags, the rank of DET is: 11 in number of lemmas, 8 in number of types and 7 in number of tokens.
The 10 most frequent DET lemmas: ten, který, tento, jeho, svůj, všechen, můj, náš, takový, nějaký
The 10 most frequent DET types: to, které, který, která, jeho, své, jejich, tím, toho, této
The 10 most frequent ambiguous lemmas: svůj (DET 10402, ADJ 5), mnoho (DET 1329, ADV 24), to (DET 1151, PART 602, X 12), tolik (DET 338, ADV 201), málo (DET 299, ADV 242, NOUN 39), pár (DET 273, NOUN 221), malinko (ADV 12, NOUN 10, DET 4)
The 10 most frequent ambiguous types: to (DET 24346, PART 602, X 12), jeho (DET 4085, PRON 20), své (DET 3023, ADJ 1), tom (DET 2517, PROPN 5), té (DET 1801, ADJ 1), ta (DET 877, INTJ 2, NOUN 1), ty (DET 900, PRON 46), svůj (DET 1069, ADJ 1), mnoho (DET 753, ADV 24), ti (DET 443, PRON 99, ADJ 2)
- to
- jeho
- své
- DET 3023: Podle toho ke své profesi nejednou přistupují - nic je k ní netáhne .
- ADJ 1: Jinak samozřejmě nevíme , co všechno vzalo za své v plamenech žároviště , a nemůžeme proto jednoznačně odpovědět na otázku , zda šlo skutečně o natolik chudé obyvatelstvo , že jedinou výbavou jejich hrobů byly rozbité nádoby .
- tom
- té
- ta
- DET 877: Teď ještě ta druhá - peníze .
- INTJ 2: Ra - ta - ta .
- NOUN 1: Peter Honak , profesor historie na Maďarské akademii věd , k 50 . výročí osvobození Maďarska sovětskou armádou * Užívej si všeho , ale po padesátce dávej vale třem “ ta “ - wanita ( ženy ) , harta ( bohatství ) a tahta ( postavení ) .
- ty
- svůj
- mnoho
- ti
Morphology
The form / lemma ratio of DET is 6.000000 (the average of all parts of speech is 2.169184).
The 1st highest number of forms (17) was observed with the lemma “můj”: moje, moji, mojí, mou, muj, má, mé, mého, mém, mému, mí, mých, mýho, mým, mýma, mými, můj.
The 2nd highest number of forms (17) was observed with the lemma “samý”: sama, sami, samo, samou, samu, samy, samá, samé, samého, samém, samému, samí, samý, samých, samým, samými, sám.
The 3rd highest number of forms (17) was observed with the lemma “tenhle”: tahle, tenhle, tihle, tohle, tohohle, tomhle, tomuhle, touhle, tuhle, tyhle, téhle, tímhle, těchhle, těhle, těmahle, těmhle, těmihle.
DET occurs with 16 features: PronType (154606; 100% instances), Case (143891; 93% instances), Number (139165; 90% instances), Gender (129582; 84% instances), Poss (31074; 20% instances), Number[psor] (20632; 13% instances), Person (20632; 13% instances), Animacy (18066; 12% instances), Reflex (10402; 7% instances), Gender[psor] (8316; 5% instances), NumType (5026; 3% instances), Variant (1514; 1% instances), ExtPos (147; 0% instances), Style (140; 0% instances), Abbr (14; 0% instances), Typo (2; 0% instances)
DET occurs with 45 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, ExtPos=ADJ, ExtPos=ADV, ExtPos=CCONJ, Gender=Fem, Gender=Fem,Neut, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Gender[psor]=Fem, Gender[psor]=Masc,Neut, NumType=Card, NumType=Ord, NumType=Sets, Number=Dual, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Dem, PronType=Dem,Ind, PronType=Emp, PronType=Ind, PronType=Int,Rel, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Style=Coll, Typo=Yes, Variant=Short
DET occurs with 468 feature combinations.
The most frequent feature combination is Case=Nom|Gender=Neut|Number=Sing|PronType=Dem (22666 tokens).
Examples: to, toto, tohle, takové, tohleto, totéž, ono, tadyto, takovéto, tamto
Relations
DET nodes are attached to their parents using 29 different relations: det (72974; 47% instances), nsubj (40986; 27% instances), obj (11699; 8% instances), obl:arg (6403; 4% instances), obl (6176; 4% instances), det:numgov (2936; 2% instances), nsubj:pass (2468; 2% instances), nmod (1804; 1% instances), root (1479; 1% instances), advcl:pred (1376; 1% instances), conj (1326; 1% instances), det:nummod (1295; 1% instances), discourse (986; 1% instances), dep (631; 0% instances), appos (460; 0% instances), amod (307; 0% instances), acl:relcl (301; 0% instances), advcl (286; 0% instances), ccomp (201; 0% instances), cc (114; 0% instances), advmod:emph (80; 0% instances), orphan (76; 0% instances), iobj (72; 0% instances), xcomp (62; 0% instances), acl (44; 0% instances), parataxis (30; 0% instances), csubj (23; 0% instances), advmod (7; 0% instances), csubj:pass (4; 0% instances)
Parents of DET nodes belong to 17 different parts of speech: NOUN (88657; 57% instances), VERB (48620; 31% instances), ADJ (7993; 5% instances), ADV (2071; 1% instances), (1479; 1% instances), PRON (1389; 1% instances), PROPN (1359; 1% instances), DET (1350; 1% instances), NUM (694; 0% instances), AUX (540; 0% instances), PART (299; 0% instances), X (70; 0% instances), CCONJ (43; 0% instances), SYM (26; 0% instances), ADP (11; 0% instances), INTJ (4; 0% instances), SCONJ (1; 0% instances)
130449 (84%) DET nodes are leaves.
14740 (10%) DET nodes have one child.
5874 (4%) DET nodes have two children.
3543 (2%) DET nodes have three or more children.
The highest child degree of a DET node is 11.
Children of DET nodes are attached using 35 different relations: case (11517; 28% instances), acl (5292; 13% instances), punct (5105; 12% instances), advmod:emph (3671; 9% instances), acl:relcl (2533; 6% instances), cop (2290; 6% instances), nsubj (1719; 4% instances), advmod (1333; 3% instances), nmod (1273; 3% instances), amod (1165; 3% instances), cc (1041; 3% instances), conj (759; 2% instances), mark (566; 1% instances), det (481; 1% instances), obl (467; 1% instances), appos (370; 1% instances), dep (337; 1% instances), advcl (277; 1% instances), orphan (211; 1% instances), aux (175; 0% instances), fixed (150; 0% instances), nummod (68; 0% instances), parataxis (61; 0% instances), ccomp (52; 0% instances), csubj (44; 0% instances), obl:arg (43; 0% instances), obj (37; 0% instances), discourse (27; 0% instances), xcomp (24; 0% instances), advcl:pred (13; 0% instances), det:nummod (6; 0% instances), vocative (4; 0% instances), expl:pass (3; 0% instances), expl:pv (1; 0% instances), nsubj:pass (1; 0% instances)
Children of DET nodes belong to 16 different parts of speech: ADP (11470; 28% instances), VERB (6985; 17% instances), PUNCT (5105; 12% instances), NOUN (3155; 8% instances), PART (3099; 8% instances), AUX (2595; 6% instances), ADJ (2298; 6% instances), ADV (2150; 5% instances), DET (1350; 3% instances), CCONJ (1299; 3% instances), PRON (607; 1% instances), SCONJ (553; 1% instances), PROPN (256; 1% instances), NUM (174; 0% instances), X (19; 0% instances), SYM (1; 0% instances)