home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-CLTT: POS Tags: DET

There are 20 DET lemmas (1%), 90 DET types (2%) and 1161 DET tokens (3%). Out of 15 observed tags, the rank of DET is: 9 in number of lemmas, 7 in number of types and 8 in number of tokens.

The 10 most frequent DET lemmas: který, tento, jeho, ten, všechen, svůj, takový, každý, jejichž, některý

The 10 most frequent DET types: které, jejich, která, jeho, této, tohoto, který, těchto, tyto, tato

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: jejich (DET 93, X 6), to (PART 33, DET 19), tuto (DET 11, ADV 7), jehož (DET 6, PRON 5), t (DET 1, NOUN 1)

Morphology

The form / lemma ratio of DET is 4.500000 (the average of all parts of speech is 1.723629).

The 1st highest number of forms (15) was observed with the lemma “tento”: t, tato, tento, tohoto, tomto, tomuto, toto, touto, tuto, tyto, této, tímto, těchto, těmito, těmto.

The 2nd highest number of forms (10) was observed with the lemma “který”: kterou, která, které, kterého, kterém, kterému, který, kterých, kterým, kterými.

The 3rd highest number of forms (8) was observed with the lemma “jeho”: jeho, jejich, její, jejích, jejího, jejím, jejími, jejímu.

DET occurs with 13 features: PronType (1161; 100% instances), Number (978; 84% instances), Case (971; 84% instances), Gender (859; 74% instances), Poss (240; 21% instances), Number[psor] (216; 19% instances), Person (216; 19% instances), Gender[psor] (111; 10% instances), Animacy (91; 8% instances), Reflex (24; 2% instances), Variant (4; 0% instances), NumType (2; 0% instances), Abbr (1; 0% instances)

DET occurs with 32 feature-value pairs: Abbr=Yes, Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Gender[psor]=Fem, Gender[psor]=Masc,Neut, NumType=Card, Number=Plur, Number=Sing, Number[psor]=Plur, Number[psor]=Sing, Person=3, Poss=Yes, PronType=Dem, PronType=Emp, PronType=Ind, PronType=Int,Rel, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Variant=Short

DET occurs with 102 feature combinations. The most frequent feature combination is Case=Nom|Gender=Fem|Number=Plur|PronType=Int,Rel (100 tokens). Examples: které, jaké

Relations

DET nodes are attached to their parents using 14 different relations: det (647; 56% instances), nsubj (262; 23% instances), obl (124; 11% instances), obj (39; 3% instances), obl:arg (35; 3% instances), nsubj:pass (32; 3% instances), conj (7; 1% instances), amod (5; 0% instances), xcomp (4; 0% instances), cc (2; 0% instances), acl:relcl (1; 0% instances), dep (1; 0% instances), det:nummod (1; 0% instances), orphan (1; 0% instances)

Parents of DET nodes belong to 8 different parts of speech: NOUN (712; 61% instances), VERB (343; 30% instances), ADJ (96; 8% instances), DET (4; 0% instances), ADV (3; 0% instances), AUX (1; 0% instances), NUM (1; 0% instances), X (1; 0% instances)

1012 (87%) DET nodes are leaves.

123 (11%) DET nodes have one child.

20 (2%) DET nodes have two children.

6 (1%) DET nodes have three or more children.

The highest child degree of a DET node is 4.

Children of DET nodes are attached using 15 different relations: case (117; 64% instances), nmod (12; 7% instances), acl (10; 5% instances), cc (10; 5% instances), conj (7; 4% instances), acl:relcl (6; 3% instances), xcomp (6; 3% instances), punct (5; 3% instances), amod (2; 1% instances), fixed (2; 1% instances), orphan (2; 1% instances), advmod:emph (1; 1% instances), ccomp (1; 1% instances), cop (1; 1% instances), nsubj (1; 1% instances)

Children of DET nodes belong to 10 different parts of speech: ADP (117; 64% instances), NOUN (24; 13% instances), VERB (15; 8% instances), CCONJ (10; 5% instances), PUNCT (5; 3% instances), DET (4; 2% instances), ADJ (3; 2% instances), AUX (3; 2% instances), ADV (1; 1% instances), PRON (1; 1% instances)