home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-CLTT: POS Tags: PRON

There are 4 PRON lemmas (0%), 37 PRON types (1%) and 622 PRON tokens (2%). Out of 15 observed tags, the rank of PRON is: 13 in number of lemmas, 9 in number of types and 10 in number of tokens.

The 10 most frequent PRON lemmas: se, jenž, on, veškerý

The 10 most frequent PRON types: se, nichž, němž, jej, němuž, je, jim, jí, jimiž, veškeré

The 10 most frequent ambiguous lemmas:

The 10 most frequent ambiguous types: se (PRON 462, ADP 29), je (AUX 207, PRON 11), jehož (DET 6, PRON 5)

Morphology

The form / lemma ratio of PRON is 9.250000 (the average of all parts of speech is 1.713272).

The 1st highest number of forms (15) was observed with the lemma “on”: ho, je, jej, jemu, ji, jim, jimi, jí, nich, nim, nimi, ní, ním, ně, něj.

The 2nd highest number of forms (14) was observed with the lemma “jenž”: jehož, jenž, jež, jimiž, jímž, nichž, nimž, niž, nímž, níž, nějž, němuž, němž, něž.

The 3rd highest number of forms (4) was observed with the lemma “se”: se, sebou, si, sobě.

PRON occurs with 9 features: Case (622; 100% instances), PronType (622; 100% instances), Reflex (470; 76% instances), Variant (464; 75% instances), Number (152; 24% instances), PrepCase (139; 22% instances), Gender (90; 14% instances), Person (69; 11% instances), Animacy (2; 0% instances)

PRON occurs with 21 feature-value pairs: Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Gender=Fem, Gender=Masc, Gender=Masc,Neut, Gender=Neut, Number=Plur, Number=Sing, Person=3, PrepCase=Npr, PrepCase=Pre, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Variant=Short

PRON occurs with 49 feature combinations. The most frequent feature combination is Case=Acc|PronType=Prs|Reflex=Yes|Variant=Short (462 tokens). Examples: se

Relations

PRON nodes are attached to their parents using 9 different relations: expl:pass (348; 56% instances), expl:pv (113; 18% instances), obl (66; 11% instances), obl:arg (32; 5% instances), obj (29; 5% instances), nmod (22; 4% instances), nsubj (6; 1% instances), acl:relcl (4; 1% instances), conj (2; 0% instances)

Parents of PRON nodes belong to 7 different parts of speech: VERB (517; 83% instances), ADJ (59; 9% instances), NOUN (35; 6% instances), X (8; 1% instances), ADV (1; 0% instances), AUX (1; 0% instances), DET (1; 0% instances)

538 (86%) PRON nodes are leaves.

76 (12%) PRON nodes have one child.

4 (1%) PRON nodes have two children.

4 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 3.

Children of PRON nodes are attached using 7 different relations: case (79; 82% instances), cop (4; 4% instances), nsubj (4; 4% instances), punct (4; 4% instances), cc (2; 2% instances), xcomp (2; 2% instances), advmod (1; 1% instances)

Children of PRON nodes belong to 7 different parts of speech: ADP (79; 82% instances), NOUN (5; 5% instances), AUX (4; 4% instances), PUNCT (4; 4% instances), CCONJ (2; 2% instances), ADJ (1; 1% instances), ADV (1; 1% instances)