Statistics of PRON in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Czech-FicTree: POS Tags: `PRON`

There are 31 PRON lemmas (0%), 137 PRON types (1%) and 14087 PRON tokens (8%). Out of 16 observed tags, the rank of PRON is: 14 in number of lemmas, 8 in number of types and 4 in number of tokens.

The 10 most frequent PRON lemmas: se, on, já, co, ty, nic, něco, kdo, nikdo, někdo

The 10 most frequent PRON types: se, si, mi, co, mě, ho, já, mu, ji, nic

The 10 most frequent ambiguous lemmas: se (PRON 6110, ADP 1), co (PRON 789, ADV 48, PART 34, SCONJ 21), jenž (PRON 123, DET 23), což (PRON 42, PART 1), my (PRON 8, DET 1), copak (PART 11, PRON 6), být (AUX 7488, PRON 1), cože (INTJ 8, PRON 1)

The 10 most frequent ambiguous types: se (PRON 4467, ADP 185), si (PRON 1351, AUX 4), co (PRON 539, ADV 46, SCONJ 21, PART 17), je (AUX 863, PRON 228), ti (PRON 129, DET 31), ty (DET 66, PRON 65), ona (PRON 70, DET 2), my (PRON 38, DET 1), něčím (PRON 11, DET 1), copak (PART 1, PRON 1)

se
- PRON 4467: Daly jsme se do řeči .
- ADP 185: Ale tohle setkání se zachráněným kotětem mi dodalo sílu .
si
- PRON 1351: Lekci , kterou mi šéf dal , jsem si odnesl do života .
- AUX 4: Já zapomněla , že ty si chytřejší a vzdělanější , “ odfrkla znechuceně .
co
- PRON 539: Jednak měla hezké rysy , ovšem to , co jí dodávalo krásu , byl pohled .
- ADV 46: Tu bys měl co nejrychleji najít . “
- SCONJ 21: Jenže od chvíle , co jsem to měl doma , byl jsem rozčilený .
- PART 17: ” A proč chtěla umřít nahá , co ?
je
- AUX 863: ” Kolik jí je ? “
- PRON 228: Každé čtyři roky je musíš sázet znovu .
ti
- PRON 129: ” Ještě je ti zle ? “ zeptala se Ilona .
- DET 31: ” Jsou ti osli ještě k něčemu ? “ zeptal se jeden .
ty
- DET 66: ” Ne , myslím jen ty , které se podobají vám . “
- PRON 65: ” Jiní by se vytahovali , a ty mlčíš .
ona
- PRON 70: Věděla jsem , že ona by z toho měla radost .
- DET 2: On ji měl stále rád , ovšem přišla léta , kdy se u žen dostaví hormonální změny , myslím tím ona kritická léta po čtyřicítce .
my
- PRON 38: Evidentně na tom kole zdrhal pryč a my zůstali stát s otevřenou pusou .
- DET 1: Každý večer vyplouvají naši blízcí na svá moře , do vln svých starostí a trýzní , každý večer čekají na světlo , které jim můžeme rozsvítit jen my .
něčím
- PRON 11: Možná by nebylo od věci něčím se zasytit .
- DET 1: Jenže dělat na něčem dlouho , dávat do toho všecko , co v člověku je , okouzlit , dokopat , zmanipulovat lidi , aby by do toho šli taky , a dokázat , že tomu taky uvěří a že se tomu taky upíšou , a pak najednou vidět , jak to zdechne na něčím psacím stole a provinilý hlas po telefonu , samozvaný cenzor a lhář , prostřednictvím překladatele či prostřednictvím úsměvu vám sdělí šmytec - tak tohle mě uvrhne do zimního spánku .
copak
- PART 1: Copak by dokázal , copak by vůbec mohl všechno opravit ?
- PRON 1: ” A copak tam budete dělat ? “

Morphology

The form / lemma ratio of PRON is 4.419355 (the average of all parts of speech is 1.970842).

The 1st highest number of forms (28) was observed with the lemma “on”: ho, je, jeho, jej, jemu, ji, jich, jim, jimi, jí, jím, mu, ni, nich, nim, nimi, ní, ním, ně, něho, něj, něm, němu, on, ona, oni, ono, ony.

The 2nd highest number of forms (20) was observed with the lemma “jenž”: jehož, jejž, jemuž, jenž, jež, jichž, jimiž, jimž, již, jímž, jíž, nichž, nimiž, niž, nímž, níž, něhož, němuž, němž, něž.

The 3rd highest number of forms (10) was observed with the lemma “já”: já, mi, mne, mnou, mně, my, mě, nám, námi, nás.

PRON occurs with 10 features: PronType (14087; 100% instances), Case (14055; 100% instances), Variant (8362; 59% instances), Number (6174; 44% instances), Reflex (6111; 43% instances), Person (5957; 42% instances), Gender (3591; 25% instances), Animacy (3121; 22% instances), PrepCase (916; 7% instances), Style (8; 0% instances)

PRON occurs with 28 feature-value pairs: Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, PrepCase=Npr, PrepCase=Pre, PronType=Ind, PronType=Int,Rel, PronType=Neg, PronType=Prs, PronType=Rel, PronType=Tot, Reflex=Yes, Style=Coll, Variant=Short

PRON occurs with 225 feature combinations. The most frequent feature combination is Case=Acc|PronType=Prs|Reflex=Yes|Variant=Short (4451 tokens). Examples: se

Relations

PRON nodes are attached to their parents using 22 different relations: expl:pv (4577; 32% instances), obj (2617; 19% instances), obl:arg (2299; 16% instances), obl (1793; 13% instances), nsubj (1507; 11% instances), expl:pass (350; 2% instances), nmod (205; 1% instances), root (154; 1% instances), discourse (121; 1% instances), conj (110; 1% instances), dep (83; 1% instances), advcl (62; 0% instances), iobj (61; 0% instances), ccomp (51; 0% instances), nsubj:pass (27; 0% instances), orphan (24; 0% instances), xcomp (14; 0% instances), acl:relcl (12; 0% instances), appos (11; 0% instances), csubj (6; 0% instances), vocative (2; 0% instances), parataxis (1; 0% instances)

Parents of PRON nodes belong to 13 different parts of speech: VERB (12958; 92% instances), ADJ (309; 2% instances), NOUN (294; 2% instances), (154; 1% instances), ADV (131; 1% instances), PRON (80; 1% instances), DET (68; 0% instances), AUX (29; 0% instances), NUM (28; 0% instances), PART (26; 0% instances), PROPN (7; 0% instances), INTJ (2; 0% instances), ADP (1; 0% instances)

11450 (81%) PRON nodes are leaves.

2150 (15%) PRON nodes have one child.

187 (1%) PRON nodes have two children.

300 (2%) PRON nodes have three or more children.

The highest child degree of a PRON node is 12.

Children of PRON nodes are attached using 31 different relations: case (1819; 47% instances), punct (529; 14% instances), cop (202; 5% instances), amod (159; 4% instances), xcomp (155; 4% instances), nsubj (138; 4% instances), conj (128; 3% instances), advmod:emph (124; 3% instances), cc (100; 3% instances), mark (78; 2% instances), nmod (75; 2% instances), advmod (74; 2% instances), acl:relcl (50; 1% instances), dep (50; 1% instances), appos (37; 1% instances), orphan (34; 1% instances), advcl (25; 1% instances), det (21; 1% instances), obl (17; 0% instances), det:numgov (11; 0% instances), discourse (10; 0% instances), nummod (10; 0% instances), nummod:gov (8; 0% instances), parataxis (8; 0% instances), aux (7; 0% instances), obl:arg (5; 0% instances), vocative (4; 0% instances), acl (2; 0% instances), ccomp (2; 0% instances), csubj (2; 0% instances), det:nummod (2; 0% instances)

Children of PRON nodes belong to 15 different parts of speech: ADP (1816; 47% instances), PUNCT (529; 14% instances), NOUN (243; 6% instances), AUX (210; 5% instances), ADJ (207; 5% instances), DET (157; 4% instances), VERB (154; 4% instances), ADV (125; 3% instances), CCONJ (125; 3% instances), PART (89; 2% instances), PRON (80; 2% instances), SCONJ (78; 2% instances), NUM (43; 1% instances), PROPN (24; 1% instances), INTJ (6; 0% instances)

Treebank Statistics: UD_Czech-FicTree: POS Tags: PRON

Morphology

Relations

Treebank Statistics: UD_Czech-FicTree: POS Tags: `PRON`