Statistics of PRON in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Estonian-EWT: POS Tags: `PRON`

There are 62 PRON lemmas (1%), 332 PRON types (2%) and 6589 PRON tokens (7%). Out of 17 observed tags, the rank of PRON is: 11 in number of lemmas, 7 in number of types and 5 in number of tokens.

The 10 most frequent PRON lemmas: mina, see, mis, tema, sina, kes, oma, ise, miski, keegi

The 10 most frequent PRON types: ma, see, mis, seda, oma, kes, ta, sa, midagi, mida

The 10 most frequent ambiguous lemmas: mina (PRON 1436, NOUN 12), see (PRON 1399, DET 746), mis (PRON 729, DET 30), oma (PRON 335, ADJ 1, NOUN 1), ise (PRON 254, ADV 149, PROPN 1), miski (PRON 235, DET 14, NOUN 5), keegi (PRON 184, DET 12), kõik (PRON 179, DET 126, ADV 1), teine (DET 92, PRON 89, ADJ 33), muu (PRON 50, DET 29)

The 10 most frequent ambiguous types: see (PRON 422, DET 214), mis (PRON 366, DET 21), seda (PRON 279, DET 81), oma (PRON 284, VERB 3), sa (PRON 182, VERB 1), keegi (PRON 121, DET 11), mina (PRON 88, NOUN 9), kõik (PRON 98, DET 76, X 1), selle (DET 101, PRON 101, ADJ 1), neid (PRON 100, DET 41)

see
- PRON 422: Kurb aga nii see on ….
- DET 214: Minupärast võib see teema lukku minna .
mis
- PRON 366: Füüsilist keha , mis on sulle eluks antud , tuleks ikka austada .
- DET 21: rds567 : Njaa , päris hull , mis aines üldse .
seda
- PRON 279: paljudel tesitel pead seda menüüst tegema .
- DET 81: gudher : Kõige targem oleks seda asja oma enda õpetaja käest küsida .
oma
- PRON 284: Ei tasu koonriga suhet-peret luua kui oma tiivad ei kanna .
- VERB 3: ei ole päris pädev ja ei oma ka volitusi , kuid vahendajaks on siiski Pühakiri ( VT ja VT ) .
sa
- PRON 182: Kas sa pead sellest kirjutama .
- VERB 1: Taruvaigu : Mina ka ei sa aru , miks ei ole täisnime , isikukoodi , aadressi , või vähemalt aadress kuhu auto on pidevalt pargitud ..
keegi
- PRON 121: Et kui keegi juhtub olema , siis andke teada ;D
- DET 11: Oleksin tänulik , kui keegi “ nartsissitark “ natuke nõu annaks .
mina
- PRON 88: Mulle meeldib , kui tema on mees ja mina naine .
- NOUN 9: järtelikult ei saaks sinu mina edasi arvutisse kanda …
kõik
- PRON 98: Ma arvan et selle mõttega on kõik nõus
- DET 76: KAs sul on need kõik materiaalid arvutis või lihtsalt paberil ?
- X 1: Üks kõik mida ta ka ei teinud , sai naine ikka surma ( erinevatel viisidel , kui tulemus oli olemas ) .
selle
- DET 101: onia fotopoe peded korrutavad nagunii selle hinna 2X .
- PRON 101: Muumak : minu meelest ka , selle nimi on pigem mudel !
- ADJ 1: kirusin sl õhtulehte põhiliselt , muuhulgas arendasin edasi ühe siit foorumist bännitu , rate lisainfot ( selle aegaset ) ..
neid
- PRON 100: Klaasist akende taga neid on rohkem veel
- DET 41: spiiker : KAS neid kõiki ühe teema alla ei saa ?

Morphology

The form / lemma ratio of PRON is 5.354839 (the average of all parts of speech is 1.733800).

The 1st highest number of forms (34) was observed with the lemma “see”: asee, need, neeed, neid, neidki, neidt, neil, neile, neist, nende, nendega, nendegagi, nendelt, nendest, se, seda, sedagi, see, seed, seegi, sel, selle, sellega, selleks, sellel, sellele, sellelt, selleni, selles, sellesse, sellest, sellestki, selleta, sest.

The 2nd highest number of forms (33) was observed with the lemma “mina”: Mede, ma, me, meid, meie, meiega, meieni, meil, meile, meis, meisse, meist, mina, minagi, mind, minu, minuga, minugi, minul, minule, minult, minuna, minuni, minus, minusse, minust, mlle, mu, muga, mul, mulle, mult, must.

The 3rd highest number of forms (28) was observed with the lemma “tema”: nad, neid, neil, neile, neilt, neist, nemad, nende, nendega, nendegi, nendel, nendele, nendelt, ta, taga, tal, talle, tast, teda, tema, temaga, temal, temale, temalt, temas, temasse, temast, ts.

PRON occurs with 11 features: PronType (6589; 100% instances), Number (6582; 100% instances), Case (6581; 100% instances), Person (2601; 39% instances), Poss (329; 5% instances), Reflex (279; 4% instances), Typo (33; 1% instances), ExtPos (16; 0% instances), Abbr (4; 0% instances), Polarity (2; 0% instances), Gender (1; 0% instances)

PRON occurs with 38 feature-value pairs: Abbr=Yes, Case=Abe, Case=Abl, Case=Acc, Case=Add, Case=Ade, Case=All, Case=Com, Case=Ela, Case=Ess, Case=Gen, Case=Ill, Case=Ine, Case=Nom, Case=Par, Case=Ter, Case=Tra, ExtPos=ADV, ExtPos=PRON, ExtPos=SCONJ, Gender=Neut, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Polarity=Neg, Poss=Yes, PronType=Dem, PronType=Ind, PronType=Int, PronType=Int,Rel, PronType=Prs, PronType=Rcp, PronType=Rel, PronType=Tot, Reflex=Yes, Typo=Yes

PRON occurs with 209 feature combinations. The most frequent feature combination is Case=Nom|Number=Sing|Person=1|PronType=Prs (680 tokens). Examples: ma, mina, minagi, I, vot

Relations

PRON nodes are attached to their parents using 25 different relations: nsubj (2115; 32% instances), obl (1052; 16% instances), obj (1045; 16% instances), nmod (993; 15% instances), nsubj:cop (727; 11% instances), root (256; 4% instances), conj (172; 3% instances), advcl (54; 1% instances), ccomp (30; 0% instances), parataxis (21; 0% instances), acl:relcl (20; 0% instances), acl (19; 0% instances), obl:agent (19; 0% instances), dep (16; 0% instances), det (14; 0% instances), amod (7; 0% instances), xcomp (7; 0% instances), advmod (6; 0% instances), orphan (6; 0% instances), csubj (4; 0% instances), appos (2; 0% instances), fixed (1; 0% instances), mark (1; 0% instances), nmod:poss (1; 0% instances), reparandum (1; 0% instances)

Parents of PRON nodes belong to 12 different parts of speech: VERB (4160; 63% instances), NOUN (1383; 21% instances), ADJ (339; 5% instances), (256; 4% instances), PRON (201; 3% instances), ADV (166; 3% instances), PROPN (58; 1% instances), NUM (19; 0% instances), AUX (2; 0% instances), DET (2; 0% instances), SYM (2; 0% instances), INTJ (1; 0% instances)

5281 (80%) PRON nodes are leaves.

699 (11%) PRON nodes have one child.

173 (3%) PRON nodes have two children.

436 (7%) PRON nodes have three or more children.

The highest child degree of a PRON node is 18.

Children of PRON nodes are attached using 31 different relations: punct (483; 16% instances), nsubj:cop (345; 11% instances), advmod (319; 11% instances), cop (313; 10% instances), case (243; 8% instances), acl (209; 7% instances), conj (142; 5% instances), cc (139; 5% instances), acl:relcl (129; 4% instances), nmod (117; 4% instances), mark (90; 3% instances), obl (84; 3% instances), parataxis (64; 2% instances), amod (63; 2% instances), det (53; 2% instances), advcl (47; 2% instances), aux (47; 2% instances), orphan (36; 1% instances), discourse (30; 1% instances), fixed (17; 1% instances), csubj:cop (16; 1% instances), appos (9; 0% instances), vocative (6; 0% instances), dep (4; 0% instances), nsubj (4; 0% instances), cc:preconj (3; 0% instances), ccomp (3; 0% instances), goeswith (3; 0% instances), nummod (2; 0% instances), reparandum (1; 0% instances), xcomp (1; 0% instances)

Children of PRON nodes belong to 16 different parts of speech: PUNCT (483; 16% instances), NOUN (449; 15% instances), ADV (397; 13% instances), VERB (372; 12% instances), AUX (361; 12% instances), ADP (243; 8% instances), PRON (201; 7% instances), CCONJ (139; 5% instances), ADJ (106; 4% instances), PROPN (95; 3% instances), SCONJ (84; 3% instances), DET (52; 2% instances), INTJ (25; 1% instances), NUM (8; 0% instances), X (4; 0% instances), SYM (3; 0% instances)

Treebank Statistics: UD_Estonian-EWT: POS Tags: PRON

Morphology

Relations

Treebank Statistics: UD_Estonian-EWT: POS Tags: `PRON`