Statistics of PRON in UD

home edit page issue tracker

This page pertains to UD version 2.

It appears that you have Javascript disabled. Please consider enabling Javascript for this page to see the visualizations.

Treebank Statistics: UD_Beja-NSC: POS Tags: `PRON`

There are 1 PRON lemmas (6%), 60 PRON types (5%) and 395 PRON tokens (7%). Out of 16 observed tags, the rank of PRON is: 11 in number of lemmas, 3 in number of types and 6 in number of tokens.

The 10 most frequent PRON lemmas: _

The 10 most frequent PRON types: =heːb, =i, =oː, ani, =eː, kna, =hoːk, =oːk, =joː, =ji

The 10 most frequent ambiguous lemmas: _ (PUNCT 1126, VERB 1097, DET 933, NOUN 894, ADP 408, PRON 395, SCONJ 298, PART 167, CCONJ 160, AUX 125, ADV 104, ADJ 77, PROPN 32, INTJ 28, NUM 26, X 18)

The 10 most frequent ambiguous types: =i (ADP 61, PRON 48, AUX 12, SCONJ 10), ani (PRON 27, VERB 9, AUX 2), =eː (PRON 23, ADP 13, SCONJ 12, DET 1), =ji (PRON 9, ADP 7, SCONJ 3, AUX 1), =jeː (SCONJ 13, PRON 6, ADP 3), hoː (PRON 6, NOUN 1), =eːk (SCONJ 10, PRON 2), i= (DET 145, PRON 2, SCONJ 2), jhaː (PRON 2, INTJ 1), nafs (NOUN 2, PRON 2)

=i
- ADP 61: jhakseːtiːt / i= mbaːba =i dhaːj haːj ɖaːbiːni eːn //
- PRON 48: tak / kaːm =i / hoː kʷiɖja ini //
- AUX 12: daːjiː =t =i diːtiːt / j= ʔar han hus ikatina /
- SCONJ 10: oːn ajhan =i gabal =eː =ka harʔiː sa~sakja /
ani
- PRON 27: ʃamattan =i =ji ʃamat =eː =ka ani i= mhiːn =i naːjeː mhan /
- VERB 9: tak rhita tini =oː =hoːb ? oː= tak rhan ani /
- AUX 2: uːn i= bissa / tikʷ / t= ʔarabijaːj =t =iː ani =hoːb / bass whiː ingad /
=eː
- PRON 23: winneːt ʔareːji eːn / ʔakra reːr // mhaj koː =jeː j= ʔar =eː //
- ADP 13: ʃamattan =i =ji ʃamat =eː =ka ani i= mhiːn =i naːjeː mhan /
- SCONJ 12: oː= kna hoːj bi= ibarin =eː =na ki= thaːj eːn /
- DET 1: uː= tak areː / ti= ndeː =t =i =da jʔeːtiːt / w= ʔoːr =oːk rhan / ti= karaːma =t =eː firar# / ti= tifirʔi =jeːt iktimna / afirha =b akajeː =wa / i= dhaj =iːb / hawaːjeː =wa rhan indi eːn //
=ji
- PRON 9: uːn uː= tak / doːr han kanaː =ji ki= iki / ti= takat hiːs =heːb =ajt //
- ADP 7: diweː / eːn i= miːmaʃa =ji / dhaj itfirʔin =eːb hiːsi ini //
- SCONJ 3: ʃamattan =i =ji ʃamat =eː =ka ani i= mhiːn =i naːjeː mhan /
- AUX 1: haːl =oːkna =ji eːdn =hoːb / uː= tak ʔakra mhijeː / duːr =uːn duːr =uːn bass w= ʔoːr hiːsi =hoːk iːd =eːt toː= na nuːn /
=jeː
- SCONJ 13: manniima ini =jeːb / i= manniimti =iː imri =jeː =na tikati //
- PRON 6: winneːt ʔareːji eːn / ʔakra reːr // mhaj koː =jeː j= ʔar =eː //
- ADP 3: ʃakʷiːn =t dhaːj dannʔi eːn // eːn i= taktʔi =jeː =da //
hoː
- PRON 6: tak / kaːm =i / hoː kʷiɖja ini //
- NOUN 1: hoː =b hoːs =oː ʃʔagaː =b =u uː= tak // ʔasalaː =b iːkti =jeːb /
=eːk
- SCONJ 10: ɖa~ɖibti =eːk wari =t hariwa naː =t teːtgiːm =eːk /
- PRON 2: ɖa~ɖibti =eːk wari =t hariwa naː =t teːtgiːm =eːk /
i=
- DET 145: agar jʔan =t i= gaw =i /
- PRON 2: i= baːgi oːn / i= girma =wwa / i= ragada i= suːriː =b =wa / bak ifdin igid / ʃʔiː uː= jhaːm ajaː =b =u =it /
- SCONJ 2: eːn i= gabal i= tʔa / hadiːdti =jeː =naː =iːb // riba eːfeːn /
jhaː
- PRON 2: jaː iraːni / bak tʔiit =eː =na ini =hoːb jhaː / oːn oː= gaw suːriː / dhaj hanka =oːk // hoːj akaː =b =a /
- INTJ 1: uːn w= ʔeːga / daːjeːb bak tageːgeːtiːt / tak ikati =hoːb // jhaː ʔaːw =wa indi eːn baruːk /
nafs
- NOUN 2: i= tak =iː =ka ʔaɖami =ka =b akajeː i= nafs =i rhi /
- PRON 2: ti= ʃartija / kaːm =i meːs =iːt / i= nafs =i hasama =b akajeː amri /

Morphology

The form / lemma ratio of PRON is 60.000000 (the average of all parts of speech is 76.500000).

The 1st highest number of forms (60) was observed with the lemma “_”: =aː, =aːk, =b, =eː, =eːk, =heːb, =hi, =hoːk, =hoːn, =i, =iheː, =ihi, =iji, =ijoː, =ijoːk, =iːsi, =iːsiː, =iːsoː, =jaː, =jeː, =ji, =joː, =joːk, =juːk, =oː, =oːk, =oːkna, =oːn, =saj, =siːsi, =t, =uː, =uːn, aneːb, ani, aniː, barijoːk, barjoː, baroːk, baruː, baruːk, beːn, hinin, hoː, i=, imbareː, jhaː, ji=, kina, kna, nafs, naː, naːn, oːn, ti=, umbaruː, umbaruːk, wi=, ʔani, ʔaːw.

PRON occurs with 11 features: Number (360; 91% instances), Person (353; 89% instances), Case (286; 72% instances), Poss (193; 49% instances), Gender (28; 7% instances), Reflex (19; 5% instances), PronType (10; 3% instances), Polite (9; 2% instances), PartType (6; 2% instances), Definite (3; 1% instances), Deixis (2; 1% instances)

PRON occurs with 22 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Nom, Case=Voc, Definite=Def, Deixis=Prox, Deixis=Remt, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, PartType=Int, Person=1, Person=2, Person=3, Polite=Form, Poss=Yes, PronType=Dem, PronType=Rel, Reflex=Yes

PRON occurs with 51 feature combinations. The most frequent feature combination is Number=Sing|Person=1 (73 tokens). Examples: =heːb, =oː, hoː, =joː, aneːb

Relations

PRON nodes are attached to their parents using 19 different relations: nmod:poss (142; 36% instances), obj (112; 28% instances), obl:mod (36; 9% instances), nsubj (35; 9% instances), nmod (14; 4% instances), dep:comp (13; 3% instances), discourse (9; 2% instances), dislocated:subj (9; 2% instances), iobj (7; 2% instances), obl:arg (4; 1% instances), acl:relcl (2; 1% instances), dislocated (2; 1% instances), dislocated:obj (2; 1% instances), reparandum (2; 1% instances), vocative (2; 1% instances), dep (1; 0% instances), dep:conj (1; 0% instances), parataxis (1; 0% instances), root (1; 0% instances)

Parents of PRON nodes belong to 13 different parts of speech: VERB (176; 45% instances), NOUN (150; 38% instances), ADP (32; 8% instances), ADJ (15; 4% instances), PRON (11; 3% instances), PROPN (3; 1% instances), PART (2; 1% instances), ADV (1; 0% instances), AUX (1; 0% instances), DET (1; 0% instances), NUM (1; 0% instances), (1; 0% instances), X (1; 0% instances)

326 (83%) PRON nodes are leaves.

43 (11%) PRON nodes have one child.

21 (5%) PRON nodes have two children.

5 (1%) PRON nodes have three or more children.

The highest child degree of a PRON node is 4.

Children of PRON nodes are attached using 13 different relations: punct (37; 36% instances), det (30; 29% instances), discourse (10; 10% instances), nmod:poss (7; 7% instances), case (5; 5% instances), cc (4; 4% instances), dep (2; 2% instances), dep:comp (2; 2% instances), nmod (2; 2% instances), cop (1; 1% instances), dep:conj (1; 1% instances), dislocated:subj (1; 1% instances), vocative (1; 1% instances)

Children of PRON nodes belong to 11 different parts of speech: PUNCT (37; 36% instances), DET (35; 34% instances), PRON (11; 11% instances), ADP (7; 7% instances), CCONJ (4; 4% instances), PART (3; 3% instances), INTJ (2; 2% instances), AUX (1; 1% instances), NOUN (1; 1% instances), SCONJ (1; 1% instances), X (1; 1% instances)

Treebank Statistics: UD_Beja-NSC: POS Tags: PRON

Morphology

Relations

Treebank Statistics: UD_Beja-NSC: POS Tags: `PRON`