home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Old_Occitan-CorAG: POS Tags: PRON

There are 1 PRON lemmas (7%), 260 PRON types (4%) and 3493 PRON tokens (8%). Out of 14 observed tags, the rank of PRON is: 10 in number of lemmas, 6 in number of types and 6 in number of tokens.

The 10 most frequent PRON lemmas: _

The 10 most frequent PRON types: qui, que, lo, se, nos, y, so, los, autre, s’

The 10 most frequent ambiguous lemmas: _ (NOUN 8359, ADP 6278, VERB 5468, DET 5372, PUNCT 4269, PRON 3493, CCONJ 3134, SCONJ 2046, ADV 1984, PROPN 1865, ADJ 1418, AUX 1213, NUM 436, PART 54)

The 10 most frequent ambiguous types: qui (PRON 374, ADV 10, SCONJ 4), que (SCONJ 1100, PRON 190, PART 15, CCONJ 1), lo (DET 782, PRON 167), se (PRON 157, SCONJ 3, ADP 1, DET 1), so (PRON 81, DET 6, ADV 1, AUX 1), los (DET 260, PRON 84, ADJ 1), autre (PRON 79, DET 60, ADJ 26), -los (DET 291, PRON 73), en (ADP 686, PRON 70, NOUN 44, ADV 2), l’ (DET 199, PRON 70)

Morphology

The form / lemma ratio of PRON is 260.000000 (the average of all parts of speech is 457.357143).

The 1st highest number of forms (260) was observed with the lemma “_”: ‘n, -aqueg, -aquetz, -cui, -en, -i, -l, -l’, -la, -lh, -lo, -loquau, -loquoau, -lor, -los, -losquals, -losquaus, -losquoaus, -ls, -lui, -m, -mi, -n, -nos, -que, -queg, -quegs, -quere, -quetz, -qui, -s, -se, -so, -sso, -u, -us, -y, .i.-, 1, 1., Aquestz, Aso, Ques, Quest, a, a-, ac, aceg, ag, aiso, aqued, aquedz, aqueg, aquegs, aquera, aqueras, aquere, aqueres, aquero, aques, aquest, aquestes, aquet, aquets, aquetz, aquez, aquo, arre, arres, asso, atal, atau, ataus, aucun, aucunas, aucuns, augu, augun, auguns, aute, autes, autras, autre, autres, autruy, aço, bos, cadan, cadaun, cadaune, cadeunes, cant, cascun, cascune, cascuns, ceys, coey, cui, cuy, degun, degune, don, dont, dos, ed, edz, eg, egs, en, entramps, eras, ere, etz, guere, hac, hi, hoc, hom, home, homi, i, id, ii, io, jo, jo-, jui, l’, la, la-, laqual, laquau, laquoal, laquoau, las, las(quas), lascals, lasquas, lasquaus, lasquoaus, le, li, liquas, liquaus, lo, lo-, loos, loqual, loquau, loquoal, loquoau, loquoaus, lor, lors, los, losquals, losquas, losquaus, losquoals, losquoaus, lui, lur, luy, l’, m’, me, medeys, medix, medixe, menhs, mes, mi, molt, molts, n’, ne, neg, negu, negun, no, nos, nostre, nostres, nulh, nulhs, o, om, on, ont, or, ou, paucs, plus, plusors, propria, qe, qual, qual-, quant, quant-, quascun, quatre, quau, quaus, quaus-, que, que-, queg, quegs, quere, queres, questa, questas, quetz, qui, qui-, quo, quoal, quoant, quoau, quoaus, re, ren, res, s’, se, sengles, si, si-, sii, so, so-, soe, soes, son, sons, sont, sonx, sos, ss’, sso, t’, tantes, tau, tot, totes, totz, trop, tropes, trops, u, un, unas, une, ung, uns, vos, vostres, y, yo.

PRON occurs with 7 features: PronType (2839; 81% instances), Person (1605; 46% instances), Number (1591; 46% instances), Gender (1265; 36% instances), Reflex (267; 8% instances), ExtPos (27; 1% instances), Poss (6; 0% instances)

PRON occurs with 16 feature-value pairs: ExtPos=ADV, Gender=Fem, Gender=Masc, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Dem, PronType=Ind, PronType=Int, PronType=Neg, PronType=Prs, PronType=Rel, Reflex=Yes

PRON occurs with 44 feature combinations. The most frequent feature combination is _ (633 tokens). Examples: nos, que, qui, vos, luy, edz, se, ed, los, lor

Relations

PRON nodes are attached to their parents using 19 different relations: nsubj (979; 28% instances), obl (874; 25% instances), obj (711; 20% instances), expl (334; 10% instances), iobj (291; 8% instances), conj (116; 3% instances), nmod (90; 3% instances), advmod (27; 1% instances), root (17; 0% instances), dislocated (15; 0% instances), fixed (11; 0% instances), appos (7; 0% instances), acl:relcl (6; 0% instances), orphan (5; 0% instances), ccomp (4; 0% instances), acl (2; 0% instances), advcl (2; 0% instances), parataxis (1; 0% instances), xcomp (1; 0% instances)

Parents of PRON nodes belong to 8 different parts of speech: VERB (3133; 90% instances), NOUN (200; 6% instances), PRON (67; 2% instances), ADJ (40; 1% instances), (17; 0% instances), PROPN (15; 0% instances), ADV (12; 0% instances), ADP (9; 0% instances)

2465 (71%) PRON nodes are leaves.

604 (17%) PRON nodes have one child.

269 (8%) PRON nodes have two children.

155 (4%) PRON nodes have three or more children.

The highest child degree of a PRON node is 8.

Children of PRON nodes are attached using 19 different relations: case (623; 37% instances), acl:relcl (212; 13% instances), det (146; 9% instances), punct (122; 7% instances), cc (118; 7% instances), conj (115; 7% instances), nmod (71; 4% instances), fixed (58; 3% instances), amod (51; 3% instances), appos (39; 2% instances), acl (26; 2% instances), cop (26; 2% instances), advmod (22; 1% instances), nsubj (18; 1% instances), orphan (18; 1% instances), obl (7; 0% instances), mark (5; 0% instances), advcl (2; 0% instances), dislocated (1; 0% instances)

Children of PRON nodes belong to 12 different parts of speech: ADP (620; 37% instances), VERB (272; 16% instances), NOUN (175; 10% instances), DET (146; 9% instances), PUNCT (122; 7% instances), CCONJ (117; 7% instances), PRON (67; 4% instances), ADJ (60; 4% instances), AUX (32; 2% instances), PROPN (28; 2% instances), ADV (26; 2% instances), SCONJ (15; 1% instances)