Treebank Statistics: UD_Occitan-TTB: POS Tags: PRON
There are 88 PRON lemmas (2%), 141 PRON types (2%) and 1915 PRON tokens (7%).
Out of 16 observed tags, the rank of PRON is: 7 in number of lemmas, 6 in number of types and 6 in number of tokens.
The 10 most frequent PRON lemmas: se, que, lo, me, te, i, ne, li, aquò, çò
The 10 most frequent PRON types: que, se, s’, i, te, li, qu’, me, n’, lo
The 10 most frequent ambiguous lemmas: se (PRON 339, SCONJ 51, ADP 1, ADV 1, PART 1), que (PRON 294, SCONJ 248, PART 132, ADV 34, CCONJ 11, ADP 2), lo (DET 2283, PRON 184), i (PRON 116, INTJ 8), ne (PRON 93, ADV 19, PART 1), aquò (PRON 48, VERB 1), çò (PRON 41, NOUN 1), el (PRON 34, NOUN 2), nos (PRON 33, NOUN 1), un (DET 535, PRON 30, ADV 1)
The 10 most frequent ambiguous types: que (PRON 224, SCONJ 206, PART 70, ADV 16, CCONJ 9, ADP 2), se (PRON 182, SCONJ 30, ADP 1, ADV 1, PART 1), s’ (PRON 108, SCONJ 7), i (PRON 86, INTJ 7), qu’ (PRON 67, SCONJ 42, PART 14, ADV 11, CCONJ 2), n’ (PRON 47, ADV 8, PART 2), lo (DET 738, PRON 48), aquò (PRON 36, VERB 1), l’ (DET 295, PRON 45), la (DET 564, PRON 34, ADV 4)
- que
- PRON 224: - Digas , que vòls ?
- SCONJ 206: Las estelas dins lo cèl beluguejavan mai que pus .
- PART 70: Rénder l’ amna que va , rénder l’ amna non vòu pas .
- ADV 16: Sénher Dieu , que de malurs e que de misèrias !
- CCONJ 9: ça que la èra un òme fòrt e hardit .
- ADP 2: - Mossur , ça diguèt la domaisèla , que sètz vengut far ací ?
- se
- PRON 182: Los Dracons se tornèron sarrar e baissavan lo cap .
- SCONJ 30: Mon Dieu , se me desliuratz d’ aquí me farai sòr de convent .
- ADP 1: Puèi l’ avèm daissat amargenar a l’ ombra , mentre que nautres anàvem salcissar sus un rol qu’ èra aquí a posita , coma se nos esperava de tota eternitat .
- ADV 1: Qu’ ei estat filmat en Varossa , aquera vath de eras Hautas Pireneas vesia de eth Comenge , dab monde de eth país qu’ explican se quin aquera societat montanhòla e virava tota a eth torn de era fabricacion de eth hormatge .
- PART 1: Aqueth documentari navèth que vos harà tanben pujar en eths cortaus de era Varossa tà véder se quin trebalh èra de hèr e de conservar eth hormatge aciu-haut .
- s’
- i
- qu’
- PRON 67: - Femna , ça diguèt lo marit , qu’ es aquel ostalet que vesi là-bas ?
- SCONJ 42: N’ aviái ja un de vtt , mas totcòp decidiguèri qu’ aviá fach son temps .
- PART 14: E fin finala , Casimir qu’ ei lhèu bèthlèu a lo cap … “
- ADV 11: E avèm dich : pòt pas èstre qu’ el !
- CCONJ 2: Sa decision , l’ aviá presa dempuèi lo matin , enrantelada qu’ èra d’ aquela dolor contunha que li pegava a l’ anma .
- n’
- lo
- aquò
- l’
- la
Morphology
The form / lemma ratio of PRON is 1.602273 (the average of all parts of speech is 1.368971).
The 1st highest number of forms (9) was observed with the lemma “lo”: -la, -lo, l’, la, las, lei, leis, lo, los.
The 2nd highest number of forms (6) was observed with the lemma “ne”: ‘n, en, n, n’, n’en, ne.
The 3rd highest number of forms (5) was observed with the lemma “el”: el, ela, eles, elis, eu.
PRON occurs with 1 features: ExtPos (3; 0% instances)
PRON occurs with 1 feature-value pairs: ExtPos=ADV
PRON occurs with 2 feature combinations.
The most frequent feature combination is _ (1912 tokens).
Examples: que, se, s’, i, te, li, qu’, me, n’, lo
Relations
PRON nodes are attached to their parents using 21 different relations: obj (547; 29% instances), expl (378; 20% instances), nsubj (374; 20% instances), iobj (270; 14% instances), obl (180; 9% instances), nmod (32; 2% instances), root (32; 2% instances), dislocated (24; 1% instances), fixed (22; 1% instances), conj (11; 1% instances), advcl (9; 0% instances), parataxis (8; 0% instances), vocative (7; 0% instances), acl (5; 0% instances), xcomp (4; 0% instances), appos (3; 0% instances), ccomp (3; 0% instances), advmod (2; 0% instances), dep (2; 0% instances), discourse (1; 0% instances), orphan (1; 0% instances)
Parents of PRON nodes belong to 12 different parts of speech: VERB (1724; 90% instances), NOUN (70; 4% instances), ADJ (33; 2% instances), (32; 2% instances), PRON (16; 1% instances), ADV (15; 1% instances), ADP (13; 1% instances), DET (5; 0% instances), X (3; 0% instances), INTJ (2; 0% instances), NUM (1; 0% instances), PROPN (1; 0% instances)
1582 (83%) PRON nodes are leaves.
187 (10%) PRON nodes have one child.
92 (5%) PRON nodes have two children.
54 (3%) PRON nodes have three or more children.
The highest child degree of a PRON node is 6.
Children of PRON nodes are attached using 24 different relations: punct (173; 31% instances), case (93; 17% instances), acl (63; 11% instances), det (39; 7% instances), nmod (39; 7% instances), cop (31; 6% instances), advmod (24; 4% instances), mark (18; 3% instances), amod (14; 3% instances), parataxis (11; 2% instances), cc (10; 2% instances), nsubj (8; 1% instances), obl (7; 1% instances), conj (6; 1% instances), advcl (5; 1% instances), fixed (5; 1% instances), appos (4; 1% instances), orphan (3; 1% instances), ccomp (1; 0% instances), csubj (1; 0% instances), discourse (1; 0% instances), dislocated (1; 0% instances), vocative (1; 0% instances), xcomp (1; 0% instances)
Children of PRON nodes belong to 13 different parts of speech: PUNCT (173; 31% instances), ADP (97; 17% instances), VERB (71; 13% instances), NOUN (53; 9% instances), DET (39; 7% instances), AUX (32; 6% instances), ADV (25; 4% instances), ADJ (17; 3% instances), PRON (16; 3% instances), SCONJ (16; 3% instances), CCONJ (11; 2% instances), PROPN (8; 1% instances), NUM (1; 0% instances)