Treebank Statistics: UD_Irish-IDT: POS Tags: PRON
There are 36 PRON
lemmas (0%), 61 PRON
types (0%) and 3621 PRON
tokens (3%).
Out of 17 observed tags, the rank of PRON
is: 9 in number of lemmas, 9 in number of types and 10 in number of tokens.
The 10 most frequent PRON
lemmas: sé, é, sin, féin, iad, siad, sí, mé, í, seo
The 10 most frequent PRON
types: sé, é, sin, féin, iad, siad, sí, mé, í, seo
The 10 most frequent ambiguous lemmas: sé (PRON 749, NUM 24, PROPN 3), sin (DET 425, PRON 414, INTJ 1), féin (PRON 255, ADV 1), sí (PRON 193, PROPN 1), seo (DET 564, PRON 145), cé (PRON 92, SCONJ 45, NOUN 2), siúd (PRON 41, DET 9), a (PART 4074, DET 736, PRON 23, ADV 7, X 5, NUM 2, NOUN 1), ceachtar (PRON 7, NOUN 1), ar (ADP 3229, PART 42, ADV 35, VERB 17, PRON 6, SCONJ 4)
The 10 most frequent ambiguous types: sé (PRON 747, NUM 19, AUX 6), sin (DET 403, PRON 346, AUX 1), féin (PRON 247, ADV 1), seo (DET 547, PRON 125, AUX 1), siúd (PRON 39, DET 8), cé (SCONJ 30, PRON 12, NOUN 1), a (PART 4050, DET 745, PRON 23, ADV 7, ADP 2, X 2, NOUN 1), san (ADP 212, DET 22, PRON 13), ar (ADP 2733, AUX 43, PART 35, ADV 33, VERB 15, PRON 6), c (PRON 2, NOUN 1)
- sé
- sin
- DET 403: Thug na páistí ruaig amháin ar an siopa sin .
- PRON 346: Cé nár dhúirt tú é bhí ‘ fhios agam gur thuig tú sin i do chroí .
- AUX 1: ’ Tógann sé deich mbliana ar an spéis léitheoireachta theacht in inmhe agus caithfear díriú ar pháistí atá ag sroichint na léitheoireachta ar bhun neamhspleách , mar sin an áit a gcailltear iad faoi láthair .
- féin
- PRON 247: Mo léan go mbeidh ar na Gaeil luí síos arís ina dtír féin .
- ADV 1: ’ Fostaíodh Comhairleoirí Gaeilge ar chonradh dhá bhliain i gcuid de na Bordcheantair cheana féin ach ceapann Comhar go bhfuil géarghá le Comhairleoir Gaeilge lánaimseartha in achan Bhordcheantar sa Tuaisceart sa dóigh is go mbeidh leanúnachas san obair .
- seo
- DET 547: An dtuigtear fós sa tír seo cé chomh mór de athrú is a bhí ansin ?
- PRON 125: Duradh gur seo ceann dena fadhbanna is mó atá sa cheantar .
- AUX 1: Cothú sa Duine Tá cúig chéim i gceist le cothú sa duine : Daoibhse atá ar bheagán Shakespeare , nó daoibhse a d’ fhág cúrsaí staidéir níos mó ná cúpla lá ó shin , seo é , go gonta , scéal traigéideach marfach Phrionsa na Danmhairge .
- siúd
- cé
- SCONJ 30: An dtuigtear fós sa tír seo cé chomh mór de athrú is a bhí ansin ?
- PRON 12: Níl ‘s ag Mícheál cé acu ag cur i gcéill atá sé nó aineolach .
- NOUN 1: Ar fhágaint slán agus beannacht age cé na Coise an tráthnóna aoibhinn caithiseach Domhnaigh seo dhúinn , agus agena a raibh do dhaoine ina seasamh ann , cé go raibh cuid mhaith ann san am gcéanna , ní raibh cuma na hainnise ná na bochtanacht ar aon duine acu , rud ná beadh im chumas do rá leo anois dá mbeinn ann , comh fada lem thuairim .
- a
- PART 4050: Ach beo bocht a bheadh ar an té nach mbeadh aige ach preátaí tura .
- DET 745: Meatachán scanraithe agus a lámha cáidheach le roidealach an bhóthair .
- PRON 23: Níl de dhíth ach aon fhocal amháin , sin a bhfuil .
- ADV 7: Tá an teicníocht inste a roghnaíonn sé thar a bheith éifeachtach .
- ADP 2: ’ Séard atá De Róiste a rá anois ná gur cuireadh ina choinne go raibh baint aige le Saor Éire agus gur thug sin leithscéal dóibh é a bhriseadh .
- X 2: ’ Cad a bhí ann ach go raibh sé ‘ cut off with a shilling ‘ , agus tugadh an scilling dó !
- NOUN 1: Sa Deibhí bíonn seacht siolla sa líne freisin , ach bíonn siolla sa bhreis i bhfocal deireanach b ar a , agus in d ar c .
- san
- ar
- ADP 2733: Ach beo bocht a bheadh ar an té nach mbeadh aige ach preátaí tura .
- AUX 43: Tá cúpla rud eile sa leabhar seo ar mhaith liom tagairt dóibh .
- PART 35: De ribeoga olla ar chuir mé casadh iontu a rinne mé na buaiceacha .
- ADV 33: Scairt mé ar ais air .
- VERB 15: ’ Tá téagar sna páirteanna a fhaigheann tú i scannán , ‘ ar sise .
- PRON 6: Bhí cloch mhór chuimhne , dúradh leis , ar na mbóthar go Páras a raibh ainm gach ar thit is nach dtángthas ar an gcorpán greanta uirthi .
- c
Morphology
The form / lemma ratio of PRON
is 1.694444 (the average of all parts of speech is 1.648496).
The 1st highest number of forms (6) was observed with the lemma “cé”: c, cé, céard, cén, cér, cérbh.
The 2nd highest number of forms (6) was observed with the lemma “sin”: hin, in, san, shin, shoin, sin.
The 3rd highest number of forms (4) was observed with the lemma “tú”: thusa, thú, tusa, tú.
PRON
occurs with 10 features: Number (2619; 72% instances), Person (2577; 71% instances), Gender (1736; 48% instances), PronType (862; 24% instances), Reflex (255; 7% instances), Form (66; 2% instances), Dialect (17; 0% instances), Typo (5; 0% instances), VerbForm (2; 0% instances), Foreign (1; 0% instances)
PRON
occurs with 22 feature-value pairs: Dialect=Connaught
, Dialect=Munster
, Dialect=Ulster
, Foreign=Yes
, Form=HPref
, Form=Len
, Form=VF
, Gender=Fem
, Gender=Masc
, Number=Plur
, Number=Sing
, Person=1
, Person=2
, Person=3
, PronType=Dem
, PronType=Emp
, PronType=Ind
, PronType=Int
, PronType=Rel
, Reflex=Yes
, Typo=Yes
, VerbForm=Cop
PRON
occurs with 38 feature combinations.
The most frequent feature combination is Gender=Masc|Number=Sing|Person=3
(1351 tokens).
Examples: sé, é, éard
Relations
PRON
nodes are attached to their parents using 21 different relations: nsubj (1715; 47% instances), nmod (645; 18% instances), obj (401; 11% instances), det (269; 7% instances), root (151; 4% instances), advcl (110; 3% instances), obl (80; 2% instances), fixed (67; 2% instances), conj (59; 2% instances), ccomp (53; 1% instances), parataxis (39; 1% instances), appos (6; 0% instances), csubj:cop (6; 0% instances), obl:tmod (6; 0% instances), acl:relcl (5; 0% instances), nsubj:outer (4; 0% instances), discourse (1; 0% instances), dislocated (1; 0% instances), flat:name (1; 0% instances), orphan (1; 0% instances), xcomp:pred (1; 0% instances)
Parents of PRON
nodes belong to 14 different parts of speech: VERB (2005; 55% instances), NOUN (854; 24% instances), PRON (236; 7% instances), ADP (199; 5% instances), (151; 4% instances), PROPN (85; 2% instances), ADJ (60; 2% instances), ADV (20; 1% instances), NUM (3; 0% instances), DET (2; 0% instances), SCONJ (2; 0% instances), X (2; 0% instances), AUX (1; 0% instances), SYM (1; 0% instances)
2745 (76%) PRON
nodes are leaves.
417 (12%) PRON
nodes have one child.
245 (7%) PRON
nodes have two children.
214 (6%) PRON
nodes have three or more children.
The highest child degree of a PRON
node is 7.
Children of PRON
nodes are attached using 31 different relations: punct (286; 17% instances), nsubj (184; 11% instances), nmod (158; 9% instances), case (134; 8% instances), det (128; 8% instances), cop (118; 7% instances), mark (117; 7% instances), acl:relcl (111; 7% instances), xcomp (71; 4% instances), conj (70; 4% instances), cc (54; 3% instances), xcomp:pred (49; 3% instances), csubj:cleft (38; 2% instances), amod (27; 2% instances), advmod (24; 1% instances), advcl (23; 1% instances), obl:prep (18; 1% instances), parataxis (15; 1% instances), csubj:cop (14; 1% instances), obl (14; 1% instances), ccomp (12; 1% instances), appos (6; 0% instances), obj (6; 0% instances), vocative (5; 0% instances), fixed (3; 0% instances), dislocated (2; 0% instances), flat:foreign (2; 0% instances), list (2; 0% instances), obl:tmod (2; 0% instances), discourse (1; 0% instances), mark:prt (1; 0% instances)
Children of PRON
nodes belong to 15 different parts of speech: NOUN (384; 23% instances), PUNCT (286; 17% instances), PRON (236; 14% instances), VERB (182; 11% instances), ADP (157; 9% instances), AUX (119; 7% instances), SCONJ (88; 5% instances), CCONJ (84; 5% instances), ADJ (69; 4% instances), PROPN (36; 2% instances), DET (24; 1% instances), ADV (18; 1% instances), NUM (8; 0% instances), X (3; 0% instances), PART (1; 0% instances)