home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Irish-IDT: POS Tags: VERB

There are 445 VERB lemmas (5%), 1709 VERB types (11%) and 8775 VERB tokens (8%). Out of 17 observed tags, the rank of VERB is: 5 in number of lemmas, 4 in number of types and 5 in number of tokens.

The 10 most frequent VERB lemmas: bí, cuir, déan, tabhair, tar, bain, abair, féad, faigh, téigh

The 10 most frequent VERB types: tá, bhí, atá, bhfuil, raibh, beidh, bheidh, mbeadh, níl, mbeidh

The 10 most frequent ambiguous lemmas: (VERB 3697, INTJ 1), déan (VERB 381, NOUN 1), tar (VERB 191, ADP 96), meas (VERB 40, NOUN 15), ceap (VERB 32, NOUN 1), úsáid (NOUN 65, VERB 29), scríobh (VERB 26, NOUN 12), dar (VERB 25, ADP 6, SCONJ 1), ar (ADP 3229, PART 42, ADV 35, VERB 17, PRON 6, SCONJ 4), inis (VERB 16, PROPN 7, NOUN 1)

The 10 most frequent ambiguous types: bhfuil (VERB 381, NOUN 1), cuireadh (VERB 43, NOUN 13), dar (VERB 13, AUX 5, ADP 2, SCONJ 1), scríobh (VERB 16, NOUN 11), ar (ADP 2733, AUX 43, PART 35, ADV 33, VERB 15, PRON 6), thosaigh (VERB 11, NOUN 1), bronnadh (VERB 6, NOUN 3), cailleadh (VERB 6, NOUN 1), rith (NOUN 43, VERB 6), ceapadh (VERB 6, NOUN 4)

Morphology

The form / lemma ratio of VERB is 3.840449 (the average of all parts of speech is 1.648496).

The 1st highest number of forms (71) was observed with the lemma “bí”: Bheinn, Bhíomar, Bím, Bítear, Nílim, Táimse, ata, atá, atáid, atáim, atáimse, atáthar, beadh, beidh, beifear, beimid, bheadh, bheas, bheidh, bheidís, bheifeá, bheifí, bheimid, bheimís, bheitheá, bhfuil, bhfuilid, bhfuilimid, bhfuilimíd, bhfuiltear, bhéas, bhéidh, bhí, bhídís, bhímid, bhíodar, bhíodh, bhíonn, bhíos, bhíothas, bígí, bímid, bíodh, bíonn, fuil, mbeadh, mbeidh, mbeidís, mbeifear, mbeifeá, mbeimid, mbeimis, mbeinn, mbímid, mbímis, mbínn, mbínnse, mbíodh, mbíonn, níl, nílirse, rabhadar, rabhamar, rabhas, rabhthas, raibh, tá, táid, táim, táimid, táthar.

The 2nd highest number of forms (43) was observed with the lemma “déan”: Déanaimid, Rinneadar, deineadh, dhearna, dhein, dheánann, dhineann, dhéanadh, dhéanann, dhéananna, dhéanfadh, dhéanfaidh, dhéanfaidís, dhéanfainn, dhéanfar, dhéanfaí, dhéanfá, dhéantar, dineadh, dintar, déan, déanadh, déanaim, déanann, déanfaidh, déanfaimid, déanfar, déanfidh, déantar, ndearna, ndearnadh, ndintar, ndéanadh, ndéanann, ndéanfadh, ndéanfaidh, ndéanfaidís, ndéanfar, ndéanfaí, ndéantar, rinne, rinneadh, rinneamar.

The 3rd highest number of forms (35) was observed with the lemma “tabhair”: ‘thug, Tabharfadsa, Thugamar, Tugaim, dtabharfadh, dtabharfaidh, dtabharfar, dtabharfaí, dtug, dtugadh, dtugann, dtugtar, dtugtaí, tabhair, tabharfaidh, tabharfar, thabharfadh, thabharfaidh, thabharfar, thabharfas, thabharfaídh, thabharfá, thug, thugadar, thugadh, thugaidís, thugaimid, thugainn, thugann, thugtar, thugtaí, tugadh, tugaimid, tugann, tugtar.

VERB occurs with 10 features: Mood (8650; 99% instances), Tense (8122; 93% instances), Form (4511; 51% instances), Person (1918; 22% instances), Polarity (659; 8% instances), PronType (598; 7% instances), Number (518; 6% instances), Aspect (299; 3% instances), Typo (22; 0% instances), Dialect (20; 0% instances)

VERB occurs with 28 feature-value pairs: Aspect=Hab, Aspect=Imp, Dialect=Munster, Form=Direct, Form=Direct,Emp, Form=Ecl, Form=Ecl,Emp, Form=Emp, Form=Emp,Len, Form=HPref, Form=Len, Mood=Cnd, Mood=Cnd,Int, Mood=Imp, Mood=Ind, Mood=Sub, Number=Plur, Number=Sing, Person=0, Person=1, Person=2, Person=3, Polarity=Neg, PronType=Rel, Tense=Fut, Tense=Past, Tense=Pres, Typo=Yes

VERB occurs with 191 feature combinations. The most frequent feature combination is Form=Len|Mood=Ind|Tense=Past (1496 tokens). Examples: bhí, tháinig, thug, chuir, chuaigh, bhain, tharla, chaith, chonaic, fhág

Relations

VERB nodes are attached to their parents using 17 different relations: root (3480; 40% instances), acl:relcl (2051; 23% instances), advcl (917; 10% instances), conj (767; 9% instances), ccomp (630; 7% instances), csubj:cleft (342; 4% instances), parataxis (340; 4% instances), csubj:cop (162; 2% instances), acl (44; 1% instances), xcomp (23; 0% instances), dislocated (6; 0% instances), obl (5; 0% instances), nmod (3; 0% instances), appos (2; 0% instances), nsubj (1; 0% instances), orphan (1; 0% instances), xcomp:pred (1; 0% instances)

Parents of VERB nodes belong to 15 different parts of speech: (3480; 40% instances), NOUN (2553; 29% instances), VERB (2117; 24% instances), ADJ (224; 3% instances), PRON (182; 2% instances), PROPN (120; 1% instances), ADP (39; 0% instances), ADV (23; 0% instances), NUM (12; 0% instances), SCONJ (9; 0% instances), PART (5; 0% instances), AUX (4; 0% instances), X (4; 0% instances), DET (2; 0% instances), SYM (1; 0% instances)

11 (0%) VERB nodes are leaves.

401 (5%) VERB nodes have one child.

1202 (14%) VERB nodes have two children.

7161 (82%) VERB nodes have three or more children.

The highest child degree of a VERB node is 14.

Children of VERB nodes are attached using 35 different relations: nsubj (5877; 17% instances), obl (5391; 16% instances), punct (4948; 15% instances), obj (2718; 8% instances), mark:prt (2301; 7% instances), xcomp (2231; 7% instances), xcomp:pred (1749; 5% instances), advmod (1582; 5% instances), obl:prep (1140; 3% instances), mark (1050; 3% instances), advcl (1007; 3% instances), conj (863; 3% instances), cc (757; 2% instances), ccomp (668; 2% instances), obl:tmod (600; 2% instances), parataxis (439; 1% instances), case (125; 0% instances), nmod (66; 0% instances), amod (36; 0% instances), dislocated (35; 0% instances), list (34; 0% instances), vocative (34; 0% instances), discourse (23; 0% instances), acl:relcl (22; 0% instances), compound:prt (12; 0% instances), cop (9; 0% instances), csubj:cop (5; 0% instances), orphan (5; 0% instances), appos (3; 0% instances), det (3; 0% instances), nsubj:outer (3; 0% instances), acl (2; 0% instances), csubj:cleft (1; 0% instances), flat:name (1; 0% instances), nmod:poss (1; 0% instances)

Children of VERB nodes belong to 17 different parts of speech: NOUN (13369; 40% instances), PUNCT (4948; 15% instances), PART (3974; 12% instances), VERB (2117; 6% instances), PRON (2005; 6% instances), ADP (1601; 5% instances), ADJ (1414; 4% instances), PROPN (1106; 3% instances), SCONJ (1023; 3% instances), ADV (1004; 3% instances), CCONJ (830; 2% instances), NUM (273; 1% instances), X (31; 0% instances), INTJ (17; 0% instances), AUX (12; 0% instances), DET (10; 0% instances), SYM (7; 0% instances)