Treebank Statistics: UD_Irish-IDT: POS Tags: NOUN
There are 4680 NOUN lemmas (49%), 8442 NOUN types (54%) and 33642 NOUN tokens (29%).
Out of 17 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN lemmas: duine, cur, bliain, cuid, fáil, déanamh, ceann, féidir, pobal, bheith
The 10 most frequent NOUN types: chur, dhéanamh, fáil, bheith, duine, féidir, chuid, chéile, réir, daoine
The 10 most frequent ambiguous lemmas: ceann (NOUN 280, PROPN 4), bheith (NOUN 220, ADV 6), céile (NOUN 185, PROPN 1), cathair (NOUN 162, PROPN 18), lá (NOUN 126, PROPN 1), réir (NOUN 125, SCONJ 2), leath (NOUN 116, ADJ 2, VERB 2), áit (NOUN 116, SCONJ 1), teanga (NOUN 110, PROPN 1), tír (NOUN 109, PROPN 4)
The 10 most frequent ambiguous types: chur (NOUN 286, VERB 1), bheith (NOUN 220, ADV 6), réir (NOUN 123, SCONJ 2), dtí (NOUN 91, ADP 1), áit (NOUN 77, SCONJ 1), linn (NOUN 71, ADP 22), láthair (NOUN 62, ADP 1), deireadh (NOUN 57, VERB 1), forbartha (NOUN 28, ADJ 5), rith (NOUN 43, VERB 6)
- chur
- bheith
- réir
- NOUN 123: Meascadh agus díol cógas de réir nós na Meánaoise .
- SCONJ 2: (4) D’ fhonn amhras a sheachaint , aon achtachán a ndéantar leasú air le halt den Acht seo a scoireann de bheith i ngníomh amhail ar an agus ón lá dá dtagraítear i bhfo-alt (1) nó , de réir mar a bheidh , amhail ar agus ó dháta éagtha na tréimhse ar lena linn a choimeádfar i ngníomh é faoi fho-alt (2) ( ‘ an t-éag ‘ ) , beidh feidhm aige agus beidh éifeacht leis amhail ar an agus ón lá sin nó , de réir mar a bheidh , amhail ar an agus ón éag , mar a bhí feidhm aige agus éifeacht leis díreach roimh dháta an Achta seo a rith ach sin faoi réir aon leasuithe a dhéanfar le haon Acht eile den Oireachtas tar éis an dáta rite sin .
- dtí
- áit
- linn
- láthair
- deireadh
- NOUN 57: ’ Tá deireadh an domhain ag teacht !
- VERB 1: Bím ag obair , mar sin , leis na coistí sin ina gceann agus ina gceann , agus i gcomhar chomh maith faoi bhrat an chomhchoiste , sé sin Chomhchoiste Ghaeltachtaí Chiarraí Theas agus Comharchumann Naomh Fhionáin Teo. B’ in é an t-aon lá sa mbliain a bhféadfá do chuid siúil a thaispeáint , mar a deireadh sé .
- forbartha
- rith
Morphology
The form / lemma ratio of NOUN is 1.803846 (the average of all parts of speech is 1.651212).
The 1st highest number of forms (13) was observed with the lemma “teach”: TI, Tighe, Títhe, dteach, dtigh, dtithe, dtí, teach, theach, thigh, tigh, tithe, tí.
The 2nd highest number of forms (12) was observed with the lemma “údarás”: hÚdarás, húdaráis, nÚdaráis, nÚdarás, t-údarás, tArd-Údarás, tÚdaras, tÚdarás, Údaras, údarais, údaráis, údarás.
The 3rd highest number of forms (10) was observed with the lemma “bás”: b(h)ás, bas, bháis, bhás, bhásanna, báis, bás, básanna, mbáis, mbás.
NOUN occurs with 12 features: Number (29285; 87% instances), Case (29177; 87% instances), Gender (28549; 85% instances), Definite (13699; 41% instances), Form (10099; 30% instances), VerbForm (4066; 12% instances), NounType (1217; 4% instances), PrepForm (977; 3% instances), Typo (161; 0% instances), Abbr (101; 0% instances), Foreign (62; 0% instances), Dialect (13; 0% instances)
NOUN occurs with 25 feature-value pairs: Abbr=Yes, Case=Dat, Case=Gen, Case=Nom, Case=Voc, Definite=Def, Dialect=Connaught, Dialect=Munster, Dialect=Ulster, Foreign=Yes, Form=Ecl, Form=Emp, Form=Emp,Len, Form=HPref, Form=Len, Gender=Fem, Gender=Masc, NounType=Strong, NounType=Weak, Number=Plur, Number=Sing, PrepForm=Cmpd, Typo=Yes, VerbForm=Inf, VerbForm=Vnoun
NOUN occurs with 247 feature combinations.
The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing (3939 tokens).
Examples: duine, rud, bith, gá, fad, ábhar, bás, iarratas, lá, ceann
Relations
NOUN nodes are attached to their parents using 32 different relations: nmod (8574; 25% instances), obl (7190; 21% instances), obj (3627; 11% instances), nsubj (3608; 11% instances), xcomp (2934; 9% instances), conj (2865; 9% instances), fixed (1169; 3% instances), root (823; 2% instances), xcomp:pred (587; 2% instances), obl:tmod (504; 1% instances), csubj:cop (424; 1% instances), parataxis (277; 1% instances), advcl (270; 1% instances), appos (247; 1% instances), compound (157; 0% instances), ccomp (109; 0% instances), acl:relcl (91; 0% instances), acl (39; 0% instances), dislocated (39; 0% instances), vocative (28; 0% instances), list (25; 0% instances), csubj:cleft (10; 0% instances), flat (10; 0% instances), advmod (7; 0% instances), nsubj:outer (7; 0% instances), discourse (6; 0% instances), orphan (6; 0% instances), mark (3; 0% instances), nummod (3; 0% instances), case (1; 0% instances), cc (1; 0% instances), flat:name (1; 0% instances)
Parents of NOUN nodes belong to 17 different parts of speech: NOUN (16199; 48% instances), VERB (13463; 40% instances), ADP (1197; 4% instances), ADJ (900; 3% instances), (823; 2% instances), PROPN (425; 1% instances), PRON (385; 1% instances), NUM (119; 0% instances), ADV (67; 0% instances), X (16; 0% instances), PART (13; 0% instances), DET (12; 0% instances), AUX (10; 0% instances), SCONJ (7; 0% instances), SYM (4; 0% instances), CCONJ (1; 0% instances), INTJ (1; 0% instances)
5734 (17%) NOUN nodes are leaves.
9248 (27%) NOUN nodes have one child.
9841 (29%) NOUN nodes have two children.
8819 (26%) NOUN nodes have three or more children.
The highest child degree of a NOUN node is 19.
Children of NOUN nodes are attached using 41 different relations: case (13637; 22% instances), nmod (10766; 18% instances), det (8295; 14% instances), amod (3951; 6% instances), punct (3614; 6% instances), conj (2768; 5% instances), obl (2257; 4% instances), cc (2213; 4% instances), mark (2189; 4% instances), acl:relcl (1950; 3% instances), obj (1829; 3% instances), nmod:poss (1069; 2% instances), cop (838; 1% instances), advmod (660; 1% instances), obl:prep (616; 1% instances), xcomp (612; 1% instances), nummod (611; 1% instances), nsubj (354; 1% instances), csubj:cop (339; 1% instances), appos (337; 1% instances), parataxis (220; 0% instances), csubj:cleft (215; 0% instances), advcl (203; 0% instances), xcomp:pred (202; 0% instances), mark:prt (179; 0% instances), compound (157; 0% instances), ccomp (145; 0% instances), obl:tmod (124; 0% instances), flat (96; 0% instances), acl (86; 0% instances), flat:name (64; 0% instances), list (56; 0% instances), fixed (49; 0% instances), case:voc (30; 0% instances), compound:prt (21; 0% instances), dislocated (16; 0% instances), vocative (11; 0% instances), discourse (5; 0% instances), nsubj:outer (4; 0% instances), orphan (4; 0% instances), goeswith (3; 0% instances)
Children of NOUN nodes belong to 17 different parts of speech: NOUN (16199; 27% instances), ADP (14478; 24% instances), DET (9273; 15% instances), ADJ (4153; 7% instances), PUNCT (3614; 6% instances), VERB (2566; 4% instances), CCONJ (2264; 4% instances), PROPN (2152; 4% instances), PART (2019; 3% instances), NUM (1365; 2% instances), PRON (862; 1% instances), AUX (842; 1% instances), ADV (541; 1% instances), SCONJ (384; 1% instances), X (67; 0% instances), SYM (14; 0% instances), INTJ (2; 0% instances)