Treebank Statistics: UD_Irish-IDT: Features: Form
This feature is language-specific.
It occurs with 7 different values: Direct, Ecl, Emp, HPref, Indirect, Len, VF.
Some words have combined values of the feature; 5 combinations have been observed: Direct|Emp, Ecl|Emp, Ecl|Indirect, Ecl|VF, Emp|Len.
19802 tokens (17%) have a non-empty value of Form.
4857 types (32%) occur at least once with a non-empty value of Form.
2811 lemmas (32%) occur at least once with a non-empty value of Form.
The feature is used with 12 part-of-speech tags: NOUN (10099; 9% instances), VERB (4511; 4% instances), PART (2334; 2% instances), PROPN (1318; 1% instances), ADJ (926; 1% instances), NUM (261; 0% instances), AUX (180; 0% instances), PRON (66; 0% instances), DET (61; 0% instances), ADP (33; 0% instances), SCONJ (7; 0% instances), ADV (6; 0% instances).
NOUN
10099 NOUN tokens (30% of all NOUN tokens) have a non-empty value of Form.
The most frequent other feature values with which NOUN and Form co-occurred: VerbForm=EMPTY (8199; 81%), Case=Nom (6670; 66%), Number=Sing (6588; 65%), Definite=EMPTY (5174; 51%).
NOUN tokens may have the following values of Form:
Ecl(2569; 25% of non-emptyForm): bhfeidhm, dtí, gcuid, gceist, gcás, gcomhairle, mbliana, gcónaí, ndiaidh, gcúirtEmp(4; 0% of non-emptyForm): Roinnse, achomharcsa, leithscéalsa, liostasaEmp,Len(5; 0% of non-emptyForm): thuairimse, chroíse, ghrúpa-san, mháthairseHPref(517; 5% of non-emptyForm): haghaidh, haois, heagraíochtaí, hathruithe, hinstitiúidí, húdaráis, healaíona, hoíche, háite, hAirteagalLen(7004; 69% of non-emptyForm): chur, dhéanamh, bheith, chuid, chéile, thabhairt, bhliain, chomhairle, fhorbairt, fháil
| Paradigm tuairim | Ecl | Emp,Len | Len |
|---|---|---|---|
| Definite=Def|Number=Sing | dtuairim | thuairimse | thuairim |
| Definite=Def|Number=Plur | dtuairimí | thuairimí | |
| Number=Sing | thuairim | ||
| Number=Plur | thuairimí |
VERB
4511 VERB tokens (51% of all VERB tokens) have a non-empty value of Form.
The most frequent other feature values with which VERB and Form co-occurred: Mood=Ind (3932; 87%), Person=EMPTY (3876; 86%).
VERB tokens may have the following values of Form:
Direct(519; 12% of non-emptyForm): atá, atáim, atáthar, ata, atáid, táDirect,Emp(1; 0% of non-emptyForm): atáimseEcl(1234; 27% of non-emptyForm): bhfuil, mbeadh, mbeidh, mbíonn, n-áirítear, mbaineann, ndéantar, dtagraítear, dtiocfadh, mbíodhEcl,Emp(5; 0% of non-emptyForm): bhfeicimidne, bhféadfainnse, gcaithfeadsa, gceapaimse, mbínnseEmp(10; 0% of non-emptyForm): deirimse, Creidimidne, Feicimse, Tabharfadsa, Táimse, adeirimse, cloisimse, déarfainnse, nílirseEmp,Len(4; 0% of non-emptyForm): bhainfinnse, fheadarsa, thuigeadarsan, thángas-saHPref(7; 0% of non-emptyForm): habair, haithneodh, hiarradh, héilítear, híocaigí, húsáideadhLen(2731; 61% of non-emptyForm): bhí, bheidh, thug, tháinig, chuir, bhaineann, bheadh, bhíonn, bhíodh, chuaigh
| Paradigm bí | Direct | Direct,Emp | Ecl | Ecl,Emp | Emp | Len |
|---|---|---|---|---|---|---|
| Aspect=Hab|Mood=Ind|Number=Plur|Person=1|Tense=Pres | mbímid | bhímid | ||||
| Aspect=Hab|Mood=Ind|Polarity=Neg|Tense=Pres | mbíonn | bhíonn | ||||
| Aspect=Hab|Mood=Ind|PronType=Rel|Tense=Pres | bhíos | |||||
| Aspect=Hab|Mood=Ind|Tense=Pres | mbíonn | bhíonn, bhíos | ||||
| Aspect=Imp|Number=Sing|Person=1|Tense=Past | mbínn | mbínnse | ||||
| Aspect=Imp|Number=Plur|Person=1|Tense=Past | mbímis | |||||
| Aspect=Imp|Number=Plur|Person=3|Polarity=Neg|Tense=Past | bhídís | |||||
| Aspect=Imp|Number=Plur|Person=3|Tense=Past | bhídís | |||||
| Aspect=Imp|Polarity=Neg|Tense=Past | mbíodh | bhíodh | ||||
| Aspect=Imp|Tense=Past | mbíodh | bhíodh | ||||
| Dialect=Munster|Mood=Ind|Number=Plur|Person=3|Tense=Pres | bhfuilid | |||||
| Mood=Cnd | mbeadh | bheadh | ||||
| Mood=Cnd,Int | mbeadh | |||||
| Mood=Cnd|Number=Sing|Person=1 | mbeinn | Bheinn | ||||
| Mood=Cnd|Number=Sing|Person=2 | bheifeá, bheitheá | |||||
| Mood=Cnd|Number=Sing|Person=2|Polarity=Neg | mbeifeá | |||||
| Mood=Cnd|Number=Plur|Person=1 | mbeimis | bheimís | ||||
| Mood=Cnd|Number=Plur|Person=3 | mbeidís | bheidís | ||||
| Mood=Cnd|Number=Plur|Person=3|Polarity=Neg | mbeidís | |||||
| Mood=Cnd|Person=0 | bheifí | |||||
| Mood=Cnd|Polarity=Neg | mbeadh | bheadh | ||||
| Mood=Ind|Number=Sing|Person=1|PronType=Rel|Tense=Pres | atáim | atáimse | ||||
| Mood=Ind|Number=Sing|Person=1|Tense=Past | bhíos | |||||
| Mood=Ind|Number=Sing|Person=1|Tense=Pres | Táimse | |||||
| Mood=Ind|Number=Sing|Person=2|Polarity=Neg|Tense=Pres | nílirse | |||||
| Mood=Ind|Number=Plur|Person=1|Polarity=Neg|Tense=Pres | bhfuilimíd | |||||
| Mood=Ind|Number=Plur|Person=1|Tense=Fut | mbeimid | bheimid | ||||
| Mood=Ind|Number=Plur|Person=1|Tense=Past | Bhíomar | |||||
| Mood=Ind|Number=Plur|Person=1|Tense=Pres | bhfuilimid | |||||
| Mood=Ind|Number=Plur|Person=3|PronType=Rel|Tense=Pres | atáid | |||||
| Mood=Ind|Number=Plur|Person=3|Tense=Past | bhíodar | |||||
| Mood=Ind|Number=Plur|Person=3|Tense=Pres | bhfuilid | |||||
| Mood=Ind|Person=0|Polarity=Neg|Tense=Fut | mbeifear | |||||
| Mood=Ind|Person=0|PronType=Rel|Tense=Pres | atáthar | |||||
| Mood=Ind|Person=0|Tense=Past | bhíothas | |||||
| Mood=Ind|Person=0|Tense=Pres | bhfuiltear | |||||
| Mood=Ind|Polarity=Neg|Tense=Fut | mbeidh | bheidh | ||||
| Mood=Ind|Polarity=Neg|Tense=Pres | bhfuil | |||||
| Mood=Ind|PronType=Rel|Tense=Fut | bheas, bhéas | |||||
| Mood=Ind|PronType=Rel|Tense=Pres | atá, tá | |||||
| Mood=Ind|PronType=Rel|Tense=Pres|Typo=Yes | ata | |||||
| Mood=Ind|Tense=Fut | mbeidh | bheidh, bhéidh | ||||
| Mood=Ind|Tense=Past | bhí | |||||
| Mood=Ind|Tense=Pres | bhfuil |
PART
2334 PART tokens (33% of all PART tokens) have a non-empty value of Form.
The most frequent other feature values with which PART and Form co-occurred: PronType=Rel (2315; 99%), PartType=Vb (2161; 93%).
PART tokens may have the following values of Form:
Direct(1840; 79% of non-emptyForm): a, nach, nár, do, náEcl,Indirect(2; 0% of non-emptyForm): n-aIndirect(473; 20% of non-emptyForm): a, ina, lena, ar, nach, dá, inar, faoina, DA, goLen(11; 0% of non-emptyForm): Mhic, MhacVF(8; 0% of non-emptyForm): ab, b’
| Paradigm a | Direct | Ecl,Indirect | Indirect |
|---|---|---|---|
| a, do | n-a | a, go |
PROPN
1318 PROPN tokens (24% of all PROPN tokens) have a non-empty value of Form.
The most frequent other feature values with which PROPN and Form co-occurred: Definite=Def (1282; 97%), Number=Sing (1266; 96%), Gender=Masc (666; 51%).
PROPN tokens may have the following values of Form:
Ecl(201; 15% of non-emptyForm): mBaile, nGall, nGaeilge, gConamara, nDún, nGaillimh, bhFrainc, gCeathrú, nGaeltacht, bParlaimintHPref(180; 14% of non-emptyForm): hÉireann, hEorpa, hÉirinn, hEaglaise, hAlban, hAoine, h-Íde, hOstaire, hAlbain, hAthbheochanaLen(937; 71% of non-emptyForm): Bhaile, Ghaeltacht, Ghaeilge, Ghaeltachta, Chathair, Mháire, Chiarraí, Dhún, Shráid, Bhéal
| Paradigm Gaeltacht | Ecl | Len |
|---|---|---|
| Case=Gen|Definite=Def|NounType=Strong|Number=Plur | nGaeltachtaí | Ghaeltachtaí |
| Case=Gen|Number=Sing | Ghaeltachta | |
| Case=Nom|Definite=Def|Number=Sing | nGaeltacht | Ghaeltacht |
| Definite=Def|Number=Sing | Ghaeltacht |
Form seems to be lexical feature of PROPN. 91% lemmas (359) occur only with one value of Form.
ADJ
926 ADJ tokens (14% of all ADJ tokens) have a non-empty value of Form.
The most frequent other feature values with which ADJ and Form co-occurred: VerbForm=EMPTY (921; 99%), NounType=EMPTY (832; 90%), Degree=EMPTY (613; 66%), Case=Nom (541; 58%), Number=Sing (511; 55%).
ADJ tokens may have the following values of Form:
Ecl(3; 0% of non-emptyForm): bhfurast, dtréan, gcéannaHPref(118; 13% of non-emptyForm): háirithe, hiomlán, hamháin, hidirnáisiúnta, hiondúil, huathoibríoch, hálainn, héifeachtach, hiontach, han-mhaithLen(805; 87% of non-emptyForm): mhór, mhaith, chóir, cheart, phoiblí, chéanna, bheag, chultúrtha, fhearr, shóisialta
| Paradigm céanna | Ecl | Len |
|---|---|---|
| Gender=Masc|NounType=Slender|Number=Plur | chéanna | |
| Gender=Masc|Number=Sing | gcéanna | chéanna |
| Gender=Fem|Number=Sing | chéanna |
Form seems to be lexical feature of ADJ. 99% lemmas (293) occur only with one value of Form.
NUM
261 NUM tokens (13% of all NUM tokens) have a non-empty value of Form.
The most frequent other feature values with which NUM and Form co-occurred: NumType=Card (149; 57%).
NUM tokens may have the following values of Form:
Ecl(31; 12% of non-emptyForm): gcéad, dtríú, gceithre, gcúig, gcúigiú, n-aonHPref(10; 4% of non-emptyForm): haon, hochtLen(220; 84% of non-emptyForm): dhá, chéad, cheithre, dhó, thrí, cheathrú, dheich, mhíle, chúig, dhara
| Paradigm céad | Ecl | Len |
|---|---|---|
| NumType=Card | gcéad | chéad |
| NumType=Ord | gcéad | chéad |
AUX
180 AUX tokens (12% of all AUX tokens) have a non-empty value of Form.
The most frequent other feature values with which AUX and Form co-occurred: VerbForm=Cop (180; 100%), Polarity=EMPTY (162; 90%), Tense=Past (145; 81%).
AUX tokens may have the following values of Form:
Ecl(12; 7% of non-emptyForm): mbaEcl,VF(5; 3% of non-emptyForm): mb’Len(2; 1% of non-emptyForm): chanVF(161; 89% of non-emptyForm): b’, gurb, gurbh, níorbh, ab, arbh, b’, nárbh, arb, darbh
| Paradigm is | Ecl | Ecl,VF | Len | VF |
|---|---|---|---|---|
| Dialect=Ulster|Polarity=Neg|Tense=Pres | chan | |||
| Mood=Cnd | mba | B' | ||
| Mood=Int|Tense=Past | arbh | |||
| Polarity=Neg|PronType=Rel|Tense=Past | nárbh | |||
| Polarity=Neg|Tense=Past | níorbh, nárbh | |||
| Polarity=Neg|Tense=Pres | Chan | |||
| PronType=Rel|Tense=Past | ab | |||
| Tense=Past | mba | mb' | b', gurbh, b’, arb, darbh | |
| Tense=Pres | gurb, darb |
PRON
66 PRON tokens (2% of all PRON tokens) have a non-empty value of Form.
The most frequent other feature values with which PRON and Form co-occurred: Gender=EMPTY (54; 82%), Number=EMPTY (43; 65%), Person=EMPTY (43; 65%), PronType=Dem (34; 52%).
PRON tokens may have the following values of Form:
HPref(16; 24% of non-emptyForm): hé, hiad, híLen(49; 74% of non-emptyForm): shin, fhéin, thú, cheachtar, shoin, thusaVF(1; 2% of non-emptyForm): cérbh
DET
61 DET tokens (1% of all DET tokens) have a non-empty value of Form.
The most frequent other feature values with which DET and Form co-occurred: Case=EMPTY (61; 100%), Gender=EMPTY (60; 98%), Number=EMPTY (59; 97%), Person=EMPTY (59; 97%), Poss=EMPTY (59; 97%), PronType=EMPTY (50; 82%), Definite=Def (48; 79%).
DET tokens may have the following values of Form:
Ecl(31; 51% of non-emptyForm): ngach, n-a, n-uileHPref(10; 16% of non-emptyForm): haonLen(20; 33% of non-emptyForm): chuile, chaon, dh’
ADP
33 ADP tokens (0% of all ADP tokens) have a non-empty value of Form.
The most frequent other feature values with which ADP and Form co-occurred: PronType=EMPTY (28; 85%), Person=3 (22; 67%), Number=Sing (17; 52%).
ADP tokens may have the following values of Form:
Ecl(1; 3% of non-emptyForm): dtíHPref(2; 6% of non-emptyForm): hairLen(30; 91% of non-emptyForm): dhá, thríd, dhíobh, dhó, dhóibh, dhe, dho, dhom, dhuit, thrí
SCONJ
7 SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Form.
SCONJ tokens may have the following values of Form:
Len(1; 14% of non-emptyForm): dháVF(6; 86% of non-emptyForm): arb, murab
ADV
6 ADV tokens (0% of all ADV tokens) have a non-empty value of Form.
ADV tokens may have the following values of Form:
Len(6; 100% of non-emptyForm): bheith
Relations with Agreement in Form
The 10 most frequent relations where parent and child node agree in Form:
ADJ –[conj]–> ADJ (17; 55%),
ADJ –[advcl]–> ADJ (4; 100%),
NOUN –[ccomp]–> ADJ (3; 60%),
ADJ –[obl]–> NUM (2; 100%),
PRON –[vocative]–> NOUN (2; 67%),
VERB –[csubj:cop]–> VERB (2; 67%),
ADJ –[csubj:cleft]–> NOUN (1; 100%),
ADJ –[parataxis]–> ADJ (1; 100%),
ADJ –[vocative]–> NOUN (1; 100%),
NOUN –[xcomp:pred]–> PROPN (1; 100%).