Treebank Statistics: UD_Irish-Cadhan: Features: Form
This feature is language-specific.
It occurs with 7 different values: Direct, Ecl, Emp, HPref, Indirect, Len, VF.
Some words have combined values of the feature; 3 combinations have been observed: Direct|Len, Ecl|Emp, Emp|Len.
854 tokens (18%) have a non-empty value of Form.
567 types (31%) occur at least once with a non-empty value of Form.
364 lemmas (33%) occur at least once with a non-empty value of Form.
The feature is used with 11 part-of-speech tags: NOUN (378; 8% instances), VERB (241; 5% instances), PART (91; 2% instances), PROPN (38; 1% instances), ADJ (31; 1% instances), ADP (22; 0% instances), NUM (21; 0% instances), AUX (18; 0% instances), PRON (12; 0% instances), DET (1; 0% instances), SCONJ (1; 0% instances).
NOUN
378 NOUN tokens (33% of all NOUN tokens) have a non-empty value of Form.
The most frequent other feature values with which NOUN and Form co-occurred: VerbForm=EMPTY (329; 87%), Number=Sing (275; 73%), Definite=Def (242; 64%), Case=Nom (240; 63%), Gender=Masc (214; 57%).
NOUN tokens may have the following values of Form:
Ecl(117; 31% of non-emptyForm): bhfearann, bhflaitheas, ndeireadh, bhfear, ccoir, dtaobh, gcomhnaidhe, gcédna, gcéill, n-áitEcl,Emp(3; 1% of non-emptyForm): Ndíasa, mbreithirsean, natharsaEmp(4; 1% of non-emptyForm): ainmsean, monairc-si, sonsan, tsáoghailsiEmp,Len(1; 0% of non-emptyForm): chroidhe-seHPref(21; 6% of non-emptyForm): hoidhche, Hiudaighe, Híudaidhe, h-ala, h-anfa, h-éin, haicmeadha, haimsir, haimsire, haithrigheLen(232; 61% of non-emptyForm): bheith, fhios, chur, dhéanamh, shaoghail, thoil, thabhairt, thighearna, thús, bhocsa
| Paradigm duine | Ecl | Len |
|---|---|---|
| Case=Dat|Number=Plur | dhaoinibh | |
| Case=Gen|NounType=Strong|Number=Plur | ndáoine | |
| Case=Nom|Number=Sing | nduine | dhuine |
VERB
241 VERB tokens (56% of all VERB tokens) have a non-empty value of Form.
The most frequent other feature values with which VERB and Form co-occurred: Polarity=EMPTY (219; 91%), Mood=Ind (197; 82%), Number=EMPTY (184; 76%), Person=EMPTY (172; 71%), Tense=Past (159; 66%).
VERB tokens may have the following values of Form:
Direct(8; 3% of non-emptyForm): atá, tá, a-ta, átaEcl(56; 23% of non-emptyForm): bhfuil, mbeadh, ngairthear, ngoirthear, ttugadh, ttugais, bhfacadar, bhfhuilim, bhfuair, bhfuairsiodEmp,Len(1; 0% of non-emptyForm): ghlacadarsanHPref(1; 0% of non-emptyForm): háitigheadhLen(175; 73% of non-emptyForm): bhí, thug, bheadh, bhíodh, chuir, Dhearc, bhi, chualas, fhuil, fhág
| Paradigm bí | Direct | Ecl | Len |
|---|---|---|---|
| Aspect=Hab|Mood=Ind|Number=Plur|Person=3|Tense=Pres | bhíd | ||
| Aspect=Hab|Mood=Ind|Polarity=Neg|Tense=Pres | bhíonn | ||
| Aspect=Imp|Number=Plur|Person=3|Tense=Past | bhídís | ||
| Aspect=Imp|Tense=Past | mbiodh | bhíodh | |
| Mood=Cnd | mbeadh, mbeath | bheadh | |
| Mood=Cnd|Number=Sing|Person=1 | mbéinn | ||
| Mood=Cnd|Number=Plur|Person=3 | mbéidís | ||
| Mood=Cnd|Polarity=Neg | bheadh, bhiadh | ||
| Mood=Ind|Number=Sing|Person=1|Tense=Past | Bhíos | ||
| Mood=Ind|Number=Plur|Person=1|Tense=Past | Bhamar | ||
| Mood=Ind|Polarity=Neg|Tense=Pres | fhuil | ||
| Mood=Ind|PronType=Rel|Tense=Fut | bhias | ||
| Mood=Ind|PronType=Rel|Tense=Pres | atá, tá, a-ta, áta | ||
| Mood=Ind|Tense=Fut | bheidh, bhéas | ||
| Mood=Ind|Tense=Past | bhí, bhi | ||
| Mood=Ind|Tense=Pres | bhfuil, bhfhuilim | ||
| Mood=Sub|Tense=Pres | mbeith |
PART
91 PART tokens (29% of all PART tokens) have a non-empty value of Form.
The most frequent other feature values with which PART and Form co-occurred: Polarity=EMPTY (90; 99%), PronType=Rel (88; 97%), PartType=Vb (81; 89%).
PART tokens may have the following values of Form:
Direct(62; 68% of non-emptyForm): a, do, noch, roDirect,Len(3; 3% of non-emptyForm): dhoIndirect(23; 25% of non-emptyForm): a, d’á, da, dá, ‘nar, ar, dar, fa’r, far, lér’Len(2; 2% of non-emptyForm): dhoVF(1; 1% of non-emptyForm): dob
| Paradigm a | Direct | Direct,Len | Indirect | Len |
|---|---|---|---|---|
| PartType=Inf | dho | |||
| PartType=Vb|PronType=Rel | a, do, noch, ro | dho | a |
PROPN
38 PROPN tokens (17% of all PROPN tokens) have a non-empty value of Form.
The most frequent other feature values with which PROPN and Form co-occurred: Definite=Def (38; 100%), Foreign=EMPTY (36; 95%), Number=Sing (34; 89%), Gender=Masc (26; 68%).
PROPN tokens may have the following values of Form:
Ecl(10; 26% of non-emptyForm): n-Éirinn, bhFailghe, bhFréamhainn, gConnachtaibh, mBaile, n-Áird, nAodh, nAssardha, neabhraHPref(2; 5% of non-emptyForm): hAodh, hÉireannLen(26; 68% of non-emptyForm): Bheannchair, Bhuck, Dhia, Dhía, Chairbre, Chesar, Chill, Chomhghaill, Chomhghall, Chriosd
| Paradigm Éire | Ecl | HPref |
|---|---|---|
| Case=Dat | n-Éirinn | |
| Case=Gen | hÉireann |
Form seems to be lexical feature of PROPN. 93% lemmas (25) occur only with one value of Form.
ADJ
31 ADJ tokens (15% of all ADJ tokens) have a non-empty value of Form.
The most frequent other feature values with which ADJ and Form co-occurred: Degree=EMPTY (21; 68%), Number=Sing (21; 68%), Case=Nom (16; 52%).
ADJ tokens may have the following values of Form:
HPref(3; 10% of non-emptyForm): haereach, haireach, holcLen(28; 90% of non-emptyForm): mhór, ghloin, mhaith, shuthoin, bheg, bhig, bhuidhe, bhán, chatharmaigh, cheart
Form seems to be lexical feature of ADJ. 100% lemmas (23) occur only with one value of Form.
ADP
22 ADP tokens (3% of all ADP tokens) have a non-empty value of Form.
The most frequent other feature values with which ADP and Form co-occurred: Number=Sing (15; 68%), Gender=EMPTY (14; 64%).
ADP tokens may have the following values of Form:
Len(22; 100% of non-emptyForm): dhe, dho, dhochum, dhom, dhá, dhíom, dhó, dhamh, dhamhsa, dhi
NUM
21 NUM tokens (35% of all NUM tokens) have a non-empty value of Form.
The most frequent other feature values with which NUM and Form co-occurred: NumType=Card (18; 86%).
NUM tokens may have the following values of Form:
Ecl(3; 14% of non-emptyForm): naon, náon, ttríLen(18; 86% of non-emptyForm): dhá, chéad, mhíle, thrí, cheithre, chúig, dhó, fhichid, sheachtmhoghad, tri
| Paradigm trí | Ecl | Len |
|---|---|---|
| ttrí | thrí, tri |
AUX
18 AUX tokens (22% of all AUX tokens) have a non-empty value of Form.
The most frequent other feature values with which AUX and Form co-occurred: Polarity=EMPTY (18; 100%), VerbForm=Cop (18; 100%), PronType=EMPTY (17; 94%).
AUX tokens may have the following values of Form:
Ecl(1; 6% of non-emptyForm): mbaVF(17; 94% of non-emptyForm): gurab, darab, dob, dárab
| Paradigm is | Ecl | VF |
|---|---|---|
| PronType=Rel|Tense=Past | dob | |
| Tense=Past | mba | gurab, dárab |
| Tense=Pres | gurab, darab |
PRON
12 PRON tokens (6% of all PRON tokens) have a non-empty value of Form.
The most frequent other feature values with which PRON and Form co-occurred: PronType=EMPTY (11; 92%), Number=Sing (9; 75%), Person=3 (8; 67%), Gender=Masc (7; 58%).
PRON tokens may have the following values of Form:
HPref(8; 67% of non-emptyForm): hé, hiádLen(4; 33% of non-emptyForm): fhéin, mhé, shoin, thú
DET
1 DET tokens (0% of all DET tokens) have a non-empty value of Form.
The most frequent other feature values with which DET and Form co-occurred: Case=EMPTY (1; 100%), Definite=Def (1; 100%), Gender=EMPTY (1; 100%), Number=EMPTY (1; 100%), Person=EMPTY (1; 100%), Poss=EMPTY (1; 100%), PronType=EMPTY (1; 100%).
DET tokens may have the following values of Form:
Ecl(1; 100% of non-emptyForm): gach
SCONJ
1 SCONJ tokens (1% of all SCONJ tokens) have a non-empty value of Form.
SCONJ tokens may have the following values of Form:
Len(1; 100% of non-emptyForm): dhá
Relations with Agreement in Form
The 10 most frequent relations where parent and child node agree in Form:
VERB –[conj]–> VERB (21; 55%),
NOUN –[vocative]–> NOUN (2; 67%),
NUM –[nmod]–> NUM (2; 100%),
NUM –[nummod]–> NUM (2; 67%),
PART –[fixed]–> PART (2; 67%),
VERB –[xcomp:pred]–> NUM (2; 100%),
NOUN –[advcl]–> ADJ (1; 100%),
NOUN –[ccomp]–> NOUN (1; 100%),
NOUN –[xcomp:pred]–> PROPN (1; 100%),
NUM –[conj]–> NUM (1; 100%).