Treebank Statistics: UD_Irish-IDT: Features: Form
This feature is language-specific.
It occurs with 7 different values: Direct
, Ecl
, Emp
, HPref
, Indirect
, Len
, VF
.
Some words have combined values of the feature; 5 combinations have been observed: Direct|Emp
, Ecl|Emp
, Ecl|Indirect
, Ecl|VF
, Emp|Len
.
19802 tokens (17%) have a non-empty value of Form
.
4858 types (32%) occur at least once with a non-empty value of Form
.
2813 lemmas (32%) occur at least once with a non-empty value of Form
.
The feature is used with 13 part-of-speech tags: NOUN (10043; 9% instances), VERB (4511; 4% instances), PART (2333; 2% instances), PROPN (1373; 1% instances), ADJ (926; 1% instances), NUM (261; 0% instances), AUX (180; 0% instances), PRON (66; 0% instances), DET (61; 0% instances), ADP (33; 0% instances), SCONJ (7; 0% instances), ADV (6; 0% instances), X (2; 0% instances).
NOUN
10043 NOUN tokens (30% of all NOUN
tokens) have a non-empty value of Form
.
The most frequent other feature values with which NOUN
and Form
co-occurred: VerbForm=EMPTY (8138; 81%), Case=Nom (6629; 66%), Number=Sing (6534; 65%), Definite=EMPTY (5203; 52%).
NOUN
tokens may have the following values of Form
:
Ecl
(2546; 25% of non-emptyForm
): bhfeidhm, dtí, gcuid, gceist, gcás, gcomhairle, mbliana, gcónaí, ndiaidh, gcúirtEmp
(4; 0% of non-emptyForm
): Roinnse, achomharcsa, leithscéalsa, liostasaEmp,Len
(5; 0% of non-emptyForm
): thuairimse, chroíse, ghrúpa-san, mháthairseHPref
(517; 5% of non-emptyForm
): haghaidh, haois, heagraíochtaí, hathruithe, hinstitiúidí, húdaráis, healaíona, hoíche, háite, hAirteagalLen
(6971; 69% of non-emptyForm
): chur, dhéanamh, bheith, chuid, chéile, thabhairt, bhliain, chomhairle, fhorbairt, fháil
Paradigm tuairim | Ecl | Emp,Len | Len |
---|---|---|---|
Definite=Def|Number=Sing | dtuairim | thuairimse | thuairim |
Definite=Def|Number=Plur | dtuairimí | thuairimí | |
Number=Sing | thuairim | ||
Number=Plur | thuairimí |
VERB
4511 VERB tokens (51% of all VERB
tokens) have a non-empty value of Form
.
The most frequent other feature values with which VERB
and Form
co-occurred: Mood=Ind (3932; 87%), Person=EMPTY (3877; 86%).
VERB
tokens may have the following values of Form
:
Direct
(519; 12% of non-emptyForm
): atá, atáim, atáthar, ata, atáid, táDirect,Emp
(1; 0% of non-emptyForm
): atáimseEcl
(1234; 27% of non-emptyForm
): bhfuil, mbeadh, mbeidh, mbíonn, n-áirítear, mbaineann, ndéantar, dtagraítear, dtiocfadh, mbíodhEcl,Emp
(5; 0% of non-emptyForm
): bhfeicimidne, bhféadfainnse, gcaithfeadsa, gceapaimse, mbínnseEmp
(10; 0% of non-emptyForm
): deirimse, Creidimidne, Feicimse, Tabharfadsa, Táimse, adeirimse, cloisimse, déarfainnse, nílirseEmp,Len
(4; 0% of non-emptyForm
): bhainfinnse, fheadarsa, thuigeadarsan, thángas-saHPref
(7; 0% of non-emptyForm
): habair, haithneodh, hiarradh, héilítear, híocaigí, húsáideadhLen
(2731; 61% of non-emptyForm
): bhí, bheidh, thug, tháinig, chuir, bhaineann, bheadh, bhíonn, bhíodh, chuaigh
Paradigm bí | Direct | Direct,Emp | Ecl | Ecl,Emp | Emp | Len |
---|---|---|---|---|---|---|
Aspect=Hab|Mood=Ind|Number=Plur|Person=1|Tense=Pres | mbímid | bhímid | ||||
Aspect=Hab|Mood=Ind|Polarity=Neg|Tense=Pres | mbíonn | bhíonn | ||||
Aspect=Hab|Mood=Ind|PronType=Rel|Tense=Pres | bhíos | |||||
Aspect=Hab|Mood=Ind|Tense=Pres | mbíonn | bhíonn, bhíos | ||||
Aspect=Imp|Number=Sing|Person=1|Tense=Past | mbínn | mbínnse | ||||
Aspect=Imp|Number=Plur|Person=1|Tense=Past | mbímis | |||||
Aspect=Imp|Number=Plur|Person=3|Polarity=Neg|Tense=Past | bhídís | |||||
Aspect=Imp|Number=Plur|Person=3|Tense=Past | bhídís | |||||
Aspect=Imp|Polarity=Neg|Tense=Past | mbíodh | bhíodh | ||||
Aspect=Imp|Tense=Past | mbíodh | bhíodh | ||||
Dialect=Munster|Mood=Ind|Number=Plur|Person=3|Tense=Pres | bhfuilid | |||||
Mood=Cnd | mbeadh | bheadh | ||||
Mood=Cnd,Int | mbeadh | |||||
Mood=Cnd|Number=Sing|Person=1 | mbeinn | Bheinn | ||||
Mood=Cnd|Number=Sing|Person=2 | bheifeá, bheitheá | |||||
Mood=Cnd|Number=Sing|Person=2|Polarity=Neg | mbeifeá | |||||
Mood=Cnd|Number=Plur|Person=1 | mbeimis | bheimís | ||||
Mood=Cnd|Number=Plur|Person=3 | mbeidís | bheidís | ||||
Mood=Cnd|Number=Plur|Person=3|Polarity=Neg | mbeidís | |||||
Mood=Cnd|Person=0 | bheifí | |||||
Mood=Cnd|Polarity=Neg | mbeadh | bheadh | ||||
Mood=Ind|Number=Sing|Person=1|PronType=Rel|Tense=Pres | atáim | atáimse | ||||
Mood=Ind|Number=Sing|Person=1|Tense=Past | bhíos | |||||
Mood=Ind|Number=Sing|Person=1|Tense=Pres | Táimse | |||||
Mood=Ind|Number=Sing|Person=2|Polarity=Neg|Tense=Pres | nílirse | |||||
Mood=Ind|Number=Plur|Person=1|Polarity=Neg|Tense=Pres | bhfuilimíd | |||||
Mood=Ind|Number=Plur|Person=1|Tense=Fut | mbeimid | bheimid | ||||
Mood=Ind|Number=Plur|Person=1|Tense=Past | Bhíomar | |||||
Mood=Ind|Number=Plur|Person=1|Tense=Pres | bhfuilimid | |||||
Mood=Ind|Number=Plur|Person=3|PronType=Rel|Tense=Pres | atáid | |||||
Mood=Ind|Number=Plur|Person=3|Tense=Past | bhíodar | |||||
Mood=Ind|Number=Plur|Person=3|Tense=Pres | bhfuilid | |||||
Mood=Ind|Person=0|Polarity=Neg|Tense=Fut | mbeifear | |||||
Mood=Ind|Person=0|PronType=Rel|Tense=Pres | atáthar | |||||
Mood=Ind|Person=0|Tense=Past | bhíothas | |||||
Mood=Ind|Person=0|Tense=Pres | bhfuiltear | |||||
Mood=Ind|Polarity=Neg|Tense=Fut | mbeidh | bheidh | ||||
Mood=Ind|Polarity=Neg|Tense=Pres | bhfuil | |||||
Mood=Ind|PronType=Rel|Tense=Fut | bheas, bhéas | |||||
Mood=Ind|PronType=Rel|Tense=Pres | atá, tá | |||||
Mood=Ind|PronType=Rel|Tense=Pres|Typo=Yes | ata | |||||
Mood=Ind|Tense=Fut | mbeidh | bheidh, bhéidh | ||||
Mood=Ind|Tense=Past | bhí | |||||
Mood=Ind|Tense=Pres | bhfuil |
PART
2333 PART tokens (33% of all PART
tokens) have a non-empty value of Form
.
The most frequent other feature values with which PART
and Form
co-occurred: PronType=Rel (2314; 99%), PartType=Vb (2160; 93%).
PART
tokens may have the following values of Form
:
Direct
(1839; 79% of non-emptyForm
): a, nach, nár, do, náEcl,Indirect
(2; 0% of non-emptyForm
): n-aIndirect
(473; 20% of non-emptyForm
): a, ina, lena, ar, nach, dá, inar, faoina, DA, goLen
(11; 0% of non-emptyForm
): Mhic, MhacVF
(8; 0% of non-emptyForm
): ab, b’
Paradigm a | Direct | Ecl,Indirect | Indirect |
---|---|---|---|
a, do | n-a | a, go |
PROPN
1373 PROPN tokens (24% of all PROPN
tokens) have a non-empty value of Form
.
The most frequent other feature values with which PROPN
and Form
co-occurred: Definite=Def (1370; 100%), Number=Sing (1304; 95%), Gender=Masc (716; 52%).
PROPN
tokens may have the following values of Form
:
Ecl
(223; 16% of non-emptyForm
): mBaile, nGall, gCoimisiún, nGaeilge, gConamara, gClár, nDún, nGaillimh, bhFrainc, gCeathrúHPref
(179; 13% of non-emptyForm
): hÉireann, hEorpa, hÉirinn, hEaglaise, hAlban, hAoine, h-Íde, hOstaire, hAlbain, hAthbheochanaLen
(971; 71% of non-emptyForm
): Bhaile, Ghaeltacht, Ghaeilge, Ghaeltachta, Chathair, Mháire, Chiarraí, Dhún, Shráid, Choiste
Paradigm Gaeltacht | Ecl | Len |
---|---|---|
Case=Gen|NounType=Strong|Number=Plur | nGaeltachtaí | Ghaeltachtaí |
Case=Gen|Number=Sing | Ghaeltachta | |
Case=Nom|Number=Sing | nGaeltacht | Ghaeltacht |
Number=Sing | Ghaeltacht |
ADJ
926 ADJ tokens (14% of all ADJ
tokens) have a non-empty value of Form
.
The most frequent other feature values with which ADJ
and Form
co-occurred: VerbForm=EMPTY (920; 99%), NounType=EMPTY (833; 90%), Degree=EMPTY (612; 66%), Case=Nom (539; 58%), Number=Sing (509; 55%).
ADJ
tokens may have the following values of Form
:
Ecl
(3; 0% of non-emptyForm
): bhfurast, dtréan, gcéannaHPref
(118; 13% of non-emptyForm
): háirithe, hiomlán, hamháin, hidirnáisiúnta, hiondúil, huathoibríoch, hálainn, héifeachtach, hiontach, han-mhaithLen
(805; 87% of non-emptyForm
): mhór, mhaith, chóir, cheart, phoiblí, chéanna, bheag, chultúrtha, fhearr, shóisialta
Paradigm céanna | Ecl | Len |
---|---|---|
Gender=Masc|NounType=Slender|Number=Plur | chéanna | |
Gender=Masc|Number=Sing | gcéanna | chéanna |
Gender=Fem|Number=Sing | chéanna |
Form
seems to be lexical feature of ADJ
. 99% lemmas (293) occur only with one value of Form
.
NUM
261 NUM tokens (13% of all NUM
tokens) have a non-empty value of Form
.
The most frequent other feature values with which NUM
and Form
co-occurred: NumType=Card (152; 58%).
NUM
tokens may have the following values of Form
:
Ecl
(31; 12% of non-emptyForm
): gcéad, dtríú, gceithre, gcúig, gcúigiú, n-aonHPref
(10; 4% of non-emptyForm
): haon, hochtLen
(220; 84% of non-emptyForm
): dhá, chéad, cheithre, dhó, thrí, cheathrú, dheich, mhíle, chúig, dhara
Paradigm céad | Ecl | Len |
---|---|---|
NumType=Card | gcéad | chéad |
NumType=Ord | gcéad | chéad |
AUX
180 AUX tokens (12% of all AUX
tokens) have a non-empty value of Form
.
The most frequent other feature values with which AUX
and Form
co-occurred: VerbForm=Cop (180; 100%), Polarity=EMPTY (162; 90%), Tense=Past (145; 81%).
AUX
tokens may have the following values of Form
:
Ecl
(12; 7% of non-emptyForm
): mbaEcl,VF
(5; 3% of non-emptyForm
): mb’Len
(2; 1% of non-emptyForm
): chanVF
(161; 89% of non-emptyForm
): b’, gurb, gurbh, níorbh, ab, arbh, b’, nárbh, arb, darbh
Paradigm is | Ecl | Ecl,VF | Len | VF |
---|---|---|---|---|
Dialect=Ulster|Polarity=Neg|Tense=Pres | chan | |||
Mood=Cnd | mba | B' | ||
Mood=Int|Tense=Past | arbh | |||
Polarity=Neg|PronType=Rel|Tense=Past | nárbh | |||
Polarity=Neg|Tense=Past | níorbh, nárbh | |||
Polarity=Neg|Tense=Pres | Chan | |||
PronType=Rel|Tense=Past | ab | |||
Tense=Past | mba | mb' | b', gurbh, b’, arb, darbh | |
Tense=Pres | gurb, darb |
PRON
66 PRON tokens (2% of all PRON
tokens) have a non-empty value of Form
.
The most frequent other feature values with which PRON
and Form
co-occurred: Gender=EMPTY (54; 82%), Number=EMPTY (43; 65%), Person=EMPTY (43; 65%), PronType=Dem (34; 52%).
PRON
tokens may have the following values of Form
:
HPref
(16; 24% of non-emptyForm
): hé, hiad, híLen
(49; 74% of non-emptyForm
): shin, fhéin, thú, cheachtar, shoin, thusaVF
(1; 2% of non-emptyForm
): cérbh
DET
61 DET tokens (1% of all DET
tokens) have a non-empty value of Form
.
The most frequent other feature values with which DET
and Form
co-occurred: Case=EMPTY (61; 100%), Gender=EMPTY (60; 98%), Number=EMPTY (59; 97%), Person=EMPTY (59; 97%), Poss=EMPTY (59; 97%), PronType=EMPTY (50; 82%), Definite=Def (48; 79%).
DET
tokens may have the following values of Form
:
Ecl
(31; 51% of non-emptyForm
): ngach, n-a, n-uileHPref
(10; 16% of non-emptyForm
): haonLen
(20; 33% of non-emptyForm
): chuile, chaon, dh’
ADP
33 ADP tokens (0% of all ADP
tokens) have a non-empty value of Form
.
The most frequent other feature values with which ADP
and Form
co-occurred: PronType=EMPTY (28; 85%), Person=3 (22; 67%), Number=Sing (17; 52%).
ADP
tokens may have the following values of Form
:
Ecl
(1; 3% of non-emptyForm
): dtíHPref
(2; 6% of non-emptyForm
): hairLen
(30; 91% of non-emptyForm
): dhá, thríd, dhíobh, dhó, dhóibh, dhe, dho, dhom, dhuit, thrí
SCONJ
7 SCONJ tokens (0% of all SCONJ
tokens) have a non-empty value of Form
.
SCONJ
tokens may have the following values of Form
:
Len
(1; 14% of non-emptyForm
): dháVF
(6; 86% of non-emptyForm
): arb, murab
ADV
6 ADV tokens (0% of all ADV
tokens) have a non-empty value of Form
.
ADV
tokens may have the following values of Form
:
Len
(6; 100% of non-emptyForm
): bheith
X
2 X tokens (1% of all X
tokens) have a non-empty value of Form
.
The most frequent other feature values with which X
and Form
co-occurred: Foreign=Yes (2; 100%).
X
tokens may have the following values of Form
:
Ecl
(1; 50% of non-emptyForm
): nAllHPref
(1; 50% of non-emptyForm
): hamazon
Relations with Agreement in Form
The 10 most frequent relations where parent and child node agree in Form
:
ADJ –[conj]–> ADJ (17; 55%),
ADJ –[advcl]–> ADJ (4; 100%),
NOUN –[ccomp]–> ADJ (3; 60%),
ADJ –[obl]–> NUM (2; 100%),
PRON –[vocative]–> NOUN (2; 67%),
VERB –[csubj:cop]–> VERB (2; 67%),
ADJ –[csubj:cleft]–> NOUN (1; 100%),
ADJ –[parataxis]–> ADJ (1; 100%),
ADJ –[vocative]–> NOUN (1; 100%),
NOUN –[xcomp:pred]–> PROPN (1; 100%).