Treebank Statistics: UD_Irish-Cadhan: Features: Form
This feature is language-specific.
It occurs with 7 different values: Direct
, Ecl
, Emp
, HPref
, Indirect
, Len
, VF
.
Some words have combined values of the feature; 3 combinations have been observed: Direct|Len
, Ecl|Emp
, Emp|Len
.
854 tokens (18%) have a non-empty value of Form
.
567 types (31%) occur at least once with a non-empty value of Form
.
364 lemmas (33%) occur at least once with a non-empty value of Form
.
The feature is used with 11 part-of-speech tags: NOUN (378; 8% instances), VERB (241; 5% instances), PART (91; 2% instances), PROPN (38; 1% instances), ADJ (31; 1% instances), ADP (22; 0% instances), NUM (21; 0% instances), AUX (18; 0% instances), PRON (12; 0% instances), DET (1; 0% instances), SCONJ (1; 0% instances).
NOUN
378 NOUN tokens (33% of all NOUN
tokens) have a non-empty value of Form
.
The most frequent other feature values with which NOUN
and Form
co-occurred: VerbForm=EMPTY (329; 87%), Number=Sing (275; 73%), Definite=Def (242; 64%), Case=Nom (240; 63%), Gender=Masc (214; 57%).
NOUN
tokens may have the following values of Form
:
Ecl
(117; 31% of non-emptyForm
): bhfearann, bhflaitheas, ndeireadh, bhfear, ccoir, dtaobh, gcomhnaidhe, gcédna, gcéill, n-áitEcl,Emp
(3; 1% of non-emptyForm
): Ndíasa, mbreithirsean, natharsaEmp
(4; 1% of non-emptyForm
): ainmsean, monairc-si, sonsan, tsáoghailsiEmp,Len
(1; 0% of non-emptyForm
): chroidhe-seHPref
(21; 6% of non-emptyForm
): hoidhche, Hiudaighe, Híudaidhe, h-ala, h-anfa, h-éin, haicmeadha, haimsir, haimsire, haithrigheLen
(232; 61% of non-emptyForm
): bheith, fhios, chur, dhéanamh, shaoghail, thoil, thabhairt, thighearna, thús, bhocsa
Paradigm duine | Ecl | Len |
---|---|---|
Case=Dat|Number=Plur | dhaoinibh | |
Case=Gen|NounType=Strong|Number=Plur | ndáoine | |
Case=Nom|Number=Sing | nduine | dhuine |
VERB
241 VERB tokens (56% of all VERB
tokens) have a non-empty value of Form
.
The most frequent other feature values with which VERB
and Form
co-occurred: Polarity=EMPTY (219; 91%), Mood=Ind (197; 82%), Number=EMPTY (184; 76%), Person=EMPTY (172; 71%), Tense=Past (159; 66%).
VERB
tokens may have the following values of Form
:
Direct
(8; 3% of non-emptyForm
): atá, tá, a-ta, átaEcl
(56; 23% of non-emptyForm
): bhfuil, mbeadh, ngairthear, ngoirthear, ttugadh, ttugais, bhfacadar, bhfhuilim, bhfuair, bhfuairsiodEmp,Len
(1; 0% of non-emptyForm
): ghlacadarsanHPref
(1; 0% of non-emptyForm
): háitigheadhLen
(175; 73% of non-emptyForm
): bhí, thug, bheadh, bhíodh, chuir, Dhearc, bhi, chualas, fhuil, fhág
Paradigm bí | Direct | Ecl | Len |
---|---|---|---|
Aspect=Hab|Mood=Ind|Number=Plur|Person=3|Tense=Pres | bhíd | ||
Aspect=Hab|Mood=Ind|Polarity=Neg|Tense=Pres | bhíonn | ||
Aspect=Imp|Number=Plur|Person=3|Tense=Past | bhídís | ||
Aspect=Imp|Tense=Past | mbiodh | bhíodh | |
Mood=Cnd | mbeadh, mbeath | bheadh | |
Mood=Cnd|Number=Sing|Person=1 | mbéinn | ||
Mood=Cnd|Number=Plur|Person=3 | mbéidís | ||
Mood=Cnd|Polarity=Neg | bheadh, bhiadh | ||
Mood=Ind|Number=Sing|Person=1|Tense=Past | Bhíos | ||
Mood=Ind|Number=Plur|Person=1|Tense=Past | Bhamar | ||
Mood=Ind|Polarity=Neg|Tense=Pres | fhuil | ||
Mood=Ind|PronType=Rel|Tense=Fut | bhias | ||
Mood=Ind|PronType=Rel|Tense=Pres | atá, tá, a-ta, áta | ||
Mood=Ind|Tense=Fut | bheidh, bhéas | ||
Mood=Ind|Tense=Past | bhí, bhi | ||
Mood=Ind|Tense=Pres | bhfuil, bhfhuilim | ||
Mood=Sub|Tense=Pres | mbeith |
PART
91 PART tokens (29% of all PART
tokens) have a non-empty value of Form
.
The most frequent other feature values with which PART
and Form
co-occurred: Polarity=EMPTY (90; 99%), PronType=Rel (88; 97%), PartType=Vb (81; 89%).
PART
tokens may have the following values of Form
:
Direct
(62; 68% of non-emptyForm
): a, do, noch, roDirect,Len
(3; 3% of non-emptyForm
): dhoIndirect
(23; 25% of non-emptyForm
): a, d’á, da, dá, ‘nar, ar, dar, fa’r, far, lér’Len
(2; 2% of non-emptyForm
): dhoVF
(1; 1% of non-emptyForm
): dob
Paradigm a | Direct | Direct,Len | Indirect | Len |
---|---|---|---|---|
PartType=Inf | dho | |||
PartType=Vb|PronType=Rel | a, do, noch, ro | dho | a |
PROPN
38 PROPN tokens (17% of all PROPN
tokens) have a non-empty value of Form
.
The most frequent other feature values with which PROPN
and Form
co-occurred: Definite=Def (38; 100%), Foreign=EMPTY (36; 95%), Number=Sing (34; 89%), Gender=Masc (26; 68%).
PROPN
tokens may have the following values of Form
:
Ecl
(10; 26% of non-emptyForm
): n-Éirinn, bhFailghe, bhFréamhainn, gConnachtaibh, mBaile, n-Áird, nAodh, nAssardha, neabhraHPref
(2; 5% of non-emptyForm
): hAodh, hÉireannLen
(26; 68% of non-emptyForm
): Bheannchair, Bhuck, Dhia, Dhía, Chairbre, Chesar, Chill, Chomhghaill, Chomhghall, Chriosd
Paradigm Éire | Ecl | HPref |
---|---|---|
Case=Dat | n-Éirinn | |
Case=Gen | hÉireann |
Form
seems to be lexical feature of PROPN
. 93% lemmas (25) occur only with one value of Form
.
ADJ
31 ADJ tokens (15% of all ADJ
tokens) have a non-empty value of Form
.
The most frequent other feature values with which ADJ
and Form
co-occurred: Degree=EMPTY (21; 68%), Number=Sing (21; 68%), Case=Nom (16; 52%).
ADJ
tokens may have the following values of Form
:
HPref
(3; 10% of non-emptyForm
): haereach, haireach, holcLen
(28; 90% of non-emptyForm
): mhór, ghloin, mhaith, shuthoin, bheg, bhig, bhuidhe, bhán, chatharmaigh, cheart
Form
seems to be lexical feature of ADJ
. 100% lemmas (23) occur only with one value of Form
.
ADP
22 ADP tokens (3% of all ADP
tokens) have a non-empty value of Form
.
The most frequent other feature values with which ADP
and Form
co-occurred: Number=Sing (15; 68%), Gender=EMPTY (14; 64%).
ADP
tokens may have the following values of Form
:
Len
(22; 100% of non-emptyForm
): dhe, dho, dhochum, dhom, dhá, dhíom, dhó, dhamh, dhamhsa, dhi
NUM
21 NUM tokens (35% of all NUM
tokens) have a non-empty value of Form
.
The most frequent other feature values with which NUM
and Form
co-occurred: NumType=Card (18; 86%).
NUM
tokens may have the following values of Form
:
Ecl
(3; 14% of non-emptyForm
): naon, náon, ttríLen
(18; 86% of non-emptyForm
): dhá, chéad, mhíle, thrí, cheithre, chúig, dhó, fhichid, sheachtmhoghad, tri
Paradigm trí | Ecl | Len |
---|---|---|
ttrí | thrí, tri |
AUX
18 AUX tokens (22% of all AUX
tokens) have a non-empty value of Form
.
The most frequent other feature values with which AUX
and Form
co-occurred: Polarity=EMPTY (18; 100%), VerbForm=Cop (18; 100%), PronType=EMPTY (17; 94%).
AUX
tokens may have the following values of Form
:
Ecl
(1; 6% of non-emptyForm
): mbaVF
(17; 94% of non-emptyForm
): gurab, darab, dob, dárab
Paradigm is | Ecl | VF |
---|---|---|
PronType=Rel|Tense=Past | dob | |
Tense=Past | mba | gurab, dárab |
Tense=Pres | gurab, darab |
PRON
12 PRON tokens (6% of all PRON
tokens) have a non-empty value of Form
.
The most frequent other feature values with which PRON
and Form
co-occurred: PronType=EMPTY (11; 92%), Number=Sing (9; 75%), Person=3 (8; 67%), Gender=Masc (7; 58%).
PRON
tokens may have the following values of Form
:
HPref
(8; 67% of non-emptyForm
): hé, hiádLen
(4; 33% of non-emptyForm
): fhéin, mhé, shoin, thú
DET
1 DET tokens (0% of all DET
tokens) have a non-empty value of Form
.
The most frequent other feature values with which DET
and Form
co-occurred: Case=EMPTY (1; 100%), Definite=Def (1; 100%), Gender=EMPTY (1; 100%), Number=EMPTY (1; 100%), Person=EMPTY (1; 100%), Poss=EMPTY (1; 100%), PronType=EMPTY (1; 100%).
DET
tokens may have the following values of Form
:
Ecl
(1; 100% of non-emptyForm
): gach
SCONJ
1 SCONJ tokens (1% of all SCONJ
tokens) have a non-empty value of Form
.
SCONJ
tokens may have the following values of Form
:
Len
(1; 100% of non-emptyForm
): dhá
Relations with Agreement in Form
The 10 most frequent relations where parent and child node agree in Form
:
VERB –[conj]–> VERB (21; 55%),
NOUN –[vocative]–> NOUN (2; 67%),
NUM –[nmod]–> NUM (2; 100%),
NUM –[nummod]–> NUM (2; 67%),
PART –[fixed]–> PART (2; 67%),
VERB –[xcomp:pred]–> NUM (2; 100%),
NOUN –[advcl]–> ADJ (1; 100%),
NOUN –[ccomp]–> NOUN (1; 100%),
NOUN –[xcomp:pred]–> PROPN (1; 100%),
NUM –[conj]–> NUM (1; 100%).