home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Irish-IDT: Features: Form

This feature is language-specific. It occurs with 7 different values: Direct, Ecl, Emp, HPref, Indirect, Len, VF. Some words have combined values of the feature; 5 combinations have been observed: Direct|Emp, Ecl|Emp, Ecl|Indirect, Ecl|VF, Emp|Len.

19802 tokens (17%) have a non-empty value of Form. 4857 types (32%) occur at least once with a non-empty value of Form. 2811 lemmas (32%) occur at least once with a non-empty value of Form. The feature is used with 12 part-of-speech tags: NOUN (10099; 9% instances), VERB (4511; 4% instances), PART (2334; 2% instances), PROPN (1318; 1% instances), ADJ (926; 1% instances), NUM (261; 0% instances), AUX (180; 0% instances), PRON (66; 0% instances), DET (61; 0% instances), ADP (33; 0% instances), SCONJ (7; 0% instances), ADV (6; 0% instances).

NOUN

10099 NOUN tokens (30% of all NOUN tokens) have a non-empty value of Form.

The most frequent other feature values with which NOUN and Form co-occurred: VerbForm=EMPTY (8199; 81%), Case=Nom (6670; 66%), Number=Sing (6588; 65%), Definite=EMPTY (5174; 51%).

NOUN tokens may have the following values of Form:

Paradigm tuairimEclEmp,LenLen
Definite=Def|Number=Singdtuairimthuairimsethuairim
Definite=Def|Number=Plurdtuairimíthuairimí
Number=Singthuairim
Number=Plurthuairimí

VERB

4511 VERB tokens (51% of all VERB tokens) have a non-empty value of Form.

The most frequent other feature values with which VERB and Form co-occurred: Mood=Ind (3932; 87%), Person=EMPTY (3876; 86%).

VERB tokens may have the following values of Form:

Paradigm DirectDirect,EmpEclEcl,EmpEmpLen
Aspect=Hab|Mood=Ind|Number=Plur|Person=1|Tense=Presmbímidbhímid
Aspect=Hab|Mood=Ind|Polarity=Neg|Tense=Presmbíonnbhíonn
Aspect=Hab|Mood=Ind|PronType=Rel|Tense=Presbhíos
Aspect=Hab|Mood=Ind|Tense=Presmbíonnbhíonn, bhíos
Aspect=Imp|Number=Sing|Person=1|Tense=Pastmbínnmbínnse
Aspect=Imp|Number=Plur|Person=1|Tense=Pastmbímis
Aspect=Imp|Number=Plur|Person=3|Polarity=Neg|Tense=Pastbhídís
Aspect=Imp|Number=Plur|Person=3|Tense=Pastbhídís
Aspect=Imp|Polarity=Neg|Tense=Pastmbíodhbhíodh
Aspect=Imp|Tense=Pastmbíodhbhíodh
Dialect=Munster|Mood=Ind|Number=Plur|Person=3|Tense=Presbhfuilid
Mood=Cndmbeadhbheadh
Mood=Cnd,Intmbeadh
Mood=Cnd|Number=Sing|Person=1mbeinnBheinn
Mood=Cnd|Number=Sing|Person=2bheifeá, bheitheá
Mood=Cnd|Number=Sing|Person=2|Polarity=Negmbeifeá
Mood=Cnd|Number=Plur|Person=1mbeimisbheimís
Mood=Cnd|Number=Plur|Person=3mbeidísbheidís
Mood=Cnd|Number=Plur|Person=3|Polarity=Negmbeidís
Mood=Cnd|Person=0bheifí
Mood=Cnd|Polarity=Negmbeadhbheadh
Mood=Ind|Number=Sing|Person=1|PronType=Rel|Tense=Presatáimatáimse
Mood=Ind|Number=Sing|Person=1|Tense=Pastbhíos
Mood=Ind|Number=Sing|Person=1|Tense=PresTáimse
Mood=Ind|Number=Sing|Person=2|Polarity=Neg|Tense=Presnílirse
Mood=Ind|Number=Plur|Person=1|Polarity=Neg|Tense=Presbhfuilimíd
Mood=Ind|Number=Plur|Person=1|Tense=Futmbeimidbheimid
Mood=Ind|Number=Plur|Person=1|Tense=PastBhíomar
Mood=Ind|Number=Plur|Person=1|Tense=Presbhfuilimid
Mood=Ind|Number=Plur|Person=3|PronType=Rel|Tense=Presatáid
Mood=Ind|Number=Plur|Person=3|Tense=Pastbhíodar
Mood=Ind|Number=Plur|Person=3|Tense=Presbhfuilid
Mood=Ind|Person=0|Polarity=Neg|Tense=Futmbeifear
Mood=Ind|Person=0|PronType=Rel|Tense=Presatáthar
Mood=Ind|Person=0|Tense=Pastbhíothas
Mood=Ind|Person=0|Tense=Presbhfuiltear
Mood=Ind|Polarity=Neg|Tense=Futmbeidhbheidh
Mood=Ind|Polarity=Neg|Tense=Presbhfuil
Mood=Ind|PronType=Rel|Tense=Futbheas, bhéas
Mood=Ind|PronType=Rel|Tense=Presatá, tá
Mood=Ind|PronType=Rel|Tense=Pres|Typo=Yesata
Mood=Ind|Tense=Futmbeidhbheidh, bhéidh
Mood=Ind|Tense=Pastbhí
Mood=Ind|Tense=Presbhfuil

PART

2334 PART tokens (33% of all PART tokens) have a non-empty value of Form.

The most frequent other feature values with which PART and Form co-occurred: PronType=Rel (2315; 99%), PartType=Vb (2161; 93%).

PART tokens may have the following values of Form:

Paradigm aDirectEcl,IndirectIndirect
a, don-aa, go

PROPN

1318 PROPN tokens (24% of all PROPN tokens) have a non-empty value of Form.

The most frequent other feature values with which PROPN and Form co-occurred: Definite=Def (1282; 97%), Number=Sing (1266; 96%), Gender=Masc (666; 51%).

PROPN tokens may have the following values of Form:

Paradigm GaeltachtEclLen
Case=Gen|Definite=Def|NounType=Strong|Number=PlurnGaeltachtaíGhaeltachtaí
Case=Gen|Number=SingGhaeltachta
Case=Nom|Definite=Def|Number=SingnGaeltachtGhaeltacht
Definite=Def|Number=SingGhaeltacht

Form seems to be lexical feature of PROPN. 91% lemmas (359) occur only with one value of Form.

ADJ

926 ADJ tokens (14% of all ADJ tokens) have a non-empty value of Form.

The most frequent other feature values with which ADJ and Form co-occurred: VerbForm=EMPTY (921; 99%), NounType=EMPTY (832; 90%), Degree=EMPTY (613; 66%), Case=Nom (541; 58%), Number=Sing (511; 55%).

ADJ tokens may have the following values of Form:

Paradigm céannaEclLen
Gender=Masc|NounType=Slender|Number=Plurchéanna
Gender=Masc|Number=Singgcéannachéanna
Gender=Fem|Number=Singchéanna

Form seems to be lexical feature of ADJ. 99% lemmas (293) occur only with one value of Form.

NUM

261 NUM tokens (13% of all NUM tokens) have a non-empty value of Form.

The most frequent other feature values with which NUM and Form co-occurred: NumType=Card (149; 57%).

NUM tokens may have the following values of Form:

Paradigm céadEclLen
NumType=Cardgcéadchéad
NumType=Ordgcéadchéad

AUX

180 AUX tokens (12% of all AUX tokens) have a non-empty value of Form.

The most frequent other feature values with which AUX and Form co-occurred: VerbForm=Cop (180; 100%), Polarity=EMPTY (162; 90%), Tense=Past (145; 81%).

AUX tokens may have the following values of Form:

Paradigm isEclEcl,VFLenVF
Dialect=Ulster|Polarity=Neg|Tense=Preschan
Mood=CndmbaB'
Mood=Int|Tense=Pastarbh
Polarity=Neg|PronType=Rel|Tense=Pastnárbh
Polarity=Neg|Tense=Pastníorbh, nárbh
Polarity=Neg|Tense=PresChan
PronType=Rel|Tense=Pastab
Tense=Pastmbamb'b', gurbh, b’, arb, darbh
Tense=Presgurb, darb

PRON

66 PRON tokens (2% of all PRON tokens) have a non-empty value of Form.

The most frequent other feature values with which PRON and Form co-occurred: Gender=EMPTY (54; 82%), Number=EMPTY (43; 65%), Person=EMPTY (43; 65%), PronType=Dem (34; 52%).

PRON tokens may have the following values of Form:

DET

61 DET tokens (1% of all DET tokens) have a non-empty value of Form.

The most frequent other feature values with which DET and Form co-occurred: Case=EMPTY (61; 100%), Gender=EMPTY (60; 98%), Number=EMPTY (59; 97%), Person=EMPTY (59; 97%), Poss=EMPTY (59; 97%), PronType=EMPTY (50; 82%), Definite=Def (48; 79%).

DET tokens may have the following values of Form:

ADP

33 ADP tokens (0% of all ADP tokens) have a non-empty value of Form.

The most frequent other feature values with which ADP and Form co-occurred: PronType=EMPTY (28; 85%), Person=3 (22; 67%), Number=Sing (17; 52%).

ADP tokens may have the following values of Form:

SCONJ

7 SCONJ tokens (0% of all SCONJ tokens) have a non-empty value of Form.

SCONJ tokens may have the following values of Form:

ADV

6 ADV tokens (0% of all ADV tokens) have a non-empty value of Form.

ADV tokens may have the following values of Form:

Relations with Agreement in Form

The 10 most frequent relations where parent and child node agree in Form: ADJ –[conj]–> ADJ (17; 55%), ADJ –[advcl]–> ADJ (4; 100%), NOUN –[ccomp]–> ADJ (3; 60%), ADJ –[obl]–> NUM (2; 100%), PRON –[vocative]–> NOUN (2; 67%), VERB –[csubj:cop]–> VERB (2; 67%), ADJ –[csubj:cleft]–> NOUN (1; 100%), ADJ –[parataxis]–> ADJ (1; 100%), ADJ –[vocative]–> NOUN (1; 100%), NOUN –[xcomp:pred]–> PROPN (1; 100%).