Treebank Statistics: UD_Irish-IDT: POS Tags: DET
There are 19 DET lemmas (0%), 38 DET types (0%) and 10284 DET tokens (9%).
Out of 17 observed tags, the rank of DET is: 14 in number of lemmas, 12 in number of types and 4 in number of tokens.
The 10 most frequent DET lemmas: an, a, seo, sin, eile, aon, gach, do, mo, uile
The 10 most frequent DET types: an, na, a, seo, sin, eile, aon, gach, do, mo
The 10 most frequent ambiguous lemmas: an (DET 7265, PART 57, X 1), a (PART 4074, DET 736, PRON 23, ADV 7, NUM 3, X 3, NOUN 1), seo (DET 564, PRON 145), sin (DET 425, PRON 414, INTJ 1), aon (DET 286, NUM 20, NOUN 4), gach (DET 232, NOUN 1), do (ADP 1447, PART 316, DET 175), uile (DET 59, NOUN 5, ADJ 1), ár (DET 42, NOUN 5), cibé (DET 18, PRON 2)
The 10 most frequent ambiguous types: an (DET 4548, PART 28, AUX 5, ADP 1, X 1), na (DET 2540, ADP 1), a (PART 4050, DET 745, PRON 23, ADV 7, ADP 2, X 2, NOUN 1), seo (DET 547, PRON 125, AUX 1), sin (DET 403, PRON 346, AUX 1), aon (DET 259, NUM 12, NOUN 3), gach (DET 199, NOUN 1), do (ADP 514, DET 111, PART 40), d’ (ADP 172, PART 138, DET 55), uile (DET 42, NOUN 5)
- an
- DET 4548: Ach beo bocht a bheadh ar an té nach mbeadh aige ach preátaí tura .
- PART 28: Cén áit an bhfágfaidh mé iad seo , a Tom .
- AUX 5: Nó cúig an ea ?
- ADP 1: Má imríonn siad faoi mar is féidir leo , is dóigh liom go bhfillfidh siad ar Staid Semple , Lá ‘ le Pádraig , ach is ina lámha féin a bheidh sé an ag imirt nó i measc an tslua a bheidh siad .
- X 1: AR 10 Feabhra 1968 , cúig lá tar éis do William Conor bás a fháil , d’ fhoilsigh an Belfast Telegraph cartún le Rowel Friers ( 1920- ) dar teideal ‘ The End of an Era , ‘ in ómós don phrionsa fir a bhí ar lár .
- na
- a
- PART 4050: Ach beo bocht a bheadh ar an té nach mbeadh aige ach preátaí tura .
- DET 745: Meatachán scanraithe agus a lámha cáidheach le roidealach an bhóthair .
- PRON 23: Níl de dhíth ach aon fhocal amháin , sin a bhfuil .
- ADV 7: Tá an teicníocht inste a roghnaíonn sé thar a bheith éifeachtach .
- ADP 2: ’ Séard atá De Róiste a rá anois ná gur cuireadh ina choinne go raibh baint aige le Saor Éire agus gur thug sin leithscéal dóibh é a bhriseadh .
- X 2: ’ Cad a bhí ann ach go raibh sé ‘ cut off with a shilling ‘ , agus tugadh an scilling dó !
- NOUN 1: Sa Deibhí bíonn seacht siolla sa líne freisin , ach bíonn siolla sa bhreis i bhfocal deireanach b ar a , agus in d ar c .
- seo
- DET 547: An dtuigtear fós sa tír seo cé chomh mór de athrú is a bhí ansin ?
- PRON 125: Duradh gur seo ceann dena fadhbanna is mó atá sa cheantar .
- AUX 1: Cothú sa Duine Tá cúig chéim i gceist le cothú sa duine : Daoibhse atá ar bheagán Shakespeare , nó daoibhse a d’ fhág cúrsaí staidéir níos mó ná cúpla lá ó shin , seo é , go gonta , scéal traigéideach marfach Phrionsa na Danmhairge .
- sin
- DET 403: Thug na páistí ruaig amháin ar an siopa sin .
- PRON 346: Cé nár dhúirt tú é bhí ‘ fhios agam gur thuig tú sin i do chroí .
- AUX 1: ’ Tógann sé deich mbliana ar an spéis léitheoireachta theacht in inmhe agus caithfear díriú ar pháistí atá ag sroichint na léitheoireachta ar bhun neamhspleách , mar sin an áit a gcailltear iad faoi láthair .
- aon
- DET 259: Ní raibh aon ghá le cuireadh .
- NUM 12: Is ionann sin agus a rá nach féidir na breoslaí seo a úsáid ach aon uair amháin .
- NOUN 3: Is mar gheall ar infheistíocht ón Údarás agus ó Choimisiún Forbartha an Iarthair a tharla an bisiú seo mar aon le cinneadh na dtáirgeoirí bogadh go dtí táirgeadh orgánach breisluacha .
- gach
- do
- d’
- uile
Morphology
The form / lemma ratio of DET is 2.000000 (the average of all parts of speech is 1.651212).
The 1st highest number of forms (7) was observed with the lemma “an”: ‘n, ‘na, a, a’, an, na, un.
The 2nd highest number of forms (4) was observed with the lemma “do”: d’, dh’, do, d’.
The 3rd highest number of forms (3) was observed with the lemma “a”: a, n-a, á.
DET occurs with 10 features: PronType (8953; 87% instances), Number (8338; 81% instances), Definite (7523; 73% instances), Gender (2428; 24% instances), Case (2352; 23% instances), Person (1073; 10% instances), Poss (1073; 10% instances), Form (61; 1% instances), Dialect (42; 0% instances), Typo (3; 0% instances)
DET occurs with 20 feature-value pairs: Case=Gen, Definite=Def, Dialect=Munster, Dialect=Ulster, Form=Ecl, Form=HPref, Form=Len, Gender=Fem, Gender=Fem,Masc, Gender=Masc, Number=Plur, Number=Sing, Person=1, Person=2, Person=3, Poss=Yes, PronType=Art, PronType=Dem, PronType=Ind, Typo=Yes
DET occurs with 27 feature combinations.
The most frequent feature combination is Definite=Def|Number=Sing|PronType=Art (3826 tokens).
Examples: an, ‘n, a, a’, na
Relations
DET nodes are attached to their parents using 13 different relations: det (9185; 89% instances), nmod:poss (1070; 10% instances), fixed (10; 0% instances), conj (5; 0% instances), flat:name (4; 0% instances), obj (2; 0% instances), obl (2; 0% instances), advcl (1; 0% instances), advmod (1; 0% instances), discourse (1; 0% instances), dislocated (1; 0% instances), nsubj (1; 0% instances), obl:tmod (1; 0% instances)
Parents of DET nodes belong to 12 different parts of speech: NOUN (9273; 90% instances), PROPN (884; 9% instances), NUM (36; 0% instances), PRON (24; 0% instances), X (18; 0% instances), ADJ (13; 0% instances), DET (10; 0% instances), VERB (10; 0% instances), ADP (9; 0% instances), SCONJ (4; 0% instances), ADV (2; 0% instances), PART (1; 0% instances)
10183 (99%) DET nodes are leaves.
92 (1%) DET nodes have one child.
6 (0%) DET nodes have two children.
3 (0%) DET nodes have three or more children.
The highest child degree of a DET node is 3.
Children of DET nodes are attached using 17 different relations: punct (25; 22% instances), fixed (24; 21% instances), case (17; 15% instances), det (10; 9% instances), nmod (8; 7% instances), cc (6; 5% instances), obl:prep (6; 5% instances), acl:relcl (3; 3% instances), cop (3; 3% instances), appos (2; 2% instances), conj (2; 2% instances), mark:prt (2; 2% instances), advmod (1; 1% instances), amod (1; 1% instances), ccomp (1; 1% instances), nsubj (1; 1% instances), parataxis (1; 1% instances)
Children of DET nodes belong to 15 different parts of speech: ADP (33; 29% instances), PUNCT (25; 22% instances), ADJ (14; 12% instances), NOUN (12; 11% instances), DET (10; 9% instances), CCONJ (6; 5% instances), AUX (3; 3% instances), PRON (2; 2% instances), VERB (2; 2% instances), ADV (1; 1% instances), NUM (1; 1% instances), PART (1; 1% instances), PROPN (1; 1% instances), SCONJ (1; 1% instances), X (1; 1% instances)