Treebank Statistics: UD_Irish-TwittIrish: POS Tags: DET
There are 46 DET
lemmas (0%), 63 DET
types (0%) and 2791 DET
tokens (6%).
Out of 17 observed tags, the rank of DET
is: 14 in number of lemmas, 13 in number of types and 5 in number of tokens.
The 10 most frequent DET
lemmas: an, a, seo, mo, eile, do, the, sin, na, aon
The 10 most frequent DET
types: an, na, a, seo, eile, mo, the, do, sin, aon
The 10 most frequent ambiguous lemmas: an (DET 1701, PART 121, ADV 1, NUM 1), a (PART 663, DET 233, ADP 3, NOUN 2, ADJ 1, ADV 1, INTJ 1, NUM 1, PRON 1), seo (DET 152, PRON 83, VERB 2), mo (DET 104, INTJ 1, PART 1), do (ADP 483, DET 69, PART 36, AUX 6, PROPN 3, VERB 3, NUM 1, PUNCT 1), sin (PRON 107, DET 62, VERB 4, ADP 1), aon (DET 56, NOUN 1, NUM 1), gach (DET 56, NOUN 1), cúpla (NOUN 20, DET 7, PROPN 1), no (DET 6, CCONJ 1, INTJ 1, VERB 1)
The 10 most frequent ambiguous types: an (DET 1033, PART 23, ADJ 9, AUX 4, ADV 2, NOUN 2), na (DET 628, ADP 2, CCONJ 2, PART 1, SCONJ 1), a (PART 703, DET 159, ADP 9, NOUN 2, X 2, ADV 1, PRON 1), seo (DET 146, PRON 64, X 1), mo (DET 86, ADJ 2), do (ADP 149, DET 60, AUX 4, PART 3, VERB 1), sin (PRON 81, DET 60, VERB 2), aon (DET 47, NOUN 1), gach (DET 40, NOUN 1), d’ (ADP 24, PART 15, DET 6, PUNCT 1)
- an
- DET 1033: @user1176 maith an fear ! CF Abú !
- PART 23: @user187 an labhraítear Coirnis ??
- ADJ 9: Bhi an agoid Slan le Sean an mhaith . Caithfear leanacht den obair .
- AUX 4: @user1067 ca bhfuil tú ? an dtig liom mo fóin cluasa a fháil ?
- ADV 2: @user1241 @user1010 @user1705 An-cheomhar an seo ar maidin ( anois díreach ) ach beidh sé ina lá geal níos déanaí . Samhradh indiach !
- NOUN 2: @user505 @user660 @user241 @user1163 is ait an socrú sin , gan meas a bheith acu ar rud a bhfuil an -dúil agat féin ann .
- na
- DET 628: 9 lá go dtí Féile na Mí . Hup ! @user138 @user987 @user412 #RáthChairn
- ADP 2: @user1376 is fearr bas na naire ! #spioraidnascoile #colaisteeoinabu
- CCONJ 2: Nil aon duine nios Gaelai na Barack Obama . Mana Gaeilge chunn tosaigh i DC . @user1704 @user1648 http://t.co/Rp92ijvs
- PART 1: @user1679 na bhrea liom a bheith Ann
- SCONJ 1: Fugitive Gaelach ar champas ANOIS !! Bígí ag faire amach . Is é an leid inniu na scaif dearg :D
- a
- PART 703: @user1424 Cad a mholfá faoi mo bhealach féin ?
- DET 159: “ Ar scáth a chéile a mhairimíd . ” #Gaeilge #IrishStateVisit
- ADP 9: @user68 @user1046 Mar ni tweetionn me go minic a chroi x
- NOUN 2: @user891 Lá fada fós romhat ? Gan o ( fada ) ná a ( fada ) . Aaaah !
- X 2: Anocht ar an clár beidh muid ag labhairt le @user1373 faoi a bhanna ceoil Rofi James Bígí ag eisteacht
- ADV 1: @user300 an-chuid dos na focail nua sa nGaeilge thar a bheith áicbheaird nach bhfuil ?
- PRON 1: An scéal is déanaí maidir le seirbhís aeir oileáin Árann AGUS a bhfuil i ndán do Thuaisceart Éireann , anocht ar Nuacht TG4 @ 7 pm .
- seo
- mo
- do
- ADP 149: RT @user692 : Ádh mór do gach duine ag déanamh scruidithe amárach :)
- DET 60: Ní fhuil aon tinteán mar do thinteán féin . https://t.co/rex6gfYSw2
- AUX 4: @user1221 すみません、わかりません 。。。 I do n’t get that , but … bí ag caint as Gaeilge , le dó thoil :D
- PART 3: @user115 Tá sé fós ann mar do rinneas ‘ like ‘ air le mó chuntas príomháideach ar facebook . P .
- VERB 1: @user1245 grma ! I should really do it more ! :) conas atá ag éirí leat dolly , long time no see !?
- sin
- aon
- gach
- d’
- ADP 24: Tá An Hobad 75 bliana d’ aois inniu . Beithlá shona don leabhar .
- PART 15: @user283 Agus d’ oibrigh sé go seoigh . Cóta Mór Gogol ina steillebheatha !
- DET 6: @user651 Scríobhas é seo inniu ! I d’ honóir ;) http://t.co/3pXk4Vf0
- PUNCT 1: @user1562 Smaoinigh ar Jill Stein le d’ thoil . Léigh fuithi anseo : http://t.co/uhBrdT3Z
Morphology
The form / lemma ratio of DET
is 1.369565 (the average of all parts of speech is 1.212231).
The 1st highest number of forms (8) was observed with the lemma “an”: ‘n, a, am, an, ar, a’, n, na.
The 2nd highest number of forms (6) was observed with the lemma “aon”: ain, aon, h-aon, haon, n-aon, t-aon.
The 3rd highest number of forms (4) was observed with the lemma “gach”: achan, gach, ghach, ngach.
DET
does not occur with any features.
Relations
DET
nodes are attached to their parents using 16 different relations: det (2426; 87% instances), det:poss (329; 12% instances), conj (5; 0% instances), flat (5; 0% instances), root (5; 0% instances), nmod (4; 0% instances), fixed (3; 0% instances), obl (3; 0% instances), nsubj (2; 0% instances), obj (2; 0% instances), parataxis:sentence (2; 0% instances), amod (1; 0% instances), cop (1; 0% instances), flat:foreign (1; 0% instances), flat:name (1; 0% instances), xcomp:pred (1; 0% instances)
Parents of DET
nodes belong to 15 different parts of speech: NOUN (2157; 77% instances), PROPN (501; 18% instances), PRON (35; 1% instances), X (28; 1% instances), NUM (17; 1% instances), VERB (16; 1% instances), ADP (12; 0% instances), ADJ (11; 0% instances), (5; 0% instances), CCONJ (2; 0% instances), DET (2; 0% instances), SYM (2; 0% instances), ADV (1; 0% instances), PART (1; 0% instances), SCONJ (1; 0% instances)
2726 (98%) DET
nodes are leaves.
48 (2%) DET
nodes have one child.
7 (0%) DET
nodes have two children.
10 (0%) DET
nodes have three or more children.
The highest child degree of a DET
node is 9.
Children of DET
nodes are attached using 26 different relations: fixed (18; 16% instances), case (17; 15% instances), punct (16; 15% instances), nmod (15; 14% instances), vocative:mention (7; 6% instances), cc (6; 5% instances), advmod (3; 3% instances), parataxis:url (3; 3% instances), amod (2; 2% instances), det (2; 2% instances), nsubj (2; 2% instances), obl:prep (2; 2% instances), parataxis:rt (2; 2% instances), parataxis:sentence (2; 2% instances), xcomp (2; 2% instances), acl:relcl (1; 1% instances), advcl (1; 1% instances), aux (1; 1% instances), conj (1; 1% instances), goeswith (1; 1% instances), mark:prt (1; 1% instances), obl (1; 1% instances), parataxis (1; 1% instances), parataxis:hashtag (1; 1% instances), vocative (1; 1% instances), xcomp:pred (1; 1% instances)
Children of DET
nodes belong to 13 different parts of speech: ADP (28; 25% instances), NOUN (23; 21% instances), PUNCT (16; 15% instances), PROPN (11; 10% instances), ADJ (9; 8% instances), CCONJ (6; 5% instances), SYM (5; 5% instances), ADV (3; 3% instances), VERB (3; 3% instances), DET (2; 2% instances), X (2; 2% instances), AUX (1; 1% instances), PART (1; 1% instances)