Treebank Statistics: UD_Irish-TwittIrish: POS Tags: NUM
There are 313 NUM
lemmas (3%), 329 NUM
types (3%) and 962 NUM
tokens (2%).
Out of 17 observed tags, the rank of NUM
is: 7 in number of lemmas, 7 in number of types and 13 in number of tokens.
The 10 most frequent NUM
lemmas: 2, 1, céad, dó, 3, 0, míle, 8, 4, 7
The 10 most frequent NUM
types: 2, 1, dhá, 0, 3, 8, míle, 4, chéad, 10
The 10 most frequent ambiguous lemmas: céad (NUM 35, ADJ 7, ADV 1), dó (NUM 33, ADP 1), míle (NUM 22, NOUN 2), 4 (NUM 24, ADP 1, NOUN 1), 7 (NUM 23, CCONJ 8, NOUN 1, PROPN 1), 10 (NUM 22, PROPN 2), 6 (NUM 22, NOUN 1), 9 (NUM 12, NOUN 1), dara (NUM 12, ADJ 1), 24 (NUM 11, PROPN 1)
The 10 most frequent ambiguous types: 2 (NUM 53, PRON 1), 1 (NUM 49, ADJ 1), dhá (NUM 18, ADP 3), 4 (NUM 23, ADP 1, NOUN 1), chéad (NUM 21, ADJ 3, ADV 1), 10 (NUM 22, PROPN 2), 7 (NUM 22, CCONJ 8, PROPN 1), 6 (NUM 21, NOUN 1), 24 (NUM 11, PROPN 1), 9 (NUM 11, NOUN 1)
- 2
- 1
- dhá
- 4
- NUM 23: Dochreidthe ! 4 bonn óir anois ag na hÉireannaigh . #paralmypicsire
- ADP 1: RT @user1731 : @user1419 Crash Course Sat 13th Sept in Coláiste Feirste . 4 further info , please con Glór na Móna or Ionad Uíbh Eachach http:…
- NOUN 1: Beidh na Leatóirí Salainn ar na bealtaí go léir ó 4 in , Déardaoin , 31 Eanair Tuilleadh eolais https://t.co/3KyXqBCEUN Ná glac go bhfuil na bóithre saor ó shioc . https://t.co/8LKeLgFg0S
- chéad
- NUM 21: Tá mé tar éis an tine a lasadh don chéad uair . #fuar
- ADJ 3: Cuireadh chuig seoladh oifigiúil an chéad leabhar idirghníomhach , i nGaelige , http://t.co/6Jq2xHikfW
- ADV 1: @user241 @user39 @user673 Tá sin i mBarcelóna is cosúil gur ansin a chéad cuireadh na ceamaraí sin ! ( An i 1984 a deineadh é ? )
- 10
- 7
- NUM 22: Ceol @user1280 anocht 7 pm ó #tradfest an Chlocháin i mí Aibreáin . @user397 #irishmusic
- CCONJ 8: @user358 Ha ! No , ní inniu - i bhfad an iomarca daoine ann . Ag breathnú ar an #gaa 7 ag tógáil go dea-réidh é ! Bain sult as do thuras ! :)
- PROPN 1: An scéal is déanaí maidir le seirbhís aeir oileáin Árann AGUS a bhfuil i ndán do Thuaisceart Éireann , anocht ar Nuacht TG4 @ 7 pm .
- 6
- 24
- 9
Morphology
The form / lemma ratio of NUM
is 1.051118 (the average of all parts of speech is 1.212231).
The 1st highest number of forms (5) was observed with the lemma “céad”: chead, chèad, chéad, céad, gcéad.
The 2nd highest number of forms (4) was observed with the lemma “dó”: dhá, dhó, dá, dó.
The 3rd highest number of forms (3) was observed with the lemma “1”: 1, 1ú, 2.
NUM
does not occur with any features.
Relations
NUM
nodes are attached to their parents using 19 different relations: nmod (284; 30% instances), nummod (223; 23% instances), obl:tmod (178; 19% instances), flat (76; 8% instances), amod (57; 6% instances), parataxis (38; 4% instances), conj (29; 3% instances), obl (16; 2% instances), appos (12; 1% instances), root (12; 1% instances), parataxis:sentence (10; 1% instances), obj (6; 1% instances), nsubj (5; 1% instances), parataxis:url (4; 0% instances), flat:name (3; 0% instances), xcomp:pred (3; 0% instances), compound (2; 0% instances), nmod:tmod (2; 0% instances), vocative:mention (2; 0% instances)
Parents of NUM
nodes belong to 12 different parts of speech: NOUN (498; 52% instances), PROPN (157; 16% instances), NUM (140; 15% instances), VERB (113; 12% instances), SYM (19; 2% instances), (12; 1% instances), ADJ (8; 1% instances), X (7; 1% instances), ADV (4; 0% instances), PRON (2; 0% instances), ADP (1; 0% instances), PART (1; 0% instances)
473 (49%) NUM
nodes are leaves.
288 (30%) NUM
nodes have one child.
134 (14%) NUM
nodes have two children.
67 (7%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 14.
Children of NUM
nodes are attached using 29 different relations: punct (232; 29% instances), case (165; 20% instances), nmod (125; 15% instances), flat (101; 12% instances), mark:prt (30; 4% instances), conj (20; 2% instances), vocative:mention (18; 2% instances), cc (17; 2% instances), det (16; 2% instances), parataxis (15; 2% instances), obl (10; 1% instances), parataxis:sentence (8; 1% instances), advmod (7; 1% instances), appos (7; 1% instances), nummod (7; 1% instances), obl:tmod (6; 1% instances), xcomp:pred (5; 1% instances), parataxis:url (4; 0% instances), amod (3; 0% instances), compound (3; 0% instances), cop (3; 0% instances), nsubj (3; 0% instances), parataxis:hashtag (3; 0% instances), acl:relcl (1; 0% instances), advcl (1; 0% instances), csubj:cleft (1; 0% instances), nmod:tmod (1; 0% instances), parataxis:rt (1; 0% instances), vocative (1; 0% instances)
Children of NUM
nodes belong to 16 different parts of speech: PUNCT (232; 29% instances), ADP (165; 20% instances), NUM (140; 17% instances), NOUN (85; 10% instances), PROPN (72; 9% instances), PART (29; 4% instances), CCONJ (17; 2% instances), DET (17; 2% instances), X (17; 2% instances), ADJ (10; 1% instances), ADV (8; 1% instances), VERB (8; 1% instances), SYM (7; 1% instances), AUX (3; 0% instances), PRON (3; 0% instances), SCONJ (1; 0% instances)