Treebank Statistics: UD_Irish-TwittIrish: POS Tags: PROPN
There are 3600 PROPN
lemmas (34%), 3813 PROPN
types (30%) and 7181 PROPN
tokens (15%).
Out of 17 observed tags, the rank of PROPN
is: 1 in number of lemmas, 2 in number of types and 2 in number of tokens.
The 10 most frequent PROPN
lemmas: gaeilge, @user241, @user1140, @user263, @user288, @user27, Éire, Nollaig, @user412, @user635
The 10 most frequent PROPN
types: gaeilge, @user241, @user1140, @user263, @user288, @user27, @user412, @user635, Ghaeilge, nollaig
The 10 most frequent ambiguous lemmas: gaeilge (NOUN 1, PROPN 1), @user241 (PROPN 102, NOUN 1), @user27 (PROPN 59, VERB 2), Éire (PROPN 52, NOUN 1), @user635 (PROPN 43, NOUN 1), gaeltacht (PROPN 17, NOUN 10), gael (PROPN 6, NOUN 2), @user187 (PROPN 30, NOUN 2), @user660 (PROPN 28, VERB 2), dia (NOUN 11, PROPN 5)
The 10 most frequent ambiguous types: @user241 (PROPN 102, NOUN 1), @user27 (PROPN 59, VERB 2), @user635 (PROPN 43, NOUN 1), Ghaeilge (PROPN 40, X 2, NOUN 1), @user187 (PROPN 30, NOUN 2), @user660 (PROPN 28, VERB 2), Dé (NOUN 26, PROPN 24), Lá (PROPN 23, NOUN 21), @user1478 (PROPN 20, NOUN 1), Gaeltachta (PROPN 19, NOUN 4)
- @user241
- @user27
- @user635
- Ghaeilge
- PROPN 40: @user271 … dóibh siúd gan Ghaeilge . Ach ní don chuid eile againn .
- X 2: @user1722 @user255 an tUasal Mhic Alastar ina bhiogóid frith Ghaeilge ? Ní doigh liom é .
- NOUN 1: RT @user1416 : Acmhainn Nua Oideachais do Bhéaltriail Ghaeilge na hArdteist https://t.co/tdmRNBjRMi #Gaeilge @user1592 @user552 @user277 …
- @user187
- @user660
- Dé
- Lá
- @user1478
- Gaeltachta
Morphology
The form / lemma ratio of PROPN
is 1.059167 (the average of all parts of speech is 1.212231).
The 1st highest number of forms (11) was observed with the lemma “Gaeilge”: Ga, Gae, Gaeil, Gaelainne, Gaelg, Ghaeilge, gaeilge, lán-Ghaeilge, nG, ngaeilge, nglan-Ghaeilge.
The 2nd highest number of forms (8) was observed with the lemma “Éire”: EIRE, h-eireann, hÉir, hÉireann, hÉirinn, Éire, Éireann, Éirinn.
The 3rd highest number of forms (6) was observed with the lemma “Gaeltacht”: Gaeltacht, Gaeltachta, Gaeltachtaí, Ghaeltacht, Ghaeltachta, nGaeltacht.
PROPN
does not occur with any features.
Relations
PROPN
nodes are attached to their parents using 28 different relations: vocative:mention (2897; 40% instances), nmod (1707; 24% instances), obl (717; 10% instances), flat:name (382; 5% instances), conj (251; 3% instances), root (233; 3% instances), nsubj (229; 3% instances), parataxis (145; 2% instances), flat (113; 2% instances), appos (102; 1% instances), obj (84; 1% instances), parataxis:sentence (80; 1% instances), vocative (66; 1% instances), compound (53; 1% instances), amod (36; 1% instances), obl:tmod (29; 0% instances), parataxis:hashtag (23; 0% instances), xcomp:pred (15; 0% instances), flat:foreign (6; 0% instances), advcl (3; 0% instances), ccomp (3; 0% instances), acl (1; 0% instances), csubj (1; 0% instances), list (1; 0% instances), nmod:poss (1; 0% instances), obl:prep (1; 0% instances), parataxis:url (1; 0% instances), xcomp (1; 0% instances)
Parents of PROPN
nodes belong to 17 different parts of speech: NOUN (2810; 39% instances), VERB (1701; 24% instances), PROPN (1667; 23% instances), ADJ (363; 5% instances), (233; 3% instances), PRON (129; 2% instances), NUM (72; 1% instances), INTJ (63; 1% instances), X (33; 0% instances), ADV (26; 0% instances), PART (23; 0% instances), SYM (21; 0% instances), ADP (19; 0% instances), DET (11; 0% instances), AUX (6; 0% instances), PUNCT (3; 0% instances), CCONJ (1; 0% instances)
4303 (60%) PROPN
nodes are leaves.
1551 (22%) PROPN
nodes have one child.
673 (9%) PROPN
nodes have two children.
654 (9%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 14.
Children of PROPN
nodes are attached using 43 different relations: case (1333; 23% instances), nmod (878; 15% instances), punct (713; 12% instances), flat:name (497; 8% instances), det (489; 8% instances), conj (273; 5% instances), vocative:mention (254; 4% instances), cc (154; 3% instances), amod (147; 2% instances), parataxis:url (131; 2% instances), parataxis (115; 2% instances), parataxis:hashtag (98; 2% instances), flat (92; 2% instances), parataxis:rt (89; 2% instances), parataxis:sentence (86; 1% instances), obl (63; 1% instances), xcomp (58; 1% instances), advmod (54; 1% instances), appos (50; 1% instances), discourse:emo (33; 1% instances), obl:tmod (32; 1% instances), compound (29; 0% instances), obl:prep (29; 0% instances), xcomp:pred (29; 0% instances), acl:relcl (25; 0% instances), case:voc (25; 0% instances), nsubj (22; 0% instances), discourse (21; 0% instances), cop (19; 0% instances), vocative (17; 0% instances), advcl (11; 0% instances), nummod (8; 0% instances), csubj:cleft (6; 0% instances), mark (6; 0% instances), flat:foreign (5; 0% instances), det:poss (4; 0% instances), goeswith (4; 0% instances), mark:prt (4; 0% instances), obj (4; 0% instances), acl (3; 0% instances), aux (1; 0% instances), compound:prt (1; 0% instances), list (1; 0% instances)
Children of PROPN
nodes belong to 17 different parts of speech: PROPN (1667; 28% instances), ADP (1339; 23% instances), PUNCT (713; 12% instances), DET (501; 8% instances), NOUN (437; 7% instances), SYM (257; 4% instances), PART (186; 3% instances), NUM (157; 3% instances), ADJ (155; 3% instances), CCONJ (155; 3% instances), X (115; 2% instances), VERB (90; 2% instances), ADV (54; 1% instances), PRON (38; 1% instances), INTJ (23; 0% instances), AUX (20; 0% instances), SCONJ (6; 0% instances)