home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Irish-TwittIrish: POS Tags: PROPN

There are 3600 PROPN lemmas (34%), 3813 PROPN types (30%) and 7181 PROPN tokens (15%). Out of 17 observed tags, the rank of PROPN is: 1 in number of lemmas, 2 in number of types and 2 in number of tokens.

The 10 most frequent PROPN lemmas: gaeilge, @user241, @user1140, @user263, @user288, @user27, Éire, Nollaig, @user412, @user635

The 10 most frequent PROPN types: gaeilge, @user241, @user1140, @user263, @user288, @user27, @user412, @user635, Ghaeilge, nollaig

The 10 most frequent ambiguous lemmas: gaeilge (NOUN 1, PROPN 1), @user241 (PROPN 102, NOUN 1), @user27 (PROPN 59, VERB 2), Éire (PROPN 52, NOUN 1), @user635 (PROPN 43, NOUN 1), gaeltacht (PROPN 17, NOUN 10), gael (PROPN 6, NOUN 2), @user187 (PROPN 30, NOUN 2), @user660 (PROPN 28, VERB 2), dia (NOUN 11, PROPN 5)

The 10 most frequent ambiguous types: @user241 (PROPN 102, NOUN 1), @user27 (PROPN 59, VERB 2), @user635 (PROPN 43, NOUN 1), Ghaeilge (PROPN 40, X 2, NOUN 1), @user187 (PROPN 30, NOUN 2), @user660 (PROPN 28, VERB 2), (NOUN 26, PROPN 24), (PROPN 23, NOUN 21), @user1478 (PROPN 20, NOUN 1), Gaeltachta (PROPN 19, NOUN 4)

Morphology

The form / lemma ratio of PROPN is 1.059167 (the average of all parts of speech is 1.212231).

The 1st highest number of forms (11) was observed with the lemma “Gaeilge”: Ga, Gae, Gaeil, Gaelainne, Gaelg, Ghaeilge, gaeilge, lán-Ghaeilge, nG, ngaeilge, nglan-Ghaeilge.

The 2nd highest number of forms (8) was observed with the lemma “Éire”: EIRE, h-eireann, hÉir, hÉireann, hÉirinn, Éire, Éireann, Éirinn.

The 3rd highest number of forms (6) was observed with the lemma “Gaeltacht”: Gaeltacht, Gaeltachta, Gaeltachtaí, Ghaeltacht, Ghaeltachta, nGaeltacht.

PROPN does not occur with any features.

Relations

PROPN nodes are attached to their parents using 28 different relations: vocative:mention (2897; 40% instances), nmod (1707; 24% instances), obl (717; 10% instances), flat:name (382; 5% instances), conj (251; 3% instances), root (233; 3% instances), nsubj (229; 3% instances), parataxis (145; 2% instances), flat (113; 2% instances), appos (102; 1% instances), obj (84; 1% instances), parataxis:sentence (80; 1% instances), vocative (66; 1% instances), compound (53; 1% instances), amod (36; 1% instances), obl:tmod (29; 0% instances), parataxis:hashtag (23; 0% instances), xcomp:pred (15; 0% instances), flat:foreign (6; 0% instances), advcl (3; 0% instances), ccomp (3; 0% instances), acl (1; 0% instances), csubj (1; 0% instances), list (1; 0% instances), nmod:poss (1; 0% instances), obl:prep (1; 0% instances), parataxis:url (1; 0% instances), xcomp (1; 0% instances)

Parents of PROPN nodes belong to 17 different parts of speech: NOUN (2810; 39% instances), VERB (1701; 24% instances), PROPN (1667; 23% instances), ADJ (363; 5% instances), (233; 3% instances), PRON (129; 2% instances), NUM (72; 1% instances), INTJ (63; 1% instances), X (33; 0% instances), ADV (26; 0% instances), PART (23; 0% instances), SYM (21; 0% instances), ADP (19; 0% instances), DET (11; 0% instances), AUX (6; 0% instances), PUNCT (3; 0% instances), CCONJ (1; 0% instances)

4303 (60%) PROPN nodes are leaves.

1551 (22%) PROPN nodes have one child.

673 (9%) PROPN nodes have two children.

654 (9%) PROPN nodes have three or more children.

The highest child degree of a PROPN node is 14.

Children of PROPN nodes are attached using 43 different relations: case (1333; 23% instances), nmod (878; 15% instances), punct (713; 12% instances), flat:name (497; 8% instances), det (489; 8% instances), conj (273; 5% instances), vocative:mention (254; 4% instances), cc (154; 3% instances), amod (147; 2% instances), parataxis:url (131; 2% instances), parataxis (115; 2% instances), parataxis:hashtag (98; 2% instances), flat (92; 2% instances), parataxis:rt (89; 2% instances), parataxis:sentence (86; 1% instances), obl (63; 1% instances), xcomp (58; 1% instances), advmod (54; 1% instances), appos (50; 1% instances), discourse:emo (33; 1% instances), obl:tmod (32; 1% instances), compound (29; 0% instances), obl:prep (29; 0% instances), xcomp:pred (29; 0% instances), acl:relcl (25; 0% instances), case:voc (25; 0% instances), nsubj (22; 0% instances), discourse (21; 0% instances), cop (19; 0% instances), vocative (17; 0% instances), advcl (11; 0% instances), nummod (8; 0% instances), csubj:cleft (6; 0% instances), mark (6; 0% instances), flat:foreign (5; 0% instances), det:poss (4; 0% instances), goeswith (4; 0% instances), mark:prt (4; 0% instances), obj (4; 0% instances), acl (3; 0% instances), aux (1; 0% instances), compound:prt (1; 0% instances), list (1; 0% instances)

Children of PROPN nodes belong to 17 different parts of speech: PROPN (1667; 28% instances), ADP (1339; 23% instances), PUNCT (713; 12% instances), DET (501; 8% instances), NOUN (437; 7% instances), SYM (257; 4% instances), PART (186; 3% instances), NUM (157; 3% instances), ADJ (155; 3% instances), CCONJ (155; 3% instances), X (115; 2% instances), VERB (90; 2% instances), ADV (54; 1% instances), PRON (38; 1% instances), INTJ (23; 0% instances), AUX (20; 0% instances), SCONJ (6; 0% instances)