Treebank Statistics: UD_Scottish_Gaelic-ARCOSG: POS Tags: PROPN
There are 1070 PROPN lemmas (17%), 1177 PROPN types (14%) and 4315 PROPN tokens (5%).
Out of 17 observed tags, the rank of PROPN is: 2 in number of lemmas, 2 in number of types and 9 in number of tokens.
The 10 most frequent PROPN lemmas: [Name], Alba, [Placename], gàidhlig, Iain, Yugoslavia, Dòmhnall, MacLeish, Uibhist, Malpas
The 10 most frequent PROPN types: [Name], [Placename], Alba, Iain, Yugoslavia, Gàidhlig, Dòmhnall, Ghàidhlig, MacLeish, Malpas
The 10 most frequent ambiguous lemmas: gaidheal (PROPN 11, NOUN 1), dòmhnallach (ADJ 1, PROPN 1), Mac (PART 44, PROPN 4), celtic (PROPN 3, NOUN 1), sgitheanach (ADJ 1, PROPN 1), BBC (NOUN 3, PROPN 3), a (PART 3253, DET 592, PRON 427, ADP 279, ADV 142, ADJ 51, SCONJ 7, X 6, INTJ 4, PROPN 3, CCONJ 2), CalMac (PROPN 2, NOUN 1), IRA (NOUN 4, PROPN 2), albannach (ADJ 11, PROPN 1)
The 10 most frequent ambiguous types: [Name] (PROPN 286, ADJ 3), Nis (PROPN 28, INTJ 3), Eilean (NOUN 15, PROPN 13), Roinn (PROPN 11, NOUN 5), Dòmhnallach (PROPN 9, ADJ 1), h-Eileanan (PROPN 9, NOUN 5), Ceann (PROPN 8, NOUN 2), Fionn (PROPN 7, NOUN 1), Siar (PROPN 7, ADJ 5), Ailean (PROPN 5, NOUN 1)
- [Name]
- Nis
- Eilean
- Roinn
- Dòmhnallach
- PROPN 9: uill còmhla ri mi tha a’s an stiùidio ana-seo an-dràsta tha Murchadh Dòmhnallach
- ADJ 1: An déidh do an sgeulachd a dhol an clò , lorgadh dà innse eile : aithris neo-iomlan a recòrd an Dr Calum MacGilleathain nach maireann do Choimisiún Béaloideasa Eireann o an sgeulaiche ainmeil nach maireann Donnchadh Dòmhnallach ( Donnchadh mac Dhòmhnaill ‘ic Dhonnchaidh ) an Uibhist a Deas ; agus aithris làmh-sgrìobhta ann an Cruinneachaidhe a chaidh a toirt sìos le Dòmhnall Iain Dòmhnallach , mac Dhonnchaidh a dh’ainmich mi , o bhràthair Dhonnchaidh , Niall mac Dhòmhnaill ‘ic Dhonnchaidh nach maireann .
- h-Eileanan
- Ceann
- Fionn
- Siar
- Ailean
Morphology
The form / lemma ratio of PROPN is 1.100000 (the average of all parts of speech is 1.317448).
The 1st highest number of forms (7) was observed with the lemma “Alba”: Alba, Albainn, dh’Alba, dh’Alba, dh’Albainn, h-Alba, h-Albann.
The 2nd highest number of forms (4) was observed with the lemma “Iain”: Iain, dh’Iain, dh’Iain, lain.
The 3rd highest number of forms (3) was observed with the lemma “Astràilia”: Astràilia, dh’Astràilia, dh’Astràilia.
PROPN occurs with 8 features: NounType (4315; 100% instances), Case (1721; 40% instances), Gender (1212; 28% instances), Number (112; 3% instances), CleftType (79; 2% instances), Abbr (48; 1% instances), Foreign (32; 1% instances), Typo (17; 0% instances)
PROPN occurs with 21 feature-value pairs: Abbr=Yes, Case=Dat, Case=Gen, Case=Nom, Case=Voc, CleftType=Nom, CleftType=Obl, Foreign=Yes, Gender=Fem, Gender=Masc, NounType=Chr, NounType=Eth, NounType=Glt, NounType=Nau, NounType=Nos, NounType=Org, NounType=Prs, NounType=Top, Number=Plur, Number=Sing, Typo=Yes
PROPN occurs with 59 feature combinations.
The most frequent feature combination is NounType=Prs (1409 tokens).
Examples: [Name], MacLeish, Malpas, Iain, Dalgleish, Aitken, Johnson, MacStay, Cooper, Nicol
Relations
PROPN nodes are attached to their parents using 22 different relations: nmod (1133; 26% instances), flat:name (642; 15% instances), obl (629; 15% instances), nsubj (565; 13% instances), root (437; 10% instances), conj (260; 6% instances), xcomp:pred (223; 5% instances), appos (115; 3% instances), vocative (93; 2% instances), obj (68; 2% instances), nmod:unmarked (55; 1% instances), dislocated (26; 1% instances), ccomp (18; 0% instances), parataxis (14; 0% instances), advcl (13; 0% instances), nsubj:pass (10; 0% instances), obl:agent (5; 0% instances), flat (4; 0% instances), reparandum (2; 0% instances), acl (1; 0% instances), acl:relcl (1; 0% instances), dep (1; 0% instances)
Parents of PROPN nodes belong to 12 different parts of speech: NOUN (1520; 35% instances), VERB (1155; 27% instances), PROPN (1033; 24% instances), (437; 10% instances), PART (73; 2% instances), PRON (43; 1% instances), ADJ (37; 1% instances), ADV (6; 0% instances), NUM (6; 0% instances), X (3; 0% instances), DET (1; 0% instances), INTJ (1; 0% instances)
1458 (34%) PROPN nodes are leaves.
1635 (38%) PROPN nodes have one child.
766 (18%) PROPN nodes have two children.
456 (11%) PROPN nodes have three or more children.
The highest child degree of a PROPN node is 10.
Children of PROPN nodes are attached using 34 different relations: case (1326; 28% instances), flat:name (609; 13% instances), det (520; 11% instances), conj (368; 8% instances), xcomp:pred (349; 7% instances), nmod (314; 7% instances), cc (218; 5% instances), punct (217; 5% instances), advmod (147; 3% instances), amod (144; 3% instances), cop (101; 2% instances), nmod:unmarked (97; 2% instances), csubj:cleft (79; 2% instances), appos (61; 1% instances), case:voc (50; 1% instances), acl:relcl (38; 1% instances), advcl (23; 0% instances), xcomp (18; 0% instances), nsubj (17; 0% instances), parataxis (17; 0% instances), discourse (13; 0% instances), flat (12; 0% instances), mark (12; 0% instances), mark:prt (12; 0% instances), dep (8; 0% instances), nmod:poss (4; 0% instances), reparandum (4; 0% instances), nummod (3; 0% instances), obj (3; 0% instances), vocative (2; 0% instances), ccomp (1; 0% instances), csubj:cop (1; 0% instances), obl:unmarked (1; 0% instances), orphan (1; 0% instances)
Children of PROPN nodes belong to 16 different parts of speech: ADP (1326; 28% instances), PROPN (1033; 22% instances), NOUN (532; 11% instances), DET (520; 11% instances), VERB (237; 5% instances), CCONJ (218; 5% instances), PUNCT (217; 5% instances), ADV (162; 3% instances), ADJ (154; 3% instances), PRON (136; 3% instances), PART (109; 2% instances), AUX (101; 2% instances), X (13; 0% instances), SCONJ (12; 0% instances), INTJ (11; 0% instances), NUM (9; 0% instances)