PROPN
: proper noun
A proper noun is a noun (or nominal content word) that is the name (or part of the name) of a specific individual, place, or object.
Acronyms of proper nouns, such as ӨФ and БҰҰ, should be tagged PROPN
.
Examples
- [kk] Исфаһан, Оксана, Шыңғыс
- [kk] Сараево, Алматы
- [kk] ӨФ, БҰҰ “RF, UN”
Treebank Statistics (UD_Kazakh)
There are 147 PROPN
lemmas (8%), 182 PROPN
types (6%) and 316 PROPN
tokens (5%).
Out of 16 observed tags, the rank of PROPN
is: 4 in number of lemmas, 4 in number of types and 6 in number of tokens.
The 10 most frequent PROPN
lemmas: Иран, Азамат, Айгүл, Ерназар, Астана, АҚШ, Бразилия, Қазақстан, Ақбілек, Төстік
The 10 most frequent PROPN
types: Иран, Азамат, Айгүл, Астанаға, _, АҚШ, Ерназардың, Азаматтың, Бразилия, Иранды
The 10 most frequent ambiguous lemmas: КСРО (PROPN 3, NOUN 2), салжұқ (NOUN 2, PROPN 1)
The 10 most frequent ambiguous types: _ (AUX 154, PART 76, NOUN 75, ADJ 72, VERB 29, PRON 23, CONJ 13, ADV 7, ADP 7, PROPN 5, NUM 4, PUNCT 1), КСРО (NOUN 2, PROPN 2), Темір (NOUN 1, PROPN 1), салжұқтар (NOUN 1, PROPN 1)
- _
- AUX 154: Иран — діни _ _ .
- PART 76: Қазірде орыстан оқыған балалардан артық жақсы кісі шыға _ _ тұр .
- NOUN 75: Иран — діни _ _ .
- ADJ 72: Жер беті суы _ _ .
- VERB 29: Құлдық пен құл саудасына , қандай түрде _ _ , тыйым салынады .
- PRON 23: Сіздің атыңыз _ _ ?
- CONJ 13: Ол _ _ , _ _ емес .
- ADV 7: — Бәйбіше _ _ ?
- ADP 7: Неке , тек екі жақтың өзара еркін және толық келісімі _ _ қиылады .
- PROPN 5: Баяғыда біреу той жасапты , тойға көп кісі жиналыпты , _ _ келіпті .
- NUM 4: Қала халқы _ _ .
- PUNCT 1: Халқының ұлттық құрамы : парсылар ( 51% ) , әзірбайжандар ( 27% ) , күрдтер ( 5% ) , арабтар , түрікмендер , белуджилер , армяндар , еврейлер , _ _ _
- КСРО
- Темір
- салжұқтар
Morphology
The form / lemma ratio of PROPN
is 1.238095 (the average of all parts of speech is 1.549647).
The 1st highest number of forms (5) was observed with the lemma “Азамат”: _, АЗАМАТ, Азамат, Азаматты, Азаматтың.
The 2nd highest number of forms (5) was observed with the lemma “Иран”: Иран, Иранда, Иранды, Иранның, Иранға.
The 3rd highest number of forms (3) was observed with the lemma “Айгүл”: Айгүл, Айгүлден, Айгүлдің.
PROPN
occurs with 2 features: Case (19; 6% instances), Gender (4; 1% instances)
PROPN
occurs with 6 feature-value pairs: Case=Acc
, Case=Gen
, Case=Loc
, Case=Nom
, Gender=Fem
, Gender=Msc
PROPN
occurs with 8 feature combinations.
The most frequent feature combination is _
(297 tokens).
Examples: Иран, Азамат, Айгүл, Астанаға, АҚШ, Ерназардың, _, Азаматтың, Бразилия, Иранды
Relations
PROPN
nodes are attached to their parents using 12 different relations: nmod:poss (106; 34% instances), nsubj (65; 21% instances), conj (49; 16% instances), nmod (40; 13% instances), name (21; 7% instances), appos (11; 3% instances), dobj (11; 3% instances), compound (4; 1% instances), amod (3; 1% instances), remnant (2; 1% instances), root (2; 1% instances), vocative (2; 1% instances)
Parents of PROPN
nodes belong to 8 different parts of speech: NOUN (148; 47% instances), VERB (90; 28% instances), PROPN (63; 20% instances), ADJ (6; 2% instances), ADV (4; 1% instances), AUX (2; 1% instances), ROOT (2; 1% instances), PRON (1; 0% instances)
228 (72%) PROPN
nodes are leaves.
41 (13%) PROPN
nodes have one child.
25 (8%) PROPN
nodes have two children.
22 (7%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 13.
Children of PROPN
nodes are attached using 19 different relations: punct (62; 30% instances), conj (44; 21% instances), name (25; 12% instances), amod (17; 8% instances), appos (12; 6% instances), cc (12; 6% instances), compound (5; 2% instances), advmod (4; 2% instances), case (4; 2% instances), cop (4; 2% instances), nsubj (4; 2% instances), kk-dep/acl:relcl (3; 1% instances), det (2; 1% instances), nummod (2; 1% instances), acl (1; 0% instances), discourse (1; 0% instances), nmod (1; 0% instances), nmod:poss (1; 0% instances), remnant (1; 0% instances)
Children of PROPN
nodes belong to 13 different parts of speech: PROPN (63; 31% instances), PUNCT (61; 30% instances), NOUN (27; 13% instances), ADJ (16; 8% instances), CONJ (12; 6% instances), NUM (5; 2% instances), ADP (4; 2% instances), PART (4; 2% instances), VERB (4; 2% instances), AUX (3; 1% instances), PRON (3; 1% instances), DET (2; 1% instances), ADV (1; 0% instances)
PROPN in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]