NOUN: noun
Nouns inflect for case, number and possession. Nouns receive nominal morphology. Other parts of speech may be derived into nouns, such as adjectives.
Proper nouns are not annotated as NOUN but rather PROPN.
Examples
- [kk] қыз “girl”
- [kk] үй “house”
- [kk] ағаш “tree”
Treebank Statistics (UD_Kazakh)
There are 778 NOUN lemmas (42%), 1262 NOUN types (44%) and 1859 NOUN tokens (30%).
Out of 16 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.
The 10 most frequent NOUN lemmas: ел, орыс, жыл, жер, ғасыр, ж., адам, бала, мемлекет, қазақ
The 10 most frequent NOUN types: _, ж., орыс, қазақ, ел, әулеті, ғасырдың, елде, мал, кісі
The 10 most frequent ambiguous lemmas: қала (NOUN 17, VERB 1), мал (NOUN 16, VERB 1), бас (NOUN 14, VERB 9), ат (NOUN 9, VERB 4), бай (NOUN 8, ADJ 2), жақ (NOUN 7, VERB 1), ана (NOUN 6, DET 1), арт (NOUN 6, VERB 1), іш (NOUN 6, VERB 3), ет (NOUN 5, VERB 3)
The 10 most frequent ambiguous types: _ (AUX 154, PART 76, NOUN 75, ADJ 72, VERB 29, PRON 23, CONJ 13, ADV 7, ADP 7, PROPN 5, NUM 4, PUNCT 1), жылы (NOUN 5, ADJ 1), бай (NOUN 2, ADJ 2), млн. (NOUN 3, NUM 2), Батыс (NOUN 2, ADJ 2), КСРО (NOUN 2, PROPN 2), сайлау (NOUN 2, VERB 1), ұлы (NOUN 2, ADJ 1), Темір (NOUN 1, PROPN 1), ар (ADJ 1, NOUN 1)
- _
- AUX 154: Иран — діни _ _ .
- PART 76: Қазірде орыстан оқыған балалардан артық жақсы кісі шыға _ _ тұр .
- NOUN 75: Иран — діни _ _ .
- ADJ 72: Жер беті суы _ _ .
- VERB 29: Құлдық пен құл саудасына , қандай түрде _ _ , тыйым салынады .
- PRON 23: Сіздің атыңыз _ _ ?
- CONJ 13: Ол _ _ , _ _ емес .
- ADV 7: — Бәйбіше _ _ ?
- ADP 7: Неке , тек екі жақтың өзара еркін және толық келісімі _ _ қиылады .
- PROPN 5: Баяғыда біреу той жасапты , тойға көп кісі жиналыпты , _ _ келіпті .
- NUM 4: Қала халқы _ _ .
- PUNCT 1: Халқының ұлттық құрамы : парсылар ( 51% ) , әзірбайжандар ( 27% ) , күрдтер ( 5% ) , арабтар , түрікмендер , белуджилер , армяндар , еврейлер , _ _ _
- жылы
- бай
- млн.
- Батыс
- КСРО
- сайлау
- ұлы
- Темір
- ар
Morphology
The form / lemma ratio of NOUN is 1.622108 (the average of all parts of speech is 1.549647).
The 1st highest number of forms (17) was observed with the lemma “бала”: _, Балаларды, Балалардың, бала, балалар, балалардан, балалармен, балаларына, балаларынан, балама, баламды, баласы, баласын, баласына, балаға, балаң, балаңа.
The 2nd highest number of forms (16) was observed with the lemma “ел”: _, ел, елге, елде, елдегі, елден, елдер, елдерден, елдерді, елдері, елдерімен, елді, елдің, елі, елінің, еліңе.
The 3rd highest number of forms (11) was observed with the lemma “жер”: жер, жерге, жерде, жерді, жері, жерін, жерінде, жеріне, жерінен, жеріңді, жеріңе.
NOUN occurs with 4 features: kk-feat/Case (314; 17% instances), kk-feat/Number[psor] (98; 5% instances), kk-feat/Person[psor] (98; 5% instances), kk-feat/Number (29; 2% instances)
NOUN occurs with 14 feature-value pairs: Case=Abl, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Number=Plur, Number[psor]=Plur, Number[psor]=Plur,Sing, Number[psor]=Sing, Person[psor]=1, Person[psor]=2, Person[psor]=3
NOUN occurs with 30 feature combinations.
The most frequent feature combination is _ (1545 tokens).
Examples: _, ж., орыс, әулеті, ел, ғасырдың, елде, парсы, тілдерін, ғасырда
Relations
NOUN nodes are attached to their parents using 24 different relations: kk-dep/nmod (497; 27% instances), kk-dep/nsubj (395; 21% instances), kk-dep/nmod:poss (321; 17% instances), kk-dep/dobj (280; 15% instances), kk-dep/conj (128; 7% instances), kk-dep/root (67; 4% instances), kk-dep/compound (54; 3% instances), kk-dep/appos (24; 1% instances), kk-dep/remnant (17; 1% instances), kk-dep/amod (14; 1% instances), kk-dep/advcl (13; 1% instances), kk-dep/name (9; 0% instances), kk-dep/parataxis (8; 0% instances), kk-dep/ccomp (7; 0% instances), kk-dep/nummod (6; 0% instances), kk-dep/iobj (5; 0% instances), kk-dep/xcomp (3; 0% instances), kk-dep/acl:relcl (2; 0% instances), kk-dep/advmod (2; 0% instances), kk-dep/nmod:own (2; 0% instances), kk-dep/vocative (2; 0% instances), kk-dep/acl (1; 0% instances), kk-dep/csubj (1; 0% instances), kk-dep/dobj:caus (1; 0% instances)
Parents of NOUN nodes belong to 12 different parts of speech: VERB (1017; 55% instances), NOUN (623; 34% instances), ADJ (88; 5% instances), ROOT (67; 4% instances), PROPN (27; 1% instances), PRON (13; 1% instances), NUM (12; 1% instances), ADV (8; 0% instances), AUX (1; 0% instances), CONJ (1; 0% instances), DET (1; 0% instances), PUNCT (1; 0% instances)
769 (41%) NOUN nodes are leaves.
669 (36%) NOUN nodes have one child.
234 (13%) NOUN nodes have two children.
187 (10%) NOUN nodes have three or more children.
The highest child degree of a NOUN node is 19.
Children of NOUN nodes are attached using 25 different relations: kk-dep/nmod:poss (431; 22% instances), kk-dep/amod (326; 17% instances), kk-dep/punct (251; 13% instances), kk-dep/conj (131; 7% instances), kk-dep/det (114; 6% instances), kk-dep/cop (93; 5% instances), kk-dep/acl:relcl (87; 5% instances), kk-dep/nmod (68; 4% instances), kk-dep/nsubj (67; 3% instances), kk-dep/cc (61; 3% instances), kk-dep/compound (59; 3% instances), kk-dep/case (55; 3% instances), kk-dep/nummod (49; 3% instances), kk-dep/appos (31; 2% instances), kk-dep/advmod (24; 1% instances), kk-dep/acl (22; 1% instances), kk-dep/remnant (17; 1% instances), kk-dep/advcl (14; 1% instances), kk-dep/parataxis (10; 1% instances), kk-dep/aux (5; 0% instances), kk-dep/name (5; 0% instances), kk-dep/discourse (4; 0% instances), kk-dep/csubj (3; 0% instances), kk-dep/ccomp (1; 0% instances), kk-dep/dobj (1; 0% instances)
Children of NOUN nodes belong to 15 different parts of speech: NOUN (623; 32% instances), PUNCT (243; 13% instances), ADJ (232; 12% instances), VERB (152; 8% instances), NUM (151; 8% instances), PROPN (148; 8% instances), DET (114; 6% instances), AUX (64; 3% instances), CONJ (60; 3% instances), ADP (55; 3% instances), PRON (42; 2% instances), PART (24; 1% instances), ADV (18; 1% instances), INTJ (2; 0% instances), SCONJ (1; 0% instances)
NOUN in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]