This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home ru/pos issue tracker

NOUN: noun

Definition

Nouns are a part of speech typically denoting a person, place, thing, animal or idea.

The NOUN tag is intended for common nouns only. See PROPN for proper nouns and PRON for pronouns.

Russian nouns have the lexical feature ru-feat/Gender. Furthermore, the nouns inflect for ru-feat/Number and ru-feat/Case.

A verbal noun can be derived productively from almost every verb (e.g. есть  “to eat” → поедание  “eating”). While in other languages a corresponding form may be called gerund and tagged VERB, in Russian it is tagged NOUN. It has always the neuter gender and the full number-case inflectional paradigm.

Examples


Treebank Statistics (UD_Russian)

There are 6400 NOUN lemmas (33%), 11538 NOUN types (38%) and 27252 NOUN tokens (27%). Out of 16 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: ГОД, ВРЕМЯ, ГОРОД, ЧЕЛОВЕК, ЧАСТЬ, РАЙОН, ОБЛАСТЬ, СОСТАВ, НАСЕЛЕНИЕ, РЕКА

The 10 most frequent NOUN types: года, году, время, области, лет, человек, войны, реки, год, км

The 10 most frequent ambiguous lemmas: Г. (NOUN 58, PROPN 1), ЧЛЕН (NOUN 57, ADV 1), ЗЕМЛЯ (NOUN 54, PROPN 1), ОСТРОВ (NOUN 52, PROPN 1), ДОМ (NOUN 44, PROPN 1), АВГУСТ (NOUN 41, PROPN 6), ВОСТОК (NOUN 41, PROPN 1), СЛОВО (NOUN 36, PROPN 1), УЛИЦА (NOUN 31, ADV 1), ПРЕЗИДЕНТ (NOUN 29, PROPN 1)

The 10 most frequent ambiguous types: мм (NOUN 27, ADJ 5), м (NOUN 23, ADJ 12), No (NOUN 21, PART 1), дома (NOUN 13, ADV 3), основном (NOUN 16, ADJ 1), начала (NOUN 12, VERB 5), б (NOUN 6, ADJ 1), типа (ADP 14, NOUN 11), а (CONJ 261, ADV 4, NOUN 1), начало (NOUN 9, VERB 1)

Morphology

The form / lemma ratio of NOUN is 1.802812 (the average of all parts of speech is 1.591757).

The 1st highest number of forms (10) was observed with the lemma “АКТЕР”: актер, актера, актеров, актеры, актёр, актёра, актёрами, актёров, актёром, актёры.

The 2nd highest number of forms (10) was observed with the lemma “ГОД”: год, года, годам, годами, годах, годов, годом, году, годы, лет.

The 3rd highest number of forms (10) was observed with the lemma “ФИЛЬМ”: фильм, фильма, фильмам, фильмами, фильмах, фильме, фильмов, фильмом, фильму, фильмы.

NOUN occurs with 4 features: ru-feat/Animacy (27197; 100% instances), ru-feat/Case (27197; 100% instances), ru-feat/Gender (27197; 100% instances), ru-feat/Number (27197; 100% instances)

NOUN occurs with 15 feature-value pairs: Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Par, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing

NOUN occurs with 73 feature combinations. The most frequent feature combination is Animacy=Inan|Case=Gen|Gender=Masc|Number=Sing (3017 tokens). Examples: года, города, мира, века, декабря, района, сентября, января, марта, июня

Relations

NOUN nodes are attached to their parents using 32 different relations: ru-dep/nmod (15477; 57% instances), ru-dep/nsubj (3160; 12% instances), ru-dep/dobj (2224; 8% instances), ru-dep/conj (2038; 7% instances), ru-dep/appos (1095; 4% instances), ru-dep/root (873; 3% instances), ru-dep/iobj (756; 3% instances), ru-dep/nsubjpass (551; 2% instances), ru-dep/advmod (496; 2% instances), ru-dep/list (138; 1% instances), ru-dep/remnant (103; 0% instances), ru-dep/parataxis (85; 0% instances), ru-dep/nummod:gov (55; 0% instances), ru-dep/mwe (42; 0% instances), ru-dep/ccomp (36; 0% instances), ru-dep/acl:relcl (32; 0% instances), ru-dep/acl (21; 0% instances), ru-dep/amod (14; 0% instances), ru-dep/xcomp (13; 0% instances), ru-dep/advcl (12; 0% instances), ru-dep/goeswith (6; 0% instances), ru-dep/cop (5; 0% instances), ru-dep/discourse (5; 0% instances), ru-dep/compound (3; 0% instances), ru-dep/nummod (3; 0% instances), ru-dep/case (2; 0% instances), ru-dep/vocative (2; 0% instances), ru-dep/csubj (1; 0% instances), ru-dep/foreign (1; 0% instances), ru-dep/mark (1; 0% instances), ru-dep/name (1; 0% instances), ru-dep/nummod:entity (1; 0% instances)

Parents of NOUN nodes belong to 14 different parts of speech: NOUN (12359; 45% instances), VERB (11974; 44% instances), ROOT (873; 3% instances), ADJ (699; 3% instances), PROPN (469; 2% instances), ADP (418; 2% instances), NUM (223; 1% instances), ADV (127; 0% instances), SYM (48; 0% instances), PRON (26; 0% instances), DET (22; 0% instances), PUNCT (9; 0% instances), CONJ (4; 0% instances), AUX (1; 0% instances)

4395 (16%) NOUN nodes are leaves.

8965 (33%) NOUN nodes have one child.

7962 (29%) NOUN nodes have two children.

5930 (22%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 55.

Children of NOUN nodes are attached using 39 different relations: ru-dep/nmod (10613; 22% instances), ru-dep/amod (10400; 21% instances), ru-dep/case (8340; 17% instances), ru-dep/punct (6150; 13% instances), ru-dep/appos (2447; 5% instances), ru-dep/conj (2181; 4% instances), ru-dep/det (1270; 3% instances), ru-dep/cc (1267; 3% instances), ru-dep/acl (1020; 2% instances), ru-dep/nummod:gov (840; 2% instances), ru-dep/nsubj (781; 2% instances), ru-dep/goeswith (518; 1% instances), ru-dep/acl:relcl (508; 1% instances), ru-dep/nummod (471; 1% instances), ru-dep/advmod (386; 1% instances), ru-dep/cop (364; 1% instances), ru-dep/list (338; 1% instances), ru-dep/discourse (159; 0% instances), ru-dep/remnant (145; 0% instances), ru-dep/parataxis (126; 0% instances), ru-dep/iobj (99; 0% instances), ru-dep/nummod:entity (89; 0% instances), ru-dep/mwe (71; 0% instances), ru-dep/compound (55; 0% instances), ru-dep/mark (51; 0% instances), ru-dep/cc:preconj (49; 0% instances), ru-dep/advcl (41; 0% instances), ru-dep/neg (32; 0% instances), ru-dep/dobj (17; 0% instances), ru-dep/ccomp (14; 0% instances), ru-dep/csubj (3; 0% instances), ru-dep/aux (2; 0% instances), ru-dep/auxpass (2; 0% instances), ru-dep/dep (2; 0% instances), ru-dep/name (2; 0% instances), ru-dep/xcomp (2; 0% instances), ru-dep/foreign (1; 0% instances), ru-dep/nsubjpass (1; 0% instances), ru-dep/vocative (1; 0% instances)

Children of NOUN nodes belong to 16 different parts of speech: NOUN (12359; 25% instances), ADJ (10760; 22% instances), ADP (8647; 18% instances), PUNCT (6363; 13% instances), PROPN (3320; 7% instances), VERB (2036; 4% instances), NUM (1562; 3% instances), DET (1438; 3% instances), CONJ (1299; 3% instances), ADV (527; 1% instances), PRON (234; 0% instances), PART (177; 0% instances), SYM (67; 0% instances), SCONJ (51; 0% instances), AUX (16; 0% instances), X (2; 0% instances)


Treebank Statistics (UD_Russian-SynTagRus)

There are 16510 NOUN lemmas (37%), 41520 NOUN types (35%) and 271242 NOUN tokens (25%). Out of 17 observed tags, the rank of NOUN is: 1 in number of lemmas, 1 in number of types and 1 in number of tokens.

The 10 most frequent NOUN lemmas: это, год, то, человек, время, все, страна, дело, работа, система

The 10 most frequent NOUN types: это, года, лет, время, все, году, того, том, этом, то

The 10 most frequent ambiguous lemmas: это (NOUN 5199, PART 686, PROPN 13), год (NOUN 4798, PROPN 11), то (NOUN 3298, SCONJ 1121, PART 222, PROPN 3), человек (NOUN 2652, PROPN 11), время (NOUN 1764, PROPN 8), все (NOUN 1697, PART 366, PROPN 6), страна (NOUN 1614, PROPN 3, X 1), дело (NOUN 1229, PROPN 2), система (NOUN 1076, PROPN 13), жизнь (NOUN 1009, PROPN 8)

The 10 most frequent ambiguous types: это (NOUN 2606, PART 663, DET 359, ADJ 31), все (DET 907, NOUN 858, PART 337, ADJ 237), того (NOUN 953, DET 107, ADJ 47), том (NOUN 891, DET 333, ADJ 51), этом (NOUN 783, DET 475, ADJ 13, PART 1), то (SCONJ 1106, NOUN 609, PART 222, DET 205, ADJ 33), тем (NOUN 577, SCONJ 88, DET 67, ADJ 47), этого (NOUN 524, DET 474, ADJ 15, PART 1), раз (NOUN 533, SCONJ 28, ADV 4), всего (NOUN 305, PART 157, DET 105, ADV 47, ADJ 13)

Morphology

The form / lemma ratio of NOUN is 2.514839 (the average of all parts of speech is 2.665758).

The 1st highest number of forms (15) was observed with the lemma “тоннель”: тоннеле, тоннелей, тоннели, тоннель, тоннелю, тоннеля, тоннелям, тоннелями, тоннелях, туннеле, туннелем, туннель, туннелю, туннеля, туннелями.

The 2nd highest number of forms (14) was observed with the lemma “год”: г, г., гг, гг., год, года, годам, годами, годах, годов, годом, году, годы, лет.

The 3rd highest number of forms (13) was observed with the lemma “век”: в, в., вв, век, века, векам, веками, веках, веке, веков, веком, веку, полвека.

NOUN occurs with 4 features: ru-feat/Animacy (271177; 100% instances), ru-feat/Number (271151; 100% instances), ru-feat/Case (271147; 100% instances), ru-feat/Gender (269867; 99% instances)

NOUN occurs with 15 feature-value pairs: Animacy=Anim, Animacy=Inan, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Case=Par, Case=Voc, Gender=Fem, Gender=Masc, Gender=Neut, Number=Plur, Number=Sing

NOUN occurs with 94 feature combinations. The most frequent feature combination is Animacy=Inan|Case=Gen|Gender=Fem|Number=Sing (20593 tokens). Examples: страны, жизни, экономики, власти, работы, системы, науки, стороны, войны, воды

Relations

NOUN nodes are attached to their parents using 26 different relations: ru-dep/nmod (146168; 54% instances), ru-dep/nsubj (47268; 17% instances), ru-dep/dobj (27427; 10% instances), ru-dep/conj (21090; 8% instances), ru-dep/root (7329; 3% instances), ru-dep/parataxis (6031; 2% instances), ru-dep/nsubjpass (5789; 2% instances), ru-dep/appos (2342; 1% instances), ru-dep/advmod (2188; 1% instances), ru-dep/nmod:agent (1440; 1% instances), ru-dep/nummod:gov (1067; 0% instances), ru-dep/iobj (1054; 0% instances), ru-dep/advcl (853; 0% instances), ru-dep/mwe (387; 0% instances), ru-dep/dep (255; 0% instances), ru-dep/compound (198; 0% instances), ru-dep/acl:relcl (138; 0% instances), ru-dep/acl (113; 0% instances), ru-dep/expl (32; 0% instances), ru-dep/nummod:entity (16; 0% instances), ru-dep/name (15; 0% instances), ru-dep/amod (14; 0% instances), ru-dep/mark (12; 0% instances), ru-dep/vocative (10; 0% instances), ru-dep/nummod (5; 0% instances), ru-dep/auxpass (1; 0% instances)

Parents of NOUN nodes belong to 14 different parts of speech: VERB (136584; 50% instances), NOUN (104231; 38% instances), ADJ (11873; 4% instances), ROOT (7329; 3% instances), ADV (4138; 2% instances), PROPN (3125; 1% instances), NUM (1878; 1% instances), PRON (982; 0% instances), SYM (463; 0% instances), SCONJ (309; 0% instances), PART (172; 0% instances), CONJ (119; 0% instances), X (37; 0% instances), INTJ (2; 0% instances)

43351 (16%) NOUN nodes are leaves.

91791 (34%) NOUN nodes have one child.

80964 (30%) NOUN nodes have two children.

55136 (20%) NOUN nodes have three or more children.

The highest child degree of a NOUN node is 18.

Children of NOUN nodes are attached using 32 different relations: ru-dep/amod (92675; 20% instances), ru-dep/nmod (88697; 19% instances), ru-dep/case (81486; 18% instances), ru-dep/punct (74927; 16% instances), ru-dep/det (21358; 5% instances), ru-dep/conj (20687; 5% instances), ru-dep/cc (14592; 3% instances), ru-dep/advmod (9534; 2% instances), ru-dep/nummod (8287; 2% instances), ru-dep/appos (7116; 2% instances), ru-dep/parataxis (6950; 2% instances), ru-dep/acl:relcl (6017; 1% instances), ru-dep/nsubj (5905; 1% instances), ru-dep/nummod:gov (4705; 1% instances), ru-dep/advcl (2772; 1% instances), ru-dep/cop (2066; 0% instances), ru-dep/dep (1630; 0% instances), ru-dep/neg (1412; 0% instances), ru-dep/mark (1128; 0% instances), ru-dep/foreign (908; 0% instances), ru-dep/acl (905; 0% instances), ru-dep/mwe (382; 0% instances), ru-dep/compound (271; 0% instances), ru-dep/nmod:agent (205; 0% instances), ru-dep/iobj (85; 0% instances), ru-dep/name (78; 0% instances), ru-dep/nummod:entity (49; 0% instances), ru-dep/dobj (43; 0% instances), ru-dep/aux (36; 0% instances), ru-dep/discourse (21; 0% instances), ru-dep/vocative (2; 0% instances), ru-dep/expl (1; 0% instances)

Children of NOUN nodes belong to 17 different parts of speech: NOUN (104231; 23% instances), ADJ (86202; 19% instances), ADP (81485; 18% instances), PUNCT (74927; 16% instances), VERB (23226; 5% instances), DET (21358; 5% instances), PROPN (17757; 4% instances), CONJ (13986; 3% instances), NUM (11519; 3% instances), ADV (6885; 2% instances), PART (6837; 2% instances), PRON (2612; 1% instances), SCONJ (1924; 0% instances), AUX (1534; 0% instances), SYM (257; 0% instances), X (169; 0% instances), INTJ (21; 0% instances)


NOUN in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]