This is part of archived UD v1 documentation. See http://universaldependencies.org/ for the current version.
home ru/pos issue tracker

PART: particle

Definition

Particles are function words that must be associated with another word or phrase to impart meaning and that do not satisfy definitions of other universal parts of speech (e.g. adpositions, coordinating conjunctions, subordinating conjunctions or auxiliary verbs). Particles may encode grammatical categories such as negation, mood, tense etc. Russian particles are not inflected.

Note that response words such as да  “yes”, нет  “no”, etc. are considered particles in the PDT tagset but they should be retagged as interjections under the UD standard.

Examples


Treebank Statistics (UD_Russian)

There are 27 PART lemmas (0%), 27 PART types (0%) and 919 PART tokens (1%). Out of 16 observed tags, the rank of PART is: 10 in number of lemmas, 10 in number of types and 12 in number of tokens.

The 10 most frequent PART lemmas: НЕ, ЖЕ, И, ТОЛЬКО, ЛИШЬ, ДАЖЕ, ЭТО, НИ, ИМЕННО, БЫ

The 10 most frequent PART types: не, же, и, только, лишь, даже, это, ни, именно, бы

The 10 most frequent ambiguous lemmas: НЕ (PART 438, CONJ 2), И (CONJ 2260, PART 89, ADV 4, PROPN 2, NUM 1), ТОЛЬКО (PART 79, CONJ 1), ЭТО (PRON 147, PART 28), НИ (PART 23, CONJ 6), ИМЕННО (PART 18, ADV 2), БЫ (PART 10, ADV 1), ПРОСТО (PART 9, ADV 2), ЛИ (PART 6, PROPN 4), ВСЁ (PRON 24, ADV 13, PART 3)

The 10 most frequent ambiguous types: не (PART 424, CONJ 2), и (CONJ 2245, PART 89, ADV 3), только (PART 78, CONJ 1), это (PRON 55, PART 25, DET 23), ни (PART 21, CONJ 6), именно (PART 10, ADV 2), бы (PART 10, ADV 1), просто (PART 9, ADV 1), Да (CONJ 2, PART 2), все (DET 41, PRON 6, ADV 5, PART 2)

Morphology

The form / lemma ratio of PART is 1.000000 (the average of all parts of speech is 1.591757).

The 1st highest number of forms (2) was observed with the lemma “ВСЁ”: все, всё.

The 2nd highest number of forms (1) was observed with the lemma “NO”: No.

The 3rd highest number of forms (1) was observed with the lemma “NON”: Non.

PART does not occur with any features.

Relations

PART nodes are attached to their parents using 10 different relations: neg (434; 47% instances), discourse (370; 40% instances), mwe (36; 4% instances), advmod (29; 3% instances), cop (28; 3% instances), cc:preconj (12; 1% instances), goeswith (7; 1% instances), appos (1; 0% instances), conj (1; 0% instances), nsubj (1; 0% instances)

Parents of PART nodes belong to 15 different parts of speech: VERB (385; 42% instances), NOUN (177; 19% instances), DET (80; 9% instances), ADJ (77; 8% instances), ADV (66; 7% instances), PRON (39; 4% instances), NUM (27; 3% instances), PART (19; 2% instances), CONJ (16; 2% instances), PROPN (16; 2% instances), AUX (7; 1% instances), ADP (4; 0% instances), SCONJ (3; 0% instances), SYM (2; 0% instances), PUNCT (1; 0% instances)

888 (97%) PART nodes are leaves.

27 (3%) PART nodes have one child.

3 (0%) PART nodes have two children.

1 (0%) PART nodes have three or more children.

The highest child degree of a PART node is 4.

Children of PART nodes are attached using 7 different relations: mwe (15; 41% instances), advmod (5; 14% instances), goeswith (5; 14% instances), neg (5; 14% instances), punct (4; 11% instances), discourse (2; 5% instances), nmod (1; 3% instances)

Children of PART nodes belong to 5 different parts of speech: PART (19; 51% instances), ADV (10; 27% instances), PUNCT (6; 16% instances), DET (1; 3% instances), PROPN (1; 3% instances)


Treebank Statistics (UD_Russian-SynTagRus)

There are 102 PART lemmas (0%), 110 PART types (0%) and 33748 PART tokens (3%). Out of 17 observed tags, the rank of PART is: 9 in number of lemmas, 10 in number of types and 9 in number of tokens.

The 10 most frequent PART lemmas: не, и, же, только, бы, даже, вот, ли, лишь, это

The 10 most frequent PART types: не, и, же, только, бы, даже, вот, ли, лишь, именно

The 10 most frequent ambiguous lemmas: не (PART 13474, VERB 27), и (CONJ 24088, PART 4409, X 3, PROPN 2), только (PART 2010, SCONJ 45), ли (PART 709, PROPN 3), это (NOUN 5199, PART 686, PROPN 13), просто (PART 615, ADV 35), ни (CONJ 491, PART 455, PROPN 37, NOUN 1), ведь (PART 427, SCONJ 109), все (NOUN 1697, PART 366, PROPN 6), да (PART 301, CONJ 295, PROPN 1)

The 10 most frequent ambiguous types: не (PART 12853, VERB 27), и (CONJ 21962, PART 4371, X 3), только (PART 1890, SCONJ 28), это (NOUN 2606, PART 663, DET 359, ADJ 31), просто (PART 577, ADJ 24, ADV 16), ни (CONJ 463, PART 419), ведь (PART 253, SCONJ 68), все (DET 907, NOUN 858, PART 337, ADJ 237), да (CONJ 180, PART 85), то (SCONJ 1106, NOUN 609, PART 222, DET 205, ADJ 33)

Morphology

The form / lemma ratio of PART is 1.078431 (the average of all parts of speech is 2.665758).

The 1st highest number of forms (4) was observed with the lemma “это”: этим, это, этого, этом.

The 2nd highest number of forms (2) was observed with the lemma “бы”: б, бы.

The 3rd highest number of forms (2) was observed with the lemma “все”: все, всё.

PART does not occur with any features.

Relations

PART nodes are attached to their parents using 22 different relations: advmod (17028; 50% instances), neg (13474; 40% instances), aux (1369; 4% instances), cop (636; 2% instances), parataxis (474; 1% instances), mwe (258; 1% instances), root (246; 1% instances), appos (62; 0% instances), conj (52; 0% instances), dep (45; 0% instances), name (37; 0% instances), auxpass (20; 0% instances), nmod (17; 0% instances), nsubj (11; 0% instances), advcl (10; 0% instances), amod (3; 0% instances), acl:relcl (1; 0% instances), ccomp (1; 0% instances), expl (1; 0% instances), mark (1; 0% instances), nmod:agent (1; 0% instances), nsubjpass (1; 0% instances)

Parents of PART nodes belong to 15 different parts of speech: VERB (14372; 43% instances), NOUN (6837; 20% instances), ADV (3788; 11% instances), ADJ (3434; 10% instances), PRON (1084; 3% instances), PART (1080; 3% instances), DET (1019; 3% instances), PROPN (556; 2% instances), CONJ (479; 1% instances), NUM (454; 1% instances), SCONJ (375; 1% instances), ROOT (246; 1% instances), SYM (18; 0% instances), X (4; 0% instances), INTJ (2; 0% instances)

30252 (90%) PART nodes are leaves.

2987 (9%) PART nodes have one child.

334 (1%) PART nodes have two children.

175 (1%) PART nodes have three or more children.

The highest child degree of a PART node is 6.

Children of PART nodes are attached using 24 different relations: punct (1978; 46% instances), advmod (978; 23% instances), neg (669; 16% instances), nsubj (159; 4% instances), name (102; 2% instances), mwe (90; 2% instances), conj (79; 2% instances), parataxis (66; 2% instances), cc (55; 1% instances), amod (28; 1% instances), nmod (25; 1% instances), dep (13; 0% instances), mark (12; 0% instances), advcl (9; 0% instances), discourse (5; 0% instances), dobj (3; 0% instances), aux (2; 0% instances), cop (2; 0% instances), det (2; 0% instances), iobj (2; 0% instances), acl (1; 0% instances), acl:relcl (1; 0% instances), case (1; 0% instances), nsubjpass (1; 0% instances)

Children of PART nodes belong to 15 different parts of speech: PUNCT (1978; 46% instances), PART (1080; 25% instances), ADV (654; 15% instances), NOUN (172; 4% instances), PROPN (93; 2% instances), VERB (82; 2% instances), SCONJ (72; 2% instances), PRON (48; 1% instances), CONJ (44; 1% instances), ADJ (38; 1% instances), ADP (12; 0% instances), INTJ (5; 0% instances), DET (2; 0% instances), NUM (2; 0% instances), AUX (1; 0% instances)


PART in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]