home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Russian-Taiga: POS Tags: ADP

There are 127 ADP lemmas (0%), 149 ADP types (0%) and 140442 ADP tokens (8%). Out of 17 observed tags, the rank of ADP is: 11 in number of lemmas, 13 in number of types and 5 in number of tokens.

The 10 most frequent ADP lemmas: в, на, с, к, по, из, у, от, за, о

The 10 most frequent ADP types: в, на, с, к, по, из, у, от, за, о

The 10 most frequent ambiguous lemmas: в (ADP 42221, X 12, NOUN 11), на (ADP 19188, PART 11, VERB 6, X 5, PUNCT 1), с (ADP 14257, X 31, PART 27, NOUN 6), к (ADP 7652, X 18, NOUN 17), по (ADP 6481, X 15), из (ADP 6311, X 3), у (ADP 5641, X 38, NOUN 8, INTJ 2), от (ADP 4910, X 2, PART 1), за (ADP 4739, X 3, VERB 2), о (ADP 3988, INTJ 379, X 72, NOUN 26)

The 10 most frequent ambiguous types: в (ADP 36450, NOUN 13, X 11, VERB 1), на (ADP 17651, X 6, PART 5, VERB 4, ADV 1, PRON 1, PUNCT 1), с (ADP 13301, X 32, PART 27, NOUN 5, ADV 1), к (ADP 7137, X 16, NOUN 10, ADV 2), по (ADP 5799, X 9, ADV 2, ADJ 1), из (ADP 5928, X 3), у (ADP 4597, X 38, NOUN 11), от (ADP 4623, X 2, PART 1, PRON 1, VERB 1), за (ADP 4324, X 7, VERB 2, ADV 1), о (ADP 3763, X 64, INTJ 53, NOUN 22, PART 1)

Morphology

The form / lemma ratio of ADP is 1.173228 (the average of all parts of speech is 2.706171).

The 1st highest number of forms (6) was observed with the lemma “из-за”: и, из, из–за, из-за, иза, изза.

The 2nd highest number of forms (4) was observed with the lemma “к”: а, в, к, къ.

The 3rd highest number of forms (4) was observed with the lemma “на”: а, н, на, на́.

ADP occurs with 3 features: ExtPos (2750; 2% instances), Abbr (54; 0% instances), Typo (26; 0% instances)

ADP occurs with 8 feature-value pairs: Abbr=Yes, ExtPos=ADJ, ExtPos=ADP, ExtPos=ADV, ExtPos=CCONJ, ExtPos=SCONJ, ExtPos=VERB, Typo=Yes

ADP occurs with 9 feature combinations. The most frequent feature combination is _ (137612 tokens). Examples: в, на, с, к, из, по, у, от, за, о

Relations

ADP nodes are attached to their parents using 23 different relations: case (138508; 99% instances), advmod (798; 1% instances), fixed (734; 1% instances), parataxis:discourse (127; 0% instances), cc (66; 0% instances), mark (49; 0% instances), conj (43; 0% instances), appos (24; 0% instances), root (23; 0% instances), nmod (18; 0% instances), obl (15; 0% instances), parataxis (10; 0% instances), nsubj (7; 0% instances), orphan (5; 0% instances), list (4; 0% instances), amod (2; 0% instances), discourse (2; 0% instances), reparandum (2; 0% instances), advcl (1; 0% instances), compound (1; 0% instances), dep (1; 0% instances), obj (1; 0% instances), xcomp (1; 0% instances)

Parents of ADP nodes belong to 17 different parts of speech: NOUN (108831; 77% instances), PRON (16626; 12% instances), PROPN (7445; 5% instances), DET (2816; 2% instances), ADJ (2284; 2% instances), VERB (994; 1% instances), NUM (368; 0% instances), ADP (356; 0% instances), X (354; 0% instances), ADV (262; 0% instances), PART (30; 0% instances), SYM (27; 0% instances), (23; 0% instances), SCONJ (14; 0% instances), INTJ (8; 0% instances), CCONJ (3; 0% instances), AUX (1; 0% instances)

137939 (98%) ADP nodes are leaves.

1407 (1%) ADP nodes have one child.

804 (1%) ADP nodes have two children.

292 (0%) ADP nodes have three or more children.

The highest child degree of a ADP node is 8.

Children of ADP nodes are attached using 20 different relations: fixed (3507; 87% instances), punct (390; 10% instances), conj (52; 1% instances), cc (27; 1% instances), advmod (20; 0% instances), goeswith (11; 0% instances), parataxis (8; 0% instances), list (7; 0% instances), nsubj (7; 0% instances), appos (5; 0% instances), nmod (4; 0% instances), det (3; 0% instances), parataxis:discourse (3; 0% instances), case (2; 0% instances), acl:relcl (1; 0% instances), advcl (1; 0% instances), cop (1; 0% instances), iobj (1; 0% instances), obl (1; 0% instances), reparandum (1; 0% instances)

Children of ADP nodes belong to 15 different parts of speech: NOUN (2198; 54% instances), PUNCT (390; 10% instances), ADP (356; 9% instances), DET (329; 8% instances), PRON (307; 8% instances), ADJ (161; 4% instances), SCONJ (114; 3% instances), PART (80; 2% instances), VERB (42; 1% instances), ADV (32; 1% instances), CCONJ (25; 1% instances), X (12; 0% instances), SYM (3; 0% instances), NUM (2; 0% instances), AUX (1; 0% instances)