home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-GUM: POS Tags: ADP

There are 89 ADP lemmas (1%), 94 ADP types (1%) and 16130 ADP tokens (10%). Out of 17 observed tags, the rank of ADP is: 9 in number of lemmas, 9 in number of types and 4 in number of tokens.

The 10 most frequent ADP lemmas: of, in, to, for, on, with, at, from, by, as

The 10 most frequent ADP types: of, in, to, for, on, with, at, from, by, as

The 10 most frequent ambiguous lemmas: of (ADP 4105, SCONJ 84, ADV 23), in (ADP 2859, SCONJ 83, ADV 17, X 5, NOUN 1), to (PART 2198, ADP 1307, SCONJ 26, X 1), for (ADP 1030, SCONJ 96), on (ADP 959, SCONJ 20, ADV 13, ADJ 2, NOUN 1), with (ADP 820, SCONJ 52), at (ADP 669, ADV 18, SCONJ 13), from (ADP 641, SCONJ 22), by (ADP 538, SCONJ 92, ADV 1), as (ADP 479, SCONJ 275, ADV 138)

The 10 most frequent ambiguous types: of (ADP 4098, SCONJ 84, ADV 15), in (ADP 2570, SCONJ 73, ADV 17, X 5), to (PART 2078, ADP 1287, SCONJ 26, DET 1, VERB 1, X 1), for (ADP 939, SCONJ 92, ADV 1), on (ADP 906, SCONJ 20, ADV 13, ADJ 2, DET 1, NOUN 1), with (ADP 789, SCONJ 50), at (ADP 624, ADV 15, SCONJ 13), from (ADP 597, SCONJ 22), by (ADP 498, SCONJ 81, ADV 1), as (ADP 446, SCONJ 239, ADV 131)

Morphology

The form / lemma ratio of ADP is 1.056180 (the average of all parts of speech is 1.226279).

The 1st highest number of forms (3) was observed with the lemma “versus”: v, versus, vs..

The 2nd highest number of forms (2) was observed with the lemma “as”: a, as.

The 3rd highest number of forms (2) was observed with the lemma “for”: for, fro.

ADP occurs with 3 features: Typo (16; 0% instances), Abbr (2; 0% instances), Foreign (1; 0% instances)

ADP occurs with 3 feature-value pairs: Abbr=Yes, Foreign=Yes, Typo=Yes

ADP occurs with 4 feature combinations. The most frequent feature combination is _ (16111 tokens). Examples: of, in, to, for, on, with, at, from, by, as

Relations

ADP nodes are attached to their parents using 17 different relations: case (14914; 92% instances), compound:prt (640; 4% instances), fixed (398; 2% instances), obl (98; 1% instances), conj (25; 0% instances), nmod (16; 0% instances), reparandum (12; 0% instances), root (6; 0% instances), obj (4; 0% instances), acl:relcl (3; 0% instances), advcl:relcl (3; 0% instances), compound (3; 0% instances), dep (3; 0% instances), ccomp (2; 0% instances), amod (1; 0% instances), flat (1; 0% instances), parataxis (1; 0% instances)

Parents of ADP nodes belong to 16 different parts of speech: NOUN (10598; 66% instances), PROPN (2677; 17% instances), VERB (787; 5% instances), PRON (782; 5% instances), NUM (496; 3% instances), ADJ (275; 2% instances), ADV (177; 1% instances), ADP (104; 1% instances), SCONJ (98; 1% instances), DET (82; 1% instances), X (22; 0% instances), SYM (18; 0% instances), INTJ (6; 0% instances), (6; 0% instances), CCONJ (1; 0% instances), PART (1; 0% instances)

15900 (99%) ADP nodes are leaves.

197 (1%) ADP nodes have one child.

27 (0%) ADP nodes have two children.

6 (0%) ADP nodes have three or more children.

The highest child degree of a ADP node is 6.

Children of ADP nodes are attached using 14 different relations: punct (101; 37% instances), fixed (82; 30% instances), conj (33; 12% instances), cc (22; 8% instances), advmod (7; 3% instances), cop (7; 3% instances), nsubj (7; 3% instances), case (4; 1% instances), obj (4; 1% instances), obl (3; 1% instances), aux (1; 0% instances), ccomp (1; 0% instances), dep (1; 0% instances), discourse (1; 0% instances)

Children of ADP nodes belong to 12 different parts of speech: ADP (104; 38% instances), PUNCT (101; 37% instances), CCONJ (22; 8% instances), ADV (12; 4% instances), NOUN (11; 4% instances), AUX (8; 3% instances), PRON (4; 1% instances), PROPN (4; 1% instances), ADJ (3; 1% instances), PART (2; 1% instances), VERB (2; 1% instances), INTJ (1; 0% instances)