home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_English-GUM: POS Tags: ADP

There are 91 ADP lemmas (1%), 101 ADP types (0%) and 24424 ADP tokens (10%). Out of 17 observed tags, the rank of ADP is: 9 in number of lemmas, 9 in number of types and 4 in number of tokens.

The 10 most frequent ADP lemmas: of, in, to, for, on, with, at, from, by, as

The 10 most frequent ADP types: of, in, to, for, on, with, at, from, by, as

The 10 most frequent ambiguous lemmas: of (ADP 6241, SCONJ 149), in (ADP 4102, SCONJ 105, ADV 41, X 6, NOUN 1), to (PART 3667, ADP 2085, SCONJ 48, X 1), for (ADP 1665, SCONJ 167), on (ADP 1485, SCONJ 33, ADV 31, ADJ 5), with (ADP 1217, SCONJ 72), at (ADP 1044, SCONJ 13), from (ADP 915, SCONJ 40), by (ADP 810, SCONJ 129, ADV 1), as (ADP 698, SCONJ 412, ADV 212)

The 10 most frequent ambiguous types: of (ADP 6207, SCONJ 149), in (ADP 3705, SCONJ 101, ADV 41, X 6), to (PART 3452, ADP 2060, SCONJ 48, DET 1, NUM 1, VERB 1, X 1), for (ADP 1536, SCONJ 163, ADV 1), on (ADP 1411, ADV 31, SCONJ 31, ADJ 5, DET 1), with (ADP 1172, SCONJ 69), at (ADP 969, SCONJ 13), from (ADP 861, SCONJ 39), by (ADP 752, SCONJ 117, ADV 1), as (ADP 653, SCONJ 358, ADV 200)

Morphology

The form / lemma ratio of ADP is 1.109890 (the average of all parts of speech is 1.248450).

The 1st highest number of forms (5) was observed with the lemma “versus”: v, v., versus, vs, vs..

The 2nd highest number of forms (3) was observed with the lemma “of”: a, of, ta.

The 3rd highest number of forms (2) was observed with the lemma “as”: a, as.

ADP occurs with 4 features: ExtPos (243; 1% instances), Typo (24; 0% instances), Abbr (12; 0% instances), Foreign (1; 0% instances)

ADP occurs with 6 feature-value pairs: Abbr=Yes, ExtPos=ADP, ExtPos=ADV, ExtPos=SCONJ, Foreign=Yes, Typo=Yes

ADP occurs with 7 feature combinations. The most frequent feature combination is _ (24144 tokens). Examples: of, in, to, for, on, with, at, from, by, as

Relations

ADP nodes are attached to their parents using 21 different relations: case (22390; 92% instances), compound:prt (1040; 4% instances), fixed (611; 3% instances), obl (164; 1% instances), advmod (61; 0% instances), reparandum (40; 0% instances), conj (32; 0% instances), mark (32; 0% instances), nmod (16; 0% instances), root (9; 0% instances), compound (6; 0% instances), acl:relcl (4; 0% instances), advcl (4; 0% instances), amod (4; 0% instances), advcl:relcl (3; 0% instances), ccomp (2; 0% instances), dep (2; 0% instances), flat (1; 0% instances), obl:agent (1; 0% instances), parataxis (1; 0% instances), xcomp (1; 0% instances)

Parents of ADP nodes belong to 17 different parts of speech: NOUN (15865; 65% instances), PROPN (3746; 15% instances), PRON (1458; 6% instances), VERB (1332; 5% instances), NUM (695; 3% instances), ADJ (463; 2% instances), ADV (395; 2% instances), ADP (183; 1% instances), DET (166; 1% instances), SYM (39; 0% instances), X (30; 0% instances), SCONJ (23; 0% instances), INTJ (12; 0% instances), (9; 0% instances), AUX (6; 0% instances), CCONJ (1; 0% instances), PART (1; 0% instances)

23958 (98%) ADP nodes are leaves.

371 (2%) ADP nodes have one child.

82 (0%) ADP nodes have two children.

13 (0%) ADP nodes have three or more children.

The highest child degree of a ADP node is 6.

Children of ADP nodes are attached using 17 different relations: fixed (245; 42% instances), punct (217; 37% instances), cc (29; 5% instances), conj (28; 5% instances), cop (16; 3% instances), nsubj (15; 3% instances), advmod (13; 2% instances), obl (6; 1% instances), discourse (5; 1% instances), case (3; 1% instances), obj (2; 0% instances), reparandum (2; 0% instances), advcl (1; 0% instances), aux (1; 0% instances), ccomp (1; 0% instances), compound (1; 0% instances), mark (1; 0% instances)

Children of ADP nodes belong to 12 different parts of speech: PUNCT (217; 37% instances), ADP (183; 31% instances), NOUN (100; 17% instances), CCONJ (30; 5% instances), AUX (17; 3% instances), PRON (12; 2% instances), ADV (11; 2% instances), INTJ (5; 1% instances), PART (4; 1% instances), ADJ (3; 1% instances), VERB (3; 1% instances), PROPN (1; 0% instances)