home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-PDT: POS Tags: ADP

There are 114 ADP lemmas (0%), 133 ADP types (0%) and 145944 ADP tokens (10%). Out of 17 observed tags, the rank of ADP is: 7 in number of lemmas, 9 in number of types and 4 in number of tokens.

The 10 most frequent ADP lemmas: v, na, z, s, o, do, k, pro, za, po

The 10 most frequent ADP types: v, na, o, z, s, do, ve, k, pro, za

The 10 most frequent ambiguous lemmas: v (ADP 36265, NOUN 9, ADJ 1), z (ADP 11656, NOUN 19), s (ADP 11169, NOUN 135, PART 21, X 3, ADJ 1), o (ADP 10328, PUNCT 100, NOUN 10, ADJ 3, INTJ 2), do (ADP 7414, PROPN 11, VERB 3, NOUN 2), k (ADP 7084, NOUN 19), po (ADP 3847, NOUN 6), podle (ADP 3564, ADV 1), u (ADP 2378, NOUN 4), bez (ADP 1147, NOUN 1)

The 10 most frequent ambiguous types: v (ADP 26490, NOUN 4, ADJ 3), o (ADP 9669, ADJ 110, PUNCT 99, NOUN 4), z (ADP 8838, NOUN 18), s (ADP 8728, NOUN 381, PART 21, ADJ 9), do (ADP 6970, PROPN 11, NOUN 2, VERB 2), k (ADP 5508, NOUN 10, ADJ 1), po (ADP 3165, NOUN 6), podle (ADP 2143, ADV 1), při (ADP 2004, NOUN 1), u (ADP 2039, NOUN 3)

Morphology

The form / lemma ratio of ADP is 1.166667 (the average of all parts of speech is 2.181221).

The 1st highest number of forms (3) was observed with the lemma “a”: a, ala, à.

The 2nd highest number of forms (3) was observed with the lemma “k”: k, ke, ku.

The 3rd highest number of forms (3) was observed with the lemma “nad”: n, nad, nade.

ADP occurs with 7 features: AdpType (145944; 100% instances), Case (145304; 100% instances), Foreign (592; 0% instances), NameType (71; 0% instances), Abbr (23; 0% instances), Style (6; 0% instances), Aspect (1; 0% instances)

ADP occurs with 20 feature-value pairs: Abbr=Yes, AdpType=Comprep, AdpType=Prep, AdpType=Voc, Aspect=Imp, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Foreign=Yes, NameType=Com, NameType=Geo, NameType=Geo,Giv,Sur, NameType=Oth, NameType=Pro, NameType=Sur, Style=Coll, Style=Rare

ADP occurs with 33 feature combinations. The most frequent feature combination is AdpType=Prep|Case=Loc (50236 tokens). Examples: v, na, o, po, při, za

Relations

ADP nodes are attached to their parents using 17 different relations: case (143859; 99% instances), fixed (1474; 1% instances), flat:foreign (191; 0% instances), flat (99; 0% instances), nmod (82; 0% instances), obl (81; 0% instances), conj (34; 0% instances), nsubj (34; 0% instances), mark (32; 0% instances), root (17; 0% instances), dep (13; 0% instances), appos (12; 0% instances), advmod (5; 0% instances), obj (5; 0% instances), orphan (4; 0% instances), advmod:emph (1; 0% instances), amod (1; 0% instances)

Parents of ADP nodes belong to 14 different parts of speech: NOUN (113647; 78% instances), PROPN (15513; 11% instances), PRON (6182; 4% instances), DET (4495; 3% instances), ADJ (2105; 1% instances), NUM (2094; 1% instances), ADP (1118; 1% instances), ADV (493; 0% instances), VERB (121; 0% instances), SYM (113; 0% instances), AUX (27; 0% instances), PART (18; 0% instances), (17; 0% instances), INTJ (1; 0% instances)

143118 (98%) ADP nodes are leaves.

1899 (1%) ADP nodes have one child.

860 (1%) ADP nodes have two children.

67 (0%) ADP nodes have three or more children.

The highest child degree of a ADP node is 8.

Children of ADP nodes are attached using 20 different relations: fixed (3414; 88% instances), flat:foreign (191; 5% instances), punct (90; 2% instances), nmod (64; 2% instances), cc (26; 1% instances), conj (23; 1% instances), amod (19; 0% instances), case (13; 0% instances), appos (12; 0% instances), dep (12; 0% instances), acl:relcl (4; 0% instances), advmod (3; 0% instances), advmod:emph (3; 0% instances), det (3; 0% instances), orphan (3; 0% instances), acl (2; 0% instances), obl:arg (2; 0% instances), nummod (1; 0% instances), obl (1; 0% instances), xcomp (1; 0% instances)

Children of ADP nodes belong to 13 different parts of speech: NOUN (2410; 62% instances), ADP (1118; 29% instances), PROPN (173; 4% instances), PUNCT (90; 2% instances), ADJ (43; 1% instances), CCONJ (25; 1% instances), ADV (11; 0% instances), VERB (8; 0% instances), DET (3; 0% instances), NUM (3; 0% instances), PART (1; 0% instances), PRON (1; 0% instances), X (1; 0% instances)