home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech: POS Tags: ADP

There are 113 ADP lemmas (0%), 132 ADP types (0%) and 145943 ADP tokens (10%). Out of 17 observed tags, the rank of ADP is: 7 in number of lemmas, 9 in number of types and 4 in number of tokens.

The 10 most frequent ADP lemmas: v, na, z, s, o, do, k, pro, za, po

The 10 most frequent ADP types: v, na, o, z, s, do, ve, k, pro, za

The 10 most frequent ambiguous lemmas: v (ADP 36265, NOUN 9, ADJ 1), z (ADP 11656, NOUN 19), s (ADP 11169, NOUN 90, PART 21, X 3, ADJ 1), o (ADP 10328, PUNCT 100, NOUN 10, ADJ 3, INTJ 2), do (ADP 7414, PROPN 11, VERB 3, NOUN 2), k (ADP 7084, NOUN 15), po (ADP 3847, NOUN 6), podle (ADP 3564, ADV 1), u (ADP 2378, NOUN 4), bez (ADP 1147, NOUN 1)

The 10 most frequent ambiguous types: v (ADP 26490, NOUN 4, ADJ 3), o (ADP 9669, ADJ 110, PUNCT 99, NOUN 4), z (ADP 8838, NOUN 18), s (ADP 8728, NOUN 381, PART 21, ADJ 9), do (ADP 6970, PROPN 11, NOUN 2, VERB 2), k (ADP 5508, NOUN 10, ADJ 1), po (ADP 3165, NOUN 6), podle (ADP 2143, ADV 1), při (ADP 2004, NOUN 1), u (ADP 2039, NOUN 3)

Morphology

The form / lemma ratio of ADP is 1.168142 (the average of all parts of speech is 2.181829).

The 1st highest number of forms (3) was observed with the lemma “a”: a, ala, à.

The 2nd highest number of forms (3) was observed with the lemma “k”: k, ke, ku.

The 3rd highest number of forms (3) was observed with the lemma “nad”: n, nad, nade.

ADP occurs with 7 features: AdpType (145943; 100% instances), Case (145304; 100% instances), Foreign (592; 0% instances), NameType (71; 0% instances), Abbr (23; 0% instances), Style (6; 0% instances), Aspect (1; 0% instances)

ADP occurs with 20 feature-value pairs: Abbr=Yes, AdpType=Comprep, AdpType=Prep, AdpType=Voc, Aspect=Imp, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Foreign=Yes, NameType=Com, NameType=Geo, NameType=Geo,Giv,Sur, NameType=Oth, NameType=Pro, NameType=Sur, Style=Coll, Style=Rare

ADP occurs with 32 feature combinations. The most frequent feature combination is AdpType=Prep|Case=Loc (50236 tokens). Examples: v, na, o, po, při, za

Relations

ADP nodes are attached to their parents using 14 different relations: case (143858; 99% instances), fixed (1479; 1% instances), flat:foreign (438; 0% instances), advmod (51; 0% instances), nmod (41; 0% instances), mark (32; 0% instances), conj (15; 0% instances), dep (10; 0% instances), root (10; 0% instances), flat (3; 0% instances), nsubj (2; 0% instances), obj (2; 0% instances), appos (1; 0% instances), orphan (1; 0% instances)

Parents of ADP nodes belong to 14 different parts of speech: NOUN (112938; 77% instances), PROPN (16536; 11% instances), PRON (6182; 4% instances), DET (4497; 3% instances), NUM (2093; 1% instances), ADJ (1882; 1% instances), ADP (1107; 1% instances), ADV (491; 0% instances), VERB (92; 0% instances), SYM (85; 0% instances), PART (28; 0% instances), (10; 0% instances), INTJ (1; 0% instances), SCONJ (1; 0% instances)

143258 (98%) ADP nodes are leaves.

1852 (1%) ADP nodes have one child.

825 (1%) ADP nodes have two children.

8 (0%) ADP nodes have three or more children.

The highest child degree of a ADP node is 5.

Children of ADP nodes are attached using 15 different relations: fixed (3420; 97% instances), punct (41; 1% instances), flat:foreign (17; 0% instances), cc (11; 0% instances), conj (11; 0% instances), dep (10; 0% instances), nmod (8; 0% instances), det (3; 0% instances), acl (2; 0% instances), advmod (2; 0% instances), obl:arg (2; 0% instances), advmod:emph (1; 0% instances), case (1; 0% instances), obl (1; 0% instances), orphan (1; 0% instances)

Children of ADP nodes belong to 10 different parts of speech: NOUN (2339; 66% instances), ADP (1107; 31% instances), PUNCT (41; 1% instances), PROPN (12; 0% instances), CCONJ (11; 0% instances), ADJ (10; 0% instances), ADV (4; 0% instances), DET (3; 0% instances), NUM (2; 0% instances), VERB (2; 0% instances)