home cs/pos edit page issue tracker

ADP: adposition

Definition

Czech has only prepositions but no postpositions or circumpositions. They occur before a complement noun phrase (noun, pronoun) and they form a single structure with the complement to express its grammatical and semantic relation to another unit within a clause.

Some prepositions take the form of fixed multiword expressions, e.g. na rozdíl od  “in contrast to”, v souvislosti s  “in connection with”. The component words are then still tagged according to their basic use (na  is ADP, rozdíl  is NOUN, etc.) and their status as multiword expressions are accounted for in the syntactic annotation.

Examples

References


Treebank Statistics (UD_Czech)

There are 114 ADP lemmas (0%), 132 ADP types (0%) and 145943 ADP tokens (10%). Out of 17 observed tags, the rank of ADP is: 8 in number of lemmas, 9 in number of types and 5 in number of tokens.

The 10 most frequent ADP lemmas: v, na, z, s, o, do, k, pro, za, po

The 10 most frequent ADP types: v, na, o, z, s, do, ve, k, pro, za

The 10 most frequent ambiguous lemmas: v (ADP 36265, NOUN 9, ADJ 1), z (ADP 11656, NOUN 19), s (ADP 11169, NOUN 90, PART 21, X 3, ADJ 1), o (ADP 10328, PUNCT 100, NOUN 10, ADJ 3, INTJ 2), do (ADP 7414, PROPN 11, VERB 3, NOUN 2), k (ADP 7084, NOUN 15), po (ADP 3847, NOUN 5), podle (ADP 3564, ADV 1), u (ADP 2378, NOUN 4), bez (ADP 1147, NOUN 1)

The 10 most frequent ambiguous types: v (ADP 26490, NOUN 4, ADJ 3), o (ADP 9669, ADJ 110, PUNCT 99, NOUN 4), z (ADP 8838, NOUN 18), s (ADP 8728, NOUN 381, PART 21, ADJ 9), do (ADP 6970, PROPN 11, NOUN 2, VERB 2), k (ADP 5508, NOUN 10, ADJ 1), po (ADP 3165, NOUN 6), podle (ADP 2143, ADV 1), při (ADP 2004, NOUN 1), u (ADP 2039, NOUN 3)

Morphology

The form / lemma ratio of ADP is 1.157895 (the average of all parts of speech is 2.195970).

The 1st highest number of forms (3) was observed with the lemma “a”: a, ala, à

The 2nd highest number of forms (3) was observed with the lemma “k”: k, ke, ku

The 3rd highest number of forms (3) was observed with the lemma “nad”: n, nad, nade

ADP occurs with 6 features: cs-feat/AdpType (145943; 100% instances), cs-feat/Case (145304; 100% instances), cs-feat/Foreign (592; 0% instances), cs-feat/NameType (71; 0% instances), cs-feat/Abbr (23; 0% instances), cs-feat/Aspect (1; 0% instances)

ADP occurs with 18 feature-value pairs: Abbr=Yes, AdpType=Comprep, AdpType=Prep, AdpType=Voc, Aspect=Imp, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, Foreign=Foreign, NameType=Com, NameType=Geo, NameType=Geo,Giv,Sur, NameType=Oth, NameType=Pro, NameType=Sur

ADP occurs with 31 feature combinations. The most frequent feature combination is AdpType=Prep|Case=Loc (50236 tokens). Examples: v, na, o, po, při, za

Relations

ADP nodes are attached to their parents using 16 different relations: cs-dep/case (143813; 99% instances), cs-dep/mwe (1439; 1% instances), cs-dep/foreign (251; 0% instances), cs-dep/nmod (228; 0% instances), cs-dep/conj (68; 0% instances), cs-dep/advmod (52; 0% instances), cs-dep/mark (43; 0% instances), cs-dep/root (11; 0% instances), cs-dep/dep (10; 0% instances), cs-dep/appos (9; 0% instances), cs-dep/cc (9; 0% instances), cs-dep/dobj (3; 0% instances), cs-dep/name (3; 0% instances), cs-dep/nsubj (2; 0% instances), cs-dep/advcl (1; 0% instances), cs-dep/advmod:emph (1; 0% instances)

Parents of ADP nodes belong to 15 different parts of speech: NOUN (112841; 77% instances), PROPN (16533; 11% instances), PRON (10549; 7% instances), NUM (2094; 1% instances), ADJ (1964; 1% instances), ADP (1110; 1% instances), ADV (504; 0% instances), VERB (125; 0% instances), DET (92; 0% instances), SYM (85; 0% instances), PART (31; 0% instances), ROOT (11; 0% instances), CONJ (2; 0% instances), INTJ (1; 0% instances), SCONJ (1; 0% instances)

143275 (98%) ADP nodes are leaves.

1790 (1%) ADP nodes have one child.

835 (1%) ADP nodes have two children.

43 (0%) ADP nodes have three or more children.

The highest child degree of a ADP node is 11.

Children of ADP nodes are attached using 15 different relations: cs-dep/mwe (3321; 91% instances), cs-dep/case (84; 2% instances), cs-dep/punct (61; 2% instances), cs-dep/nmod (40; 1% instances), cs-dep/cc (27; 1% instances), cs-dep/conj (24; 1% instances), cs-dep/parataxis (19; 1% instances), cs-dep/appos (18; 0% instances), cs-dep/foreign (17; 0% instances), cs-dep/dep (11; 0% instances), cs-dep/dobj (7; 0% instances), cs-dep/advmod (3; 0% instances), cs-dep/acl (2; 0% instances), cs-dep/advmod:emph (1; 0% instances), cs-dep/nsubj (1; 0% instances)

Children of ADP nodes belong to 11 different parts of speech: NOUN (2382; 66% instances), ADP (1110; 31% instances), PUNCT (61; 2% instances), CONJ (20; 1% instances), ADJ (18; 0% instances), PRON (15; 0% instances), PROPN (15; 0% instances), ADV (8; 0% instances), VERB (5; 0% instances), NUM (1; 0% instances), SCONJ (1; 0% instances)


ADP in other languages: [bg] [cs] [de] [el] [en] [es] [eu] [fa] [fi] [fr] [ga] [he] [hu] [it] [ja] [ko] [sv] [u]