home edit page issue tracker

This page pertains to UD version 2.

Treebank Statistics: UD_Czech-PDTC: POS Tags: ADP

There are 95 ADP lemmas (0%), 112 ADP types (0%) and 320013 ADP tokens (9%). Out of 17 observed tags, the rank of ADP is: 9 in number of lemmas, 11 in number of types and 4 in number of tokens.

The 10 most frequent ADP lemmas: v, na, z, s, o, do, za, k, pro, po

The 10 most frequent ADP types: v, na, z, o, s, do, za, ve, pro, k

The 10 most frequent ambiguous lemmas: v (ADP 76000, NOUN 9), z (ADP 26208, NOUN 20), s (ADP 25778, X 607, NOUN 34), o (ADP 21080, NOUN 106, X 2), do (ADP 16388, PROPN 11, X 3, NOUN 2), k (ADP 13437, NOUN 9), pro (ADP 11851, X 4, ADJ 2), podle (ADP 5306, ADV 1), u (ADP 5299, NOUN 3), bez (ADP 1832, NOUN 2)

The 10 most frequent ambiguous types: v (ADP 55190, NOUN 9), z (ADP 20296, NOUN 20), o (ADP 20204, ADJ 114, NOUN 106, X 2), s (ADP 20009, X 607, NOUN 368, ADJ 7), do (ADP 15607, PROPN 11, X 3, NOUN 2), pro (ADP 11078, X 4, ADJ 1), k (ADP 10677, NOUN 13), po (ADP 6967, NOUN 6), podle (ADP 3305, ADV 1), u (ADP 4719, NOUN 3)

Morphology

The form / lemma ratio of ADP is 1.178947 (the average of all parts of speech is 2.169184).

The 1st highest number of forms (3) was observed with the lemma “k”: k, ke, ku.

The 2nd highest number of forms (3) was observed with the lemma “nad”: n, nad, nade.

The 3rd highest number of forms (3) was observed with the lemma “před”: př, před, přede.

ADP occurs with 6 features: AdpType (320011; 100% instances), Case (319441; 100% instances), ExtPos (6425; 2% instances), Abbr (32; 0% instances), Style (21; 0% instances), Typo (1; 0% instances)

ADP occurs with 14 feature-value pairs: Abbr=Yes, AdpType=Comprep, AdpType=Prep, AdpType=Voc, Case=Acc, Case=Dat, Case=Gen, Case=Ins, Case=Loc, Case=Nom, ExtPos=ADP, ExtPos=ADV, Style=Coll, Typo=Yes

ADP occurs with 30 feature combinations. The most frequent feature combination is AdpType=Prep|Case=Loc (101360 tokens). Examples: v, na, o, po, při

Relations

ADP nodes are attached to their parents using 20 different relations: case (316765; 99% instances), fixed (2988; 1% instances), mark (128; 0% instances), conj (48; 0% instances), nmod (29; 0% instances), root (17; 0% instances), obl (13; 0% instances), dep (4; 0% instances), acl:relcl (3; 0% instances), cc (3; 0% instances), reparandum (3; 0% instances), advmod (2; 0% instances), appos (2; 0% instances), csubj:pass (2; 0% instances), acl (1; 0% instances), advcl (1; 0% instances), ccomp (1; 0% instances), obj (1; 0% instances), parataxis (1; 0% instances), xcomp (1; 0% instances)

Parents of ADP nodes belong to 16 different parts of speech: NOUN (247610; 77% instances), PROPN (30209; 9% instances), PRON (14041; 4% instances), DET (11470; 4% instances), NUM (7940; 2% instances), ADJ (2862; 1% instances), ADP (2163; 1% instances), X (1655; 1% instances), ADV (1309; 0% instances), SYM (551; 0% instances), VERB (162; 0% instances), (17; 0% instances), PART (10; 0% instances), AUX (9; 0% instances), CCONJ (3; 0% instances), INTJ (2; 0% instances)

313482 (98%) ADP nodes are leaves.

4898 (2%) ADP nodes have one child.

1585 (0%) ADP nodes have two children.

48 (0%) ADP nodes have three or more children.

The highest child degree of a ADP node is 6.

Children of ADP nodes are attached using 20 different relations: fixed (7983; 97% instances), punct (80; 1% instances), conj (38; 0% instances), cop (37; 0% instances), cc (34; 0% instances), nsubj (29; 0% instances), obl (11; 0% instances), advmod (10; 0% instances), case (10; 0% instances), advcl (4; 0% instances), advmod:emph (4; 0% instances), mark (4; 0% instances), nmod (4; 0% instances), aux (3; 0% instances), det (3; 0% instances), acl:relcl (1; 0% instances), advcl:pred (1; 0% instances), discourse (1; 0% instances), obj (1; 0% instances), obl:arg (1; 0% instances)

Children of ADP nodes belong to 14 different parts of speech: NOUN (5876; 71% instances), ADP (2163; 26% instances), PUNCT (80; 1% instances), AUX (40; 0% instances), CCONJ (34; 0% instances), ADV (14; 0% instances), VERB (12; 0% instances), DET (11; 0% instances), ADJ (6; 0% instances), NUM (6; 0% instances), PROPN (5; 0% instances), SCONJ (5; 0% instances), PART (4; 0% instances), PRON (3; 0% instances)