Treebank Statistics: UD_Upper_Sorbian-UFAL: POS Tags: X
There are 166 X
lemmas (5%), 166 X
types (4%) and 199 X
tokens (2%).
Out of 16 observed tags, the rank of X
is: 7 in number of lemmas, 7 in number of types and 13 in number of tokens.
The 10 most frequent X
lemmas: a, i, vitis, al, backen, o, apg, au, b, c
The 10 most frequent X
types: a, i, Vitis, al, backen, o, APG, H, au, b
The 10 most frequent ambiguous lemmas: a (CCONJ 337, X 5), c (X 2, PROPN 1), dr (X 2, NOUN 1), mj (VERB 2, X 2), und (X 2, PROPN 1), center (NOUN 1, X 1), d (ADJ 1, X 1), et (CCONJ 2, X 1), institut (NOUN 16, X 1), k (ADP 48, X 1)
The 10 most frequent ambiguous types: a (CCONJ 337, X 4), H (X 2, PROPN 1), dr (X 2, NOUN 1), mj (VERB 2, X 2, ADJ 1), n (DET 37, ADP 2, X 2), und (X 2, PROPN 1), INSTITUT (NOUN 1, X 1), et (CCONJ 2, X 1), k (ADP 41, X 1), m (NOUN 2, X 1)
- a
- H
- dr
- X 2: Su w germanšćinje sydom ablawtowych rjadow , znutřka kotrychž wokal so po krutym prawidle ablawtuje ( prěnjotna přičina za to su mj . dr . sćěhowace konsonanty ) .
- NOUN 1: Zjawny přednošk w Serbskim instituće na Dwórnišćowej 6 ( PD dr . Sönke Friedreich , ISGV Drježdźany ) : “ Vom Urlaub erzählen . Zur Erfahrungsgeschichte des Reisens in der DDR “
- mj
- VERB 2: Zo by so GNU - licenca dodźeržała , dyrbi so při tym 3 - 5 tamnišich t . mj . hłownych awtorow mjenować .
- X 2: Su w germanšćinje sydom ablawtowych rjadow , znutřka kotrychž wokal so po krutym prawidle ablawtuje ( prěnjotna přičina za to su mj . dr . sćěhowace konsonanty ) .
- ADJ 1: W času persiskeho knjejstwa bu aramejšćina mócnarstwowa rěč , t . mj . mócnarstwowa aramejšćina .
- n
- DET 37: Klinowe pismo docpě wo 2700 př . n . l . swoju dokonjanosć .
- ADP 2: Stolica abo hłowne město je politiski , husto tež stawizniski centrum kraja abo stata a tuž zwjetša sydło najwyšich politiskich institucijow , kaž n . př . knježerstwa , sejma abo monarcha .
- X 2: Lokalne adwerby móža akuzatiwny n měć , hdyž směr zwuraznjeja , na př . tie “ tam , tamle “ - tien “ tam “ .
- und
- X 2: Pod titlom “ Wandel gestalten - Geschichten und Strategien um Identitäts - und Landschaftswandel in der Lausitz “ wuhotuje Serbski institut na Gižkojskim kuble we Wikach pola Drjowka fachowu konferencu , na kotrejž wobdźěli so 16 referentow a referentkow z wobłukow wědomosće , hospodarstwa a regionalneho zarjadnistwa .
- PROPN 1: Knižna premjera zwjazka 48 Spisow Serbskeho instituta “ Stätten und Stationen religiösen Wirkens “ w Smolerjec kniharni
- INSTITUT
- et
- k
- m
Morphology
The form / lemma ratio of X
is 1.000000 (the average of all parts of speech is 1.418889).
The 1st highest number of forms (1) was observed with the lemma “100px”: 100px.
The 2nd highest number of forms (1) was observed with the lemma “100x200px”: 100x200px.
The 3rd highest number of forms (1) was observed with the lemma “a”: a.
X
occurs with 1 features: Abbr (8; 4% instances)
X
occurs with 1 feature-value pairs: Abbr=Yes
X
occurs with 2 feature combinations.
The most frequent feature combination is _
(191 tokens).
Examples: a, i, Vitis, al, backen, o, H, au, b, c
Relations
X
nodes are attached to their parents using 17 different relations: conj (48; 24% instances), flat (46; 23% instances), nmod (44; 22% instances), appos (22; 11% instances), nsubj (7; 4% instances), parataxis (6; 3% instances), dep (4; 2% instances), list (4; 2% instances), compound (3; 2% instances), fixed (3; 2% instances), obl (3; 2% instances), advmod:emph (2; 1% instances), flat:foreign (2; 1% instances), obj (2; 1% instances), amod (1; 1% instances), dep:alt (1; 1% instances), root (1; 1% instances)
Parents of X
nodes belong to 7 different parts of speech: X (94; 47% instances), NOUN (75; 38% instances), VERB (15; 8% instances), PROPN (9; 5% instances), ADV (3; 2% instances), ADJ (2; 1% instances), (1; 1% instances)
74 (37%) X
nodes are leaves.
61 (31%) X
nodes have one child.
26 (13%) X
nodes have two children.
38 (19%) X
nodes have three or more children.
The highest child degree of a X
node is 15.
Children of X
nodes are attached using 14 different relations: punct (122; 42% instances), flat (46; 16% instances), conj (42; 14% instances), appos (23; 8% instances), advmod (13; 4% instances), cc (13; 4% instances), case (8; 3% instances), amod (5; 2% instances), advmod:emph (4; 1% instances), nummod (4; 1% instances), fixed (3; 1% instances), list (3; 1% instances), nmod (3; 1% instances), dep (1; 0% instances)
Children of X
nodes belong to 11 different parts of speech: PUNCT (122; 42% instances), X (94; 32% instances), ADV (16; 6% instances), CCONJ (13; 4% instances), NOUN (13; 4% instances), ADP (11; 4% instances), ADJ (7; 2% instances), NUM (5; 2% instances), PROPN (5; 2% instances), VERB (3; 1% instances), SCONJ (1; 0% instances)