Treebank Statistics: UD_German-GSD: POS Tags: PROPN
There are 16458 PROPN
lemmas (36%), 17019 PROPN
types (31%) and 31184 PROPN
tokens (11%).
Out of 16 observed tags, the rank of PROPN
is: 2 in number of lemmas, 2 in number of types and 5 in number of tokens.
The 10 most frequent PROPN
lemmas: von, Deutschland, de, of, US, Berlin, the, deutsch, SPD, Weltkrieg
The 10 most frequent PROPN
types: von, Deutschland, de, of, US, Berlin, the, SPD, St., für
The 10 most frequent ambiguous lemmas: von (ADP 3418, PROPN 206, X 4), de (PROPN 126, ADP 9, PRON 1), of (PROPN 129, ADP 2), US (PROPN 108, NOUN 3), the (PROPN 39, X 1), deutsch (ADJ 181, PROPN 79, NOUN 7), SPD (PROPN 71, NOUN 1), Weltkrieg (PROPN 70, NOUN 1), Frankreich (PROPN 69, NOUN 1), St. (PROPN 68, NOUN 5)
The 10 most frequent ambiguous types: von (ADP 3264, PROPN 203), de (PROPN 125, ADP 9, PRON 1), of (PROPN 129, ADP 2), US (PROPN 108, NOUN 3), the (PROPN 39, X 1), SPD (PROPN 71, NOUN 1), St. (PROPN 68, NOUN 5), für (ADP 1448, PROPN 59, NOUN 3, ADV 2, ADJ 1), Frankreich (PROPN 58, NOUN 1), la (PROPN 23, X 1)
- von
- de
- of
- US
- the
- SPD
- St.
- für
- ADP 1448: Es war für mich Ausgangspunkt zu einer Parfümkreation .
- PROPN 59: 1985 übernahm er eine Rolle in dem Film Was für ein Genie .
- NOUN 3: Typisch für das Chuanqi ist eine dramatische Liebesgeschichte , die persönliche Schicksale und Machtkämpfe in der Politik darstellt .
- ADV 2: Zunächst wurde es als Jugend - , Kultur - und Standesamt genutzt , da die Räumlichkeiten in der Hauptstraße 11-13 zu beengt für alle Ämter der Stadt Langenfeld waren .
- ADJ 1: Die Osteoporose ( von ostoun , Knochen ‘ und poros , Furt , Pore ‘ ) ist eine häufige Alters - Erkrankung des Knochens , die ihn für Brüche ( Frakturen ) anfälliger macht .
- Frankreich
- la
Morphology
The form / lemma ratio of PROPN
is 1.034087 (the average of all parts of speech is 1.185142).
The 1st highest number of forms (5) was observed with the lemma “deutsch”: Deutsch, Deutsche, Deutschen, Deutscher, Deutsches.
The 2nd highest number of forms (5) was observed with the lemma “neu”: Neu, Neuen, Neuer, neue, neues.
The 3rd highest number of forms (4) was observed with the lemma “Archiv”: Archiv, Archive, Archiven, Archives.
PROPN
occurs with 11 features: Number (28389; 91% instances), Case (28339; 91% instances), Gender (27341; 88% instances), Foreign (1183; 4% instances), NumType (427; 1% instances), VerbForm (63; 0% instances), Mood (52; 0% instances), Person (47; 0% instances), Tense (47; 0% instances), Poss (13; 0% instances), Polarity (2; 0% instances)
PROPN
occurs with 23 feature-value pairs: Case=Acc
, Case=Dat
, Case=Gen
, Case=Nom
, Foreign=Yes
, Gender=Fem
, Gender=Masc
, Gender=Neut
, Mood=Imp
, Mood=Ind
, Mood=Sub
, NumType=Card
, Number=Plur
, Number=Sing
, Person=1
, Person=3
, Polarity=Neg
, Poss=Yes
, Tense=Past
, Tense=Pres
, VerbForm=Fin
, VerbForm=Inf
, VerbForm=Part
PROPN
occurs with 63 feature combinations.
The most frequent feature combination is Case=Nom|Gender=Masc|Number=Sing
(5959 tokens).
Examples: de, Hans, Paul, Peter, Johann, Wilhelm, Juli, Karl, August, Helmut
Relations
PROPN
nodes are attached to their parents using 28 different relations: nmod (6643; 21% instances), flat (6268; 20% instances), appos (5128; 16% instances), obl (3092; 10% instances), nsubj (3019; 10% instances), conj (2557; 8% instances), amod (1924; 6% instances), case (684; 2% instances), obj (546; 2% instances), nsubj:pass (379; 1% instances), root (293; 1% instances), iobj (152; 0% instances), dep (144; 0% instances), compound (139; 0% instances), nummod (54; 0% instances), advmod (35; 0% instances), acl (29; 0% instances), cop (22; 0% instances), xcomp (20; 0% instances), parataxis (18; 0% instances), aux (11; 0% instances), det (9; 0% instances), punct (7; 0% instances), advcl (4; 0% instances), ccomp (4; 0% instances), cc (1; 0% instances), fixed (1; 0% instances), mark (1; 0% instances)
Parents of PROPN
nodes belong to 15 different parts of speech: PROPN (14801; 47% instances), NOUN (8779; 28% instances), VERB (6325; 20% instances), ADJ (650; 2% instances), (293; 1% instances), ADP (154; 0% instances), PRON (69; 0% instances), NUM (49; 0% instances), ADV (32; 0% instances), CCONJ (9; 0% instances), X (8; 0% instances), AUX (6; 0% instances), DET (5; 0% instances), PART (3; 0% instances), SCONJ (1; 0% instances)
11929 (38%) PROPN
nodes are leaves.
8140 (26%) PROPN
nodes have one child.
5289 (17%) PROPN
nodes have two children.
5826 (19%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 26.
Children of PROPN
nodes are attached using 30 different relations: case (7669; 19% instances), flat (7099; 17% instances), punct (6365; 16% instances), det (5535; 14% instances), nmod (3077; 8% instances), conj (2554; 6% instances), appos (2438; 6% instances), amod (2428; 6% instances), cc (1445; 4% instances), advmod (571; 1% instances), acl (443; 1% instances), cop (272; 1% instances), nsubj (261; 1% instances), compound (252; 1% instances), nummod (207; 1% instances), dep (153; 0% instances), obj (48; 0% instances), det:poss (43; 0% instances), nsubj:pass (37; 0% instances), parataxis (19; 0% instances), advcl (17; 0% instances), fixed (13; 0% instances), aux (11; 0% instances), mark (11; 0% instances), compound:prt (4; 0% instances), xcomp (4; 0% instances), ccomp (3; 0% instances), iobj (3; 0% instances), obl (2; 0% instances), expl (1; 0% instances)
Children of PROPN
nodes belong to 15 different parts of speech: PROPN (14801; 36% instances), ADP (6959; 17% instances), PUNCT (6345; 15% instances), DET (5554; 14% instances), NOUN (2475; 6% instances), CCONJ (1444; 4% instances), ADJ (917; 2% instances), NUM (872; 2% instances), PRON (459; 1% instances), VERB (420; 1% instances), ADV (407; 1% instances), AUX (247; 1% instances), X (59; 0% instances), PART (15; 0% instances), SCONJ (11; 0% instances)