Treebank Statistics: UD_German: POS Tags: PROPN
There are 16475 PROPN
lemmas (36%), 17041 PROPN
types (31%) and 31467 PROPN
tokens (11%).
Out of 15 observed tags, the rank of PROPN
is: 2 in number of lemmas, 2 in number of types and 5 in number of tokens.
The 10 most frequent PROPN
lemmas: d, von, Deutschland, de, of, Berlin, the, deutsch, US, SPD
The 10 most frequent PROPN
types: von, Deutschland, de, of, der, Berlin, the, US, SPD, St.
The 10 most frequent ambiguous lemmas: d (PROPN 251, SCONJ 19, X 3, ADJ 2, ADP 1, AUX 1, DET 1, NOUN 1), von (ADP 3417, PROPN 206, X 4), de (PROPN 126, ADP 9), of (PROPN 129, ADP 2), the (PROPN 39, X 1), deutsch (ADJ 179, PROPN 79, NOUN 7), US (PROPN 76, NOUN 1), Weltkrieg (PROPN 70, NOUN 1), Frankreich (PROPN 69, NOUN 1), St. (PROPN 67, NOUN 5)
The 10 most frequent ambiguous types: von (ADP 3264, PROPN 203), de (PROPN 126, ADP 9), of (PROPN 129, ADP 2), der (DET 8314, PRON 469, PROPN 97, ADP 1), the (PROPN 39, X 1), US (PROPN 76, NOUN 1), St. (PROPN 67, NOUN 5), für (ADP 1448, PROPN 59, NOUN 3, ADV 2, ADJ 1), Frankreich (PROPN 58, NOUN 1), la (PROPN 23, X 1)
- von
- de
- of
- der
- DET 8314: Für uns war vor allem der Hausbesuch das Highlight dieses Optikers .
- PRON 469: Wer also eine entspannende Massage sucht der ist hier genau richtig .
- PROPN 97: Hier geht seinem Eigennamen die Bezeichnung Herr der Kronen voraus .
- ADP 1: Chris Patten , der als Parteivorsitzender maßgeblich an dem Wahlsieg der britischen `` Tories ‘’ in dem April dieses Jahres beteiligt war , gilt als erfahrener Politiker .
- the
- US
- St.
- für
- ADP 1448: Es war für mich Ausgangspunkt zu einer Parfümkreation .
- PROPN 59: 1985 übernahm er eine Rolle in dem Film Was für ein Genie .
- NOUN 3: Typisch für das Chuanqi ist eine dramatische Liebesgeschichte , die persönliche Schicksale und Machtkämpfe in der Politik darstellt .
- ADV 2: Zunächst wurde es als Jugend - , Kultur - und Standesamt genutzt , da die Räumlichkeiten in der Hauptstraße 11-13 zu beengt für alle Ämter der Stadt Langenfeld waren .
- ADJ 1: Die Osteoporose ( von ostoun , Knochen ‘ und poros , Furt , Pore ‘ ) ist eine häufige Alters - Erkrankung des Knochens , die ihn für Brüche ( Frakturen ) anfälliger macht .
- Frankreich
- la
Morphology
The form / lemma ratio of PROPN
is 1.034355 (the average of all parts of speech is 1.186689).
The 1st highest number of forms (7) was observed with the lemma “d”: d, das, dem, den, der, des, die.
The 2nd highest number of forms (5) was observed with the lemma “deutsch”: Deutsch, Deutsche, Deutschen, Deutscher, Deutsches.
The 3rd highest number of forms (5) was observed with the lemma “neu”: Neu, Neuen, Neuer, neue, neues.
PROPN
occurs with 4 features: Case (9887; 31% instances), Number (9887; 31% instances), Gender (4750; 15% instances), Polarity (7; 0% instances)
PROPN
occurs with 11 feature-value pairs: Case=Acc
, Case=Dat
, Case=Gen
, Case=Nom
, Gender=Fem
, Gender=Masc
, Gender=Masc,Neut
, Gender=Neut
, Number=Plur
, Number=Sing
, Polarity=Neg
PROPN
occurs with 32 feature combinations.
The most frequent feature combination is _
(21573 tokens).
Examples: von, of, de, Deutschland, der, the, Berlin, US, St., für
Relations
PROPN
nodes are attached to their parents using 29 different relations: nmod (6919; 22% instances), flat (6290; 20% instances), appos (5145; 16% instances), obl (3107; 10% instances), nsubj (3077; 10% instances), conj (2571; 8% instances), amod (1926; 6% instances), case (717; 2% instances), obj (557; 2% instances), nsubj:pass (381; 1% instances), root (295; 1% instances), iobj (155; 0% instances), dep (99; 0% instances), nummod (54; 0% instances), advmod (37; 0% instances), acl (29; 0% instances), cop (22; 0% instances), xcomp (20; 0% instances), parataxis (18; 0% instances), det (12; 0% instances), aux (11; 0% instances), punct (8; 0% instances), advcl (5; 0% instances), ccomp (5; 0% instances), compound (3; 0% instances), cc (1; 0% instances), expl (1; 0% instances), fixed (1; 0% instances), mark (1; 0% instances)
Parents of PROPN
nodes belong to 16 different parts of speech: PROPN (15086; 48% instances), NOUN (8706; 28% instances), VERB (6402; 20% instances), ADJ (653; 2% instances), (295; 1% instances), ADP (156; 0% instances), PRON (53; 0% instances), NUM (50; 0% instances), ADV (32; 0% instances), CCONJ (9; 0% instances), X (8; 0% instances), AUX (6; 0% instances), DET (5; 0% instances), PART (3; 0% instances), PUNCT (2; 0% instances), SCONJ (1; 0% instances)
13570 (43%) PROPN
nodes are leaves.
6662 (21%) PROPN
nodes have one child.
4858 (15%) PROPN
nodes have two children.
6377 (20%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 26.
Children of PROPN
nodes are attached using 30 different relations: case (7691; 18% instances), punct (7326; 17% instances), flat (7237; 17% instances), det (5535; 13% instances), nmod (3080; 7% instances), conj (2560; 6% instances), appos (2445; 6% instances), amod (2431; 6% instances), cc (1448; 3% instances), advmod (573; 1% instances), acl (443; 1% instances), cop (273; 1% instances), nsubj (262; 1% instances), compound (231; 1% instances), nummod (207; 0% instances), obj (48; 0% instances), det:poss (43; 0% instances), nsubj:pass (37; 0% instances), dep (34; 0% instances), parataxis (19; 0% instances), advcl (17; 0% instances), fixed (13; 0% instances), aux (11; 0% instances), mark (11; 0% instances), compound:prt (4; 0% instances), xcomp (4; 0% instances), ccomp (3; 0% instances), iobj (3; 0% instances), obl (2; 0% instances), expl (1; 0% instances)
Children of PROPN
nodes belong to 15 different parts of speech: PROPN (15086; 36% instances), PUNCT (7328; 17% instances), ADP (6959; 17% instances), DET (5525; 13% instances), NOUN (2537; 6% instances), CCONJ (1442; 3% instances), ADJ (914; 2% instances), NUM (884; 2% instances), VERB (420; 1% instances), ADV (399; 1% instances), AUX (247; 1% instances), PRON (166; 0% instances), X (59; 0% instances), PART (15; 0% instances), SCONJ (11; 0% instances)