Treebank Statistics: UD_Polish-PUD: POS Tags: X
There are 59 X
lemmas (1%), 59 X
types (1%) and 77 X
tokens (0%).
Out of 15 observed tags, the rank of X
is: 7 in number of lemmas, 8 in number of types and 15 in number of tokens.
The 10 most frequent X
lemmas: of, the, de, 2004, 2016, El, von, ‘ya, 1165, 1918
The 10 most frequent X
types: of, the, de, 2004, 2016, El, Von, ‘Ya, 1.165, 1918
The 10 most frequent ambiguous lemmas: de (X 4, ADP 1), 2004 (X 2, ADJ 1), 2016 (X 2, ADJ 1), 1918 (ADJ 1, X 1), 1991 (ADJ 1, X 1), 1992 (ADJ 2, X 1), 1994 (ADJ 1, X 1), 1997 (ADJ 2, X 1), 2008 (ADJ 1, X 1), 2013 (ADJ 3, X 1)
The 10 most frequent ambiguous types: de (X 3, ADP 1), 2004 (X 2, ADJ 1), 2016 (X 2, ADJ 1), 1918 (ADJ 1, X 1), 1991 (ADJ 1, X 1), 1992 (ADJ 2, X 1), 1994 (ADJ 1, X 1), 1997 (ADJ 2, X 1), 2008 (ADJ 1, X 1), 2013 (ADJ 3, X 1)
- de
- 2004
- 2016
- 1918
- ADJ 1: W lipcu 1918 podpisano traktat francusko - monakijski , zapewniający Monako ograniczoną ochronę ze strony Francji .
- X 1: Mimo pozostawania Islandii pod polityczną kontrolą Danii do dużo późniejszych czasów ( 1918 ) , w języku islandzkim występuje bardzo niewiele wpływów i zapożyczeń z duńskiego .
- 1991
- 1992
- 1994
- 1997
- 2008
- 2013
- ADJ 3: W 2013 r . wystąpiła poza sezonem w CTV Montreal jako gościnna prezenterka pogody .
- X 1: Rząd federalny proaktywnie raportuje łączne poziomy premii za wyniki i dodatków dla każdego działu , ale najnowsze dane opublikowane w Internecie dotyczą lat 2013 - 2014 , czyli są przestarzałe o dwa lata .
Morphology
The form / lemma ratio of X
is 1.000000 (the average of all parts of speech is 1.436503).
The 1st highest number of forms (1) was observed with the lemma “’ya”: ‘Ya.
The 2nd highest number of forms (1) was observed with the lemma “1165”: 1.165.
The 3rd highest number of forms (1) was observed with the lemma “1918”: 1918.
X
occurs with 2 features: Foreign (54; 70% instances), NumForm (21; 27% instances)
X
occurs with 2 feature-value pairs: Foreign=Yes
, NumForm=Digit
X
occurs with 3 feature combinations.
The most frequent feature combination is Foreign=Yes
(54 tokens).
Examples: of, the, de, El, Von, ‘Ya, A, Breaking, Buck, Century
Relations
X
nodes are attached to their parents using 11 different relations: flat:foreign (24; 31% instances), flat (15; 19% instances), amod (12; 16% instances), conj (7; 9% instances), nmod (6; 8% instances), obl (5; 6% instances), nsubj (3; 4% instances), fixed (2; 3% instances), appos (1; 1% instances), iobj (1; 1% instances), nmod:arg (1; 1% instances)
Parents of X
nodes belong to 6 different parts of speech: X (30; 39% instances), NOUN (27; 35% instances), PROPN (11; 14% instances), VERB (5; 6% instances), ADJ (2; 3% instances), ADP (2; 3% instances)
21 (27%) X
nodes are leaves.
40 (52%) X
nodes have one child.
6 (8%) X
nodes have two children.
10 (13%) X
nodes have three or more children.
The highest child degree of a X
node is 5.
Children of X
nodes are attached using 8 different relations: punct (32; 38% instances), flat:foreign (22; 26% instances), flat (19; 22% instances), conj (6; 7% instances), cc (3; 4% instances), case (1; 1% instances), mark (1; 1% instances), nmod:flat (1; 1% instances)
Children of X
nodes belong to 7 different parts of speech: PUNCT (32; 38% instances), X (30; 35% instances), PROPN (17; 20% instances), CCONJ (3; 4% instances), ADP (1; 1% instances), NOUN (1; 1% instances), SCONJ (1; 1% instances)