Treebank Statistics: UD_Lithuanian-ALKSNIS: POS Tags: PROPN
There are 559 PROPN
lemmas (6%), 702 PROPN
types (4%) and 1593 PROPN
tokens (2%).
Out of 17 observed tags, the rank of PROPN
is: 4 in number of lemmas, 4 in number of types and 11 in number of tokens.
The 10 most frequent PROPN
lemmas: Lietuva, Europa, Vilnius, Kaunas, Šengenas, Kalėdos, Marcinkevičienė, Glaveckas, Mažuolis, Rusija
The 10 most frequent PROPN
types: Lietuvos, Europos, Lietuvoje, Kauno, Vilniaus, Lietuva, Šengeno, Lietuvai, Vilnius, Kalėdų
The 10 most frequent ambiguous lemmas: Kalėdos (PROPN 14, NOUN 1), Iglesias (PROPN 1, X 1), klausimas (NOUN 63, PROPN 1)
The 10 most frequent ambiguous types: Kalėdų (PROPN 10, NOUN 1), Gintaras (PROPN 2, NOUN 1), Greta (PROPN 2, ADP 1), Iglesias (PROPN 1, X 1), Seimo (NOUN 28, PROPN 1), Vidutinė (ADJ 1, PROPN 1), klausimai (NOUN 8, PROPN 1)
- Kalėdų
- Gintaras
- Greta
- PROPN 2: 1990 m . mirė Švedijoje gimusi kino žvaigždė Greta Garbo .
- ADP 1: Greta įprastinių parametrų matavome šiuos naujus parametrus : latencijos ( Lp 300 ) ir amplitudės ( Ap 300 ) santykį , atpažinimo laiką ( IT ) , kognityvinio komplekso statumą ( SCC ) , P 300 bangos energiją , P 300 bangos nusileidžiančiosios dalies greitį ( SDS ) bei greičio parametrus A ir B .
- Iglesias
- Seimo
- NOUN 28: Seimo komitetai svarsto šį pasiūlymą “ , - žino direktorė .
- PROPN 1: Pasiūlymų labai daug įvairių buvo išnagrinėta ne tik Vyriausybėje , bet ir Seimo komitetuose , tai ką , jie ( „ NDX Energija “ - red . past . ) neturi teisės atnešti , jie yra viena iš kompanijų , kuri dalyvauja elektros versle .
- Vidutinė
- klausimai
Morphology
The form / lemma ratio of PROPN
is 1.255814 (the average of all parts of speech is 2.065341).
The 1st highest number of forms (8) was observed with the lemma “Lietuva”: LIETUVA, LIETUVOJE, LIETUVOS, Lietuva, Lietuvai, Lietuvoje, Lietuvos, Lietuvą.
The 2nd highest number of forms (5) was observed with the lemma “Europa”: EUROPOS, Europa, Europoje, Europos, Europą.
The 3rd highest number of forms (5) was observed with the lemma “Glaveckas”: GLAVECKAS, Glaveckas, Glavecko, Glavecku, Glavecką.
PROPN
occurs with 3 features: Gender (1574; 99% instances), Number (1573; 99% instances), Case (1572; 99% instances)
PROPN
occurs with 10 feature-value pairs: Case=Acc
, Case=Dat
, Case=Gen
, Case=Ins
, Case=Loc
, Case=Nom
, Gender=Fem
, Gender=Masc
, Number=Plur
, Number=Sing
PROPN
occurs with 22 feature combinations.
The most frequent feature combination is Case=Gen|Gender=Fem|Number=Sing
(522 tokens).
Examples: Lietuvos, Europos, LIETUVOS, EUROPOS, Marcinkevičienės, Rusijos, Baltijos, Trejybės, Latvijos, Prancūzijos
Relations
PROPN
nodes are attached to their parents using 14 different relations: nmod (883; 55% instances), obl (146; 9% instances), nsubj (141; 9% instances), conj (126; 8% instances), flat (110; 7% instances), obl:arg (79; 5% instances), root (73; 5% instances), obj (20; 1% instances), parataxis (9; 1% instances), nsubj:pass (2; 0% instances), advcl (1; 0% instances), appos (1; 0% instances), ccomp (1; 0% instances), iobj (1; 0% instances)
Parents of PROPN
nodes belong to 10 different parts of speech: NOUN (976; 61% instances), VERB (264; 17% instances), PROPN (245; 15% instances), (73; 5% instances), ADJ (15; 1% instances), X (12; 1% instances), ADV (3; 0% instances), PRON (3; 0% instances), DET (1; 0% instances), PART (1; 0% instances)
1057 (66%) PROPN
nodes are leaves.
312 (20%) PROPN
nodes have one child.
140 (9%) PROPN
nodes have two children.
84 (5%) PROPN
nodes have three or more children.
The highest child degree of a PROPN
node is 8.
Children of PROPN
nodes are attached using 21 different relations: nmod (314; 35% instances), punct (230; 26% instances), conj (142; 16% instances), case (59; 7% instances), cc (53; 6% instances), advmod:emph (18; 2% instances), det (14; 2% instances), acl (13; 1% instances), amod (9; 1% instances), nsubj (8; 1% instances), appos (7; 1% instances), acl:relcl (6; 1% instances), mark (6; 1% instances), obl (5; 1% instances), cop (3; 0% instances), flat (3; 0% instances), obl:arg (3; 0% instances), advmod (2; 0% instances), parataxis (2; 0% instances), csubj (1; 0% instances), nummod (1; 0% instances)
Children of PROPN
nodes belong to 15 different parts of speech: PROPN (245; 27% instances), PUNCT (230; 26% instances), X (146; 16% instances), NOUN (75; 8% instances), ADP (59; 7% instances), CCONJ (53; 6% instances), VERB (22; 2% instances), PART (18; 2% instances), DET (14; 2% instances), PRON (11; 1% instances), ADJ (10; 1% instances), SCONJ (6; 1% instances), NUM (5; 1% instances), AUX (3; 0% instances), ADV (2; 0% instances)