Treebank Statistics: UD_Ligurian-GLT: POS Tags: NUM
There are 31 NUM
lemmas (2%), 34 NUM
types (2%) and 42 NUM
tokens (1%).
Out of 16 observed tags, the rank of NUM
is: 7 in number of lemmas, 9 in number of types and 14 in number of tokens.
The 10 most frequent NUM
lemmas: doî, eutto, 1929, quaranta, quattro, trenta, 1746, 1815, 1847, 1874
The 10 most frequent NUM
types: doe, doî, eutto, 1929, quaranta, 1746, 1815, 1847, 1874, 1892
The 10 most frequent ambiguous lemmas: doî (NUM 6, NOUN 1), un (DET 116, PRON 18, NUM 1), çinqueçento (NOUN 1, NUM 1)
The 10 most frequent ambiguous types: doe (NUM 3, NOUN 1), un (DET 76, PRON 12, NUM 1), Çinqueçento (NOUN 1, NUM 1)
- doe
- un
- DET 76: Ciù un pö no me piggia un açidente .
- PRON 12: No pe caxo , un de i versci o minsuña o Balilla .
- NUM 1: Ne sei pròpio seguo che feua cà , lonxi da i vòstri euggi e da quelli de sò moæ , infiâ inte unna famiggia ch’ ei conosciuo giusto pe caxo pe un meise ò doî , a se mantegnià delongo quello ch’ a l’ é oua , e che un giorno no aviei da pentî ve à lagrime e sangue d’ ëse ve ne destaccou in raxon d’ un guägno ch’ o no l’ é manco seguo ?
- Çinqueçento
- NOUN 1: Da a meitæ de o trexento à i primmi de o Çinqueçento no gh’ é stæto gexa , cappella , ötöio da Saña fin à a Fransa che no seggian stæti decoræ segondo o dexidëio e e poscibilitæ de ciascheduña communitæ .
- NUM 1: O l’ é o “ Basso de Zena “ , un tòcco de o Çinqueçento ch’ o representava a çittæ à l’ indefeua .
Morphology
The form / lemma ratio of NUM
is 1.096774 (the average of all parts of speech is 1.323735).
The 1st highest number of forms (2) was observed with the lemma “doî”: doe, doî.
The 2nd highest number of forms (2) was observed with the lemma “quattro”: quattr’, quattro.
The 3rd highest number of forms (2) was observed with the lemma “trenta”: trent’, trenta.
NUM
occurs with 2 features: NumType (42; 100% instances), Gender (7; 17% instances)
NUM
occurs with 3 feature-value pairs: Gender=Fem
, Gender=Masc
, NumType=Card
NUM
occurs with 3 feature combinations.
The most frequent feature combination is NumType=Card
(35 tokens).
Examples: eutto, 1929, quaranta, doe, doî, 1746, 1815, 1847, 1874, 1892
Relations
NUM
nodes are attached to their parents using 7 different relations: nummod (19; 45% instances), nmod (11; 26% instances), obl (7; 17% instances), obj (2; 5% instances), conj (1; 2% instances), flat (1; 2% instances), root (1; 2% instances)
Parents of NUM
nodes belong to 5 different parts of speech: NOUN (27; 64% instances), VERB (8; 19% instances), PROPN (5; 12% instances), (1; 2% instances), X (1; 2% instances)
20 (48%) NUM
nodes are leaves.
3 (7%) NUM
nodes have one child.
17 (40%) NUM
nodes have two children.
2 (5%) NUM
nodes have three or more children.
The highest child degree of a NUM
node is 5.
Children of NUM
nodes are attached using 9 different relations: case (19; 41% instances), det (18; 39% instances), punct (3; 7% instances), acl:relcl (1; 2% instances), advmod (1; 2% instances), cc (1; 2% instances), conj (1; 2% instances), nmod (1; 2% instances), parataxis (1; 2% instances)
Children of NUM
nodes belong to 7 different parts of speech: ADP (19; 41% instances), DET (18; 39% instances), PUNCT (3; 7% instances), ADV (2; 4% instances), VERB (2; 4% instances), CCONJ (1; 2% instances), NOUN (1; 2% instances)