forked from UniversalDependencies/UD_Spanish-AnCora
-
Notifications
You must be signed in to change notification settings - Fork 0
/
Copy patheval.log
80 lines (80 loc) · 6.21 KB
/
eval.log
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
Running the following version of UD tools:
commit 78ce4b21495c6e4c17a7b07925bec1267d833d14
Author: Dan Zeman <[email protected]>
Date: Sun May 5 09:21:16 2024 +0200
Evaluating the following revision of UD_Spanish-AnCora:
commit 4a9119b5c1b562bfd8bd9e711defd290b2ae9136
Merge: 22f0f0e b210633
Author: Dan Zeman <[email protected]>
Size: counted 560137 of 560137 words (nodes).
Size: min(0, log((N/1000)**2)) = 12.6563627933323.
Size: maximum value 13.815511 is for 1000000 words or more.
Split: Found more than 10000 training words.
Split: Found at least 10000 development words.
Split: Found at least 10000 test words.
Lemmas: source of annotation (from README) factor is 0.8.
Universal POS tags: 17 out of 17 found in the corpus.
Universal POS tags: source of annotation (from README) factor is 0.8.
Features: 382699 out of 560137 total words have one or more features.
Features: source of annotation (from README) factor is 0.8.
Universal relations: 33 out of 37 found in the corpus.
Universal relations: source of annotation (from README) factor is 0.8.
Udapi:
TOTAL 1375
Udapi: found 1375 bugs.
Udapi: worst expected case (threshold) is one bug per 10 words. There are 560137 words.
Genres: found 1 out of 17 known.
/net/work/people/zeman/unidep/tools/validate.py --lang es --max-err=10 UD_Spanish-AnCora/es_ancora-ud-dev.conllu
[Line 25 Sent 3LB-CAST-111_C-2-s1 Node 18]: [L3 Warning fixed-gap] Gaps in fixed expression [18, 20] 'a * máximo'
[Line 2476 Sent 3LB-CAST-c1-2-s10 Node 28]: [L3 Warning fixed-gap] Gaps in fixed expression [28, 30] 'a * menos'
[Line 2540 Sent 3LB-CAST-c2-6-s1 Node 15]: [L3 Warning fixed-gap] Gaps in fixed expression [15, 17] 'de * todo'
[Line 4636 Sent 3LB-CAST-dc1-15-s9 Node 1]: [L3 Warning fixed-gap] Gaps in fixed expression [1, 3, 4] 'A * igual que'
[Line 4673 Sent 3LB-CAST-dc1-15-s10 Node 13]: [L3 Warning fixed-gap] Gaps in fixed expression [13, 15] 'a * momento'
[Line 4859 Sent 3LB-CAST-dc1-3-s5 Node 1]: [L3 Warning fixed-gap] Gaps in fixed expression [1, 3, 4] 'A * igual que'
[Line 7340 Sent 3LB-CAST-r2-7-s13 Node 2]: [L3 Warning fixed-gap] Gaps in fixed expression [2, 4] 'a * final'
[Line 17222 Sent CESS-CAST-A-20000618-14736-s7 Node 31]: [L3 Warning fixed-gap] Gaps in fixed expression [31, 33, 34] 'a * tiempo que'
[Line 17806 Sent CESS-CAST-A-20000620-15780-s7 Node 44]: [L3 Warning fixed-gap] Gaps in fixed expression [44, 46, 47] 'a * tiempo que'
[Line 19380 Sent CESS-CAST-A-20001117-13026-s9 Node 45]: [L3 Warning fixed-gap] Gaps in fixed expression [45, 47, 48] 'a * menos durante'
...suppressing further errors regarding Warning
Warnings: 25
*** PASSED ***
/net/work/people/zeman/unidep/tools/validate.py --lang es --max-err=10 UD_Spanish-AnCora/es_ancora-ud-test.conllu
[Line 3954 Sent 3LB-CAST-dc10-2-s10 Node 26]: [L3 Warning fixed-gap] Gaps in fixed expression [26, 28] 'de * todo'
[Line 9215 Sent 3LB-CAST-t6-4-s10 Node 21]: [L3 Warning fixed-gap] Gaps in fixed expression [21, 23, 24] 'a * margen de'
[Line 10241 Sent CESS-CAST-A-20000123-15801-s6 Node 38]: [L3 Warning fixed-gap] Gaps in fixed expression [38, 40] 'a * menos'
[Line 11248 Sent CESS-CAST-A-20000213-10585-s6 Node 21]: [L3 Warning fixed-gap] Gaps in fixed expression [21, 23, 24] 'a * final en'
[Line 11928 Sent CESS-CAST-A-20000221-16816-s3 Node 25]: [L3 Warning fixed-gap] Gaps in fixed expression [25, 27, 28] 'a * igual que'
[Line 11944 Sent CESS-CAST-A-20000221-16816-s3 Node 39]: [L3 Warning fixed-gap] Gaps in fixed expression [39, 41] 'a * respecto'
[Line 12866 Sent CESS-CAST-A-20000317-13928-s3 Node 16]: [L3 Warning fixed-gap] Gaps in fixed expression [16, 18, 19] 'de * orden de'
[Line 14297 Sent CESS-CAST-A-20000415-12000-s7 Node 39]: [L3 Warning fixed-gap] Gaps in fixed expression [39, 41] 'a * respecto'
[Line 23174 Sent CESS-CAST-AA-20000204-3836-s18 Node 17]: [L3 Warning fixed-gap] Gaps in fixed expression [17, 19, 20] 'a * tiempo que'
[Line 26533 Sent CESS-CAST-AA-20000307-5124-s3 Node 37]: [L3 Warning fixed-gap] Gaps in fixed expression [37, 39, 40] 'a * margen de'
...suppressing further errors regarding Warning
Warnings: 22
*** PASSED ***
/net/work/people/zeman/unidep/tools/validate.py --lang es --max-err=10 UD_Spanish-AnCora/es_ancora-ud-train.conllu
[Line 28731 Sent 3LB-CAST-a14-3-s5 Node 13]: [L3 Warning fixed-gap] Gaps in fixed expression [13, 15] 'de * todo'
[Line 32508 Sent 3LB-CAST-a20-1-s4 Node 16]: [L3 Warning fixed-gap] Gaps in fixed expression [16, 17, 18, 19, 20, 22] 'a el fin y a * cabo'
[Line 38579 Sent 3LB-CAST-c2-2-s5 Node 18]: [L3 Warning fixed-gap] Gaps in fixed expression [18, 20] 'de * todo'
[Line 60061 Sent 3LB-CAST-dc10-8-s9 Node 1]: [L3 Warning fixed-gap] Gaps in fixed expression [1, 2, 3, 4, 5, 7] 'A el fin y a * cabo'
[Line 80700 Sent 3LB-CAST-n1-15-s6 Node 74]: [L3 Warning fixed-gap] Gaps in fixed expression [74, 76] 'de * todo'
[Line 83555 Sent 3LB-CAST-r2-0-s3 Node 16]: [L3 Warning fixed-gap] Gaps in fixed expression [16, 17, 18, 19, 20, 22] 'a el fin y a * cabo'
[Line 117938 Sent CESS-CAST-A-20000221-17317-s9 Node 51]: [L3 Warning fixed-gap] Gaps in fixed expression [51, 53, 54] 'de * orden de'
[Line 140682 Sent CESS-CAST-A-20000520-16210-s1 Node 23]: [L3 Warning fixed-gap] Gaps in fixed expression [23, 25] 'de * todo'
[Line 140814 Sent CESS-CAST-A-20000520-16210-s4 Node 12]: [L3 Warning fixed-gap] Gaps in fixed expression [12, 14] 'de * todo'
[Line 184810 Sent CESS-CAST-A-20001121-16182-s6 Node 76]: [L3 Warning fixed-gap] Gaps in fixed expression [76, 78] 'de * todo'
...suppressing further errors regarding Warning
Warnings: 17
*** PASSED ***
Validity: 1
(weight=0.0769230769230769) * (score{features}=0.8) = 0.0615384615384615
(weight=0.0769230769230769) * (score{genres}=0.0588235294117647) = 0.00452488687782805
(weight=0.0769230769230769) * (score{lemmas}=0.8) = 0.0615384615384615
(weight=0.256410256410256) * (score{size}=0.916098087018308) = 0.23489694538931
(weight=0.0512820512820513) * (score{split}=1) = 0.0512820512820513
(weight=0.0769230769230769) * (score{tags}=0.8) = 0.0615384615384615
(weight=0.307692307692308) * (score{udapi}=0.975452433958121) = 0.300139210448653
(weight=0.0769230769230769) * (score{udeprels}=0.713513513513514) = 0.0548856548856549
(TOTAL score=0.830344133498881) * (availability=1) * (validity=1) = 0.830344133498881
STARS = 4
UD_Spanish-AnCora 0.830344133498881 4