KaiquanMah commited on
Commit
9bf4b87
·
verified ·
1 Parent(s): 6d6ccd8

Upload 2 files

Browse files
predictions-zeroshot/round8-fewshot-1exampleeach-k-knownintent-restoos-100oossentences/classification_report_llama3.2_3b_banking_full.txt ADDED
@@ -0,0 +1,388 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Run: 1notoos
2
+ Overall Accuracy: 100.00%
3
+ Overall F1: 100.00%
4
+ precision recall f1-score support
5
+
6
+ oos 1.00 1.00 1.00 100
7
+
8
+ accuracy 1.00 100
9
+ macro avg 1.00 1.00 1.00 100
10
+ weighted avg 1.00 1.00 1.00 100
11
+
12
+ predicted
13
+ oos 100
14
+ Name: count, dtype: int64
15
+
16
+ Classification report saved to classification_report_llama3.2:3b_banking1notoos.txt
17
+
18
+
19
+ ================================================================================
20
+
21
+
22
+ Run: 5notoos
23
+ Overall Accuracy: 15.00%
24
+ Overall F1: 26.09%
25
+ precision recall f1-score support
26
+
27
+ oos 1.00 0.15 0.26 100
28
+ virtual_card_not_working 0.00 nan 0.00 0
29
+ why_verify_identity 0.00 nan 0.00 0
30
+
31
+ accuracy 0.15 100
32
+ macro avg 0.33 0.15 0.09 100
33
+ weighted avg 1.00 0.15 0.26 100
34
+
35
+ predicted
36
+ virtual_card_not_working 83
37
+ oos 15
38
+ why_verify_identity 2
39
+ Name: count, dtype: int64
40
+
41
+ Classification report saved to classification_report_llama3.2:3b_banking5notoos.txt
42
+
43
+
44
+
45
+
46
+
47
+
48
+ ================================================================================
49
+
50
+
51
+
52
+
53
+ Run: 10notoos
54
+ Overall Accuracy: 43.00%
55
+ Overall F1: 60.14%
56
+ precision recall f1-score support
57
+
58
+ oos 1.00 0.43 0.60 100
59
+ unable_to_verify_identity 0.00 nan 0.00 0
60
+ virtual_card_not_working 0.00 nan 0.00 0
61
+ transfer_timing 0.00 nan 0.00 0
62
+ verify_my_identity 0.00 nan 0.00 0
63
+
64
+ accuracy 0.43 100
65
+ macro avg 0.20 0.43 0.12 100
66
+ weighted avg 1.00 0.43 0.60 100
67
+
68
+ predicted
69
+ oos 43
70
+ virtual_card_not_working 22
71
+ verify_my_identity 21
72
+ transfer_timing 9
73
+ unable_to_verify_identity 5
74
+ Name: count, dtype: int64
75
+
76
+ Classification report saved to classification_report_llama3.2:3b_banking10notoos.txt
77
+
78
+
79
+ ================================================================================
80
+
81
+
82
+ Run: 15notoos
83
+ Overall Accuracy: 4.00%
84
+ Overall F1: 7.69%
85
+ precision recall f1-score support
86
+
87
+ unable_to_verify_identity 0.00 nan 0.00 0
88
+ virtual_card_not_working 0.00 nan 0.00 0
89
+ topping_up_by_card 0.00 nan 0.00 0
90
+ transfer_timing 0.00 nan 0.00 0
91
+ oos 1.00 0.04 0.08 100
92
+ verify_my_identity 0.00 nan 0.00 0
93
+
94
+ accuracy 0.04 100
95
+ macro avg 0.17 0.04 0.01 100
96
+ weighted avg 1.00 0.04 0.08 100
97
+
98
+ predicted
99
+ virtual_card_not_working 62
100
+ topping_up_by_card 23
101
+ unable_to_verify_identity 5
102
+ transfer_timing 4
103
+ oos 4
104
+ verify_my_identity 2
105
+ Name: count, dtype: int64
106
+
107
+ Classification report saved to classification_report_llama3.2:3b_banking15notoos.txt
108
+
109
+
110
+ ================================================================================
111
+
112
+
113
+ Run: 20notoos
114
+ Overall Accuracy: 13.00%
115
+ Overall F1: 23.01%
116
+ precision recall f1-score support
117
+
118
+ topping_up_by_card 0.00 nan 0.00 0
119
+ oos 1.00 0.13 0.23 100
120
+ virtual_card_not_working 0.00 nan 0.00 0
121
+ unable_to_verify_identity 0.00 nan 0.00 0
122
+ top_up_failed 0.00 nan 0.00 0
123
+ transfer_into_account 0.00 nan 0.00 0
124
+ top_up_by_card_charge 0.00 nan 0.00 0
125
+ verify_my_identity 0.00 nan 0.00 0
126
+ transfer_timing 0.00 nan 0.00 0
127
+
128
+ accuracy 0.13 100
129
+ macro avg 0.11 0.13 0.03 100
130
+ weighted avg 1.00 0.13 0.23 100
131
+
132
+ predicted
133
+ topping_up_by_card 49
134
+ virtual_card_not_working 19
135
+ oos 13
136
+ unable_to_verify_identity 12
137
+ top_up_by_card_charge 2
138
+ verify_my_identity 2
139
+ top_up_failed 1
140
+ transfer_into_account 1
141
+ transfer_timing 1
142
+ Name: count, dtype: int64
143
+
144
+ Classification report saved to classification_report_llama3.2:3b_banking20notoos.txt
145
+
146
+
147
+
148
+ ================================================================================
149
+
150
+
151
+ Run: 30notoos
152
+ Overall Accuracy: 10.00%
153
+ Overall F1: 18.18%
154
+ precision recall f1-score support
155
+
156
+ supported_cards_and_currencies 0.00 nan 0.00 0
157
+ topping_up_by_card 0.00 nan 0.00 0
158
+ pin_blocked 0.00 nan 0.00 0
159
+ unable_to_verify_identity 0.00 nan 0.00 0
160
+ virtual_card_not_working 0.00 nan 0.00 0
161
+ verify_my_identity 0.00 nan 0.00 0
162
+ oos 1.00 0.10 0.18 100
163
+ top_up_failed 0.00 nan 0.00 0
164
+ top_up_by_card_charge 0.00 nan 0.00 0
165
+ terminate_account 0.00 nan 0.00 0
166
+
167
+ accuracy 0.10 100
168
+ macro avg 0.10 0.10 0.02 100
169
+ weighted avg 1.00 0.10 0.18 100
170
+
171
+ predicted
172
+ topping_up_by_card 47
173
+ pin_blocked 16
174
+ oos 10
175
+ virtual_card_not_working 9
176
+ supported_cards_and_currencies 5
177
+ unable_to_verify_identity 4
178
+ verify_my_identity 4
179
+ top_up_failed 2
180
+ top_up_by_card_charge 2
181
+ terminate_account 1
182
+ Name: count, dtype: int64
183
+
184
+ Classification report saved to classification_report_llama3.2:3b_banking30notoos.txt
185
+
186
+
187
+
188
+ ================================================================================
189
+
190
+
191
+ Run: 35notoos
192
+ Overall Accuracy: 2.00%
193
+ Overall F1: 3.92%
194
+ precision recall f1-score support
195
+
196
+ order_physical_card 0.00 nan 0.00 0
197
+ passcode_forgotten 0.00 nan 0.00 0
198
+ topping_up_by_card 0.00 nan 0.00 0
199
+ pin_blocked 0.00 nan 0.00 0
200
+ verify_my_identity 0.00 nan 0.00 0
201
+ virtual_card_not_working 0.00 nan 0.00 0
202
+ oos 1.00 0.02 0.04 100
203
+ pending_card_payment 0.00 nan 0.00 0
204
+ lost_or_stolen_phone 0.00 nan 0.00 0
205
+
206
+ accuracy 0.02 100
207
+ macro avg 0.11 0.02 0.00 100
208
+ weighted avg 1.00 0.02 0.04 100
209
+
210
+ predicted
211
+ order_physical_card 71
212
+ passcode_forgotten 8
213
+ virtual_card_not_working 8
214
+ pin_blocked 5
215
+ topping_up_by_card 3
216
+ oos 2
217
+ verify_my_identity 1
218
+ pending_card_payment 1
219
+ lost_or_stolen_phone 1
220
+ Name: count, dtype: int64
221
+
222
+ Classification report saved to classification_report_llama3.2:3b_banking35notoos.txt
223
+
224
+
225
+
226
+
227
+
228
+ ================================================================================
229
+
230
+
231
+ Run: 40notoos
232
+ Overall Accuracy: 2.00%
233
+ Overall F1: 3.92%
234
+ precision recall f1-score support
235
+
236
+ getting_virtual_card 0.00 nan 0.00 0
237
+ get_physical_card 0.00 nan 0.00 0
238
+ order_physical_card 0.00 nan 0.00 0
239
+ verify_my_identity 0.00 nan 0.00 0
240
+ unable_to_verify_identity 0.00 nan 0.00 0
241
+ oos 1.00 0.02 0.04 100
242
+ virtual_card_not_working 0.00 nan 0.00 0
243
+ lost_or_stolen_card 0.00 nan 0.00 0
244
+
245
+ accuracy 0.02 100
246
+ macro avg 0.12 0.02 0.00 100
247
+ weighted avg 1.00 0.02 0.04 100
248
+
249
+ predicted
250
+ get_physical_card 67
251
+ getting_virtual_card 17
252
+ order_physical_card 9
253
+ oos 2
254
+ virtual_card_not_working 2
255
+ verify_my_identity 1
256
+ unable_to_verify_identity 1
257
+ lost_or_stolen_card 1
258
+ Name: count, dtype: int64
259
+
260
+ Classification report saved to classification_report_llama3.2:3b_banking40notoos.txt
261
+
262
+
263
+
264
+
265
+ ================================================================================
266
+
267
+ Run: 50notoos
268
+ Overall Accuracy: 4.00%
269
+ Overall F1: 7.69%
270
+ precision recall f1-score support
271
+
272
+ order_physical_card 0.00 nan 0.00 0
273
+ get_physical_card 0.00 nan 0.00 0
274
+ getting_virtual_card 0.00 nan 0.00 0
275
+ direct_debit_payment_not_recognised 0.00 nan 0.00 0
276
+ edit_personal_details 0.00 nan 0.00 0
277
+ verify_my_identity 0.00 nan 0.00 0
278
+ pin_blocked 0.00 nan 0.00 0
279
+ oos 1.00 0.04 0.08 100
280
+ virtual_card_not_working 0.00 nan 0.00 0
281
+ lost_or_stolen_card 0.00 nan 0.00 0
282
+
283
+ accuracy 0.04 100
284
+ macro avg 0.10 0.04 0.01 100
285
+ weighted avg 1.00 0.04 0.08 100
286
+
287
+ predicted
288
+ order_physical_card 43
289
+ get_physical_card 25
290
+ getting_virtual_card 15
291
+ edit_personal_details 6
292
+ oos 4
293
+ pin_blocked 2
294
+ virtual_card_not_working 2
295
+ direct_debit_payment_not_recognised 1
296
+ verify_my_identity 1
297
+ lost_or_stolen_card 1
298
+ Name: count, dtype: int64
299
+
300
+ Classification report saved to classification_report_llama3.2:3b_banking50notoos.txt
301
+
302
+
303
+ ================================================================================
304
+
305
+ Run: 60notoos
306
+ Overall Accuracy: 2.00%
307
+ Overall F1: 3.92%
308
+ precision recall f1-score support
309
+
310
+ order_physical_card 0.00 nan 0.00 0
311
+ getting_spare_card 0.00 nan 0.00 0
312
+ getting_virtual_card 0.00 nan 0.00 0
313
+ get_physical_card 0.00 nan 0.00 0
314
+ card_swallowed 0.00 nan 0.00 0
315
+ change_pin 0.00 nan 0.00 0
316
+ contactless_not_working 0.00 nan 0.00 0
317
+ verify_my_identity 0.00 nan 0.00 0
318
+ unable_to_verify_identity 0.00 nan 0.00 0
319
+ oos 1.00 0.02 0.04 100
320
+ pin_blocked 0.00 nan 0.00 0
321
+ virtual_card_not_working 0.00 nan 0.00 0
322
+ lost_or_stolen_card 0.00 nan 0.00 0
323
+
324
+ accuracy 0.02 100
325
+ macro avg 0.08 0.02 0.00 100
326
+ weighted avg 1.00 0.02 0.04 100
327
+
328
+ predicted
329
+ order_physical_card 36
330
+ getting_virtual_card 35
331
+ getting_spare_card 7
332
+ get_physical_card 7
333
+ contactless_not_working 5
334
+ card_swallowed 2
335
+ oos 2
336
+ verify_my_identity 1
337
+ change_pin 1
338
+ unable_to_verify_identity 1
339
+ pin_blocked 1
340
+ virtual_card_not_working 1
341
+ lost_or_stolen_card 1
342
+ Name: count, dtype: int64
343
+
344
+ Classification report saved to classification_report_llama3.2:3b_banking60notoos.txt
345
+
346
+
347
+ ================================================================================
348
+
349
+ Run: 70notoos
350
+ Overall Accuracy: 2.00%
351
+ Overall F1: 3.92%
352
+ precision recall f1-score support
353
+
354
+ card_acceptance 0.00 nan 0.00 0
355
+ order_physical_card 0.00 nan 0.00 0
356
+ card_linking 0.00 nan 0.00 0
357
+ card_arrival 0.00 nan 0.00 0
358
+ card_not_working 0.00 nan 0.00 0
359
+ edit_personal_details 0.00 nan 0.00 0
360
+ contactless_not_working 0.00 nan 0.00 0
361
+ unable_to_verify_identity 0.00 nan 0.00 0
362
+ Refund_not_showing_up 0.00 nan 0.00 0
363
+ oos 1.00 0.02 0.04 100
364
+ lost_or_stolen_card 0.00 nan 0.00 0
365
+
366
+ accuracy 0.02 100
367
+ macro avg 0.09 0.02 0.00 100
368
+ weighted avg 1.00 0.02 0.04 100
369
+
370
+ predicted
371
+ order_physical_card 36
372
+ card_acceptance 23
373
+ card_arrival 20
374
+ card_not_working 9
375
+ edit_personal_details 4
376
+ card_linking 2
377
+ oos 2
378
+ contactless_not_working 1
379
+ unable_to_verify_identity 1
380
+ Refund_not_showing_up 1
381
+ lost_or_stolen_card 1
382
+ Name: count, dtype: int64
383
+
384
+ Classification report saved to classification_report_llama3.2:3b_banking70notoos.txt
385
+
386
+
387
+
388
+ ================================================================================
predictions-zeroshot/round8-fewshot-1exampleeach-k-knownintent-restoos-100oossentences/classification_report_llama3.2_3b_stackoverflow_full.txt ADDED
@@ -0,0 +1,299 @@
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ Run: 1notoos
2
+ Overall Accuracy: 100.00%
3
+ Overall F1: 100.00%
4
+ precision recall f1-score support
5
+
6
+ oos 1.00 1.00 1.00 100
7
+
8
+ accuracy 1.00 100
9
+ macro avg 1.00 1.00 1.00 100
10
+ weighted avg 1.00 1.00 1.00 100
11
+
12
+ predicted
13
+ oos 100
14
+ Name: count, dtype: int64
15
+
16
+ Classification report saved to classification_report_llama3.2:3b_stackoverflow1notoos.txt
17
+
18
+
19
+
20
+ ================================================================================
21
+
22
+
23
+
24
+ Run: 2notoos
25
+ Overall Accuracy: 94.00%
26
+ Overall F1: 96.91%
27
+ precision recall f1-score support
28
+
29
+ oos 1.00 0.94 0.97 100
30
+ magento 0.00 nan 0.00 0
31
+
32
+ accuracy 0.94 100
33
+ macro avg 0.50 0.94 0.48 100
34
+ weighted avg 1.00 0.94 0.97 100
35
+
36
+ predicted
37
+ oos 94
38
+ magento 6
39
+ Name: count, dtype: int64
40
+
41
+ Classification report saved to classification_report_llama3.2:3b_stackoverflow2notoos.txt
42
+
43
+ ================================================================================
44
+
45
+ Run: 4notoos
46
+ Overall Accuracy: 81.00%
47
+ Overall F1: 89.50%
48
+ precision recall f1-score support
49
+
50
+ oos 1.00 0.81 0.90 100
51
+ linq 0.00 nan 0.00 0
52
+ drupal 0.00 nan 0.00 0
53
+ magento 0.00 nan 0.00 0
54
+
55
+ accuracy 0.81 100
56
+ macro avg 0.25 0.81 0.22 100
57
+ weighted avg 1.00 0.81 0.90 100
58
+
59
+ predicted
60
+ oos 81
61
+ linq 8
62
+ magento 8
63
+ drupal 3
64
+ Name: count, dtype: int64
65
+
66
+ Classification report saved to classification_report_llama3.2:3b_stackoverflow4notoos.txt
67
+
68
+
69
+ ================================================================================
70
+
71
+ Run: 6notoos
72
+ Overall Accuracy: 25.00%
73
+ Overall F1: 40.00%
74
+ precision recall f1-score support
75
+
76
+ magento 0.00 nan 0.00 0
77
+ oos 1.00 0.25 0.40 100
78
+ linq 0.00 nan 0.00 0
79
+ drupal 0.00 nan 0.00 0
80
+ ajax 0.00 nan 0.00 0
81
+
82
+ accuracy 0.25 100
83
+ macro avg 0.20 0.25 0.08 100
84
+ weighted avg 1.00 0.25 0.40 100
85
+
86
+ predicted
87
+ magento 53
88
+ oos 25
89
+ drupal 14
90
+ linq 7
91
+ ajax 1
92
+ Name: count, dtype: int64
93
+
94
+ Classification report saved to classification_report_llama3.2:3b_stackoverflow6notoos.txt
95
+
96
+
97
+ ================================================================================
98
+
99
+ Run: 8notoos
100
+ Overall Accuracy: 17.00%
101
+ Overall F1: 29.06%
102
+ precision recall f1-score support
103
+
104
+ magento 0.00 nan 0.00 0
105
+ drupal 0.00 nan 0.00 0
106
+ linq 0.00 nan 0.00 0
107
+ ajax 0.00 nan 0.00 0
108
+ oos 1.00 0.17 0.29 100
109
+ sharepoint 0.00 nan 0.00 0
110
+
111
+ accuracy 0.17 100
112
+ macro avg 0.17 0.17 0.05 100
113
+ weighted avg 1.00 0.17 0.29 100
114
+
115
+ predicted
116
+ magento 46
117
+ oos 17
118
+ sharepoint 12
119
+ linq 11
120
+ drupal 10
121
+ ajax 4
122
+ Name: count, dtype: int64
123
+
124
+ Classification report saved to classification_report_llama3.2:3b_stackoverflow8notoos.txt
125
+
126
+
127
+ ================================================================================
128
+
129
+ Run: 10notoos
130
+ Overall Accuracy: 61.00%
131
+ Overall F1: 75.78%
132
+ precision recall f1-score support
133
+
134
+ oos 1.00 0.61 0.76 100
135
+ sharepoint 0.00 nan 0.00 0
136
+ linq 0.00 nan 0.00 0
137
+ drupal 0.00 nan 0.00 0
138
+ magento 0.00 nan 0.00 0
139
+ ajax 0.00 nan 0.00 0
140
+
141
+ accuracy 0.61 100
142
+ macro avg 0.17 0.61 0.13 100
143
+ weighted avg 1.00 0.61 0.76 100
144
+
145
+ predicted
146
+ oos 61
147
+ sharepoint 21
148
+ linq 7
149
+ magento 6
150
+ ajax 3
151
+ drupal 2
152
+ Name: count, dtype: int64
153
+
154
+ Classification report saved to classification_report_llama3.2:3b_stackoverflow10notoos.txt
155
+
156
+
157
+ ================================================================================
158
+
159
+
160
+ Run: 12notoos
161
+ Overall Accuracy: 11.00%
162
+ Overall F1: 19.82%
163
+ precision recall f1-score support
164
+
165
+ magento 0.00 nan 0.00 0
166
+ sharepoint 0.00 nan 0.00 0
167
+ oos 1.00 0.11 0.20 100
168
+ drupal 0.00 nan 0.00 0
169
+ bash 0.00 nan 0.00 0
170
+ linq 0.00 nan 0.00 0
171
+ ajax 0.00 nan 0.00 0
172
+
173
+ accuracy 0.11 100
174
+ macro avg 0.14 0.11 0.03 100
175
+ weighted avg 1.00 0.11 0.20 100
176
+
177
+ predicted
178
+ sharepoint 39
179
+ magento 33
180
+ oos 11
181
+ drupal 8
182
+ bash 4
183
+ linq 3
184
+ ajax 2
185
+ Name: count, dtype: int64
186
+
187
+ Classification report saved to classification_report_llama3.2:3b_stackoverflow12notoos.txt
188
+
189
+
190
+ ================================================================================
191
+
192
+ Run: 14notoos
193
+ Overall Accuracy: 8.00%
194
+ Overall F1: 14.81%
195
+ precision recall f1-score support
196
+
197
+ visual-studio 0.00 nan 0.00 0
198
+ magento 0.00 nan 0.00 0
199
+ linq 0.00 nan 0.00 0
200
+ oos 1.00 0.08 0.15 100
201
+ drupal 0.00 nan 0.00 0
202
+ sharepoint 0.00 nan 0.00 0
203
+ cocoa 0.00 nan 0.00 0
204
+ ajax 0.00 nan 0.00 0
205
+ bash 0.00 nan 0.00 0
206
+
207
+ accuracy 0.08 100
208
+ macro avg 0.11 0.08 0.02 100
209
+ weighted avg 1.00 0.08 0.15 100
210
+
211
+ predicted
212
+ visual-studio 48
213
+ sharepoint 15
214
+ magento 13
215
+ oos 8
216
+ drupal 6
217
+ linq 4
218
+ cocoa 4
219
+ ajax 1
220
+ bash 1
221
+ Name: count, dtype: int64
222
+
223
+ Classification report saved to classification_report_llama3.2:3b_stackoverflow14notoos.txt
224
+
225
+
226
+ ================================================================================
227
+
228
+ Run: 16notoos
229
+ Overall Accuracy: 7.00%
230
+ Overall F1: 13.08%
231
+ precision recall f1-score support
232
+
233
+ visual-studio 0.00 nan 0.00 0
234
+ drupal 0.00 nan 0.00 0
235
+ cocoa 0.00 nan 0.00 0
236
+ ajax 0.00 nan 0.00 0
237
+ oos 1.00 0.07 0.13 100
238
+ sharepoint 0.00 nan 0.00 0
239
+ magento 0.00 nan 0.00 0
240
+ excel 0.00 nan 0.00 0
241
+ bash 0.00 nan 0.00 0
242
+
243
+ accuracy 0.07 100
244
+ macro avg 0.11 0.07 0.01 100
245
+ weighted avg 1.00 0.07 0.13 100
246
+
247
+ predicted
248
+ drupal 52
249
+ cocoa 19
250
+ oos 7
251
+ visual-studio 6
252
+ sharepoint 6
253
+ ajax 5
254
+ magento 2
255
+ excel 2
256
+ bash 1
257
+ Name: count, dtype: int64
258
+
259
+ Classification report saved to classification_report_llama3.2:3b_stackoverflow16notoos.txt
260
+
261
+
262
+ ================================================================================
263
+
264
+ Run: 18notoos
265
+ Overall Accuracy: 3.00%
266
+ Overall F1: 5.83%
267
+ precision recall f1-score support
268
+
269
+ apache 0.00 nan 0.00 0
270
+ drupal 0.00 nan 0.00 0
271
+ excel 0.00 nan 0.00 0
272
+ ajax 0.00 nan 0.00 0
273
+ visual-studio 0.00 nan 0.00 0
274
+ oos 1.00 0.03 0.06 100
275
+ sharepoint 0.00 nan 0.00 0
276
+ cocoa 0.00 nan 0.00 0
277
+ magento 0.00 nan 0.00 0
278
+
279
+ accuracy 0.03 100
280
+ macro avg 0.11 0.03 0.01 100
281
+ weighted avg 1.00 0.03 0.06 100
282
+
283
+ predicted
284
+ drupal 41
285
+ apache 20
286
+ excel 17
287
+ sharepoint 6
288
+ ajax 4
289
+ visual-studio 4
290
+ oos 3
291
+ cocoa 3
292
+ magento 2
293
+ Name: count, dtype: int64
294
+
295
+ Classification report saved to classification_report_llama3.2:3b_stackoverflow18notoos.txt
296
+
297
+
298
+
299
+ ================================================================================