Starting fine-tuning evaluation...
====================================================================================================
Sample 1/40
Positive: 0.8836, Negative: 0.0052
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 2/40
Positive: 0.7760, Negative: -0.0524
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 3/40
Positive: 0.8126, Negative: 0.2996
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 4/40
Positive: 0.7417, Negative: -0.1425
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 5/40
Positive: 0.7156, Negative: 0.3208
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 6/40
Positive: 0.4373, Negative: 0.0201
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 7/40
Positive: 0.6209, Negative: -0.0475
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 8/40
Positive: 0.6389, Negative: -0.0110
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 9/40
Positive: 0.6835, Negative: -0.0322
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 10/40
Positive: 0.7110, Negative: -0.0592
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 11/40
Positive: 0.8333, Negative: 0.3832
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 12/40
Positive: 0.8234, Negative: 0.3786
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 13/40
Positive: 0.6846, Negative: 0.3276
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 14/40
Positive: 0.7523, Negative: 0.1574
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 15/40
Positive: 0.4928, Negative: 0.2356
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 16/40
Positive: 0.8164, Negative: 0.0141
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 17/40
Positive: 0.5266, Negative: 0.0799
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 18/40
Positive: 0.8020, Negative: 0.2003
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 19/40
Positive: 0.7023, Negative: -0.0496
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 20/40
Positive: 0.7398, Negative: -0.0339
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 21/40
Positive: 0.6253, Negative: 0.3056
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 22/40
Positive: 0.4634, Negative: -0.0055
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 23/40
Positive: 0.4318, Negative: 0.3263
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 24/40
Positive: 0.7547, Negative: -0.0265
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 25/40
Positive: 0.6421, Negative: 0.4205
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 26/40
Positive: 0.6076, Negative: 0.2016
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 27/40
Positive: 0.7400, Negative: 0.6232
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 28/40
Positive: 0.6000, Negative: -0.0592
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 29/40
Positive: 0.6903, Negative: 0.5531
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 30/40
Positive: 0.4140, Negative: 0.2203
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 31/40
Positive: 0.6745, Negative: 0.6913
Result: ✗ Wrong
------------------------------------------------------------------------------------------
Sample 32/40
Positive: 0.8263, Negative: 0.4778
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 33/40
Positive: 0.7925, Negative: 0.3103
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 34/40
Positive: 0.7583, Negative: 0.1287
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 35/40
Positive: 0.8496, Negative: 0.1525
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 36/40
Positive: 0.7250, Negative: 0.5422
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 37/40
Positive: 0.8378, Negative: -0.0069
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 38/40
Positive: 0.5784, Negative: 0.2563
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 39/40
Positive: 0.6282, Negative: 0.3806
Result: ✓ Correct
------------------------------------------------------------------------------------------
Sample 40/40
Positive: 0.5763, Negative: 0.3881
Result: ✓ Correct
------------------------------------------------------------------------------------------
====================================================================================================
Pairwise Accuracy: 39/40 = 97.50%
====================================================================================================
Retrieval Task Testing...
Retrieval Test 1:
Correct Document Rank: 1/5
Top-1: ✓
Top-3:
1. sim=0.7410 ← correct
2. sim=0.1145
3. sim=0.0799
------------------------------------------------------------------------------------------
Retrieval Test 2:
Correct Document Rank: 1/5
Top-1: ✓
Top-3:
1. sim=0.6482 ← correct
2. sim=0.0599
3. sim=0.0550
------------------------------------------------------------------------------------------
Retrieval Test 3:
Correct Document Rank: 1/5
Top-1: ✓
Top-3:
1. sim=0.6007 ← correct
2. sim=0.1302
3. sim=0.0835
------------------------------------------------------------------------------------------
Retrieval Test 4:
Correct Document Rank: 1/5
Top-1: ✓
Top-3:
1. sim=0.8003 ← correct
2. sim=0.1457
3. sim=0.0883
------------------------------------------------------------------------------------------
Retrieval Test 5:
Correct Document Rank: 1/5
Top-1: ✓
Top-3:
1. sim=0.7396 ← correct
2. sim=0.1031
3. sim=0.0846
------------------------------------------------------------------------------------------
Retrieval Test 6:
Correct Document Rank: 1/5
Top-1: ✓
Top-3:
1. sim=0.6588 ← correct
2. sim=0.1011
3. sim=0.0529
------------------------------------------------------------------------------------------
Retrieval Test 7:
Correct Document Rank: 1/5
Top-1: ✓
Top-3:
1. sim=0.8398 ← correct
2. sim=0.1840
3. sim=0.1325
------------------------------------------------------------------------------------------
Retrieval Test 8:
Correct Document Rank: 1/5
Top-1: ✓
Top-3:
1. sim=0.5624 ← correct
2. sim=0.1171
3. sim=0.1068
------------------------------------------------------------------------------------------
Retrieval Test 9:
Correct Document Rank: 1/5
Top-1: ✓
Top-3:
1. sim=0.5426 ← correct
2. sim=0.1341
3. sim=0.1120
------------------------------------------------------------------------------------------
Retrieval Test 10:
Correct Document Rank: 1/5
Top-1: ✓
Top-3:
1. sim=0.7565 ← correct
2. sim=0.1497
3. sim=0.1364
------------------------------------------------------------------------------------------
Testing completed! 🚀