Skip to content

Actions: sbintuitions/flexeval

Run tests

Actions

Loading...
Loading

Show workflow options

Create status badge

Loading
397 workflow runs
397 workflow runs

Filter by Event

Filter by Status

Filter by Branch

Filter by Actor

Add separate_reasoning_and_content
Run tests #397: Pull request #124 synchronize by ryosukeiri
January 31, 2025 09:34 9m 4s delete_previous_think_tag
January 31, 2025 09:34 9m 4s
Add separate_reasoning_and_content
Run tests #396: Pull request #124 synchronize by ryosukeiri
January 31, 2025 09:32 9m 0s delete_previous_think_tag
January 31, 2025 09:32 9m 0s
Merge pull request #126 from sbintuitions/fix-repetition-count
Run tests #395: Commit 620d1d5 pushed by ryokan0123
January 31, 2025 07:18 9m 25s main
January 31, 2025 07:18 9m 25s
Fix get_most_repeated_pattern
Run tests #394: Pull request #126 synchronize by butsugiri
January 31, 2025 06:44 9m 52s fix-repetition-count
January 31, 2025 06:44 9m 52s
Fix get_most_repeated_pattern
Run tests #393: Pull request #126 opened by butsugiri
January 31, 2025 06:37 9m 32s fix-repetition-count
January 31, 2025 06:37 9m 32s
Add new metric functions: LLMGEvalScore and ChatLLMGEvalScore
Run tests #392: Pull request #125 synchronize by m-ast
January 31, 2025 06:35 9m 15s feat/geval_score
January 31, 2025 06:35 9m 15s
Add new metric functions: LLMGEvalScore and ChatLLMGEvalScore
Run tests #391: Pull request #125 opened by m-ast
January 31, 2025 05:47 9m 3s feat/geval_score
January 31, 2025 05:47 9m 3s
Add separate_reasoning_and_content
Run tests #390: Pull request #124 synchronize by ryosukeiri
January 31, 2025 05:26 8m 58s delete_previous_think_tag
January 31, 2025 05:26 8m 58s
Add separate_reasoning_and_content
Run tests #389: Pull request #124 synchronize by ryosukeiri
January 31, 2025 05:21 8m 55s delete_previous_think_tag
January 31, 2025 05:21 8m 55s
Add separate_reasoning_and_content
Run tests #388: Pull request #124 synchronize by ryosukeiri
January 30, 2025 09:30 9m 39s delete_previous_think_tag
January 30, 2025 09:30 9m 39s
Add separate_reasoning_and_content
Run tests #387: Pull request #124 synchronize by ryosukeiri
January 30, 2025 09:26 9m 10s delete_previous_think_tag
January 30, 2025 09:26 9m 10s
Add separate_reasoning_and_content
Run tests #386: Pull request #124 opened by ryosukeiri
January 30, 2025 09:14 9m 14s delete_previous_think_tag
January 30, 2025 09:14 9m 14s
Merge pull request #123 from sbintuitions/add_repetition
Run tests #385: Commit 6f71c47 pushed by ryokan0123
January 29, 2025 09:26 14m 48s main
January 29, 2025 09:26 14m 48s
Implement RepetitionCount metric
Run tests #384: Pull request #123 synchronize by ryokan0123
January 29, 2025 03:34 9m 19s add_repetition
January 29, 2025 03:34 9m 19s
Implement RepetitionCount metric
Run tests #383: Pull request #123 synchronize by ryokan0123
January 29, 2025 01:52 9m 13s add_repetition
January 29, 2025 01:52 9m 13s
Implement RepetitionCount metric
Run tests #382: Pull request #123 opened by ryokan0123
January 29, 2025 01:38 14m 59s add_repetition
January 29, 2025 01:38 14m 59s
Merge pull request #122 from sbintuitions/update_bleu
Run tests #381: Commit fe065b6 pushed by ryokan0123
January 16, 2025 07:09 9m 6s main
January 16, 2025 07:09 9m 6s
Set effective_order=True for computing sentence-level bleu
Run tests #380: Pull request #122 synchronize by ryokan0123
January 16, 2025 06:48 9m 3s update_bleu
January 16, 2025 06:48 9m 3s
Set effective_order=True for computing sentence-level bleu
Run tests #379: Pull request #122 opened by ryokan0123
January 16, 2025 06:26 8m 57s update_bleu
January 16, 2025 06:26 8m 57s
Merge pull request #121 from sbintuitions/direct_template
Run tests #378: Commit 8ead327 pushed by ryokan0123
January 15, 2025 00:50 9m 26s main
January 15, 2025 00:50 9m 26s
Simplify prompt_template in configs
Run tests #377: Pull request #121 synchronize by ryokan0123
January 15, 2025 00:02 9m 46s direct_template
January 15, 2025 00:02 9m 46s
Simplify prompt_template in configs
Run tests #376: Pull request #121 synchronize by ryokan0123
January 14, 2025 15:12 9m 25s direct_template
January 14, 2025 15:12 9m 25s
Simplify prompt_template in configs
Run tests #375: Pull request #121 synchronize by ryokan0123
January 14, 2025 14:38 9m 28s direct_template
January 14, 2025 14:38 9m 28s
Simplify prompt_template in configs
Run tests #374: Pull request #121 synchronize by ryokan0123
January 14, 2025 09:23 9m 11s direct_template
January 14, 2025 09:23 9m 11s
Simplify prompt_template in configs
Run tests #373: Pull request #121 synchronize by ryokan0123
January 14, 2025 09:17 9m 27s direct_template
January 14, 2025 09:17 9m 27s