Fix report generation when multiple datasets are involved #221

mariamedp · 2024-12-12T13:15:07Z

Fixed name in report generation files, was causing the reports to be overriden instead of creating one for each eval flow executed.
Added small refactoring to reduce code duplications.

After the change, files for all eval flows executed are created. Eval flow name is used now in the file name.
Example with web classification use case:

…s-multiple-datasets

mariamedp · 2024-12-17T15:33:29Z

llmops/common/prompt_eval.py

-            combined_metrics_df.to_csv(
-                f"{report_dir}/{run_dataset.name}_metrics.csv"
-                )
+            fname_base = f"{report_dir}/{evaluator.name}"


This is the fix. evaluator.name is used for the file name instead of run_dataset.name.

This is because run_dataset is only a temporary variable that is used in a loop some lines above to iterate through all existing datasets. Hence, run_dataset here always has the same value since it's after the loop has run and has iterated through the whole list.

mariamedp added 6 commits July 29, 2024 13:29

workflow_dispatch in web classification

72c8ade

workflow_dispatch in all usecases CI workflows

8ac1c85

Merge remote-tracking branch 'template/main' into fix/download-result…

bfe08b9

…s-multiple-datasets

Fix flow eval artifact name

789367a

workflow_dispatch webclass ci

43e3081

Execution in azure

e31e8f8

mariamedp commented Dec 17, 2024

View reviewed changes

mariamedp marked this pull request as ready for review December 17, 2024 15:33

mariamedp requested review from ritesh-modi and noraabiakar December 17, 2024 15:33

mariamedp added 2 commits December 17, 2024 19:06

Cleanup

cd0434f

Cleanup

d4cb12d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix report generation when multiple datasets are involved #221

Fix report generation when multiple datasets are involved #221

mariamedp commented Dec 12, 2024 •

edited

Loading

mariamedp Dec 17, 2024

Fix report generation when multiple datasets are involved #221

Are you sure you want to change the base?

Fix report generation when multiple datasets are involved #221

Conversation

mariamedp commented Dec 12, 2024 • edited Loading

mariamedp Dec 17, 2024

Choose a reason for hiding this comment

mariamedp commented Dec 12, 2024 •

edited

Loading