Log all hyper-parameters to mlflow #529

guybartal · 2024-05-06T00:06:51Z

closes #480

This PR includes

Log all hyper parameters to mlflow
Config refactoring - lowercase for all attributes as those are not constants (match coding conventions)
Print azure ml monitoring URL right after its creation to allow easy monitoring (ctrl + left click)
Fix issues with experiment and job names, allowing azure ml commands open experiment and mlflow runs automatically, while locally we create those manually.
Remove unused experiment settings from .env sample file
Hide warning azureml warnings by using CliV2AnonymousEnvironment as Azure ML environment name
Temporary solution for wrong json format in Q&A Gen step with current CI generation model version by removing all "..." strings from the model response

WIP

Example

example for printing azure ml job monitoring runs:

vscode ➜ /workspaces/rag-experiment-accelerator (guy/fix-mlflow-tags) $ make azureml

»»» 🧩  running on Azure ML...
python3 azureml/pipeline.py  --data_dir ./data   --config_path ./config.json
2024-05-06 08:07:50,635 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: index_name_prefix = surface
2024-05-06 08:07:50,635 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: experiment_name = search types and chunk sizes
2024-05-06 08:07:50,635 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: job_name = search types with different chunk sizes
2024-05-06 08:07:50,635 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: job_description = experimenting with different search types and chunking strategies
2024-05-06 08:07:50,635 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: preprocess = False
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: chunking = {'chunk_size': [1000, 1500], 'overlap_size': [100], 'generate_title': False, 'generate_summary': False, 'override_content_with_summary': False}
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: embedding_models = [{'type': 'sentence-transformer', 'model_name': 'all-mpnet-base-v2'}]
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: ef_construction = [400]
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: ef_search = [400]
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: language = {'analyzers': {'analyzer_name': 'en.microsoft', 'index_analyzer_name': '', 'search_analyzer_name': '', 'char_filters': [], 'tokenizers': [], 'token_filters': []}, 'query_language': 'en-us'}
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: rerank = True
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: rerank_type = crossencoder
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: llm_re_rank_threshold = 3
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: cross_encoder_at_k = 4
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: crossencoder_model = cross-encoder/stsb-roberta-base
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: search_types = ['search_for_match_semantic', 'search_for_manual_hybrid']
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: retrieve_num_of_documents = 5
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: metric_types = ['cosine']
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: azure_oai_chat_deployment_name = gpt-35-turbo
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: azure_oai_eval_deployment_name = gpt-35-turbo
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: openai_temperature = 0
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: search_relevancy_threshold = 0.8
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: data_formats = all
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: eval_data_jsonl_file_path = ./artifacts/eval_data.jsonl
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: chunking_strategy = basic
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: chain_of_thoughts = True
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: hyde = disabled
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: query_expansion = False
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: min_query_expansion_related_question_similarity_score = 90
2024-05-06 08:07:50,636 - DEBUG - rag_experiment_accelerator.config.config - Configuration setting: azure_document_intelligence_model = prebuilt-read
2024-05-06 08:07:54,206 - INFO - __main__ - Starting pipeline for index: surface_p-0_cs-1000_o-100_efc-400_efs-400_sp-0_t-0_s-0_oc-0_all-mpnet-base-v2
Class AutoDeleteSettingSchema: This is an experimental class, and may change at any time. Please see https://aka.ms/azuremlexperimental for more information.
Class AutoDeleteConditionSchema: This is an experimental class, and may change at any time. Please see https://aka.ms/azuremlexperimental for more information.
Class BaseAutoDeleteSettingSchema: This is an experimental class, and may change at any time. Please see https://aka.ms/azuremlexperimental for more information.
Class IntellectualPropertySchema: This is an experimental class, and may change at any time. Please see https://aka.ms/azuremlexperimental for more information.
Class ProtectionLevelSchema: This is an experimental class, and may change at any time. Please see https://aka.ms/azuremlexperimental for more information.
Class BaseIntellectualPropertySchema: This is an experimental class, and may change at any time. Please see https://aka.ms/azuremlexperimental for more information.
Uploading rag-experiment-accelerator (28.15 MBs): 100%|████████████| 28145635/28145635 [00:28<00:00, 977345.83it/s]


Uploading config.json (< 1 MB): 100%|█████████████████████████████████████████| 1.85k/1.85k [00:00<00:00, 20.6kB/s]


2024-05-06 08:09:20,884 - INFO - __main__ - Pipeline job started...
Index name: surface_p-0_cs-1000_o-100_efc-400_efs-400_sp-0_t-0_s-0_oc-0_all-mpnet-base-v2
Monitoring url: https://ml.azure.com/runs/sleepy_egg_f3n0qvqf5f?wsid=/subscriptions/xxxx-xxxxx-xxxxxxx-xxxxxx/resourcegroups/xxxx/workspaces/xxxx&tid=xxxx-xxxxx-xxxxxxx-xxxxxx
2024-05-06 08:09:20,884 - INFO - __main__ - Starting pipeline for index: surface_p-0_cs-1500_o-100_efc-400_efs-400_sp-0_t-0_s-0_oc-0_all-mpnet-base-v2
2024-05-06 08:09:56,348 - INFO - __main__ - Pipeline job started...
Index name: surface_p-0_cs-1500_o-100_efc-400_efs-400_sp-0_t-0_s-0_oc-0_all-mpnet-base-v2
Monitoring url: https://ml.azure.com/runs/red_box_7sxtp59ftl?wsid=/subscriptions/xxxx-xxxxx-xxxxxxx-xxxxxx/resourcegroups/rg-guyb/workspaces/xxxx/workspaces/xxxx&tid=xxxx-xxxxx-xxxxxxx-xxxxxx

Comparing experiment runs

You can easily compare metrices and hyper parameters between experiment runs using Azure ML / ML Flow UI:

…query expansion

The `expand_to_multiple_questions` flag in the config file was set to `false`. This commit updates the flag to `true` to enable the feature of splitting complex queries into multiple questions. This change allows the Language Model to generate multiple queries and retrieve relevant documents accordingly.

guybartal added 2 commits May 6, 2024 00:05

refactor config and mlflow logging

9c968ab

sort imports

7d0da20

guybartal changed the title ~~Fix mlflow logging~~ Log all hyper-parameters to mlflow May 6, 2024

guybartal added 2 commits May 6, 2024 06:12

fix test_index

ce9a896

remove not needed parameters from mlflow

f92eee4

guybartal requested review from tanya-borisova, ritesh-modi and julia-meshcheryakova May 6, 2024 16:16

guybartal mentioned this pull request May 7, 2024

Add RAGAS evaluation metrics #504

Closed

guybartal added 4 commits May 9, 2024 07:18

fix for weird response which breaks json format "..."

340235f

fix missing arguments

d76ff43

fix issue in similarity score for filtering non related questions on …

e257966

…query expansion

Merge branch 'development' into guy/fix-mlflow-tags

aff8a56

guybartal mentioned this pull request May 9, 2024

Separate each search type into its own mlflow run to allow comparison using Azure ML / mlflow #540

Open

guybartal enabled auto-merge May 9, 2024 11:02

guybartal disabled auto-merge May 9, 2024 11:02

guybartal and others added 8 commits May 9, 2024 15:30

fix empty experiment_name field, and fix deep copy issue

1bc3bcf

Merge branch 'development' into guy/fix-mlflow-tags

85fdc0e

change properties to lowercase

02ce3cc

fix bug in title and summary generation

87dc0ea

fix generate title and summary issue

0bb752d

Merge branch 'development' into guy/fix-mlflow-tags

5d464e2

Merge branch 'development' into guy/fix-mlflow-tags

0818816

guybartal self-assigned this May 16, 2024

guybartal changed the base branch from development to prerelease May 18, 2024 12:48

guybartal merged commit 618007a into prerelease May 18, 2024
3 checks passed

guybartal mentioned this pull request May 18, 2024

Prerelease #556

Draft

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Log all hyper-parameters to mlflow #529

Log all hyper-parameters to mlflow #529

guybartal commented May 6, 2024 •

edited

Log all hyper-parameters to mlflow #529

Log all hyper-parameters to mlflow #529

Conversation

guybartal commented May 6, 2024 • edited

This PR includes

WIP

Example

Comparing experiment runs

guybartal commented May 6, 2024 •

edited