Add support for shortening dimensions for AOAI embeddings #502

tomconte · 2024-04-22T09:56:00Z

This is a proposal to add support for shortening dimensions when using newer Azure OpenAI embeddings models, e.g. text-embedding-3-large.

Also includes a fix for the AML experiment name in eval.py (it was using the index prefix instead of the experiment name).

Fixes #503

Note: I believe CI fails because of #452, but I did run the CI steps manually in the dev container.

martinpeck · 2024-04-25T13:05:30Z

rag_experiment_accelerator/config/index_config.py

@@ -83,5 +84,5 @@ def from_index_name(cls, index_name: str, config: Any) -> "IndexConfig":
            generate_title=bool(int(cls.__get_index_value(values[7]))),
            generate_summary=bool(int(cls.__get_index_value(values[8]))),
            override_content_with_summary=bool(int(cls.__get_index_value(values[9]))),
-            embedding_model=config._find_embedding_model_by_name(values[10].strip()),
+            embedding_model=config._find_embedding_model_by_name(values[11].strip()),


Why isn't the dimension being returned as part of the IndexConfig?
Does this break "round tripping"?

The dimensions are already part of the embedding_model, they are read from the config file. I only added them to the index name so that we can have two indices with the same model but different number of dimensions, e.g.

surface_p-0_cs-1000_o-200_efc-400_efs-400_sp-0_t-0_s-0_oc-0_dim-3072_text-3-large

and

surface_p-0_cs-1000_o-200_efc-400_efs-400_sp-0_t-0_s-0_oc-0_dim-256_text-3-large

martinpeck · 2024-04-25T13:08:33Z

rag_experiment_accelerator/evaluation/eval.py

@@ -774,7 +774,7 @@ def evaluate_prompts(
        os.path.join(config.EVAL_DATA_LOCATION, f"sum_{name_suffix}.csv")
    )
    draw_hist_df(sum_df, run_id, client)
-    generate_metrics(config.INDEX_NAME_PREFIX, run_id, client)
+    generate_metrics(config.EXPERIMENT_NAME, run_id, client)


What's the reason for this change?
Is it related to the shortened dimensions, or something else?

Is there a bug related to this change?

This is for an unrelated issue I had when running the process with my config.

The experiment name is set like this in evaluation.py:

mlflow.set_experiment(config.EXPERIMENT_NAME)

And later generate_metrics() complained that the experiment did not exist here in eval.py, I believe because INDEX_NAME_PREFIX was used instead of EXPERIMENT_NAME.

Is there a bug related to this change?

I can't see one, no. I can remove this commit until the issue is confirmed.

It's probably fine. I wanted to check that fixing this wouldn't cause it's own issues (e.g. that the change in name wouldn't have a back-compat issue for someone using it today). I also tend to think that it's better to avoid too many "while I'm here" bug fixes, unless they're already logged somewhere.

That said, if @ritesh-modi is happy with this, I'm also happy. :)

To clarify, here is the output of the final step, using the CI configuration, on the current development branch:

vscode ➜ /workspaces/rag-experiment-accelerator (development) $ python 04_evaluation.py --data_dir='data-ci' --config_path=.github/workflows/config.json 2024/04/26 09:19:22 INFO mlflow.tracking.fluent: Experiment with name 'baseline' does not exist. Creating a new experiment. 2024-04-26 09:19:24,632 - INFO - rag_experiment_accelerator.run.evaluation - Evaluating Index: ci_p-0_cs-1000_o-200_efc-400_efs-400_sp-0_t-1_s-1_oc-0_all-mpnet-base-v2 2024-04-26 09:19:24,643 - INFO - rag_experiment_accelerator.artifact.handlers.artifact_handler - Loading artifacts from path: .github/workflows/artifacts/query_data/eval_output_ci_p-0_cs-1000_o-200_efc-400_efs-400_sp-0_t-1_s-1_oc-0_all-mpnet-base-v2_baseline_baseline_job.jsonl 2024-04-26 09:19:24,645 - INFO - rag_experiment_accelerator.artifact.handlers.artifact_handler - Loaded 6 artifacts from path: .github/workflows/artifacts/query_data/eval_output_ci_p-0_cs-1000_o-200_efc-400_efs-400_sp-0_t-1_s-1_oc-0_all-mpnet-base-v2_baseline_baseline_job.jsonl Traceback (most recent call last): File "/workspaces/rag-experiment-accelerator/04_evaluation.py", line 35, in <module> run( File "/workspaces/rag-experiment-accelerator/rag_experiment_accelerator/run/evaluation.py", line 36, in run eval.evaluate_prompts( File "/workspaces/rag-experiment-accelerator/rag_experiment_accelerator/evaluation/eval.py", line 777, in evaluate_prompts generate_metrics(config.INDEX_NAME_PREFIX, run_id, client) File "/workspaces/rag-experiment-accelerator/rag_experiment_accelerator/evaluation/eval.py", line 377, in generate_metrics experiment = dict(client.get_experiment_by_name(experiment_name)) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ TypeError: 'NoneType' object is not iterable

tomconte added 6 commits April 22, 2024 09:48

Fix for experiment name

620adb8

Allow shortening dimensions in AOAI

d1c5cae

Fix tests

fa39a1d

Merge branch 'development' into tconte/add-embeddings-dimensions

064f977

Additional tests

d80491a

Merge branch 'development' into tconte/add-embeddings-dimensions

93419d7

martinpeck reviewed Apr 25, 2024

View reviewed changes

tomconte added 3 commits April 25, 2024 15:58

Merge branch 'development' into tconte/add-embeddings-dimensions

09f8b8b

Merge branch 'development' into tconte/add-embeddings-dimensions

f5768ad

Merge branch 'development' into tconte/add-embeddings-dimensions

b95e44b

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add support for shortening dimensions for AOAI embeddings #502

Add support for shortening dimensions for AOAI embeddings #502

tomconte commented Apr 22, 2024 •

edited

martinpeck Apr 25, 2024

tomconte Apr 25, 2024

martinpeck Apr 25, 2024

martinpeck Apr 25, 2024

tomconte Apr 25, 2024 •

edited

tomconte Apr 25, 2024

martinpeck Apr 25, 2024

tomconte Apr 26, 2024 •

edited

Add support for shortening dimensions for AOAI embeddings #502

Are you sure you want to change the base?

Add support for shortening dimensions for AOAI embeddings #502

Conversation

tomconte commented Apr 22, 2024 • edited

martinpeck Apr 25, 2024

Choose a reason for hiding this comment

tomconte Apr 25, 2024

Choose a reason for hiding this comment

martinpeck Apr 25, 2024

Choose a reason for hiding this comment

martinpeck Apr 25, 2024

Choose a reason for hiding this comment

tomconte Apr 25, 2024 • edited

Choose a reason for hiding this comment

tomconte Apr 25, 2024

Choose a reason for hiding this comment

martinpeck Apr 25, 2024

Choose a reason for hiding this comment

tomconte Apr 26, 2024 • edited

Choose a reason for hiding this comment

tomconte commented Apr 22, 2024 •

edited

tomconte Apr 25, 2024 •

edited

tomconte Apr 26, 2024 •

edited