[v2] add similarity_fn in ModelMeta #1759

sam-hey · 2025-01-10T20:33:10Z

Checklist

Run tests locally to make sure nothing is broken using make test.
Run the formatter to format the code using make lint.
fix: Retrieve name and revision from SentenceTransformers metadata.
mv: Move the get_similarity_function method to ModelMeta.
fix: Resolve issues with bm25s.
feat: Add a max_similarity function to utils.py.
feat: Map and load SentenceTransformers similarity functions to MTEB similarity functions.
ref: Changed max_sim to MaxSim to align with the pylate implementation of similarity_fn_name Update similarity_fn_name = 'cosine' to MaxSim lightonai/pylate#77

Samoed

Overall, I really appreciate your suggestions, but I’m not sure if we should rely on the ModelMeta object for anything during evaluation.

mteb/evaluation/evaluators/model_classes.py

mteb/model_meta.py

mteb/evaluation/evaluators/model_classes.py

isaac-chung

Added 2 comments as I see that this is meant to be partially complete at this time.

mteb/model_meta.py

mteb/evaluation/evaluators/model_classes.py

Co-authored-by: Isaac Chung <[email protected]>

This reverts commit 7d1e949.

This reverts commit d71718b.

sam-hey · 2025-01-12T18:15:10Z

I made some additional changes, so the PR ended up being a bit larger than originally planned. Please take a look at what I’ve done and share your feedback—I’m looking forward to hearing your thoughts! @Samoed @isaac-chung

Just a heads-up, there’s one failing test: #1777.

Thanks!

Samoed

This currently doesn’t fix the issues with the similarity function, as it focuses only on the retrieval model wrapper. I think the changes should be made in get_model. The model name and revision can be used there, and the type of similarity function can be passed to the loader.

mteb/mteb/models/overview.py

Lines 156 to 157 in c3b46b7

    
           meta = get_model_meta(model_name, revision) 
        
           model = meta.load_model(**kwargs)

Also ModelMeta can be changed to separate loader and additional loader_kwargs

sam-hey · 2025-01-12T19:23:46Z

This currently doesn’t fix the issues with the similarity function, as it focuses only on the retrieval model wrapper. I think the changes should be made in get_model. The model name and revision can be used there, and the type of similarity function can be passed to the loader.

mteb/mteb/models/overview.py

Lines 156 to 157 in c3b46b7

meta = get_model_meta(model_name, revision)

model = meta.load_model(**kwargs)

Also ModelMeta can be changed to separate loader and additional loader_kwargs

Yes, you're correct about the changes. Did you notice these changes? I’m using the st similarity function names and map them to the Enum.
https://github.com/embeddings-benchmark/mteb/pull/1759/files#diff-629817399c17e22b713b95ac5146dc9ddd81b3977bef3165c23fa6166534224b

Samoed · 2025-01-12T19:28:20Z

No, I missed, thanks! But still I think this should be changed to not duplicate model_name and revision

KennethEnevoldsen · 2025-01-12T20:09:19Z

I am note quite sure of the context of this PR. It would great with a short explanation of what the PR is trying to solve. Below are some questions to help getting that process started.

fix: Retrieve name and revision from SentenceTransformers metadata.

Can't see where this happens? Why do we want this?

feat: Introduce an Enum to represent all implementations of similarity functions.
feat: Refactor all models to utilize the ScoringFunction Enum.
rm: Eliminate literals used to describe scoring functions.

We generally havenøt used enum's anywhere else in the repo. Any strong reason to introduce it over the literal?

mv: Move the get_similarity_function method to ModelMeta.
feat: Map and load SentenceTransformers similarity functions to MTEB similarity functions.

Hmm, but why do we need a get_similarity function in the first place? The interface specified that the model should specify the similarity function. Is there anything that I am missing here?

fix: Resolve issues with bm25s.

What issue?

but totally how integrated ModelMeta with get_model. For now, all models duplicating model_name and revision. E.g.

I agree. I don't like this as well. However that is quite a large overhaul. We could consider reformatting models similar to tasks (so that we have a model where we can call model.metadata). However, the reason why we have the current system is that we don't want to load the model when only dealing with the metadata. This is solved in tasks using the load_data method. We could redo models in the same way, but currently I don't see a strong reason to do so. Thogh I would love to be convinced otherwise (however this is not for this PR)

Samoed · 2025-01-12T20:11:55Z

Ok, I'll create separate PR for updating work with ModelMeta

sam-hey · 2025-01-12T20:31:44Z

fix: Retrieve name and revision from SentenceTransformers metadata.
Can't see where this happens? Why do we want this?

Currently, when passing a model, the old way MTEB creates two folders with missing model name and revision. This happens because a type check in mteb/models/overview.py is on model, not model.model, which is of type SentenceTransformers.

We generally haven't used enums anywhere else in the repo. Any strong reason to introduce it over the literal?

This might be a subjective preference, but using an Enum reduces the code size and, in my opinion, makes it easier to work with when implementing a new model, as it provides type suggestions from the IDE.

I also believe it makes development in other parts of the project much easier, as the available types are clearly defined and suggested by the IDE.
Personally, I'd also love to see all the different language codes as an Enum.

Hmm, but why do we need a get_similarity_function in the first place?
The interface specifies that the model should specify the similarity function. Is there anything that I’m missing here?

The get_similarity_function maps the Enum in ModelMeta to the actual function that gets called. It’s not practical to define a function directly in ModelMeta, as it would require searching for every function.

What issue?

The BM25 function call does not match the defined search function.

KennethEnevoldsen · 2025-01-12T21:00:31Z

Ok, I'll create separate PR for updating work with ModelMeta

Maybe good to outline it in an issue first before you spend a lot of time on the refactor

KennethEnevoldsen · 2025-01-12T21:14:20Z

Currently, when passing a model, the old way MTEB creates two folders with missing model name and revision. This happens because a type check in mteb/models/overview.py is on model, not model.model, which is of type SentenceTransformers.

Not sure what you mean here. What is model in this case? When I run:

import mteb
task = mteb.get_task("BornholmBitextMining") 
model = mteb.get_model("sentence-transformers/all-MiniLM-L6-v2")
bench = mteb.MTEB(tasks=[task])
bench.run(model) # only created one folder

I do not get two folders.

re Enum

I understand where you are coming from. I think this is a more general change across the repo and not one that should happen just here. I would suggest that we stick with Literals as this will be a breaking change so no reason to add it unless we have a strong reason to. However do open an issue on Literals vs. Enums (most IDEs, e.g. vscode also allow suggestions when using literals).

The get_similarity_function maps the Enum in ModelMeta to the actual function that gets called. It’s not practical to define a function directly in ModelMeta, as it would require searching for every function.

But why not use the model.similarity interface:

mteb/mteb/encoder_interface.py

Line 71 in 0c5c3a5

def similarity(

This is modelled after sentence-transformers and freely allow the model to implement their own similarity function. Why is there a reason to fetch it?

sam-hey · 2025-01-13T07:16:10Z

Not sure what you mean here. What is model in this case? When I run:

When I run the following code:

from sentence_transformers import CrossEncoder, SentenceTransformer

from mteb import MTEB
from mteb.models import ModelMeta

de_name = "average_word_embeddings_komninos"
revision = "21eec43590414cb8e3a6f654857abed0483ae36e"
de = SentenceTransformer(de_name, revision=revision)
ce = CrossEncoder("cross-encoder/ms-marco-TinyBERT-L-2-v2")
ce_revision = "e9ea2688951463fc2791a2ea2ddfce6762900675"

eval = MTEB(tasks=["SciFact"])
eval.run(
    de,
    output_folder="tests/results/stage1",
    overwrite_results=True,
    save_predictions=True,
    eval_splits=["test"],
)
eval.run(
    ce,
    eval_splits=["test"],
    output_folder="tests/results/stage2",
    save_predictions=True,
    top_k=10,
    previous_results="tests/results/stage1",
    )

Enums

VSCode does not provide autocompletion for Literal types by default when using the Python extension.

Accessing Available Types:
To find the available types, you need to manually explore ModelMeta and check the attribute.
This process is inconvenient and hampers productivity. Transitioning to Enums would:

Drastically simplify type inspection
Improve the ease of adding new models

But why not use the model.similarity interface:

The current implementation uses the similarity() method.
By aligning the approach with SentenceTransformers and incorporating ModelMeta, the implementation becomes more streamlined and consistent.

KennethEnevoldsen · 2025-01-13T12:42:50Z

Thanks for the clarification. For future PRs it would make it easier for the reviewer if what it intends to solve is specified in the first comment (it seems like this was discussed elsewhere, in which case do link to it). Besides the Enums (which is being discussed in #1784) I believe this is ready to merge as soon as tests pass.

sam-hey · 2025-01-13T13:32:41Z

This PR originally had a completely different scope, so you're absolutely correct—it ended up getting quite tangled with multiple different PRs and issues addressing the same topic. My apologies for the confusion.

Only #1777 is failing, but that's expected

isaac-chung · 2025-01-13T13:39:07Z

Only #1777 is failing, but that's expected

That's fixed in the latest commit in main. Feel free to copy that small commit over.

* skip AfriSentiLID for now * skip relevant test case instead --------- Co-authored-by: Isaac Chung <[email protected]>

KennethEnevoldsen · 2025-01-13T16:07:33Z

There seems to be a conflict. @sam-hey if you resolve this remove the Enum part we can merge this in.

This is not at all a hard decision on Enums, but I think with two major merges coming up, this will cause a lot of conflicts (cc. @Samoed, @gowitheflow-1998). We can still do the transition after the merge (outlined in #1784).

lightonai/pylate#77

sam-hey · 2025-01-13T18:55:20Z

@KennethEnevoldsen, what do you think about merging it in its current state? I’ve kept the Enum but reverted the changes in all the models.
This should minimize the effort required to merge the new changes while still supporting enums.

Personally, I think this is a good compromise. Backward compatibility is fully maintained.

mteb/models/overview.py

mteb/model_meta.py

isaac-chung · 2025-01-13T23:50:33Z

Model loading test has been fixed here in main as well: #1775

Samoed · 2025-01-14T05:06:35Z

@isaac-chung I've merged your updates to v2 branch and on next CI trigger everything should work

sam-hey · 2025-01-15T14:55:55Z

All Enums have been removed. The changes should be ready for merging now.

Samoed · 2025-01-15T15:15:31Z

tests/test_benchmark/test_models.py

Sorry, last changes! Maybe apply here pytest.importskip instead of pytest.mark.skip?

I'm totally here to improve my coding, so I really appreciate a thorough review, even if it means I have to redo parts! @Samoed, thanks for the feedback!

And you're absolutely right it is not so nice —pytest.importskip doesn't exist, as far as I can tell. However, pytest.importorskip is a similar function, but it’s not a marker.

We could go with something like this, or even write a custom decorator:

try: import pylate # noqa except ImportError: pylate_installed = False @pytest.mark.skipif(not pylate_installed, reason="PyLate not installed")

https://docs.pytest.org/en/7.1.x/reference/reference.html#pytest-importorskip
pytest-dev/pytest#9548
https://stackoverflow.com/questions/73750328/make-pytest-ignore-gathering-a-file-if-a-module-is-not-installed

Yes, I meaned importorskip

Have you reviewed the documentation and the related issue? You can find it here: pytest-dev/pytest#9548.

https://docs.pytest.org/en/7.1.x/reference/reference.html#pytest-importorskip

Using importorskip will work, but only if we split each component into its own file, which isn't very practical.

You can use it inside test function

Samoed and others added 4 commits January 10, 2025 18:28

add dotwrapper

d71718b

lint

d50fd88

make cleaner

7d1e949

add poc similarity_fn in ModelMeta

9e9a111

Samoed reviewed Jan 10, 2025

View reviewed changes

mteb/evaluation/evaluators/model_classes.py Outdated Show resolved Hide resolved

mteb/model_meta.py Outdated Show resolved Hide resolved

mteb/evaluation/evaluators/model_classes.py Outdated Show resolved Hide resolved

Samoed requested review from KennethEnevoldsen and isaac-chung January 10, 2025 20:49

Samoed mentioned this pull request Jan 11, 2025

[v2] fix contriever (add similarity_fn_name to ST wrapper) #1749

Merged

2 tasks

isaac-chung reviewed Jan 11, 2025

View reviewed changes

mteb/model_meta.py Outdated Show resolved Hide resolved

mteb/evaluation/evaluators/model_classes.py Outdated Show resolved Hide resolved

ref: rename EvaluationFunction to ScoringFunction

e4a692f

Co-authored-by: Isaac Chung <[email protected]>

sam-hey force-pushed the fix_contriever branch from 6d04547 to e4a692f Compare January 11, 2025 17:43

make cos_sim default

1865345

sam-hey changed the base branch from fix_contriever to v2.0.0 January 11, 2025 20:29

sam-hey added 3 commits January 11, 2025 21:38

Revert "make cleaner"

f34f110

This reverts commit 7d1e949.

Revert "add dotwrapper"

49a954e

This reverts commit d71718b.

lint

d9ebe97

Samoed changed the title ~~add similarity_fn in ModelMeta~~ [v2] add similarity_fn in ModelMeta Jan 12, 2025

sam-hey marked this pull request as draft January 12, 2025 12:21

sam-hey added 2 commits January 12, 2025 15:51

fix: _run_eval no co tracking

4c89681

Merge remote-tracking branch 'mteb/v2.0.0' into fix_contriever

fae6e31

sam-hey mentioned this pull request Jan 12, 2025

Non Cosine-Sim Similarity functions for Sentence Transformer models are broken in v2.0.0 #1731

Closed

sam-hey added 5 commits January 12, 2025 16:47

fix: bm25s

6298d75

add enum to models

5a023d6

add mapping st sim fn name to mteb sim fn name

8ad1e88

fix model meta use new fn for sim operators

700ad58

add max_sim

8cffb6a

Samoed reviewed Jan 12, 2025

View reviewed changes

sam-hey marked this pull request as ready for review January 12, 2025 20:09

Samoed mentioned this pull request Jan 12, 2025

[v2] Updated ModelMeta parameters #1779

Open

sam-hey mentioned this pull request Jan 13, 2025

[v2] Refactor to Use Enums Instead of Literals #1784

Open

fix: colbert & rm similarity_fn_name

bf0cf07

ci: skip AfriSentiLID for now (embeddings-benchmark#1785)

3391e1e

* skip AfriSentiLID for now * skip relevant test case instead --------- Co-authored-by: Isaac Chung <[email protected]>

sam-hey and others added 6 commits January 13, 2025 17:49

Merge branch 'v2.0.0' into fix_contriever

7bb43ab

test: add test for bm25s and ColBERT

4fabb09

lint

1442673

feat: add mapping for max_sim from pylate

bb4beec

lightonai/pylate#77

test: bm25s skip

0f923c1

fix: MaxSim as max_sim match pylate & rm Enum in models

f4779c7

Samoed reviewed Jan 13, 2025

View reviewed changes

mteb/models/overview.py Show resolved Hide resolved

Samoed reviewed Jan 13, 2025

View reviewed changes

mteb/model_meta.py Outdated Show resolved Hide resolved

sam-hey added 2 commits January 14, 2025 08:11

Merge remote-tracking branch 'mteb/v2.0.0' into fix_contriever

89d1ae8

rm enum

07f4d6a

Samoed reviewed Jan 15, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[v2] add similarity_fn in ModelMeta #1759

[v2] add similarity_fn in ModelMeta #1759

sam-hey commented Jan 10, 2025 •

edited

Loading

Samoed left a comment

isaac-chung left a comment

sam-hey commented Jan 12, 2025

Samoed left a comment •

edited

Loading

sam-hey commented Jan 12, 2025

Samoed commented Jan 12, 2025

KennethEnevoldsen commented Jan 12, 2025

Samoed commented Jan 12, 2025

sam-hey commented Jan 12, 2025

KennethEnevoldsen commented Jan 12, 2025

KennethEnevoldsen commented Jan 12, 2025

sam-hey commented Jan 13, 2025 •

edited

Loading

KennethEnevoldsen commented Jan 13, 2025 •

edited

Loading

sam-hey commented Jan 13, 2025

isaac-chung commented Jan 13, 2025

KennethEnevoldsen commented Jan 13, 2025

sam-hey commented Jan 13, 2025

isaac-chung commented Jan 13, 2025

Samoed commented Jan 14, 2025

sam-hey commented Jan 15, 2025

Samoed Jan 15, 2025 •

edited

Loading

sam-hey Jan 15, 2025

Samoed Jan 15, 2025

sam-hey Jan 15, 2025

Samoed Jan 15, 2025

	meta = get_model_meta(model_name, revision)
	model = meta.load_model(**kwargs)

[v2] add similarity_fn in ModelMeta #1759

Are you sure you want to change the base?

[v2] add similarity_fn in ModelMeta #1759

Conversation

sam-hey commented Jan 10, 2025 • edited Loading

Checklist

Samoed left a comment

Choose a reason for hiding this comment

isaac-chung left a comment

Choose a reason for hiding this comment

sam-hey commented Jan 12, 2025

Samoed left a comment • edited Loading

Choose a reason for hiding this comment

sam-hey commented Jan 12, 2025

Samoed commented Jan 12, 2025

KennethEnevoldsen commented Jan 12, 2025

Samoed commented Jan 12, 2025

sam-hey commented Jan 12, 2025

KennethEnevoldsen commented Jan 12, 2025

KennethEnevoldsen commented Jan 12, 2025

sam-hey commented Jan 13, 2025 • edited Loading

Enums

KennethEnevoldsen commented Jan 13, 2025 • edited Loading

sam-hey commented Jan 13, 2025

isaac-chung commented Jan 13, 2025

KennethEnevoldsen commented Jan 13, 2025

sam-hey commented Jan 13, 2025

isaac-chung commented Jan 13, 2025

Samoed commented Jan 14, 2025

sam-hey commented Jan 15, 2025

Samoed Jan 15, 2025 • edited Loading

Choose a reason for hiding this comment

sam-hey Jan 15, 2025

Choose a reason for hiding this comment

Samoed Jan 15, 2025

Choose a reason for hiding this comment

sam-hey Jan 15, 2025

Choose a reason for hiding this comment

Samoed Jan 15, 2025

Choose a reason for hiding this comment

sam-hey commented Jan 10, 2025 •

edited

Loading

Samoed left a comment •

edited

Loading

sam-hey commented Jan 13, 2025 •

edited

Loading

KennethEnevoldsen commented Jan 13, 2025 •

edited

Loading

Samoed Jan 15, 2025 •

edited

Loading