Hacker Newsnew | past | comments | ask | show | jobs | submitlogin

Yes I had a look at it, I haven't tried e5-mistral-7b-instruct yet but I'll definitely give it a go. Is there such a leaderboard only focused on retrieval by any chance? I haven't found one so far


The BEIR project might be what you're looking for: https://github.com/beir-cellar/beir/wiki/Leaderboard



It's interesting how the massive model of e5-mistral has only marginal performance gains over the bge-base and similar ones. It could still be useful for the longer sentence length though.


e5-mistral is essentially a distillation from gpt-4 to a smaller model. You can see here https://github.com/microsoft/unilm/blob/16da2f193b9c1dab0a69...

they actually have custom prompts for each dataset being tested.

Question would be, if you haven't seen the task before, what is a good prompt to prepend for your task?

IMO e5-mistral is overfit to MTEB




Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact

Search: