Improving Text Embeddings with Large Language Models: Implementation Details

9 Oct 2024

Authors:

(1) Liang Wang, Microsoft Corporation, and Correspondence to ([email protected]);

(2) Nan Yang, Microsoft Corporation, and correspondence to ([email protected]);

(3) Xiaolong Huang, Microsoft Corporation;

(4) Linjun Yang, Microsoft Corporation;

(5) Rangan Majumder, Microsoft Corporation;

(6) Furu Wei, Microsoft Corporation and Correspondence to ([email protected]).

Table of Links

3 Method

4 Experiments

5 Analysis

The model and dataset release information is available at https://github.com/microsoft/ unilm/tree/master/e5.

This paper is available on arxiv under CC0 1.0 DEED license.

Improving Text Embeddings with Large Language Models: Conclusion and References

Improving Text Embeddings with Large Language Models: Test Set Contamination Analysis