microsoft/ms_marco
Viewer • Updated • 1.11M • 21.5k • 243
How to use hltcoe/plaidx-large-rus-tdist-mt5xxl-engeng with Transformers:
# Load model directly
from transformers import AutoTokenizer, HF_ColBERT
tokenizer = AutoTokenizer.from_pretrained("hltcoe/plaidx-large-rus-tdist-mt5xxl-engeng")
model = HF_ColBERT.from_pretrained("hltcoe/plaidx-large-rus-tdist-mt5xxl-engeng")Translate-Distill is a training technique that produces state-of-the-art CLIR dense retrieval model through translation and distillation.
plaidx-large-rus-tdist-mt5xxl-engeng is trained with KL-Divergence from the mt5xxl MonoT5 reranker inferenced on
English MS MARCO training queries and English passages.
To properly load ColBERT-X models from Huggingface Hub, please use the following version of PLAID-X.
pip install PLAID-X==0.3.1
Following code snippet loads the model through Huggingface API.
from colbert.modeling.checkpoint import Checkpoint
from colbert.infra import ColBERTConfig
Checkpoint('hltcoe/plaidx-large-rus-tdist-mt5xxl-engeng', colbert_config=ColBERTConfig())
For full tutorial, please refer to the PLAID-X Jupyter Notebook, which is part of the SIGIR 2023 CLIR Tutorial.
Please cite the following two papers if you use the model.
@inproceedings{colbert-x,
author = {Suraj Nair and Eugene Yang and Dawn Lawrie and Kevin Duh and Paul McNamee and Kenton Murray and James Mayfield and Douglas W. Oard},
title = {Transfer Learning Approaches for Building Cross-Language Dense Retrieval Models},
booktitle = {Proceedings of the 44th European Conference on Information Retrieval (ECIR)},
year = {2022},
url = {https://arxiv.org/abs/2201.08471}
}
@inproceedings{translate-distill,
author = {Eugene Yang and Dawn Lawrie and James Mayfield and Douglas W. Oard and Scott Miller},
title = {Translate-Distill: Learning Cross-Language Dense Retrieval by Translation and Distillation},
booktitle = {Proceedings of the 46th European Conference on Information Retrieval (ECIR)},
year = {2024},
url = {https://arxiv.org/abs/2401.04810}
}