virattt/rag-reranking-gpt-colbert.ipynb

Last active October 30, 2025 02:24

Star (31) You must be signed in to star a gist
Fork (8) You must be signed in to fork a gist

Select an option

Learn more about clone URLs
Clone this repository at <script src="https://gist.github.com/virattt/b140fb4bf549b6125d53aa153dc53be6.js"></script>
Save virattt/b140fb4bf549b6125d53aa153dc53be6 to your computer and use it in GitHub Desktop.

Download ZIP

rag-reranking-gpt-colbert.ipynb

Raw

rag-reranking-gpt-colbert.ipynb

Sorry, something went wrong. Reload?

Sorry, we cannot display this file.

Sorry, this file is invalid so it cannot be displayed.

truebit commented Jan 23, 2024

thanks for sharing but the query_embedding variable missing assignment statement

jsancs commented Jan 23, 2024 •

edited

Loading

@truebit If I have done it right you need to add:

# Add this lines
query = "Your query in string format..."
query_encoding = tokenizer(query, return_tensors='pt', truncation=True, max_length=512)
query_embedding = model(**query_encoding).last_hidden_state.squeeze(0)

# Get score for each document
for document in splits:
    document_encoding = tokenizer(document, return_tensors='pt', truncation=True, max_length=512)
    document_embedding = model(**document_encoding).last_hidden_state

    # Calculate MaxSim score
    score = maxsim(query_embedding.unsqueeze(0), document_embedding)
    ...

truebit commented Jan 23, 2024

@Psancs05 thx

virattt commented Jan 23, 2024

Author

Great catch - updated 🙏

jsancs commented Jan 23, 2024

@virattt Do you know the difference between using:
query_embedding = model(**query_encoding).last_hidden_state.squeeze(0)
query_embedding = model(**query_encoding).last_hidden_state.mean(dim=1)

I have tested both and seems that the squeeze(0) returns better quality similar documents (maybe it's just the use-case I tried)

TripleExclam commented Jan 30, 2024

query_embedding = model(**query_encoding).last_hidden_state.squeeze(0) is correct since it returns a vector per token, whilst
query_embedding = model(**query_encoding).last_hidden_state.mean(dim=1) returns a single vector averaged over all tokens.

virattt/rag-reranking-gpt-colbert.ipynb

Select an option

No results found

Select an option

No results found

truebit commented Jan 23, 2024

Uh oh!

jsancs commented Jan 23, 2024 •

edited

Loading

Uh oh!

truebit commented Jan 23, 2024

Uh oh!

virattt commented Jan 23, 2024

Uh oh!

jsancs commented Jan 23, 2024

Uh oh!

TripleExclam commented Jan 30, 2024

Uh oh!

virattt/rag-reranking-gpt-colbert.ipynb

truebit commented Jan 23, 2024

Uh oh!

jsancs commented Jan 23, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

truebit commented Jan 23, 2024

Uh oh!

virattt commented Jan 23, 2024

Uh oh!

jsancs commented Jan 23, 2024

Uh oh!

TripleExclam commented Jan 30, 2024

Uh oh!

jsancs commented Jan 23, 2024 •

edited

Loading