How Microsoft Trained a 270M-Pair AI to Power Smarter Search

Researchers at Microsoft Corporation introduce E5, a powerful text embedding model trained on 270M curated web text pairs (CCPairs). Using contrastive learning with in-batch negatives, E5 becomes the first unsupervised model to outperform BM25 on the BEIR benchmark. After fine-tuning, it tops the MTEB leaderboard—beating models 40× larger in retrieval, clustering, classification, and semantic similarity tasks.

Source: HackerNoon →

Blog

How Microsoft Trained a 270M-Pair AI to Power Smarter Search

Category

Latest News

Bitcoin and Ethereum Price Forecast After Release of First US CPI Print Since US...

US-Iran War: Polymarket Odds for April Peace Deal Surge to 33%

Astounding Stories of Super-Science, April, 2004 - Table of Links

DOGE, SHIB, PEPE Price Forecast as US Senators Probe Trump’s Mar-a-Lago Meme Coi...

Top 3 Reasons Why Pi Network Price Is Down Today (12th April)

Top Category

Blog

How Microsoft Trained a 270M-Pair AI to Power Smarter Search

Category

Share

Latest News

Bitcoin and Ethereum Price Forecast After Release of First US CPI Print Since US...

US-Iran War: Polymarket Odds for April Peace Deal Surge to 33%

Astounding Stories of Super-Science, April, 2004 - Table of Links

DOGE, SHIB, PEPE Price Forecast as US Senators Probe Trump’s Mar-a-Lago Meme Coi...

Top 3 Reasons Why Pi Network Price Is Down Today (12th April)

Top Category