MediumCapability

Text and Code Embeddings via Contrastive Pre-training

AI Impact Summary

The introduction of a capability to generate text and code embeddings via contrastive pre-training indicates a shared embedding space that aligns natural language and code representations, improving cross-modal similarity tasks. For technical teams, this can boost retrieval accuracy in semantic search, code search, and recommendation workflows, but may impact embedding dimensions, models, and endpoints used downstream. Businesses relying on embeddings-based pipelines should plan to re-index data with the new embeddings and evaluate any latency or throughput implications as they adopt the new capability.

Business Impact

Improved cross-modal retrieval enables more accurate text-code search and recommendations; teams should plan data reindexing and updates to embedding pipelines to leverage the new model.

Risk domains

792%

Source text

Date: Date not specified
Change type: capability
Severity: medium

Text and Code Embeddings via Contrastive Pre-training

More from OpenAI

Get alerts for OpenAI