ByT5: Towards a token-free future with pre-trained byte-to-byte models Paper β’ 2105.13626 β’ Published May 28, 2021 β’ 5
pplx-embed Collection Diffusion-Pretrained Dense and Contextual Embeddings β’ 7 items β’ Updated Feb 26 β’ 95
LateOn-Code π» Collection State-of-the-art late interaction code retrieval models β’ 6 items β’ Updated 2 days ago β’ 17
view article Article LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling Feb 12 β’ 52
view article Article LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family Jan 19 β’ 88
PyLate π Collection State-of-the-art late interaction models trained using PyLate β’ 5 items β’ Updated 2 days ago β’ 4